Improved Regional Proposal Generation and Proposal Selection Method for Weakly Supervision Object detection

Authors

  • Yujiao Wang
  • Hua Huo

DOI:

https://doi.org/10.54097/ajst.v5i3.7806

Keywords:

Weakly supervised, Object detection, Proposals.

Abstract

 In recent years, object detection has made great progress with the continuous development of deep neural network. At present, there are many different fully supervised object detection algorithms in the field of computer vision, which are basically saturated, while object detection in a weakly supervised manner is more challenging than strongly supervised object detection. Since nowadays mature object detection algorithms rely heavily on strongly labeled datasets, but strong labeled datasets are very expensive and require huge datasets to support in order to train a better object detection model, weakly supervised object detection has received more and more attention. In this paper, a new module can be embedded in the framework of weakly supervised object detection, three modules are introduced into the weakly supervised object detection framework, which is used to generate high-quality proposals and screen these proposals, and finally selecting more accurate proposal boxes that are beneficial for subsequent training, and demonstrate their effectiveness on the PASCAL VOC2007 and PASCAL VOC2012 datasets, in which this paper achieves a significant improvement over the existing classic weakly supervised object detection algorithms with significant improvements.

Downloads

Download data is not yet available.
<br data-mce-bogus="1"> <br data-mce-bogus="1">

References

Tang P, Wang X, Bai S, et al. Pcl: Proposal cluster learning for weakly supervised object detection[J]. IEEE transactions on pattern analysis and machine intelligence, 2018, 42(1): 176-191.

Tang P, Wang X, Bai X, et al. Multiple instance detection network with online instance classifier refinement [C]// Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2843-2851.

Jifeng Dai, Yi Li, Kaiming He, and Jian Sun. R-fcn: Object detection via region-based fully convolutional networks. In NIPS, pages 379–387, 2016.

Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, and Yichen Wei. Deformable convolutional networks. In ICCV, pages 764–773, 2017.

H. Bilen and A. Vedaldi. Weakly supervised deep detection networks. In Proc. CVPR, 2016.

Mark Everingham, SM Ali Eslami, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. The pascal visual object classes challenge: A retrospective. International journal of computer vision, 111(1):98–136, 2015.

Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. The pascal visual object classes (voc) challenge. IJCV, 88(2):303–338, 2010.

Ross Girshick. Fast r-cnn. In Proceedings of the IEEE international conference on computer vision, pages 1440–1448, 2015.

Ross Girshick, Jeff Donahue, Trevor Darrell, and Jitendra Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 580–587, 2014.

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.

Zeng Z, Liu B, Fu J, et al. Wsod2: Learning bottom-up and top-down objectness distillation for weakly-supervised object detection[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2019: 8292-8300.

Diba, A., Sharma, V., Pazandeh, A., Pirsiavash, H., Van Gool, L.: Weakly supervised cascaded convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017).

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097–1105, 2012.

Yann LeCun, L´eon Bottou, Yoshua Bengio, Patrick Haffner, et al. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.

Jie, Z., Wei, Y., Jin, X., Feng, J., Liu, W.: Deep self-taught learning for weakly supervised object localization. In: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR) (2017).

Ren, Z., Yu, Z., Yang, X., Liu, M.Y., Lee, Y.J., Schwing, A.G., Kautz, J.: Instanceaware, context-focused, and memory-efficient weakly supervised object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020).

Tsung-Yi Lin, Piotr Doll´ar, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2117–2125, 2017.

Kaiming He, Georgia Gkioxari, Piotr Doll´ar, and Ross Girshick. Mask r-cnn. In Proc. ICCV, 2017.

Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. Ssd: Single shot multibox detector. In European conference on computer vision, pages 21–37. Springer, 2016.

Alina Kuznetsova, Hassan Rom, Neil Alldrin, Jasper Uijlings, Ivan Krasin, Jordi Pont-Tuset, Shahab Kamali, Stefan Popov, Matteo Malloci, Tom Duerig, and Vittorio Ferrari. The open images dataset v4: Unified image classification, object detection, and visual relationship detection at scale. arXiv:1811.00982, 2018.

Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems, pages 91–99, 2015.

Yang K, Li D, Dou Y. Towards precise end-to-end weakly supervised object detection network[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 8372-8381.

Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Doll´ar, and C Lawrence Zitnick. Microsoft coco: Common objects in context. In ECCV, pages 740–755, 2014.

Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, et al. Imagenet large scale visual recognition challenge. International journal of computer vision, 115(3):211–252, 2015.

Vadim Kantorov, Maxime Oquab, Minsu Cho, and Ivan Laptev. Contextlocnet: Context-aware deep network models for weakly supervised localization. In Proc. ECCV, 2016.

Wan F, Wei P, Jiao J, et al. Min-entropy latent model for weakly supervised object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 1297-1306.

Hei Law and Jia Deng. Cornernet: Detecting objects as paired keypoints. In Proc. ECCV, 2018.

Gao M, Li A, Yu R, et al. C-wsl: Count-guided weakly supervised localization[C]//Proceedings of the European conference on computer vision (ECCV). 2018: 152-168.

Zhou B, Khosla A, Lapedriza A, et al. Learning deep features for discriminative localization[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 2921-2929.

Uijlings J R R, Van De Sande K E A, Gevers T, et al. Selective search for object recognition[J]. International journal of computer vision, 2013, 104: 154-171.

Chattopadhay A, Sarkar A, Howlader P, et al. Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks[C]//2018 IEEE winter conference on applications of computer vision (WACV). IEEE, 2018: 839-847.

Arun A, Jawahar C V, Kumar M P. Dissimilarity coefficient based weakly supervised object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 9432-9441.

Shen Y, Ji R, Wang Y, et al. Cyclic guidance for weakly supervised joint detection and segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 697-707.

Li X, Kan M, Shan S, et al. Weakly supervised object detection with segmentation collaboration[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2019: 9735-9744.

Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, and Ali Farhadi. You only look once: Unified, real-time object detection. In Proc. CVPR, 2016.

Chen Z, Fu Z, Jiang R, et al. Slv: Spatial likelihood voting for weakly supervised object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 12995-13004.

Lin C, Wang S, Xu D, et al. Object instance mining for weakly supervised object detection[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2020, 34(07): 11482-11489.Zhang D, Han J, Zhang Y. Supervision by fusion: towards unsupervised learning of deep salient object detector[A]. IEEE International Conference on Computer Vision[C]. Honolulu, Hawaii, USA: IEEE, 2017. 4048 - 4056.

Wang L, Lu H, Wang Y, et al. Learning to detect salient objects with image-level supervision[A]. IEEE Conference on Computer Vision and Pattern Recognition[C]. Honolu⁃lu, HI, USA: IEEE, 2017. 136 - 145.

Woo S, Park J, Lee JY, et al. CBAM: Convolutional block attention module[C]. Proceedings of the European Conference on Computer Vision(ECCV), 2018, 3-19.

Zhang Y, Bai Y, Ding M, et al. W2f: A weakly-supervised to fully-supervised framework for object detection [C] //Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 928-936.

Zitnick C L, Dollár P. Edge boxes: Locating object proposals from edges[C]. Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer International Publishing, 2014: 391-405.

Alexe B, Deselaers T, Ferrari V. What is an object?[C]//2010 IEEE computer society conference on computer vision and pattern recognition. IEEE, 2010: 73-80.

Downloads

Published

24-04-2023

How to Cite

Wang, Y., & Huo, H. (2023). Improved Regional Proposal Generation and Proposal Selection Method for Weakly Supervision Object detection. Academic Journal of Science and Technology, 5(3), 137–148. https://doi.org/10.54097/ajst.v5i3.7806

Issue

Section

Articles