• Acta Optica Sinica
  • Vol. 39, Issue 6, 0615001 (2019)
Jiangrong Xie1、2、3, Fanming Li1、3、*, Hong Wei1, Bing Li1, and Baotai Shao1、2、3
Author Affiliations
  • 1 Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai 200083, China
  • 2 University of Chinese Academy of Sciences, Beijing 100049, China
  • 3 Key Laboratory of Infrared System Detection and Imaging Technology, Chinese Academy of Sciences, Shanghai 200083, China
  • show less
    DOI: 10.3788/AOS201939.0615001 Cite this Article Set citation alerts
    Jiangrong Xie, Fanming Li, Hong Wei, Bing Li, Baotai Shao. Enhancement of Single Shot Multibox Detector for Aerial Infrared Target Detection[J]. Acta Optica Sinica, 2019, 39(6): 0615001 Copy Citation Text show less
    References

    [1] Erhan D, Szegedy C, Toshev A et al. Scalable object detection using deep neural networks. [C]//2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. New York: IEEE, 2155-2162(2014).

         Erhan D, Szegedy C, Toshev A et al. Scalable object detection using deep neural networks. [C]//2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. New York: IEEE, 2155-2162(2014).

    [2] Borji A, Cheng M M, Jiang H Z et al. Salient object detection: a benchmark[J]. IEEE Transactions on Image Processing, 24, 5706-5722(2015). http://link.springer.com/chapter/10.1007/978-3-642-33709-3_30?CFID=555422687&CFTOKEN=46090569

         Borji A, Cheng M M, Jiang H Z et al. Salient object detection: a benchmark[J]. IEEE Transactions on Image Processing, 24, 5706-5722(2015). http://link.springer.com/chapter/10.1007/978-3-642-33709-3_30?CFID=555422687&CFTOKEN=46090569

    [3] Singla N. Motion detection based on frame difference method[J]. International Journal of Information & Computation Technology, 4, 1559-1565(2014).

         Singla N. Motion detection based on frame difference method[J]. International Journal of Information & Computation Technology, 4, 1559-1565(2014).

    [4] Horn B K P, Schunck B G. Determining optical flow[J]. Artificial Intelligence, 17, 185-203(1981).

         Horn B K P, Schunck B G. Determining optical flow[J]. Artificial Intelligence, 17, 185-203(1981).

    [5] Barinova O, Lempitsky V, Kholi P. On detection of multiple object instances using hough transforms[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 1773-1784(2012). http://europepmc.org/abstract/MED/22450818

         Barinova O, Lempitsky V, Kholi P. On detection of multiple object instances using hough transforms[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 1773-1784(2012). http://europepmc.org/abstract/MED/22450818

    [6] Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 60, 91-110(2004). http://doi.ieeecomputersociety.org/resolve?ref_id=doi:10.1023/B:VISI.0000029664.99615.94&rfr_id=trans/tp/2008/10/ttp2008101683.htm

         Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 60, 91-110(2004). http://doi.ieeecomputersociety.org/resolve?ref_id=doi:10.1023/B:VISI.0000029664.99615.94&rfr_id=trans/tp/2008/10/ttp2008101683.htm

    [7] Dalal N, Triggs B. Histograms of oriented gradients for human detection. [C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), June 20-25, 2005, San Diego, CA, USA. New York: IEEE, 886-893(2005).

         Dalal N, Triggs B. Histograms of oriented gradients for human detection. [C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), June 20-25, 2005, San Diego, CA, USA. New York: IEEE, 886-893(2005).

    [8] Burges C J C. A tutorial on support vector machines for pattern recognition[J]. Data Mining and Knowledge Discovery, 2, 121-167(1998). http://www.emeraldinsight.com/servlet/linkout?suffix=b1&dbid=16&doi=10.1108%2Filt-02-2013-0020&key=10.1023%2FA%3A1009715923555

         Burges C J C. A tutorial on support vector machines for pattern recognition[J]. Data Mining and Knowledge Discovery, 2, 121-167(1998). http://www.emeraldinsight.com/servlet/linkout?suffix=b1&dbid=16&doi=10.1108%2Filt-02-2013-0020&key=10.1023%2FA%3A1009715923555

    [9] Hastie T, Rosset S, Zhu J et al. Multi-class AdaBoost[J]. Statistics and Its Interface, 2, 349-360(2009).

         Hastie T, Rosset S, Zhu J et al. Multi-class AdaBoost[J]. Statistics and Its Interface, 2, 349-360(2009).

    [10] Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017). http://www.ncbi.nlm.nih.gov/pubmed/27295650

         Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017). http://www.ncbi.nlm.nih.gov/pubmed/27295650

    [11] Redmon J. -04-08)[2018-12-01][EB/OL]. Farhadi A. YOLOv3: an incremental improvement., org/abs/1804, 02767(2018). https://arxiv.

         Redmon J. -04-08)[2018-12-01][EB/OL]. Farhadi A. YOLOv3: an incremental improvement., org/abs/1804, 02767(2018). https://arxiv.

    [12] Liu W, Anguelov D, Erhan D et al. SSD: single shot multibox dtector[M]. //Leibe B, Matas J, Sebe N, et al. Computer Vision: ECCV 2016. Cham: Springer, 9905, 21-37(2016).

         Liu W, Anguelov D, Erhan D et al. SSD: single shot multibox dtector[M]. //Leibe B, Matas J, Sebe N, et al. Computer Vision: ECCV 2016. Cham: Springer, 9905, 21-37(2016).

    [13] Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 640-651(2017). http://www.tandfonline.com/servlet/linkout?suffix=CIT0044&dbid=16&doi=10.1080%2F15481603.2018.1426091&key=10.1109%2FCVPR.2015.7298965

         Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 640-651(2017). http://www.tandfonline.com/servlet/linkout?suffix=CIT0044&dbid=16&doi=10.1080%2F15481603.2018.1426091&key=10.1109%2FCVPR.2015.7298965

    [14] He K M, Gkioxari G, Dollár P et al. Mask R-CNN[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2844175(2018).

         He K M, Gkioxari G, Dollár P et al. Mask R-CNN[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2844175(2018).

    [15] Li Z X. -05-17)[2018-12-01][EB/OL]. Zhou F Q. FSSD: feature fusion single shot multibox detector., org/abs/1712, 00960(2018). https://arxiv.

         Li Z X. -05-17)[2018-12-01][EB/OL]. Zhou F Q. FSSD: feature fusion single shot multibox detector., org/abs/1712, 00960(2018). https://arxiv.

    [16] Cao G M, Xie X M, Yang W Z et al. Feature-fused SSD: fast detection for small objects[J]. Proceedings of SPIE, 10615, 106151E(2018). http://arxiv.org/abs/1709.05054

         Cao G M, Xie X M, Yang W Z et al. Feature-fused SSD: fast detection for small objects[J]. Proceedings of SPIE, 10615, 106151E(2018). http://arxiv.org/abs/1709.05054

    [17] Fu C Y, Liu W, Ranga A et al. -01-23)[2018-12-01], org/abs/1701, 06659(2017). https://arxiv.

         Fu C Y, Liu W, Ranga A et al. -01-23)[2018-12-01], org/abs/1701, 06659(2017). https://arxiv.

    [18] Jeong J, Park H. -05-26)[2018-12-01][EB/OL]. Kwak N. Enhancement of SSD by concatenating feature maps for object detection., org/abs/1705, 09587(2017). https://arxiv.

         Jeong J, Park H. -05-26)[2018-12-01][EB/OL]. Kwak N. Enhancement of SSD by concatenating feature maps for object detection., org/abs/1705, 09587(2017). https://arxiv.

    [19] Tang C, Ling Y S, Zheng K D et al. Object detection method of multi-view SSD based on deep learning[J]. Infrared and Laser Engineering, 47, 126003(2018).

         Tang C, Ling Y S, Zheng K D et al. Object detection method of multi-view SSD based on deep learning[J]. Infrared and Laser Engineering, 47, 126003(2018).

    [20] Simonyan K. -04-10)[2018-12-01][EB/OL]. Zisserman A. Very deep convolutional networks for large-scale image recognition., org/abs/1409, 1556(2015). https://arxiv.

         Simonyan K. -04-10)[2018-12-01][EB/OL]. Zisserman A. Very deep convolutional networks for large-scale image recognition., org/abs/1409, 1556(2015). https://arxiv.

    [21] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition. [C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 770-778(2016).

         He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition. [C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 770-778(2016).

    [22] Lin T Y, Goyal P, Girshick R et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2999-3007(2017). http://ieeexplore.ieee.org/document/8417976/

         Lin T Y, Goyal P, Girshick R et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2999-3007(2017). http://ieeexplore.ieee.org/document/8417976/

    [23] Xin P, Xu Y L, Tang H et al. Fast airplane detection based on multi-layer feature fusion of fully convolutional networks[J]. Acta Optica Sinica, 38, 0315003(2018).

         Xin P, Xu Y L, Tang H et al. Fast airplane detection based on multi-layer feature fusion of fully convolutional networks[J]. Acta Optica Sinica, 38, 0315003(2018).

    [24] Zhang Z S, Qiao S Y, Xie C H et al. Single-shot object detection with enriched semantics. [C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE, 5813-5821(2018).

         Zhang Z S, Qiao S Y, Xie C H et al. Single-shot object detection with enriched semantics. [C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE, 5813-5821(2018).

    [25] Feng X Y, Mei W, Hu D S. Aerialtarget detection based on improved faster R-CNN[J]. Acta Optica Sinica, 38, 0615004(2018).

         Feng X Y, Mei W, Hu D S. Aerialtarget detection based on improved faster R-CNN[J]. Acta Optica Sinica, 38, 0615004(2018).

    [26] Wang W X, Fu Y T, Dong F et al. Infrared ship target detection method based on deep convolution neural network[J]. Acta Optica Sinica, 38, 0712006(2018).

         Wang W X, Fu Y T, Dong F et al. Infrared ship target detection method based on deep convolution neural network[J]. Acta Optica Sinica, 38, 0712006(2018).

    Jiangrong Xie, Fanming Li, Hong Wei, Bing Li, Baotai Shao. Enhancement of Single Shot Multibox Detector for Aerial Infrared Target Detection[J]. Acta Optica Sinica, 2019, 39(6): 0615001
    Download Citation