Enhancement of Single Shot Multibox Detector for Aerial Infrared Target Detection

Jiangrong Xie; Fanming Li; Hong Wei; Bing Li; Baotai Shao

doi:10.3788/AOS201939.0615001

[1] Erhan D, Szegedy C, Toshev A et al. Scalable object detection using deep neural networks. [C]//2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. New York: IEEE, 2155-2162(2014).

Erhan D, Szegedy C, Toshev A et al. Scalable object detection using deep neural networks. [C]//2014 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2014, Columbus, OH, USA. New York: IEEE, 2155-2162(2014).

[2] Borji A, Cheng M M, Jiang H Z et al. Salient object detection: a benchmark[J]. IEEE Transactions on Image Processing, 24, 5706-5722(2015). http://link.springer.com/chapter/10.1007/978-3-642-33709-3_30?CFID=555422687&CFTOKEN=46090569

Borji A, Cheng M M, Jiang H Z et al. Salient object detection: a benchmark[J]. IEEE Transactions on Image Processing, 24, 5706-5722(2015). http://link.springer.com/chapter/10.1007/978-3-642-33709-3_30?CFID=555422687&CFTOKEN=46090569

[3] Singla N. Motion detection based on frame difference method[J]. International Journal of Information & Computation Technology, 4, 1559-1565(2014).

Singla N. Motion detection based on frame difference method[J]. International Journal of Information & Computation Technology, 4, 1559-1565(2014).

[4] Horn B K P, Schunck B G. Determining optical flow[J]. Artificial Intelligence, 17, 185-203(1981).

Horn B K P, Schunck B G. Determining optical flow[J]. Artificial Intelligence, 17, 185-203(1981).

[5] Barinova O, Lempitsky V, Kholi P. On detection of multiple object instances using hough transforms[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 1773-1784(2012). http://europepmc.org/abstract/MED/22450818

Barinova O, Lempitsky V, Kholi P. On detection of multiple object instances using hough transforms[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 1773-1784(2012). http://europepmc.org/abstract/MED/22450818

[6] Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 60, 91-110(2004). http://doi.ieeecomputersociety.org/resolve?ref_id=doi:10.1023/B:VISI.0000029664.99615.94&rfr_id=trans/tp/2008/10/ttp2008101683.htm

Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 60, 91-110(2004). http://doi.ieeecomputersociety.org/resolve?ref_id=doi:10.1023/B:VISI.0000029664.99615.94&rfr_id=trans/tp/2008/10/ttp2008101683.htm

[7] Dalal N, Triggs B. Histograms of oriented gradients for human detection. [C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), June 20-25, 2005, San Diego, CA, USA. New York: IEEE, 886-893(2005).

Dalal N, Triggs B. Histograms of oriented gradients for human detection. [C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), June 20-25, 2005, San Diego, CA, USA. New York: IEEE, 886-893(2005).

[8] Burges C J C. A tutorial on support vector machines for pattern recognition[J]. Data Mining and Knowledge Discovery, 2, 121-167(1998). http://www.emeraldinsight.com/servlet/linkout?suffix=b1&dbid=16&doi=10.1108%2Filt-02-2013-0020&key=10.1023%2FA%3A1009715923555

Burges C J C. A tutorial on support vector machines for pattern recognition[J]. Data Mining and Knowledge Discovery, 2, 121-167(1998). http://www.emeraldinsight.com/servlet/linkout?suffix=b1&dbid=16&doi=10.1108%2Filt-02-2013-0020&key=10.1023%2FA%3A1009715923555

[9] Hastie T, Rosset S, Zhu J et al. Multi-class AdaBoost[J]. Statistics and Its Interface, 2, 349-360(2009).

Hastie T, Rosset S, Zhu J et al. Multi-class AdaBoost[J]. Statistics and Its Interface, 2, 349-360(2009).

[10] Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017). http://www.ncbi.nlm.nih.gov/pubmed/27295650

Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1137-1149(2017). http://www.ncbi.nlm.nih.gov/pubmed/27295650

[11] Redmon J. -04-08)[2018-12-01][EB/OL]. Farhadi A. YOLOv3: an incremental improvement., org/abs/1804, 02767(2018). https://arxiv.

Redmon J. -04-08)[2018-12-01][EB/OL]. Farhadi A. YOLOv3: an incremental improvement., org/abs/1804, 02767(2018). https://arxiv.

[12] Liu W, Anguelov D, Erhan D et al. SSD: single shot multibox dtector[M]. //Leibe B, Matas J, Sebe N, et al. Computer Vision: ECCV 2016. Cham: Springer, 9905, 21-37(2016).

Liu W, Anguelov D, Erhan D et al. SSD: single shot multibox dtector[M]. //Leibe B, Matas J, Sebe N, et al. Computer Vision: ECCV 2016. Cham: Springer, 9905, 21-37(2016).

[13] Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 640-651(2017). http://www.tandfonline.com/servlet/linkout?suffix=CIT0044&dbid=16&doi=10.1080%2F15481603.2018.1426091&key=10.1109%2FCVPR.2015.7298965

Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 640-651(2017). http://www.tandfonline.com/servlet/linkout?suffix=CIT0044&dbid=16&doi=10.1080%2F15481603.2018.1426091&key=10.1109%2FCVPR.2015.7298965

[14] He K M, Gkioxari G, Dollár P et al. Mask R-CNN[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2844175(2018).

He K M, Gkioxari G, Dollár P et al. Mask R-CNN[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2844175(2018).

[15] Li Z X. -05-17)[2018-12-01][EB/OL]. Zhou F Q. FSSD: feature fusion single shot multibox detector., org/abs/1712, 00960(2018). https://arxiv.

Li Z X. -05-17)[2018-12-01][EB/OL]. Zhou F Q. FSSD: feature fusion single shot multibox detector., org/abs/1712, 00960(2018). https://arxiv.

[16] Cao G M, Xie X M, Yang W Z et al. Feature-fused SSD: fast detection for small objects[J]. Proceedings of SPIE, 10615, 106151E(2018). http://arxiv.org/abs/1709.05054

Cao G M, Xie X M, Yang W Z et al. Feature-fused SSD: fast detection for small objects[J]. Proceedings of SPIE, 10615, 106151E(2018). http://arxiv.org/abs/1709.05054

[17] Fu C Y, Liu W, Ranga A et al. -01-23)[2018-12-01], org/abs/1701, 06659(2017). https://arxiv.

Fu C Y, Liu W, Ranga A et al. -01-23)[2018-12-01], org/abs/1701, 06659(2017). https://arxiv.

[18] Jeong J, Park H. -05-26)[2018-12-01][EB/OL]. Kwak N. Enhancement of SSD by concatenating feature maps for object detection., org/abs/1705, 09587(2017). https://arxiv.

Jeong J, Park H. -05-26)[2018-12-01][EB/OL]. Kwak N. Enhancement of SSD by concatenating feature maps for object detection., org/abs/1705, 09587(2017). https://arxiv.

[19] Tang C, Ling Y S, Zheng K D et al. Object detection method of multi-view SSD based on deep learning[J]. Infrared and Laser Engineering, 47, 126003(2018).

Tang C, Ling Y S, Zheng K D et al. Object detection method of multi-view SSD based on deep learning[J]. Infrared and Laser Engineering, 47, 126003(2018).

[20] Simonyan K. -04-10)[2018-12-01][EB/OL]. Zisserman A. Very deep convolutional networks for large-scale image recognition., org/abs/1409, 1556(2015). https://arxiv.

Simonyan K. -04-10)[2018-12-01][EB/OL]. Zisserman A. Very deep convolutional networks for large-scale image recognition., org/abs/1409, 1556(2015). https://arxiv.

[21] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition. [C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 770-778(2016).

He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition. [C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 770-778(2016).

[22] Lin T Y, Goyal P, Girshick R et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2999-3007(2017). http://ieeexplore.ieee.org/document/8417976/

Lin T Y, Goyal P, Girshick R et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2999-3007(2017). http://ieeexplore.ieee.org/document/8417976/

[23] Xin P, Xu Y L, Tang H et al. Fast airplane detection based on multi-layer feature fusion of fully convolutional networks[J]. Acta Optica Sinica, 38, 0315003(2018).

Xin P, Xu Y L, Tang H et al. Fast airplane detection based on multi-layer feature fusion of fully convolutional networks[J]. Acta Optica Sinica, 38, 0315003(2018).

[24] Zhang Z S, Qiao S Y, Xie C H et al. Single-shot object detection with enriched semantics. [C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE, 5813-5821(2018).

Zhang Z S, Qiao S Y, Xie C H et al. Single-shot object detection with enriched semantics. [C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE, 5813-5821(2018).

[25] Feng X Y, Mei W, Hu D S. Aerialtarget detection based on improved faster R-CNN[J]. Acta Optica Sinica, 38, 0615004(2018).

Feng X Y, Mei W, Hu D S. Aerialtarget detection based on improved faster R-CNN[J]. Acta Optica Sinica, 38, 0615004(2018).

[26] Wang W X, Fu Y T, Dong F et al. Infrared ship target detection method based on deep convolution neural network[J]. Acta Optica Sinica, 38, 0712006(2018).

Wang W X, Fu Y T, Dong F et al. Infrared ship target detection method based on deep convolution neural network[J]. Acta Optica Sinica, 38, 0712006(2018).