Dynamic Receptive Field-Based Object Detection in Aerial Imaging

Xueli Xie; Chuanxiang Li; Xiaogang Yang; Jianxiang Xi; Tong Chen

doi:10.3788/AOS202040.0415001

[1] Aguilar W G, Luna M A, Moya J F et al. Pedestrian detection for UAVs using cascade classifiers with meanshift. [C]∥2017 IEEE 11th International Conference on Semantic Computing (ICSC), January 30-February 1, 2017, San Diego, CA, USA. New York: IEEE, 509-514(2017).

[2] Yuan C, Liu Z X, Zhang Y M. UAV-based forest fire detection and tracking using image processing techniques. [C]∥2015 International Conference on Unmanned Aircraft Systems (ICUAS), June 9-12, 2015, Denver, CO, USA. New York: IEEE, 639-643(2015).

[3] Xu Y Z, Yu G Z, Wang Y P et al. Car detection from low-altitude UAV imagery with the faster R-CNN[J]. Journal of Advanced Transportation, 2017, 2823617(2017).

[4] Dalal N, Triggs B. Histograms of oriented gradients for human detection. [C]∥2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), June 20-25, 2005, San Diego, CA, USA. New York: IEEE, 8588935(2005).

[5] Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 60, 91-110(2004). http://doi.ieeecomputersociety.org/resolve?ref_id=doi:10.1023/B:VISI.0000029664.99615.94&rfr_id=trans/tp/2008/10/ttp2008101683.htm

[6] Viola P, Jones M J. Robust real-time face detection[J]. International Journal of Computer Vision, 57, 137-154(2004).

[7] Ren S Q, He K M, Girshick R et al. Faster R-CNN: towards real-time object detection with region proposal networks. [C]∥Advances in Neural Information Processing Systems, December 7-12, 2015, Montreal, Quebec, Canada. Canada: NIPS, 91-99(2015).

[8] Dai J, Li Y, He K M et al. R-FCN: object detection via region-based fully convolutional networks. [C]∥Advances in Neural Information Processing Systems, December 5-10, 2016, Barcelona, Spain. Canada: NIPS, 379-387(2016).

[9] Redmon J, Divvala S, Girshick R et al. You only look once: unified, real-time object detection. [C]∥2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA. New York: IEEE, 779-788(2016).

[10] Redmon J, Farhadi A. YOLO9000: better, faster, stronger. [C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA. New York: IEEE, 6517-6525(2017).

[11] Redmon J. -04-08)[2019-08-28][J/OL]. Farhadi A. Yolov3: an incremental improvement., top/abs/1804, 02767(2018). https://arxiv.xilesou.

[12] Liu W, Anguelov D, Erhan D et al. SSD: single shot MultiBox detector[M]. ∥Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science. Cham: Springer, 9905, 21-37(2016).

[13] Fu C Y, Liu W, Ranga A et al. -01-23)[2019-08-28], top/abs/1701, 06659(2017). https://arxiv.xilesou.

[14] Li Z. -05-17)[2019-08-28][J/OL]. Zhou F. FSSD: feature fusion single shot MultiBox detector., top/abs/1712, 00960(2018). https://arxiv.xilesou.

[15] Lin T Y, Goyal P, Girshick R et al. Focal loss for dense object detection. [C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 2999-3007(2017).

[16] Ou P, Zhang Z, Lu K et al. Object detection in of remote sensing images based on convolutional neural networks[J]. Laser & Optoelectronics Progress, 56, 051002(2019).

[17] Wang J Q, Li J S, Zhou X W et al. Improved SSD algorithm and its performance analysis of small target detection in remote sensing images[J]. Acta Optica Sinica, 39, 0628005(2019).

[18] Liang X, Zhang J, Zhuo L et al. Small object detection in unmanned aerial vehicle images using feature fusion and scaling-based single shot detector with spatial context analysis[J]. IEEE Transactions on Circuits and Systems for Video Technology(2019).

[19] Xie S N, Girshick R, Dollar P et al. Aggregated residual transformations for deep neural networks. [C]∥2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA. New York: IEEE, 5987-5995(2017).

[20] Hu J, Shen L, Sun G. Squeeze-and-excitation networks. [C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE, 7132-7141(2018).

[21] Liu S, Qi L, Qin H F et al. Path aggregation network for instance segmentation. [C]∥2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE, 8759-8768(2018).

[22] Li H, Xiong P, An J et al. -11-25)[2019-08-28], top/abs/1805, 10180(2018). https://arxiv.xilesou.

[23] Cao Y, Xu J, Lin S et al. -04-25)[2019-08-28][J/OL]. beyond., top/abs/1904, 11492(2019). https://arxiv.xilesou.

[24] Li Y, Chen Y, Wang N et al. -08-20)[2019-08-28], top/abs/1901, 01892(2019). https://arxiv.xilesou.

[25] Zhu P, Wen L, Bian X et al. -04-23)[2019-08-28], top/abs/1804, 07437(2018). https://arxiv.xilesou.

[26] Lin T Y, Maire M, Belongie S et al. Microsoft COCO: common objects in context[M]. ∥Fleet D, Pajdla T, Schiele B, et al. Computer vision-ECCV 2014. Lecture notes in computer science. Cham: Springer, 8693, 740-755(2014).

[27] Bodla N, Singh B, Chellappa R et al. Soft-NMS: improving object detection with one line of code. [C]∥2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE, 5562-5570(2017).