UAV Aerial Image Object Detection Based on Improved YOLOv5s

NING Tao; FU Shimo; CHANG Qing; WANG Yaoli

doi:10.3969/j.issn.1671-637x.2024.12.007

[2] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Columbus: IEEE, 2014: 580-587.

[3] GIRSHICK R. Fast R-CNN[C]//IEEE International Conference on Computer Vision (ICCV). Santiago: IEEE, 2015: 1440-1448.

[4] REN S Q, HE K M，GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.

[5] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas: IEEE, 2016: 779-788.

[6] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multi box detector[C]//European Conference on Computer Vision (ECCV). Cham: Springer, 2016: 21-37.

[10] ZHU X K, LU S C, WANG X, et al. TPH-YOLOv5: improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]//IEEE/CVF International Conference on Computer Vision Workshops (ICCVW). Montreal: IEEE, 2021: 2778-2788.

[12] WANG C Y, BOCHKOVSKIY A, LIAO H Y M. Scaled YOLOv4: scaling cross stage partial network[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Nashville: IEEE, 2021: 13024-13033.

[13] SRINIVAS A, LIN T Y, PARMAR N, et al. Bottleneck transformers for visual recognition[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Nashville: IEEE, 2021: 16519-16529.

[14] YU G H, CHANG Q Y, LV W Y, et al. PP-PicoDet: a better real-time object detector on mobile devices[R]. Los Alamos: arXiv Preprint, 2021: arXiv: 2111.00902.

[15] REDMON J, FARHADI A. YOLOv3: an incremental improvement[R]. Los Alamos: arXiv Preprint, 2018: arXiv: 1804.02767.

[16] JOCHER G, STOKEN A, BOROVEC J, et al. Ultralytics/YOLOv5: v5.0-YOLOv5-P6 1280 models, AWS, Supervise.ly and YouTube integrations[EB/OL].(2023-12-05)[2021-04-11]. https://zenodo.org/records/4679653.

[17] WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Vancouver: IEEE, 2023: 7464-7475.

[18] LI C Y, LI L, JIANG H L, et al. YOLOv6: a single-stage object detection framework for industrial applications[R]. Los Alamos: arXiv Preprint, 2022: arXiv: 2209.02976.

微信扫一扫：分享

微信扫一扫：分享