[8] Y Li, Y Chen, N Wang et al. Scale-Aware Trident Networks for Object Detection, 6053-6062(2019).
[13] H Hu, J Gu, Z Zhang et al. Relation Networks for Object Detection, 3588-3597(2018).
[14] T Xu, D K Du, Z He et al. PyramidBox: A Context-assisted Single Shot Face Detector(2018).
[15] S Zhang, X Zhu, Z Lei et al. S3FD: Single Shot Scale-invariant Face Detector, 192-201(2017).
[17] J Yu, Y Jiang, Z Wang et al. UnitBox: An Advanced Object Detection Network, 516-520(2016).
[26] Z Liu, H Mao, C Y Wu et al. A ConvNet for the 2020s, 11966-11976(2022).
[29] B Hui, Z Song, H Fan et al. A Dataset for Infrared Image Dim-Small Aircraft Target Detection and Tracking under Ground / Air Background. Science Data Bank(2019).
[30] A Dosovitskiy, L Beyer, A Kolesnikov et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale(2021).