Review of Computer Vision Based Object Counting Methods

Ni Jiang; Haiyang Zhou; Feihong Yu

doi:10.3788/LOP202158.1400002

[1] Zhan B B, Monekosso D N, Remagnino P et al. Crowd analysis: a survey[J]. Machine Vision and Applications, 19, 345-357(2008).

[2] Saleh S A M, Suandi S A, Ibrahim H. Recent survey on crowd density estimation and counting for visual surveillance[J]. Engineering Applications of Artificial Intelligence, 41, 103-114(2015). http://smartsearch.nstl.gov.cn/paper_detail.html?id=08d17b00673da83fb820d2c040776f2c

[3] Zhang J J, Shi Z G, Li J C. Current researches and future perspectives of crowd counting and crowd density estimation technology[J]. Computer Engineering & Science, 40, 282-291(2018).

[4] Sindagi V A, Patel V M. A survey of recent advances in CNN-based single image crowd counting and density estimation[J]. Pattern Recognition Letters, 107, 3-16(2018). http://www.sciencedirect.com/science/article/pii/S0167865517302398

[5] Gao G S, Gao J Y, Liu Q J et al. CNN-based density estimation and crowd counting: a survey[EB/OL]. (2020-03-01)[2020-10-10]. https://arxiv.org/abs/2003.12783

[6] Shang C, Ai H Z, Bai B. End-to-end crowd counting via joint learning local and global count[C]. //2016 IEEE International Conference on Image Processing (ICIP), September 25-28, 2016, Phoenix, AZ, USA., 1215-1219(2016).

[7] Szegedy C, Liu W, Jia Y Q et al. Going deeper with convolutions[C]. //2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA., 1-9(2015).

[8] Walach E, Wolf L. Learning to count with CNN boosting[M]. //Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science, 9906, 660-676(2016).

[9] Marsden M, McGuinness K, Little S et al. People, penguins and petri dishes: adapting object counting models to new visual domains and object types without forgetting[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA, 8070-8079(2018).

[10] Zhang S H, Wu G H, Costeira J P et al. FCN-rLSTM: deep spatio-temporal neural networks for vehicle counting in city cameras[C]. //2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy, 3687-3696(2017).

[11] Oñoro-Rubio D, López-Sastre R J. Towards perspective-free object counting with deep learning[M]. //Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science, 9911, 615-629(2016).

[12] Zhang S H, Wu G H, Costeira J P et al. Understanding traffic density from large-scale web camera data[C]. //2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 21-26, 2017, Honolulu, HI, USA., 4264-4273(2017).

[13] Zhang C, Li H S, Wang X G et al. Cross-scene crowd counting via deep convolutional neural networks[C]. //2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA., 833-841(2015).

[14] Xu C Y. Research on automated cervical cytological smears interpretation method[D](2014).

[15] Maitra M, Gupta R K, Mukherjee M. Detection and counting of red blood cells in blood cell images using Hough transform[J]. International Journal of Computer Applications, 53, 13-17(2012). http://adsabs.harvard.edu/abs/2012IJCA...53p..13M

[16] Kothari S, Chaudry Q, Wang M D. Automated cell counting and cluster segmentation using concavity detection and ellipse fitting techniques[C]. //2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, June 28-July 1, 2009, Boston, MA, USA, 795-798(2009).

[17] Lempitsky V, Zisserman A. Learning to count objects in images[C]. //Advances in neural information processing systems. December 6-9,2010, Vancouver, BC, 1324-1332(2017).

[18] Xie W D, Noble J A, Zisserman A. Microscopy cell counting and detection with fully convolutional regression networks[J]. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization, 6, 283-292(2018).

[19] Zhang Y Y, Zhou D S, Chen S Q et al. Single-image crowd counting via multi-column convolutional neural network[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 589-597(2016).

[20] Ma Z, Yu L, Chan A B. Small instance detection by integer programming on object density maps[C]. //2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 7-12, 2015, Boston, MA, USA., 3689-3697(2015).

[21] Fiaschi L, Koethe U, Nair R et al. Learning to count with regression forest and structured labels[C]. //Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), November 11-15, 2012, Tsukuba, Japan., 2685-2688(2012).

[22] Sommer C, Straehle C, Köthe U et al. Ilastik: interactive learning and segmentation toolkit[C]. //2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, March 30-April 2, 2011, Chicago, IL, USA, 230-233(2011).

[23] Borstel M, Kandemir M, Schmidt P et al. Gaussian process density counting from weak supervision[M]. //Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science, 9905, 365-380(2016).

[24] Deb D, Ventura J. An aggregated multicolumn dilated convolution network for perspective-free counting[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 18-22, 2018, Salt Lake City, UT, USA., 308-309(2018).

[25] Wang Y J, Hu S Y, Wang G D et al. Multi-scale dilated convolution of convolutional neural network for crowd counting[J]. Multimedia Tools and Applications, 79, 1057-1073(2020). http://link.springer.com/article/10.1007/s11042-019-08208-6

[26] Yu F, Koltun V. Multi-scale context aggregation by dilated convolutions[EB/OL]. (2015-11-23)[2020-10-10]. https://arxiv.org/abs/1511.07122

[27] Li Y H, Zhang X F, Chen D M. CSRNet: dilated convolutional neural networks for understanding the highly congested scenes[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA, 1091-1100(2018).

[28] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[EB/OL]. (2014-09-01)[2020-10-10]. https://arxiv.org/abs/1409.1556

[29] Marsden M, McGuinness K, Little S et al. Fully convolutional crowd counting on highly congested scenes[C]. //Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, February 27-March 1, 2017, Porto, Portugal, 27-33(2017).

[30] Amirgholipour S, He X J, Jia W J et al. A-CCNN: adaptive CCNN for density estimation and crowd counting[C]. //2018 25th IEEE International Conference on Image Processing (ICIP), October 7-10, 2018, Athens, Greece, 948-952(2018).

[31] Xie Y P, Xing F Y, Kong X F et al. Beyond classification: structured regression for robust cell detection using convolutional neural network[M]. //Navab N, Hornegger J, Wells W M, et al. Medical image computing and computer-assisted intervention-MICCAI 2015. Lecture notes in computer science, 9351, 358-365(2015).

[32] Xue Y, Ray N, Hugh J et al. Cell counting by regression using convolutional neural network[M]. //Hua G, Jégou H. Computer vision-ECCV 2016 workshops. Lecture notes in computer science, 9913, 274-290(2016).

[33] He K M, Zhang X Y, Ren S Q et al. Deep residual learning for image recognition[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 770-778(2016).

[34] Liu X. Object counting in surveillance video[D](2018).

[35] Liu J, Gao C Q, Meng D Y et al. DecideNet: counting varying density crowds through attention guided detection and density estimation[C]. //2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA, 5197-5206(2018).

[36] Chen X Y, Bin Y R, Sang N et al. Scale pyramid network for crowd counting[C]. //2019 IEEE Winter Conference on Applications of Computer Vision (WACV), January 7-11, 2019, Waikoloa, HI, USA., 1941-1950(2019).

[37] Rad R M, Saeedi P, Au J et al. Blastomere cell counting and centroid localization in microscopic images of human embryo[C]. //2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP), August 29-31, 2018, Vancouver, BC, Canada., 1-6(2018).

[38] Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation[M]. //Navab N, Hornegger J, Wells W M, et al. Medical image computing and computer-assisted intervention-MICCAI 2015. Lecture notes in computer science, 234-241(2015).

[39] He K M, Zhang X Y, Ren S Q et al. Identity mappings in deep residual networks[M]. //Leibe B, Matas J, Sebe N, et al. Computer vision-ECCV 2016. Lecture notes in computer science, 9913, 630-645(2016).

[40] Cohen J P, Boucher G, Glastonbury C A et al. Count-ception: counting by fully convolutional redundant counting[C]. //2017 IEEE International Conference on Computer Vision Workshops (ICCVW), October 22-29, 2017, Venice, Italy, 18-26(2017).

[41] Szegedy C, Vanhoucke V, Ioffe S et al. Rethinking the inception architecture for computer vision[C]. //2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27-30, 2016, Las Vegas, NV, USA., 2818-2826(2016).

[42] Rad R M, Saeedi P, Au J et al. Cell-net: embryonic cell counting and centroid localization via residual incremental atrous pyramid and progressive upsampling convolution[J]. IEEE Access, 7, 81945-81955(2019).

[43] Zhang Y M, Zhou C L, Chang F L et al. Attention to head locations for crowd counting[M]. //Zhao Y, Barnes N, Chen B Q, et al. ICIG 2019: image and graphics. Lecture notes in computer science, 11902, 727-737(2019).

[44] Gao J Y, Wang Q, Yuan Y. SCAR: spatial-/channel-wise attention regression networks for crowd counting[J]. Neurocomputing, 363, 1-8(2019). http://www.sciencedirect.com/science/article/pii/S0925231219311373

[45] Guo Y, Stein J, Wu G R et al. SAU-net: a universal deep network for cell counting[C]. //Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Niagara Falls, NY, USA, 299-306(2019).

[46] Hossain M, Hosseinzadeh M, Chanda O et al. Crowd counting using scale-aware attention networks[C]. //2019 IEEE Winter Conference on Applications of Computer Vision (WACV), January 7-11, 2019, Waikoloa, HI, USA., 1280-1288(2019).

[47] Zhang A R, Shen J Y, Xiao Z H et al. Relational attention network for crowd counting[C]. //2019 IEEE/CVF International Conference on Computer Vision (ICCV), October 27-November 2, 2019, Seoul, Korea (South)., 6787-6796(2019).

[48] Jiang X H, Zhang L, Xu M L et al. Attention scaling for crowd counting[C]. //2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 13-19, 2020, Seattle, WA, USA., 4705-4714(2020).

[49] Wang Y J, Zhang W, Liu Y Y et al. Two-branch fusion network with attention map for crowd counting[J]. Neurocomputing, 411, 1-8(2020). http://www.sciencedirect.com/science/article/pii/S0925231220310055

[50] Liu L B, Chen J Q, Wu H F et al. Efficient crowd counting via structured knowledge transfer. (2020-08-11)[2020-10-10]. https://arxiv.org/pdf/2003.10120.pdf

[51] Luo H L, Sang J, Wu W Q et al. A high-density crowd counting method based on convolutional feature fusion[J]. Applied Sciences, 8, 2367(2018). http://www.researchgate.net/publication/329158291_A_High-Density_Crowd_Counting_Method_Based_on_Convolutional_Feature_Fusion

[52] Yang B, Cao J M, Wang N et al. Counting challenging crowds robustly using a multi-column multi-task convolutional neural network[J]. Signal Processing: Image Communication, 64, 118-129(2018).

[53] Jiang X H, Zhang L, Zhang T Z et al. Density-aware multi-task learning for crowd counting[J]. IEEE Transactions on Multimedia, 23, 443-453(2021). http://ieeexplore.ieee.org/document/9037113/

[54] Idrees H, Tayyab M, Athrey K et al. Composition loss for counting, density map estimation and localization in dense crowds[M]. //Ferrari V, Hebert M, Sminchisescu C, et al. Computer vision-ECCV 2018. Lecture notes in computer science, 11902, 544-559(2018).

[55] Zhang L, Shi M J, Chen Q B. Crowd counting via scale-adaptive convolutional neural network[C]. //2018 IEEE Winter Conference on Applications of Computer Vision (WACV), March 12-15, 2018, Lake Tahoe, NV, USA., 1113-1121(2018).

[56] Sang J, Wu W Q, Luo H L et al. Improved crowd counting method based on scale-adaptive convolutional neural network[J]. IEEE Access, 7, 24411-24419(2019). http://ieeexplore.ieee.org/document/8643345/

[57] Zhang Y M, Zhou C L, Chang F L et al. Multi-resolution attention convolutional neural network for crowd counting[J]. Neurocomputing, 329, 144-152(2019). http://www.zhangqiaokeyan.com/academic-journal-foreign_other_thesis/0204112896348.html

[58] Zhu L, Zhao Z J, Lu C et al. Dual path multi-scale fusion networks with attention for crowd counting[EB/OL]. (2019-02-01)[2020-10-10]. https://arxiv.org/abs/1902.01115v1

[59] Yu S Y, Pu J. Aggregated context network for crowd counting[J]. Frontiers of Information Technology Electronic Engineering, 21, 1626-1638(2020).

[60] Chen J W, Su W, Wang Z F. Crowd counting with crowd attention convolutional neural network[J]. Neurocomputing, 382, 210-220(2020). http://www.sciencedirect.com/science/article/pii/S0925231219316662

[61] Cao J M, Yang B, Nan W et al. Robust crowd counting based on refined density map[J]. Multimedia Tools and Applications, 79, 2837-2853(2020). http://link.springer.com/article/10.1007%2Fs11042-019-08467-3

[62] Lu E, Xie W D, Zisserman A. Class-agnostic counting[M]. //Jawahar C V, Li H D, Mori G, et al. Computer vision-ACCV 2018. Lecture notes in computer science, 11363, 669-684(2019).

[63] Akram S U, Kannala J, Eklund L et al. Cell segmentation proposal network for microscopy image analysis[M]. //Carneiro G, Mateus D, Peter L, et al. Deep learning and data labeling for medical applications. Lecture notes in computer science, 10008, 21-29(2016).

[64] Liu X P. A research on automatic cell counting method in fluorescence microimaging based on deep learning[D](2020).

[65] Chan A B, Liang Z S, Vasconcelos N. Privacy preserving crowd monitoring: Privacy preserving crowd monitoring[C]. //2008 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2008, Anchorage, AK, USA, 1-7(2008).

[66] Chen K, Loy C C, Gong S G et al. Feature mining for localised crowd counting[EB/OL]. [2020-10-10]. http:∥www.bmva.org/bmvc/2012/BMVC/paper021/index.html

[67] Idrees H, Saleemi I, Seibert C et al. Multi-source multi-scale counting in extremely dense crowd images[C]. // 2013 IEEE Conference on Computer Vision and Pattern Recognition, June 23-28, 2013, Portland, OR, USA., 13824453(2013).

[68] Kainz P, Urschler M, Schulter S et al. You should use regression to detect cells[M]. //Navab N, Hornegger J, Wells W M, et al. Medical image computing and computer-assisted intervention-MICCAI 2015. Lecture notes in computer science, 9351, 276-283(2015).

[69] Lonsdale J, Thomas J, Salvatore M et al. The genotype-tissue expression (GTEx) project[J]. Nature Genetics, 45, 580-585(2013).

微信扫一扫：分享

微信扫一扫：分享