Improved spatio-temporal graph convolutional networks for video anomaly detection

Hongmin Zhang; Dingding Yan; Qianqian Tian

doi:10.12086/oee.2024.240034

Journals >Opto-Electronic Engineering >Volume 51 >Issue 5 >Page 240034 > Article

Opto-Electronic Engineering
Vol. 51, Issue 5, 240034 (2024)

Improved spatio-temporal graph convolutional networks for video anomaly detection

Hongmin Zhang^*, Dingding Yan, and Qianqian Tian

Author Affiliations

School of Electrical and Electronic Engineering, Chongqing University of Technology, Chongqing 400054, China

show less

DOI: 10.12086/oee.2024.240034 Cite this Article

Hongmin Zhang, Dingding Yan, Qianqian Tian. Improved spatio-temporal graph convolutional networks for video anomaly detection[J]. Opto-Electronic Engineering, 2024, 51(5): 240034 Copy Citation Text

show less

Fig. 1. Improved spatio-temporal graph convolutional network model framework

Download full size | View in the Article

Fig. 2. Comparison between GCN module and CRF-GCN module. (a) GCN module; (b) CRF-GCN module

Download full size | View in the Article

Fig. 3. Flowchart of mean-field inference for CRF-GCN

Download full size | View in the Article

Fig. 4. Test results of UCSD Ped2 dataset. (a) Test003; (b) Test012

Download full size | View in the Article

Fig. 5. Test results of ShanghaiTech dataset. (a) 04_0004; (b) 12_0173

Download full size | View in the Article

Fig. 6. Test results of IITB-Corridor dataset. (a) Test000228; (b) Train000139 (Normal)

Download full size | View in the Article

Fig. 7. Noised experiments. (a) AUC loss for training with noise-added data; (b) ACC loss for training with noise-added

Download full size | View in the Article

数据集	帧数	年份	标注	分辨率	异常类型
UCSD Ped2	4560	2010	Frame-level	360×240	骑自行车、小型车辆
ShanghaiTech	317398	2016	Frame-level	480×856	骑自行车、逃票、打架
IITB-Corridor	483566	2020	Frame-level	1920×1080	抗议、打斗、追逐等

Table 1. UCSD Ped2, ShanghaiTech and IITB-Corridor datasets

View in the Article

监督方式	对比方法	特征提取方式	AUC/%	准确率/%
无监督方式	Hasan的方法^[28]	-	90.0	89.5
	Gong的方法^[29]	-	94.1	-
	Yu的方法^[30]	-	97.3	95.6
	Taghinezhad的方法^[31]	Encoder	97.6	-
弱监督方式	GCN-Anomaly^[27]	TSN	93.2	90.3
	Sultani的方法^[7]	I3D	92.3	-
	RTFM^[32]	TSN	96.5	-
	Chen的方法^[33]	C3D	97.4	96.1
	Wang的方法^[34]	Encoder	97.7	93.4
	本文方法	C3D	97.7	96.5

Table 2. Comparison results of different methods on UCSD Ped2 dataset

View in the Article

监督方式	对比方法	特征提取方式	AUC/%	准确率/%
无监督方式	Hasan的方法^[28]	-	60.8	60.1
	Gong的方法^[29]	-	71.2	-
	Yu的方法^[30]	-	74.4	72.6
	Tur的方法^[35]	3D-ResNet18	76.1	-
弱监督方式	GCN-Anomaly^[27]	TSN	84.4	82.6
	Sultani的方法^[7]	I3D	86.3	-
	Zhou的方法^[12]	I3D	89.8	-
	Acsintoae的方法^[36]	-	83.7	86.1
	Wang的方法^[34]	Encoder	71.3	82.6
	本文方法	C3D	90.4	88.6

Table 3. Comparison results of different methods on ShanghaiTech dataset

View in the Article

监督方式	对比方法	特征提取方式	AUC/%
无监督方式	Zeng的方法^[37]	-	73.9
弱监督方式	Li的方法^[38]	C3D	72.2
	Cao的方法^[39	CVAE	73.6
	Royston的方法^[26]	I3D	67.1
	Majhi的方法^[40]	I3D	84.1
	本文方法	C3D	86.0

Table 4. Comparison results of different methods on IITB-Corridor dataset

View in the Article

分类	对比方法	MACs/G	Params/M
基于其他框架的方法	Sultani的方法^[7]	154.22	63.33
基于其他框架的方法	Feng的方法^[19]	156.86	34.75
基于图卷积的方法	GCN-Anomaly^[27]	154.22	63.38
	Chen的方法^[33]	154.23	63.90
	本文方法	109.14	19.90

Table 5. Comparison results of different methods on complexity

View in the Article

时间依赖图	空间相似图	图融合方式	CRF	AUC/%	准确率/%
√		-		96.6	96.2
	√	-		97.1	96.1
√	√	平均融合^[29]		89.2	86.9
√	√	自适应时空融合		96.1	94.2
√	√	自适应时空融合	√	97.7	96.5

Table 6. Results of ablation experiments

Hongmin Zhang, Dingding Yan, Qianqian Tian. Improved spatio-temporal graph convolutional networks for video anomaly detection[J]. Opto-Electronic Engineering, 2024, 51(5): 240034

Download Citation

Tools

Save the article for my favorites

Paper Information

微信扫一扫：分享

微信扫一扫：分享