A Survey on Self-Supervised Learning-Based Video Anomaly Detection


  • Mengjie Hu
  • Qingtao Wu




Video Anomaly Detection (VAD); Self-Supervised Learning; Machine Learning Standpoint; Commonly Utilized Datasets in VAD; Promote the Future Development of VAD.


Video anomaly detection (VAD) exhibits promising applications across diverse domains, bolstering intelligence, security, and operational efficiency, thereby catalyzing industry growth. This paper begins by examining the research background and significance of VAD, providing an in-depth analysis of its relevance across various sectors. Subsequently, from a machine learning standpoint, recent advancements in self-supervised learning (SSL)-based VAD models are systematically categorized and summarized, elucidating their underlying principles and deployment scenarios. Additionally, commonly utilized datasets in VAD are introduced to facilitate readers' understanding of model assessment and comparative analysis. Lastly, discussions on future trajectories and extant challenges in VAD are undertaken to foster deeper exploration and propel the advancement of this domain.


Download data is not yet available.


Xiang T, Gong S. Video behavior profiling for anomaly detection[J]. IEEE transactions on pattern analysis and machine intelligence, 2008, 30(5): 893-908.

Sánchez F L, Hupont I, Tabik S, et al. Revisiting crowd behaviour analysis through deep learning: Taxonomy, anomaly detection, crowd emotions, datasets, opportunities and prospects[J]. Information Fusion, 2020, 64: 318-335.

Xia X, Pan X, Li N, et al. GAN-based anomaly detection: A review[J]. Neurocomputing, 2022, 493: 497-535.

Santhosh K K, Dogra D P, Roy P P. Anomaly detection in road traffic using visual surveillance: A survey[J]. ACM Computing Surveys (CSUR), 2020, 53(6): 1-26.

Wang L, Tian J, Zhou S, et al. Memory-augmented appearance-motion network for video anomaly detection[J]. Pattern Recognition, 2023, 138: 109335.

Xing P, Li Z. Visual anomaly detection via partition memory bank module and error estimation[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2023.

Hyun W, Nam W J, Lee S W. Dissimilate-and-assimilate strategy for video anomaly detection and localization[J]. Neurocomputing, 2023, 522: 203-213.

Hao Y, Li J, Wang N, et al. Spatiotemporal consistency-enhanced network for video anomaly detection[J]. Pattern Recognition, 2022, 121: 108232.

Li Z, Zhao M, Zeng X, et al. Memory-Augmented Spatial-Temporal Consistency Network for Video Anomaly Detection [C]//Chinese Conference on Pattern Recognition and Computer Vision (PRCV). Singapore: Springer Nature Singapore, 2023: 95-107.

Chen M, Wei F, Li C, et al. Frame-wise action representations for long videos via sequence contrastive learning[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 13801-13810.

Dwibedi D, Aytar Y, Tompson J, et al. Temporal cycle-consistency learning[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019: 1801-1810.

Dvornik M, Hadji I, Derpanis K G, et al. Drop-dtw: Aligning common signal between sequences while dropping outliers[J]. Advances in Neural Information Processing Systems, 2021, 34: 13782-13793.

Wang X, Zhu L, Wu Y, et al. Symbiotic attention for egocentric action recognition with object-centric alignment[J]. IEEE transactions on pattern analysis and machine intelligence, 2020.

Huang C, Wen J, Xu Y, et al. Self-supervised attentive generative adversarial networks for video anomaly detection[J]. IEEE transactions on neural networks and learning systems, 2022.

Chen D, Yue L, Chang X, et al. NM-GAN: Noise-modulated generative adversarial network for video anomaly detection[J]. Pattern Recognition, 2021, 116: 107969.

Wang T, Qiao M, Lin Z, et al. Generative neural networks for anomaly detection in crowded scenes[J]. IEEE Transactions on Information Forensics and Security, 2018, 14(5): 1390-1399.

Wang S, Miao Z. Anomaly detection in crowd scene[C]//IEEE 10th International Conference on Signal Processing Proceedings. IEEE, 2010: 1220-1223.

Ravanbakhsh M, Nabi M, Mousavi H, et al. Plug-and-play cnn for crowd motion analysis: An application in abnormal event detection[C]//2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 2018: 1689-1698.

Luo W, Liu W, Gao S. A revisit of sparse coding based anomaly detection in stacked rnn framework[C]//Proceedings of the IEEE international conference on computer vision. 2017: 341-349.

Lu C, Shi J, Jia J. Abnormal event detection at 150 fps in matlab[C]//Proceedings of the IEEE international conference on computer vision. 2013: 2720-2727.

Ramachandra B, Jones M. Street scene: A new dataset and evaluation protocol for video anomaly detection[C]// Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2020: 2569-2578.






How to Cite

A Survey on Self-Supervised Learning-Based Video Anomaly Detection. (2024). Academic Journal of Science and Technology, 11(2), 41-44. https://doi.org/10.54097/etr5a113

Similar Articles

1-10 of 789

You may also start an advanced similarity search for this article.