Open Access Open Access  Restricted Access Subscription Access

Real-Time CCTV Anomaly Detection Using a Hybrid ResNet + Bidirectional LSTM Spatio-Temporal Framework: A Deployable Web-Based Surveillance System

Anish Dubey, Md. Abdul Raheem Alam, Dr. K. Madhubabu, Dr. M. Mamatha

Abstract


The rapid proliferation of CCTV surveillance cameras across public and private spaces has rendered continuous manual monitoring inefficient, labour-intensive, and prone to operator fatigue. This work presents a real-time CCTV anomaly detection framework that automatically classifies surveillance video as Normal or Anomalous using a hybrid spatio-temporal deep learning pipeline. The proposed model couples a ResNet-based convolutional encoder for per-frame spatial feature extraction with a Bidirectional Long Short-Term Memory (Bi-LSTM) network for forward-and-backward temporal sequence modelling, operating on uniformly sampled sequences of 20 frames at a resolution of 224×224. The pipeline is trained on the UCF-Crime Anomaly Detection Dataset (Kaggle), comprising 13 anomaly categories alongside normal activity videos. The trained model is deployed as a full-stack web application: a Flask REST API backend (port 5000) serves the inference engine and exposes endpoints for both video-file upload and live webcam ingestion, while a Vue.js 3 frontend (Vite, port 5173) delivers a dual-mode user interface with drag-and-drop upload, WebRTC live capture, real-time backend health monitoring, and colour-coded result rendering. The system operates without specialized GPU hardware and avoids cloud dependence, making it suitable for low-cost local deployment. Experimental validation on the UCF-Crime test set confirms reliable binary classification with high inference responsiveness, and the modular two-tier architecture (app.py for routing, inference.py for model logic) supports straightforward future model upgrades. Compared to existing benchmark approaches such as the Sultani et al. MIL framework and frame-only CNN detectors, the proposed system simultaneously delivers spatio-temporal modelling, dual-mode input, and an accessible web interface in one deployable package.


Full Text:

PDF

References


W. Sultani, C. Chen, and M. Shah, "Real-World Anomaly Detection in Surveillance Videos," Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), pp. 6479-6488, 2018.

C. Affonso et al., "AI for Anomaly Detection Using Security Cameras," IEEE Access, vol. 9, pp. 1-12, 2021.

M. S. Sarfraz et al., "Real-Time Anomaly Detection and Classification," Journal of Ambient Intelligence and Humanized Computing, vol. 12, pp. 1-15, 2021.

S. Mane et al., "MobileNet + Bi-LSTM Framework for Anomaly Detection," Journal of Intelligent Systems, Springer/IEEE, pp. 1-10, 2024.

S. A. Qadri et al., "YOLO + Vision Transformer for CCTV Anomaly Detection," International Journal of Computer Vision, IEEE/Elsevier, pp. 1-12, 2024.

M. Mariswari and V. Narayani, "DBN + GAN Hybrid Anomaly Detection," Procedia Computer Science, vol. 232, pp. 1-10, 2024.

D. Manju et al., "MRCNN-LSTM Early Anomaly Detection," Proc. International Conference on Electronics and Communication, pp. 1-8, 2025.

S. Gandapur and E. Verdu, "Video Anomaly Detection using ConvGRU-CNN," IEEE Transactions on Multimedia, vol. 25, pp. 1-12, 2023.

K. Chidananda and R. Siva Kumar, "VidAnomalyNet: A Custom CNN Architecture for Surveillance," Pattern Recognition Letters, vol. 178, pp. 1-9, 2024.

S. Biswas et al., "Attention-LSTM Surveillance Framework," International Journal of Multimedia Information Retrieval, vol. 10, pp. 1-10, 2021.


Refbacks

  • There are currently no refbacks.