Voice Activity Detection in Cellular Networks: An Ekolama Loss–Optimised Deep Q-Network Approach for Adaptive Bandwidth Allocation and Speech Quality Enhancement
Abstract
Voice Activity Detection (VAD) plays a critical role in enhancing cellular network efficiency by suppressing silent intervals and conserving bandwidth. However, traditional and conventional deep learning-based VAD methods often fail to adapt to dynamic network conditions, resulting in suboptimal performance under variable noise, jitter, and packet loss. This study proposes the Ekolama Loss–Optimised Deep Q-Network (ELO-DQN), a reinforcement learning model designed to improve detection accuracy, bandwidth usage, and speech quality across heterogeneous networks from GSM to 5G. The ELO-DQN integrates a novel composite loss function by leveraging the potentials of mean squared error, mean absolute error, and Huber loss combined with an adaptive exponential weighting mechanism, to enhance training stability and robustness under non-stationary conditions. The proposed model was evaluated through extensive simulations across GSM, LTE, VoLTE, and VoNR network environments, integrating realistic network impairments such as latency (45–320 ms), jitter (2–95 ms), and packet loss (up to 5%). Comparative analysis demonstrated that ELO-DQN significantly outperformed conventional Deep leaning and baseline VAD approaches. It achieved a 67% reduction in mean latency (from 185 ± 62 ms to 60 ± 58 ms), a 27% decrease in jitter (from 49 ± 18 ms to 36 ± 15 ms), and a 28% improvement in packet loss reduction (from 3.9% to 2.8%), with corresponding gains in the 95th percentile values. Furthermore, ELO-DQN improved detection accuracy up to (94.8% as against 87.2% baseline), precision of (94.1% as against 85.5% bseline), recall of (95.3% as against 84.7% baseline), and F1-score of (94.7% as against 85.1% baseline) traditional VAD, and Deep learning VAD approaches. Statistical validation via paired t-tests and Wilcoxon signed-rank tests confirmed the significance of these improvements (p < 0.05). The model also enhanced bandwidth efficiency and maintained higher Mean Opinion Score (MOS) and Perceptual Evaluation of Speech Quality (PESQ) metrics, demonstrating superior resilience in low-SNR and high-traffic scenarios. These findings establish ELO-DQN as a robust, adaptive solution for voice activity detection in modern mobile networks, offering substantial benefits in spectral efficiency, user experience, and network scalability.
References
Ball, J. (2023). Voice Activity Detection (VAD) in Noisy Environments. arXiv preprint arXiv:2312.05815.
Odida, M. O. (2024). The evolution of mobile communication: A comprehensive survey on 5g technology. J Sen Net Data Comm, 4(1), 01-11.
Agarwal, B., Togou, M. A., Ruffini, M., & Muntean, G. M. (2022). Qoe-driven optimization in 5g o-ran-enabled hetnets for enhanced video service quality. IEEE Communications Magazine, 61(1), 56-62.
Patil, R. M., & Patil, C. M. (2024, July). Unveiling the state-of-the-art: A comprehensive survey on voice activity detection techniques. In 2024 Asia Pacific Conference on Innovation in Technology (APCIT) (pp. 1-5). IEEE.
Yadav, S., Legaspi, P. A. D., Oude Alink, M. S., Kokkeler, A. B., & Nauta, B. (2022). Hardware implementations for voice activity detection: Trends, challenges and outlook. IEEE transactions on circuits and systems I: regular papers, 70(3), 1083-1096.
Ahmad, I., Shahabuddin, S., Malik, H., Harjula, E., Leppänen, T., Loven, L., ... & Riekki, J. (2020). Machine learning meets communication networks: Current trends and future challenges. IEEE access, 8, 223418-223460.
Gonzalez-Lopez, J. A., Gomez-Alanis, A., Doñas, J. M. M., Pérez-Córdoba, J. L., & Gomez, A. M. (2020). Silent speech interfaces for speech restoration: A review. IEEE access, 8, 177995-178021.
Che, W. (2022). World Of 5g, The-Volume 1: Internet Of Everything. World Scientific.
Ahmad, N. (2023). Exploring QoS-QoE: Importance of Cross-Correlated Quality of Service Metrics in Evaluating the User Experience for Various Multimedia Applications.
Osee, K. M. T., Nkemeni, V., & Sone, M. E. (2025). Quality of Experience Management in Mobile Networks: Techniques, Constraints, and Emerging Trends for Value-Added Services. Computer Networks and Communications, 21-58.
Refbacks
- There are currently no refbacks.