Open Access Open Access  Restricted Access Subscription Access

AI-Assisted Diagnosis of Oral Cancer: A Deep Learning Approach for Early Detection Using Histopathological Imaging

Chandana B, Gajanan M Naik

Abstract


The prevalence of OSCC (oral squamous cell carcinoma) continues to be a leading cause of death from cancer world-wide. Even though substantial developments have occurred in medicine, OSCC can often be diagnosed too late. In addition, due to the paucity of screening facilities are available, especially in under-developed areas, Through the utilization of invasive diagnostic procedures such as biopsies continue to be the primary means of diagnosing OSCC. As such, the need for developing non-invasive diagnostic tools continues to grow. In this paper, we present an artificial intelligence-based deep learning framework that uses histopathological images to detect and classify OSCC. As such, the imperative of developing non-invasive diagnostic tools continues to grow. This model is founded upon the principle of transfer learning from the pre-trained CNNs (Convolutional Neural Networks) MobileNet, VGG16, and ResNet5, and were trained on curated datasets of histopathological images of oral tissues classified into normal, pre-cancerous and cancerous. We also employed numerous image preprocessing techniques (resizing, normalization, rotation, flipping, and brightness adjustments) to augment the features of our dataset, which was necessary to strengthen the learning The diagnostic utility of the proposed model and to increase its feature robustness. To further improve the accuracy and reduce false negative results, we implemented a soft voting ensemble method. Soft voting is a weighted version of hard voting and combines the probabilistic output values of all of the separate individual CNN models. Commonly adopted performance metrics for classification models, namely accuracy,F1-score,recall, precision and ROC-AUC were implemented to assess the model’s overall performance. Additionally, Grad- CAM (Gradient-weighted Class Activation Mapping) was leveraged to visually demonstrate the most critical lesion regions with an intention to enhance the interpretability and clinical trustworthiness of the proposed model. The model was developed using both TensorFlow and PyTorch on a Google Colab GPU environment and presents a non-invasive, cost-effective, and explainable diagnostic tool to assist clinicians in performing early oral cancer screening and risk assessments by integrating AI-driven image analysis with clinical oncology. This work provides a foundation for future multimodal and transformer-based diagnostic systems, capable of transforming the early detection and handling of oral cancer cases.


Full Text:

PDF

References


Das, M., Dash, R., Mishra, S. K., & Dalai, A. K. (2024). An ensemble deep learning model for oral squamous cell carcinoma detection using histopathological image analysis. IEEE Access, 12, 31498–31512.

Devindi, G. A. I., Dissanayake, D. M. D. R., Liyanage, S. N., Francis, F. B. A. H., Pavithya, M. B. D., Piyarathne, N. S., Hettiarachchi, P. V. K. S., Rasnayaka, R. M. S. G. K., Jayasinghe, R. D., Ragel, R. G., & Nawinne, I. (2024). Multimodal deep convolutional neural network pipeline for AI-assisted early detection of oral cancer. IEEE Access, 12, 124375–124387.

Goswami, B., Bhuyan, M. K., Alfarhood, S., & Safran, M. (2024). Classification of oral cancer into pre-cancerous stages from white light images using LightGBM algorithm. IEEE Access, 12, 31626–31637.

Ramya, S., Minu, R. I., & Magesh, K. T. (2025). Xception spiking fractional neural network for oral squamous cell carcinoma classification based on histopathological edTrans: Intelligent Computing for Medical Diagnosis Using Multiscale Cross-

Attention Vision Transformer,” IEEE Access, vol. 12, pp. 146575–146589, 2024.

P. M. Conforti, G. Lazzini, P. Russo, and

M. D’Acunto, “Raman Spectroscopy and AI Applications in Cancer Grading: An Overview,” IEEE Access, vol. 12, pp. 54816–54829, 2024.

T. Reinhard, E. Z. Toth, and C. G. Johnson, “Color Normalization Techniques for Histopathological Image Analysis: A Comparative Study,” Journal of Pathology Informatics, vol. 12, pp. 145–152, 2023.

K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770–778.

K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks for Large- Scale Image Recognition,” arXiv preprint arXiv:1409.1556, 2014.

A. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam, “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications,” arXiv preprint arXiv:1704.04861, 2017.

S. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, “Grad- CAM: Visual Explanations from Deep Networks via Gradient-Based Localization,” in Proc. IEEE Int. Conf. Computer Vision (ICCV), Venice, Italy, 2017, pp. 618–626.


Refbacks

  • There are currently no refbacks.