A Novel Feature Extraction Approach Using Deformable Adaptive Instance-Based U-Net Architecture for Segmentation and Classification of Oral Mucosal Lesion
Received: 3 April 2025 | Revised: 16 May 2025 and 7 June 2025 | Accepted: 9 June 2025 | Online: 2 August 2025
Corresponding author: S. M. Sagari
Abstract
Oral cancer is one of the six cancer types having high morbidity and mortality rates, especially among socioeconomically deprived groups of people due to their lack of knowledge about oral hygiene. This study aimed to detect oral lesions in different areas of the oral cavity based on visual features of suspicious regions. The localization, detection, and classification of regions of interest in digital images stemming from diverse resolution cameras presents a formidable challenge due to the variation in illumination, image size, and varied noise. The proposed method employs image pre- and post-processing approaches to locate the regions effectively. A dataset of 2050 oral cavity images was used, having 1000 malignant, 700 benign, and 350 premalignant cases. The proposed method uses deformable convolution and instance normalization in the U-Net architecture to segment the region of interest by preprocessing the images using canny and local binary pattern feature extractors. These segmented regions are classified by combining the Bresenham circle and flood fill algorithms. The experimental analysis of the proposed approach showed precision, recall, and F1 scores of 93.85%, 97.37%, and 95.58% for noised malignant images and 96.20%, 96.82%, and 96.51% for denoised malignant images. Similarly, precision, recall, and F1 scores were 98.67%, 94.94%, and 96.77% for benign lesion noise images, and 96.95%, 96.36%, and 96.66% for benign denoised lesion images.
Keywords:
oral cancer, mucosal lesion, U-NetDownloads
References
P. H. Montero and S. G. Patel, "Cancer of the Oral Cavity," Surgical Oncology Clinics of North America, vol. 24, no. 3, pp. 491–508, Jul. 2015. DOI: https://doi.org/10.1016/j.soc.2015.03.006
K. W. Aschheim, Esthetic Dentistry: A Clinical Approach to Techniques and Materia, 3rd ed. Mosby, 2015.
A. Chaurasia, S. I. Alam, and N. Singh, "Oral cancer diagnostics: An overview," National Journal of Maxillofacial Surgery, vol. 12, no. 3, pp. 324–332, Sep. 2021. DOI: https://doi.org/10.4103/njms.NJMS_130_20
A. Rahman et al., "Histopathologic Oral Cancer Prediction Using Oral Squamous Cell Carcinoma Biopsy Empowered with Transfer Learning," Sensors, vol. 22, no. 10, May 2022, Art. no. 3833. DOI: https://doi.org/10.3390/s22103833
M. S. Rahman et al., "Evaluation of a low-cost, portable imaging system for early detection of oral cancer," Head & Neck Oncology, vol. 2, no. 1, Apr. 2010, Art. no. 10. DOI: https://doi.org/10.1186/1758-3284-2-10
A. Sungheetha and R. Sharma R, "Design an Early Detection and Classification for Diabetic Retinopathy by Deep Feature Extraction based Convolution Neural Network," Journal of Trends in Computer Science and Smart Technology, vol. 3, no. 2, pp. 81–94, Jul. 2021. DOI: https://doi.org/10.36548/jtcsst.2021.2.002
B. W. Neville and T. A. Day, "Oral Cancer and Precancerous Lesions," CA: A Cancer Journal for Clinicians, vol. 52, no. 4, pp. 195–215, 2002. DOI: https://doi.org/10.3322/canjclin.52.4.195
T. Baykul, H. Yilmaz, Ü. Aydin, M. Aydin, M. Aksoy, and D. Yildirim, "Early Diagnosis of Oral Cancer," Journal of International Medical Research, vol. 38, no. 3, pp. 737–749, Jun. 2010. DOI: https://doi.org/10.1177/147323001003800302
T. O. Bittar, L. R. Paranhos, D. H. Fornazari, and A. C. Pereira, "Epidemiological features of oral cancer: a world public health matter," RFO UPF, vol. 15, no. 1, pp. 87–93, Apr. 2010.
D. P. Slaughter, H. W. Southwick, and W. Smejkal, "Field cancerization in oral stratified squamous epithelium; clinical implications of multicentric origin," Cancer, vol. 6, no. 5, pp. 963–968, Sep. 1953. DOI: https://doi.org/10.1002/1097-0142(195309)6:5<963::AID-CNCR2820060515>3.0.CO;2-Q
C. Scully, J. V. Bagan, C. Hopper, and J. B. Epstein, "Oral cancer: current and future diagnostic techniques," American journal of dentistry, vol. 21, no. 4, pp. 199–209, Aug. 2008.
"Oral Cancer (Lips and Tongue) images." Kaggle, [Online]. Available: https://www.kaggle.com/datasets/shivam17299/oral-cancer-lips-and-tongue-images.
B. R. Nanditha et al., "Oral Images Dataset." Mendeley, Feb. 05, 2021.
J. Lizé, V. Débordès, H. Lu, K. Kpalma, and J. Ronsin, "Local Binary Pattern and Its Variants: Application to Face Analysis," in Advances in Smart Technologies Applications and Case Studies, 2020, pp. 94–102. DOI: https://doi.org/10.1007/978-3-030-53187-4_11
Y. Wang, L. Shi, J. Lausanne, and D. Zhong, "Straight lane line detection based on the Otsu-Canny algorithm," in 2022 IEEE 6th Information Technology and Mechatronics Engineering Conference (ITOEC), Chongqing, China, Mar. 2022, pp. 27–30. DOI: https://doi.org/10.1109/ITOEC53115.2022.9734320
O. Ronneberger, P. Fischer, and T. Brox, "U-Net: Convolutional Networks for Biomedical Image Segmentation," in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, 2015, pp. 234–241. DOI: https://doi.org/10.1007/978-3-319-24574-4_28
X. Huang and S. Belongie, "Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization," in 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, Oct. 2017, pp. 1510–1519. DOI: https://doi.org/10.1109/ICCV.2017.167
X. Zhu, H. Hu, S. Lin, and J. Dai, "Deformable ConvNets V2: More Deformable, Better Results," in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun. 2019, pp. 9300–9308. DOI: https://doi.org/10.1109/CVPR.2019.00953
Z. Shi, Y. Chen, E. Gavves, P. Mettes, and C. G. M. Snoek, "Unsharp Mask Guided Filtering," IEEE Transactions on Image Processing, vol. 30, pp. 7472–7485, 2021. DOI: https://doi.org/10.1109/TIP.2021.3106812
Y. He, T. Hu, and D. Zeng, "Scan-Flood Fill(SCAFF): An Efficient Automatic Precise Region Filling Algorithm for Complicated Regions," in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA, Jun. 2019, pp. 761–769. DOI: https://doi.org/10.1109/CVPRW.2019.00104
M. Lin, Q. Chen, and S. Yan, "Network In Network." arXiv, Mar. 04, 2014.
S. Xie and Z. Tu, "Holistically-Nested Edge Detection," International Journal of Computer Vision, vol. 125, no. 1, pp. 3–18, Dec. 2017. DOI: https://doi.org/10.1007/s11263-017-1004-z
A. Galdran, G. Carneiro, and M. A. G. Ballester, "On the Optimal Combination of Cross-Entropy and Soft Dice Losses for Lesion Segmentation with Out-of-Distribution Robustness," in Diabetic Foot Ulcers Grand Challenge, 2023, pp. 40–51. DOI: https://doi.org/10.1007/978-3-031-26354-5_4
B. R. Nanditha, G. Kiran, and A. M. P. Sanathkumar, "Oral Cancer Detection using Machine Learning and Deep Learning Techniques," International Journal of Current Research and Review, vol. 14, no. 01, pp. 64–70, 2022. DOI: https://doi.org/10.31782/IJCRR.2021.14104
Downloads
How to Cite
License
Copyright (c) 2025 S. M. Sagari, Vindhya P. Malagi, B. Chandrahas

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
