A Vision Transformer-Based Convolutional Neural Network for the Automated Diagnosis of Eye Diseases Using Self-Attention Mechanisms

Noor Ayesha

doi:10.48084/etasr.10649

Authors

Noor Ayesha Center of Excellence in Cyber Security (CYBEX), Prince Sultan University Riyadh, Saudi Arabia

Volume: 15 | Issue: 4 | Pages: 24493-24497 | August 2025 | https://doi.org/10.48084/etasr.10649

Received: 20 February 2025 | Revised: 12 April 2025, 27 April 2025, and 03 May 2025 | Accepted: 4 May 2025 | Online: 2 August 2025

Corresponding author: Noor Ayesha

Abstract

Daily life is highly dependent on the eyes, making them one of the most essential organs in the body. This study focuses on four eye conditions: Normal, Diabetic Retinopathy, Cataracts, and Glaucoma. This study presents a Convolutional Neural Network (CNN) model based on a Vision Transformer (ViT) with a Self-Attention Mechanism (SAM) for diagnosing various eye diseases. Initially, the dataset was preprocessed through resizing and normalization to enhance image quality and facilitate feature extraction. The proposed model was evaluated, achieving a commendable accuracy of 94% on test data, with an average AUC of 98.82%. This model effectively diagnoses conditions such as Diabetic Retinopathy, Cataracts, Glaucoma, and normal cases. The GUI-based application was developed and tested, allowing doctors to upload multiple images and analyze eye disease categories, enhancing interpretability and showing promise for clinical applications. The proposed model can assist ophthalmologists in detecting eye disorders, enabling timely treatment of patients and helping to prevent vision loss.

Keywords:

eye disease, deep learning, vision transformer, classification, health risks

Downloads

Download data is not yet available.

References

M. Hussain et al., "An Enhanced Convolutional Neural Network (CNN) based P-EDR Mechanism for Diagnosis of Diabetic Retinopathy (DR) using Machine Learning," Engineering, Technology & Applied Science Research, vol. 15, no. 1, pp. 19062–19067, Feb. 2025. DOI: https://doi.org/10.48084/etasr.8854

S. A. Hassan, S. Akbar, A. Rehman, T. Saba, H. Kolivand, and S. A. Bahaj, "Recent Developments in Detection of Central Serous Retinopathy Through Imaging and Artificial Intelligence Techniques–A Review," IEEE Access, vol. 9, pp. 168731–168748, 2021. DOI: https://doi.org/10.1109/ACCESS.2021.3108395

A. Jabbar et al., "Deep Transfer Learning-Based Automated Diabetic Retinopathy Detection Using Retinal Fundus Images in Remote Areas," International Journal of Computational Intelligence Systems, vol. 17, no. 1, May 2024, Art. no. 135. DOI: https://doi.org/10.1007/s44196-024-00557-x

X. Chen, J. Xu, X. Chen, and K. Yao, "Cataract: Advances in surgery and whether surgery remains the only treatment in future," Advances in Ophthalmology Practice and Research, vol. 1, no. 1, Nov. 2021, Art. no. 100008. DOI: https://doi.org/10.1016/j.aopr.2021.100008

D. Kothadiya, A. Rehman, S. Abbas, F. S. Alamri, and T. Saba, "Attention-based deep learning framework to recognize diabetes disease from cellular retinal images," Biochemistry and Cell Biology, vol. 101, no. 6, pp. 550–561, Dec. 2023. DOI: https://doi.org/10.1139/bcb-2023-0151

H. Naz, R. Nijhawan, N. J. Ahuja, T. Saba, F. S. Alamri, and A. Rehman, "Micro-segmentation of retinal image lesions in diabetic retinopathy using energy-based fuzzy C-Means clustering (EFM-FCM)," Microscopy Research and Technique, vol. 87, no. 1, pp. 78–94, 2024. DOI: https://doi.org/10.1002/jemt.24413

T. Babaqi, M. Jaradat, A. E. Yildirim, S. H. Al-Nimer, and D. Won, "Eye Disease Classification Using Deep Learning Techniques." arXiv, Jul. 19, 2023.

H. Naz, T. Saba, F. S. Alamri, A. S. Almasoud, and A. Rehman, "An Improved Robust Fuzzy Local Information K-Means Clustering Algorithm for Diabetic Retinopathy Detection," IEEE Access, vol. 12, pp. 78611–78623, 2024. DOI: https://doi.org/10.1109/ACCESS.2024.3392032

"Compact Bat Algorithm with Deep Learning Model for Biomedical EEG EyeState Classification," Computers, Materials and Continua, vol. 72, no. 3, pp. 4589–4601, Apr. 2022. DOI: https://doi.org/10.32604/cmc.2022.027922

M. Smaida, S. Yaroshchak, and Y. El Barg, "DCGAN for Enhancing Eye Diseases Classification," in Computer Modeling and Intelligent Systems, 2021, vol. 2864, pp. 22–33. DOI: https://doi.org/10.32782/cmis/2864-3

M. Smaida and D. Y. Serhii, "Comparative Study of Image Classification Algorithms for Eyes Diseases Diagnostic," International Journal of Innovative Science and Research Technology, vol. 4, no. 12, pp. 40–48, 2019.

A. Ramis Uulu, G. Gimaletdinova, and Z. Orozakhunov, "Eye Disease Classification Using Deep Learning Approaches: A Case Study on Retinal Images." Preprints.org, Dec. 25, 2024. DOI: https://doi.org/10.20944/preprints202412.1996.v1

I. Topaloglu, "Deep Learning Based Convolutional Neural Network Structured New Image Classification Approach for Eye Disease Identification," Scientia Iranica, vol. 30, no. 5, pp. 1731–1742, Oct. 2023.

"Eye diseases classification." Kaggle, [Online]. Available: https://www.kaggle.com/datasets/gunavenkatdoddi/eye-diseases-classification.

I. Bello, B. Zoph, Q. Le, A. Vaswani, and J. Shlens, "Attention Augmented Convolutional Networks," in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), Oct. 2019, pp. 3285–3294. DOI: https://doi.org/10.1109/ICCV.2019.00338

A. Vaswani et al., "Attention is All you Need," in Advances in Neural Information Processing Systems, 2017, vol. 30.

T. Saba, "Automated lung nodule detection and classification based on multiple classifiers voting," Microscopy Research and Technique, vol. 82, no. 9, pp. 1601–1609, 2019. DOI: https://doi.org/10.1002/jemt.23326

A. Rehman and T. Saba, "Features extraction for soccer video semantic analysis: current achievements and remaining issues," Artificial Intelligence Review, vol. 41, no. 3, pp. 451–461, Mar. 2014. DOI: https://doi.org/10.1007/s10462-012-9319-1

N. N. An, N. Q. Thanh, and Y. Liu, "Deep CNNs With Self-Attention for Speaker Identification," IEEE Access, vol. 7, pp. 85327–85337, 2019. DOI: https://doi.org/10.1109/ACCESS.2019.2917470

C. Zhang et al., "Multi-Gram CNN-Based Self-Attention Model for Relation Classification," IEEE Access, vol. 7, pp. 5343–5357, 2019. DOI: https://doi.org/10.1109/ACCESS.2018.2888508

M. Li, W. Hsu, X. Xie, J. Cong, and W. Gao, "SACNN: Self-Attention Convolutional Neural Network for Low-Dose CT Denoising With Self-Supervised Perceptual Loss Network," IEEE Transactions on Medical Imaging, vol. 39, no. 7, pp. 2289–2301, Jul. 2020. DOI: https://doi.org/10.1109/TMI.2020.2968472

S. H. Abbood, H. N. A. Hamed, M. S. M. Rahim, A. Rehman, T. Saba, and S. A. Bahaj, "Hybrid Retinal Image Enhancement Algorithm for Diabetic Retinopathy Diagnostic Using Deep Learning Model," IEEE Access, vol. 10, pp. 73079–73086, 2022. DOI: https://doi.org/10.1109/ACCESS.2022.3189374

B. L. Chen, J. J. Wan, T. Y. Chen, Y. T. Yu, and M. Ji, "A self-attention based faster R-CNN for polyp detection from colonoscopy images," Biomedical Signal Processing and Control, vol. 70, Sep. 2021, Art. no. 103019. DOI: https://doi.org/10.1016/j.bspc.2021.103019

M. Londhe, "Classification of Eye Diseases using Hybrid CNN-RNN Models," M.S. Thesis, Dublin, National College of Ireland, 2021.

A Vision Transformer-Based Convolutional Neural Network for the Automated Diagnosis of Eye Diseases Using Self-Attention Mechanisms

Authors

Abstract

Keywords:

Downloads

References

Downloads

How to Cite

Metrics

License