AI-based Retinal Image Classification for Early Detection of Eye Diseases
view PDF
view PDF

How to Cite

A., Ananthakumari, Muthu Menaka N., and Kaniska M. 2026. “AI-Based Retinal Image Classification for Early Detection of Eye Diseases”. Journal of Soft Computing Paradigm 8 (2): 143-57. https://doi.org/10.36548/jscp.2026.2.004.

Keywords

Eye Disease Detection
Retinal Image Classification
Vision Transformer (ViT)
Deep Learning
Medical Image Analysis
Streamlit

Abstract

Early detection of eye diseases are essential to prevent severe vision impairment and improve patient care. Eye diseases such as cataract, glaucoma, and diabetic retinopathy are among the leading causes of vision loss if not diagnosed at an early stage. The goal of this project is to use artificial intelligence to develop an automated classification system for retinal images, which uses retinal fundus images to classify the four most common retinal diseases (i.e., cataract, diabetic retinopathy, glaucoma, and normal). We designed an image classification system which uses the Vision Transformer (ViT) deep learning model to classify images into four categories: cataract, diabetic retinopathy, glaucoma, and normal. Before training occurred, all images were pre-processed by using various methods of resizing, normalizing, and applying data augmentations to enhance the model's performance and generalization. The performance metrics used to evaluate the overall performance of the trained model were accuracy, precision, recall, F1-score, confusion matrix, and ROC curve. Overall, the proposed model achieved an average overall accuracy of 93.48% on the test dataset.

References

  1. World Health Organization: WHO. (2026, February 10). Blindness and vision impairment. https://www.who.int/news-room/fact-sheets/detail/blindness-and-visual-impairment.
  2. LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. "Deep Learning." nature 2015, vol. 521, no. 7553: 436-444.
  3. Schmidhuber, Jürgen. "Deep Learning in Neural Networks: An overview." Neural networks 2015, vol. 61: 85-117.
  4. Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "ImageNet Classification with Deep Convolutional Neural Networks." Communications of the ACM 2017, vol. 60, no. 6: 84-90.
  5. Russakovsky, Olga, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang et al. "Imagenet Large Scale Visual Recognition Challenge." International journal of computer vision 2015, vol. 115, no. 3, 211-252.
  6. Litjens, Geert, Thijs Kooi, Babak Ehteshami Bejnordi, Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen Awm Van Der Laak, Bram Van Ginneken, and Clara I. Sánchez. "A Survey on Deep Learning in Medical Image Analysis." Medical image analysis 2017, vol. 42: 60-88.
  7. Gulshan, Varun, Lily Peng, Marc Coram, Martin C. Stumpe, Derek Wu, Arunachalam Narayanaswamy, Subhashini Venugopalan et al. "Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs." Jama 2016, vol. 316, no. 22, 2402-2410.
  8. Pratt, Harry, Frans Coenen, Deborah M. Broadbent, Simon P. Harding, and Yalin Zheng. "Convolutional Neural Networks for Diabetic Retinopathy." Procedia computer science 2016, vol. 90, 200-205.
  9. Isha, I., C, J. G. T., Anusha, A., S, A., & Harsha, H. "Glaucoma Detection Using Deep Learning and Image Processing." Interantional Journal of Scientific Research in Engineering and Management 2025, vol. 09, no. 12, 1–9.
  10. Porwal, Prasanna, Samiksha Pachade, Ravi Kamble, Manesh Kokare, Girish Deshmukh, Vivek Sahasrabuddhe, and Fabrice Meriaudeau. "Indian Diabetic Retinopathy Image Dataset (IDRiD): A Database for Diabetic Retinopathy Screening Research." Data 2018, vol. 3, no. 3: 25.
  11. Budai, Attila, Rüdiger Bock, Andreas Maier, Joachim Hornegger, and Georg Michelson. "Robust Vessel Segmentation in Fundus Images." International journal of biomedical imaging 2013, vol. 2013, no. 1: 154860.
  12. Jason Brownlee, “A Gentle Introduction to Object Recognition with Deep Learning,” Machine Learning Mastery 2021. [Online]. Available: https://machinelearningmastery.com/ [Accessed: Mar. 20, 2026].
  13. Albawi, Saad, Oguz Bayat, Saad Al-Azawi, and Osman N. Ucan. "Social Touch Gesture Recognition Using Convolutional Neural Network." Computational Intelligence and Neuroscience 2018, vol. 2018, no. 1: 6973103.
  14. Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. "Attention is All You Need." 31st Conference on Neural Information Processing Systems (NIPS 2017), 6000 – 6010.
  15. Pedregosa, Fabian, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel et al. "Scikit-Learn: Machine Learning in Python." the Journal of machine Learning research 2011, vol. 12: 2825-2830.
  16. Virtanen, Pauli, Ralf Gommers, Travis E. Oliphant, Matt Haberland, Tyler Reddy, David Cournapeau, Evgeni Burovski et al. "SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python." Nature methods 2020, vol. 17, no. 3: 261-272.
  17. Paszke, Adam, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen et al. "PyTorch: An Imperative Style, High-Performance Deep Learning Library." Advances in Neural Information Processing Systems: 8024-8035.
  18. Selvaraju, Ramprasaath R., Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. "Grad-Cam: Visual Explanations from Deep Networks via Gradient-Based Localization." In Proceedings of the IEEE international conference on computer vision 2017, 618-626.
  19. Abadi, Martín, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin et al. "TensorFlow: A System for Large-Scale Machine Learning." 12th USENIX symposium on operating systems design and implementation (OSDI 16) 2016, 265-283.
  20. Retinal Disease Classification (Dataset)- https://www.kaggle.com/datasets/andrewmvd/retinal-disease-classification.