Abstract
Lack of good pixel-level expert annotations has traditionally impaired the development of robust object detection models for medical diagnosis. This article proposes a weakly supervised approach that generates accurate bounding box labels with minimal user interaction through image-level classification. The weakly supervised nature of the proposed approach tackles the annotation bottleneck by converting cheaper and more available class-level labels into spatial annotations of high value. The proposed two-stage method first trains a classifier on diagnostic labels and then applies Class Activation Mapping (Grad-CAM) to generate high-quality pseudo-labels. These machine-generated annotations are then used to train a state-of-the-art YOLOv8s detector for the final diagnosis task. The system performed cataract detection from fundus images with a mean Average Precision (mAP@50) of 99% and a stricter mAP@50-95 of 96.9%. An important recall rate of 97.1% was achieved in the cataract class, making the possibility of a missed diagnosis almost negligible. These results hold competitive status when compared with fully supervised methods that require extensive manual annotation, reaffirming our method as data-efficient, highly scalable, and a robust collaborator in fast-tracking the development of medical AI tools.
References
Biswas, Ankur, and Rita Banik. "Cnn fusion: A promising technique for ophthalmic disorder diagnosis." Procedia Computer Science 233 (2024): 411-421.
Sharma, Vansh, Shubhangi Pandey, Divija Agrawal, and S. Thenmalar. "TheiaNet Pioneering Eye Disease Detection through Convolution Neural Networks." Available at SSRN 5091426 (2024).
Madduri, Vamsi Krishna, and Battula Srinivasa Rao. "Detection and diagnosis of diabetic eye diseases using two phase transfer learning approach." PeerJ Computer Science 10 (2024): e2135.
Shah, A. 2024. "Comparative Analysis of Cataract Eye Disease Detection Using YOLOv8 and YOLOv10." International Journal of Computer Trends and Technology 72 (10): 141–147. https://doi.org/10.14445/22312803/IJCTT-V72I10P121.
PL, Lahari, Ramesh Vaddi, Mahmoud O. Elish, Venkateswarlu Gonuguntla, and Siva Sankar Yellampalli. "CSDNet: a novel deep learning framework for improved cataract state detection." Diagnostics 14, no. 10 (2024): 983.
Rahman, Mushfiqur, Kazi Hasiba Ferdous Oushi, and Md Al Mamun. "EYE DISEASE CATARACT CLASSIFICATION USING DEEP LEARNING." DAFFODIL INTERNATIONAL UNIVERSITY JOURNAL OF SCIENCE AND TECHNOLOGY 19, no. 1 (2024).
Ren, Zeyu, Shuihua Wang, and Yudong Zhang. "Weakly supervised machine learning." CAAI Transactions on Intelligence Technology 8, no. 3 (2023): 549-580.
Lin, Jianghang, Yunhang Shen, Bingquan Wang, Shaohui Lin, Ke Li, and Liujuan Cao. "Weakly supervised open-vocabulary object detection." In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 4, 2024, 3404-3412.
Kadam, A., D. Mehta, D. Pande, A. Trivedi, K. Chotai, and A. Banu. 2025. "Ocular Disease Recognition System Using ResNet50 and InceptionV3 over DenseNet, Xception, VGG, and U-Net Architectures." Journal of Information Systems Engineering and Management 10 (46s).
Dash, Shreemat Kumar, Kante Satyanarayana, Santi Kumari Behera, Sudarson Jena, Ashoka Kumar Ratha, Prabira Kumar Sethy, and Aziz Nanthaamornphong. "Ocular Disease Detection Using Fundus Images: A Hybrid Approach of Grad‐CAM and Multiscale Retinex Preprocessing With VGG16 Deep Features and Fine KNN Classification." Applied Computational Intelligence and Soft Computing 2025, no. 1 (2025): 6653543.
Alhussein, Hanaa Hashim Imran, and Ali Abdulazeez Mohammedbaqer Qazzaz. "License Plate Detection and Recognition Using Faster RCNN." In International Conference on Cyber Intelligence and Information Retrieval, Singapore: Springer Nature Singapore, 2023. 173-186.
Koondhar, M. Y., Z. A. Maher, M. Memon, I. A. Memon, A. R. Rang, and M. H. Depar. 2023. "Human Eye Disease Detection and Classification of Retinal Imagery Using MobileNet CNN." Kurdish Studies 11 (3): 1003–1009.
Erdaş, Ç. B., and G. Arslan. 2024. "Efficient Detection of Multiclass Eye Diseases Using Deep Learning Models: A Comparative Study." In Proceedings of EnSci Dubai 2024 – International Conference on Engineering & Sciences, 6–16. STRA.
Shams, Sarmad, Mishkaat Jamil, Aqsa Faheem, Afnan Qureshi, Zona Khan, and Natasha Mukhtiar. "Ocular Disease Detection Using state of the art Machine Learning techniques based Clinical Decision Support System for Ophthalmologist." In Proceedings of the 4th International Conference on Key Enabling Technologies (KEYTECH 2024), vol. 35, p. 56. Springer Nature, 2024.
Li, Ning, Tao Li, Chunyu Hu, Kai Wang, and Hong Kang. "A benchmark of ocular disease intelligent recognition: One shot for multi-disease detection." In International symposium on benchmarking, measuring and optimization, Cham: Springer International Publishing, 2020, 177-193.
Ramakrishnan, Akshay Bhuvaneswari, Mukunth Madavan, R. Manikandan, and Amir H. Gandomi. "A Hybrid Deep Learning Paradigm for Robust Feature Extraction and Classification for Cataracts." Applied AI Letters 6, no. 2 (2025): e113.
Acevedo, Elena, Dinora Orantes, Marco Acevedo, and Ricardo Carreño. "Identification of Eye Diseases Through Deep Learning." Diagnostics 15, no. 7 (2025): 916.
Yu, Hongjie, and Xingbo Dong. "Ensemble-based eye disease detection system utilizing fundus and vascular structures." Scientific Reports 15, no. 1 (2025): 19298.
ul Hassan, Mahmood, Amin A. Al-Awady, Naeem Ahmed, Muhammad Saeed, Jarallah Alqahtani, Ali Mousa Mohamed Alahmari, and Muhammad Wasim Javed. "A transfer learning enabled approach for ocular disease detection and classification." Health Information Science and Systems 12, no. 1 (2024): 36.
Ismail, Walaa N., and Hessah A. Alsalamah. "A novel CatractNetDetect deep learning model for effective cataract classification through data fusion of fundus images." Discover Artificial Intelligence 4, no. 1 (2024): 54.
Yadav, H., and S. Mallick. 2024. "Early Detection of Cataract, Diabetic Retinopathy, and Glaucoma Using Deep Learning." International Journal of Creative Research Thoughts 12 (12): Article IJCRT2412184.
Fung, Daniel Kai Xiang, Di Wang, Hao Wang, Yongwei Wang, Pengcheng Wu, Yan Yee Hah, Chee Chew Yip et al. "Accurate and Explainable Cataract Detection Using Eye Images Taken by Hand-held Slit-lamp Cameras." In 2024 IEEE Conference on Artificial Intelligence (CAI), IEEE, 2024, 83-88.
