Abstract
Oculang is a communication system that uses computer vision and machine learning techniques to enable individuals with neurodegenerative disorders to communicate using eye gestures. The application focuses on detecting facial landmarks, gaze estimation, and blink detection to analyze combinations of eye movements captured from video input and produce message outputs. Dlib’s shape predictor (68 facial landmarks) and OpenCV-based image processing methods are used to extract and process the features of the eye region. A decision-making algorithm maps the detected gestures and predefined keywords for message generation. The application supports both real-time video capturing and uploading video through a Django-based user interface. Experimental evaluation on recorded datasets of eye movements demonstrated robust performance in accurately recognizing blinks, winks, and gaze directions, validating the system’s reliability.
References
- Cazzato, Dario, Marco Leo, Cosimo Distante, and Holger Voos. "When I look into your eyes: A survey on computer vision contributions for human gaze estimation and tracking." Sensors 20, no. 13 (2020): 3739.
- Gururaj, H. L., B. C. Soundarya, S. Priya, J. Shreyas, and Francesco Flammini. "A Comprehensive Review of Face Recognition Techniques, Trends and Challenges." IEEE Access (2024).
- Juhong, Aniwat, and Chuchart Pintavirooj. "Face recognition based on facial landmark detection." In 2017 10th Biomedical Engineering International Conference (BMEiCON),Hokkaido, Japan. IEEE, 2017. 1-4.
- Barz, Michael, and Daniel Sonntag. "Automatic visual attention detection for mobile eye tracking using pre-trained computer vision models and human gaze." Sensors 21, no. 12 (2021): 4143.
- Roy, Koushik, and Dibaloke Chanda. "A robust webcam-based eye gaze estimation system for Human-Computer interaction." In 2022 International Conference on Innovations in Science, Engineering and Technology (ICISET),Chittagong, Bangladesh. IEEE, 2022. 146-151.
- Grootjen, Jesse W., Henrike Weingärtner, and Sven Mayer. "Uncovering and Addressing Blink-Related Challenges in Using Eye Tracking for Interactive Systems." In Proceedings of the CHI Conference on Human Factors in Computing Systems, Canada. 2024. 1-23.
- Chen, Siyuan, Julien Epps, and Fred Paas. "Pupillometric and blink measures of diverse task loads: Implications for working memory models." British Journal of Educational Psychology 93 (2023): 318-338.
- Hollander, John, and Stephanie Huette. "Extracting blinks from continuous eye-tracking data in a mind wandering paradigm." Consciousness and Cognition 100 (2022): 103303.
- Dewi, Christine, Rung-Ching Chen, Xiaoyi Jiang, and Hui Yu. "Adjusting eye aspect ratio for strong eye blink detection based on facial landmarks." PeerJ Computer Science 8 (2022): e943.
- Pundlik, Shrinivas, Prerana Shivshanker, and Gang Luo. "Impact of apps as assistive devices for visually impaired persons." Annual Review of Vision Science 9, no. 1 (2023): 111-130.
- Dewi, Christine, Rung-Ching Chen, Chun-Wei Chang, Shih-Hung Wu, Xiaoyi Jiang, and Hui Yu. "Eye aspect ratio for real-time drowsiness detection to improve driver safety." Electronics 11, no. 19 (2022): 3183.
- Atweh, Jad A., Ihab Tabbara, Ghady Nasrallah, and Sara L. Riggs. "Reducing Barriers in Analyzing Eye Tracking Data: The Development of a GUI to Preprocess Eye Tracking Data." In 2024 Systems and Information Engineering Design Symposium (SIEDS),Charlottesville, VA, USA. IEEE, 2024. 286-291.
- Rakhmatulin, Ildar. "Dataset for eye-tracking tasks." Rakhmatulin, I. Dataset for Eye-Tracking Tasks. Preprints 2020120047 (2020).
- Krafka, Kyle, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra Bhandarkar, Wojciech Matusik, and Antonio Torralba. "Eye tracking for everyone." In Proceedings of the IEEE conference on computer vision and pattern recognition, USA. 2016. 2176-2184.
- Fernández-Batanero, José María, Marta Montenegro-Rueda, José Fernández-Cerero, and Inmaculada García-Martínez. "Assistive technology for the inclusion of students with disabilities: a systematic review." Educational Technology Research and Development 70, no. 5 (2022): 1911-1930.
