Diabetes Prediction using Machine Learning Techniques
PDF

Keywords

Logistic Regression
Random Forest
Decision Tree
K-Nearest Neighbor
Support Vector Machine
Naïve Bayes
Gradient Boosting
Machine Learning (ML)
Diabetes

How to Cite

Jithendra, V, R M Sai Mohit, M Madhusudhan, B Jagadeesh, and S Kusuma. 2023. “Diabetes Prediction Using Machine Learning Techniques”. Journal of Artificial Intelligence and Capsule Networks 5 (2): 190-206. https://doi.org/10.36548/jaicn.2023.2.008.

Abstract

Now a day due to hectic schedules and sedentary lifestyle people do not follow the proper diet. Poor diet may lead to diabetes, and which could result in various health issues such as heart attacks, strokes, renal failure, nerve damage, etc. When diabetes is accurately detected in its early stage , it can be effectively treated. By using Machine Learning methods, the problem can be easily detected and a solution could bearrived. Early diabetes detection and prediction can be greatly improved with machine learning (ML) approaches. When it is detected in an early stage, it can be resolved quickly. The objective of this research is to provide prediction using various supervised machine learning methods. Seven algorithms are compared with each other to figure out which is the best. The algorithms are Logistic Regression, Random Forest, Decision Tree, K-Nearest Neighbor, Support Vector Machine, Naïve Bayes, Gradient Boosting. The evaluation results stated that Logistic Regression is more accurate than other algorithms for the given data set with an accuracy of 82%. After selecting the ML model which is more accurate. A User Interface where users can enter the new data and get results was developed and the results to the user were forwarded through WhatsApp along with some suggestions and precautions.

PDF

References

Arwatki Chen L, Nurul Amin and Soumen Moulik’s “Diabetes Disease Prediction Using Machine Learning Algorithms” published in IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES) in 2020.

Le, T. M., Vo, T. M., Pham, T. N., & Dao, S. V.T. (2021). A Novel Wrapper Based Feature Selection for Early Diabetes Prediction Enhanced with a Metaheuristic. IEEE Access, 9,7869– 7884. doi:10.1109/access.2020.3047942.

P, Anirudh Hebbar; M V, Manoj Kumar; H A, Sanjay (2019).[IEEE 2019 1st International Conference on Advances in Information Technology (ICAIT) - Chikmagalur, India(2019.7.25- 2019.7.27)] 2019 1st International Conference on Advances in InformationTechnology (ICAIT) - DRAP: Decision Tree and Random Forest Based Classification Modelto Predict Diabetes. 271–276. doi:10.1109/icait47043.2019.8987277

Sivaranjani, S., Ananya, S., Aravinth, J., & Karthika, R. (2021). Diabetes Prediction using Machine Learning Algorithms with Feature Selection and Dimensionality Reduction. 2021 7th International Conference on Advanced Computing and Communication Systems (ICACCS). doi:10.1109/icaccs51430.2021.9441935

Barhate, Rahul; Kulkarni, Pradnya (2018).[IEEE 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) - Pune, India (2018.8.16- 2018.8.18)] 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) - Analysis of Classifiers for Prediction of Type II Diabetes Mellitus.1–6. doi:10.1109/ICCUBEA.2018.8697856

Chaudhuri, A. K., & Das, A. (2020). Variable Selection in Genetic Algorithm Model with Logistic Regression for Prediction of Progression to Diseases. 2020 IEEE International Conference for Innovation in Technology(INOCON) doi:10.1109/inocon50539.2020.9298372

Ahmed, Hager; Younis, Eman M.G.; Ali, Abdelmgeid A. (2020).[IEEE 2020 International Conference on Innovative Trends in Communication and Computer Engineering (ITCE) - Aswan, Egypt (2020.2.8-2020.2.9)] 2020 International Conference on Innovative Trends in Communication and Computer Engineering (ITCE) - Predicting Diabetes using Distributed Machine Learning based on ApacheSpark*,44–49. doi:10.1109/ITCE48509.2020.9047795

Debadri Dutta, Debpriyo Paul, Parthajeet Ghosh, "Analyzing Feature Importance’s for Diabetes Prediction using Machine Learning". IEEE, pp 942-928, 2018.

K.VijiyaKumar, B.Lavanya, I.Nirmala, S.Sofia Caroline, "Random Forest Algorithm for the Prediction of Diabetes ".Proceeding of International Conference on Systems Computation Automation and Networking, 2019.

Md. Faisal Faruque, Asaduzzaman, Iqbal H. Sarker, "Performance Analysis of Machine Learning Techniques to Predict Diabetes Mellitus". International Conference on Electrical, Computer and Communication Engineering (ECCE), 7-9 February, 2019.

Tejas N. Joshi, Prof. Pramila M. Chawan, "Diabetes Prediction Using Machine Learning Techniques”. Int. Journal of Engineering Research and Application, Vol. 8, Issue 1, (Part -II) January 2018, pp.-09-13

Nonso Nnamoko, Abir Hussain, David England, "Predicting Diabetes Onset: an Ensemble Supervised Learning Approach ". IEEE Congress on Evolutionary Computation (CEC), 2018.

Deeraj Shetty, Kishor Rit, Sohail Shaikh, Nikita Patil, "Diabetes Disease Prediction Using Data Mining ".International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS), 2017.

Nahla B., Andrew et al,"Intelligible support vector machines for diagnosis of diabetes mellitus. Information Technology in Biomedicine", IEEE Transactions. 14, (July. 2010), 1114-20.

A.K., Dewangan, and P., Agrawal, “Classification of Diabetes Mellitus Using Machine Learning Techniques,” International Journal of Engineering and Applied Sciences, vol. 2, 2015.