Diabetes Prediction System Using Machine Learning Techniques
Main Article Content
Abstract
Introduction: Diabetes mellitus is a chronic metabolic disorder caused by high blood glucose levels and insufficient insulin production. If left untreated, it may lead to severe complications such as cardiovascular disease, kidney failure, nerve damage, and vision loss. Early detection of diabetes plays a crucial role in preventing these complications and improving patient outcomes.
Objectives: The objective of this study is to develop a predictive system using machine learning techniques to identify diabetic patients at an early stage with higher accuracy.
Methods: The study uses the Pima Indians Diabetes Dataset obtained from the UCI Machine Learning Repository. Several supervised learning algorithms including Logistic Regression, Support Vector Machine, Decision Tree, K-Nearest Neighbor, Random Forest, and Gradient Boosting were applied. Data preprocessing techniques such as missing value removal, normalization, and dataset splitting were performed prior to model training.
Results: Experimental analysis shows that ensemble models outperform individual classifiers. Among all algorithms, the Random Forest classifier achieved the highest prediction accuracy, demonstrating its effectiveness for diabetes prediction.
Conclusions: Machine learning-based predictive models can significantly support healthcare professionals in early diabetes detection and clinical decision-making. The proposed system can be extended for real-time healthcare applications.
Article Details
References
Smith, J., & Lee, K. Machine Learning Approaches for Diabetes Prediction Using Clinical Data. Journal of Medical Systems, 2022.
Patel, R., Shah, M. Comparative Study of Classification Algorithms for Diabetes Diagnosis. International Journal of Computer Applications, 2018.
Sisodia, D., & Sisodia, D.S. Prediction of Diabetes Using Classification Algorithms. Procedia Computer Science, 2023.
Kavakiotis, I. et al. Machine Learning and Data Mining Methods in Diabetes Research. Computational and Structural Biotechnology Journal, 2021.
Perveen, S., Shahbaz, M. Performance Analysis of Data Mining Classification Techniques for Diabetes Prediction. IEEE Access, 2022.
Zou, Q., Qu, K., Luo, Y. Predicting Diabetes Mellitus Using Machine Learning Techniques. Artificial Intelligence in Medicine, 2018.
Ahmad, M., & Khan, S. Early Detection of Diabetes Using Data Mining Techniques. Journal of Healthcare Engineering, 2019.
Gupta, A., Kumar, P. Ensemble Learning Methods for Diabetes Prediction. International Journal of Advanced Computer Science, 2020.
Reddy, G., & Reddy, B. Evaluation of Machine Learning Algorithms for Diabetes Classification. International Journal of Engineering Research, 2017.
Kaur, H., & Kumari, V. Predictive Modelling for Diabetes Using Logistic Regression and Random Forest. International Journal of Data Science, 2019.
Choi, B.G., & Lee, S. Application of Support Vector Machine in Diabetes Risk Prediction. Biomedical Engineering Letters, 2018.
Aljumah, A.A., Siddiqui, M.K. Data Mining Applications in Diabetes Healthcare. Journal of King Saud University – Computer and Information Sciences, 2013.
Wu, H., Yang, S. Machine Learning Based Decision Support System for Diabetes Diagnosis. Expert Systems with Applications, 2020.
Islam, M.M., et al. Prediction of Diabetes Using Machine Learning Algorithms: A Comparative Study. Health Informatics Journal, 2019.
Rani, A.S., & Jyothi, S. Performance Analysis of Classification Algorithms on Healthcare Datasets. IEEE Conference on Computing for Sustainable Global Development, 2016.
Chen, M., Hao, Y. Disease Prediction Using Big Data and Machine Learning Techniques. IEEE Network, 2017.
Han, J., Pei, J., & Kamber, M. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2012.
Breiman, L. Random Forests. Machine Learning Journal, 2001.
Cortes, C., & Vapnik, V. Support Vector Networks. Machine Learning Journal, 1995.
Pedregosa, F., et al. Scikit-Learn: Machine Learning in Python. Journal of Machine Learning Research, 2011.