Classification of Diabetes Using Machine Learning Technics

Abstract

Diabetes is one of the most widespread chronic diseases worldwide. More than 400 million people in the world suffer from it, and their number is increasing significantly every year. The continents of the world, especially Asia and Africa, are experiencing a high rate of infection due to a number of factors and environmental factors. Related to diet and healthy habits used in many developing countries. Diabetes is considered one of the most serious chronic diseases that affect the human body due to impaired secretion of the hormone insulin, because the pancreas is unable to secrete a sufficient amount of insulin, or when the cells of the human body do not accept insulin and don't use it. Effective, causing a sudden drop or rise in blood sugar levels. It has a huge impact on human health and type 2 diabetes is the most common form in humans as many factors contribute to its spread, such as genetic factors, poor diet and lack of regular exercise. In this paper we used the machine learning technical to classification of diabetes such as logistic regression, random forest, decision tree, vector classification, KNN and Naïve bayes and the accuracy was 77.66%, 76.33%, 98.66%, 75.33%, 80.66% and 80% The rest of the general standards used in classification are mentioned in the results of this paper.

Country : Iraq

1 Ihab Tareq Elias2 Muna M. Taher Jawhar

  1. Student, Department of Software, College of Computer Science and Mathematics, University of Mosul, Iraq
  2. Teacher, Department of Software, College of Computer Science and Mathematics, University of Mosul, Iraq

IRJIET, Volume 8, Issue 1, January 2024 pp. 113-118

doi.org/10.47001/IRJIET/2024.801015

References

  1. Williams, R., Karuranga, S., Malanda, B., Saeedi, P., Basit, A., Besançon, S., Bommer, C., Esteghamati, A., Ogurtsova, K., & Zhang, P. (2020). Global and regional estimates and projections of diabetes-related health expenditure: Results from the International Diabetes Federation Diabetes Atlas. Diabetes Research and Clinical Practice, 162, 108072.
  2. Lonappan, A., Bindu, G., Thomas, V., Jacob, J., Rajasekaran, C., & Mathew, K. T. (2007). Diagnosis of diabetes mellitus using microwaves. Journal of Electromagnetic Waves and Applications, 21(10), 1393–1401.
  3. Lebovitz, H. E. (1999). Type 2 diabetes: an overview. Clinical Chemistry, 45(8), 1339–1345.
  4. Mokdad, A. H., Ford, E. S., Bowman, B. A., Nelson, D. E., Engelgau, M. M., Vinicor, F., & Marks, J. S. (2000). Diabetes trends in the US: 1990-1998. Diabetes Care, 23(9), 1278–1283.
  5. Cardozo, G., Pintarelli, G. B., Andreis, G. R., Lopes, A. C. W., & Marques, J. L. B. (2022). Use of Machine Learning and Routine Laboratory Tests for Diabetes Mellitus Screening. BioMed Research International, 2022.
  6. Sisodia, D., & Sisodia, D. S. (2018). Prediction of diabetes using classification algorithms. Procedia Computer Science, 132, 1578–1585.
  7. Wu, H., Yang, S., Huang, Z., He, J., & Wang, X. (2018). Type 2 diabetes mellitus prediction model based on data mining. Informatics in Medicine Unlocked, 10, 100–107.
  8. Wu, Y.-T., Zhang, C.-J., Mol, B. W., Kawai, A., Li, C., Chen, L., Wang, Y., Sheng, J.-Z., Fan, J.-X., & Shi, Y. (2021). Early prediction of gestational diabetes mellitus in the Chinese population via advanced machine learning. The Journal of Clinical Endocrinology & Metabolism, 106(3), e1191–e1205.
  9. Alehegn, M., Joshi, R. R., & Mulay, P. (2019). Diabetes analysis and prediction using random forest,
  10. KNN, Naïve Bayes, and J48: An ensemble approach. International Journal of Scientific and Technology Research, 8(9), 1346–1354.
  11. Thomas, J., Joseph, A., Johnson, I., & Thomas, J. (2019). Machine learning approach for diabetes prediction. International Journal of Information, 8(2).
  12. Zou, Q., Qu, K., Luo, Y., Yin, D., Ju, Y., & Tang, H. (2018). Predicting Diabetes Mellitus with Machine Learning Techniques. Frontiers in Genetics, 9(November),1–10.https://doi.org/10.3389/fgene.2018.00515
  13. Vapnik, V. (1998). Statistical learning theory. https://api.semanticscholar.org/CorpusID:61112307
  14. James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning (Vol. 112). Springer.
  15. Kotu, V., & Deshpande, B. (2018). Data science: concepts and practice. Morgan Kaufmann.
  16. Alzubi, J., Nayyar, A., & Kumar, A. (2018). Machine learning from theory to algorithms: an overview. Journal of Physics: Conference Series, 1142, 12012.
  17. Jadhav, S. D., & Channe, H. P. (2016). Comparative study of K-NN, naive Bayes and decision tree classification techniques. International Journal of Science and Research (IJSR), 5(1), 1842–1845.
  18. Polat, K. (2019). A hybrid approach to Parkinson disease classification using speech signal: the combination of smote and random forests. 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), 1–3.