A Decision-Making Model for Selecting Criterion in a Decision Tree Algorithm

Authors

Keywords:

Machine Learning, Supervised Learning, Decision Tree, Classification, Entropy, Gini Index, Data Preprocessing

Abstract

The decision tree algorithm is widely used for classification, which is a part of supervised learning. It classifies the node based on criterions for information gain which are termed as Entropy and Gini index. A comparative study of these techniques is required to determine the most feasible criterion for the specific dataset. As of now, machine learning and healthcare go hand in hand. Diabetes is a common disease that occurs when blood glucose levels exceed the normal range. It is expected to affectmillions of people, with half of the population remaining undiagnosed. The medical field generates a large amount of data, which is required to be analyzed further using machine learning. This paper focuses on machine learning, specifically the Supervised Learning’s algorithm decision tree, which is used for classification, to predict whether a patient is diabetic or not. When using a classification technique, the criterion plays an important role. Here, two popular criterions are discussed and compared for the accuracy of both. Ultimately, the prediction is based on a comparative study of both the Gini Index and the entropy to determine the best way to gain more accuracy. The algorithm is used on dataset, which is fed with different parameter values to get the result of the prediction.

Downloads

Published

2024-09-01

How to Cite

Ashish P. Joshi, Umeshkumar Tank, Hasamukhbhai B. Patel, & Vivek Vyas. (2024). A Decision-Making Model for Selecting Criterion in a Decision Tree Algorithm. Journal of Computational Analysis and Applications (JoCAAA), 33(2), 106–111. Retrieved from http://eudoxuspress.com/index.php/pub/article/view/282

Issue

Section

Articles

Similar Articles

1 2 3 > >> 

You may also start an advanced similarity search for this article.