000 02378nam a22002417a 4500
003 ft8900
005 20251218092135.0
008 251218b ||||| |||| 00| 0 eng d
041 _aengtag
050 _aQA76.9 A43 A43 2025
082 _a.
100 1 _aAlcaide, Robin Bryan R.; Inciong, Richard Aaron R.; Masigla, Jemuel Frian D
245 _aModified iterative dichotomiser 3 (ID3) algorithm applied in diabetes risk detection
264 1 _a.
_b.
_cc2025
300 _bUndergraduate Thesis: (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2025
336 _2text
_atext
_btext
337 _2unmediated
_aunmediated
_bunmediated
338 _2volume
_avolume
_bvolume
505 _aABSTRACT: This study presents a modified Iterative Dichotomiser 3 (ID3) decision tree algorithm, designed to address multi-value bias, equally important attribute problem, and overfitting where these challenges impact the tree’s classification accuracy. This aim to enhance the algorithm by improving its attribute selection, handling mechanism for tied attributes, and applying a regularization technique for better generalization. The modified ID3 utilized mutual information-based information gain for attribute selection, incorporated purity calculation for tie-breaking situations, and introduced the concept of dropout regularization to mitigate overfitting. Testing was conducted on a diabetes dataset containing 520 instances with 16 features of categorical values. The model was evaluated using standard performance measures (accuracy, precision, recall, F1 score) and compared to the traditional ID3 and other modified versions. The modified algorithm achieved an average accuracy of 97% which surpassed the traditional ID3 algorithm and other modified ID3 algorithms. These findings reveal an average accuracy improvement of approximately 1% to 3% across training holdouts of 50% to 90% and various dropout rates of 0.1 to 0.4. Additionally, the modified ID3 algorithm produced less average number of nodes compared to the traditional ID3 algorithm, ranging from a difference of 6 to 22 nodes. Overall, the modifications that were implemented further improved the traditional ID3 algorithm’s classification performance, producing a more accurate and reliable decision tree.
526 _aF
655 _aacademic writing
942 _2lcc
_cMS
999 _c37371
_d37371