| 000 -LEADER |
| fixed length control field |
02378nam a22002417a 4500 |
| 003 - CONTROL NUMBER IDENTIFIER |
| control field |
ft8900 |
| 005 - DATE AND TIME OF LATEST TRANSACTION |
| control field |
20251218092135.0 |
| 008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
| fixed length control field |
251218b ||||| |||| 00| 0 eng d |
| 041 ## - LANGUAGE CODE |
| Language code of text/sound track or separate title |
engtag |
| 050 ## - LIBRARY OF CONGRESS CALL NUMBER |
| Classification number |
QA76.9 A43 A43 2025 |
| 082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER |
| Classification number |
. |
| 100 1# - MAIN ENTRY--PERSONAL NAME |
| Personal name |
Alcaide, Robin Bryan R.; Inciong, Richard Aaron R.; Masigla, Jemuel Frian D |
| 245 ## - TITLE STATEMENT |
| Title |
Modified iterative dichotomiser 3 (ID3) algorithm applied in diabetes risk detection |
| 264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE |
| Place of production, publication, distribution, manufacture |
. |
| Name of producer, publisher, distributor, manufacturer |
. |
| Date of production, publication, distribution, manufacture, or copyright notice |
c2025 |
| 300 ## - PHYSICAL DESCRIPTION |
| Other physical details |
Undergraduate Thesis: (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2025 |
| 336 ## - CONTENT TYPE |
| Source |
text |
| Content type term |
text |
| Content type code |
text |
| 337 ## - MEDIA TYPE |
| Source |
unmediated |
| Media type term |
unmediated |
| Media type code |
unmediated |
| 338 ## - CARRIER TYPE |
| Source |
volume |
| Carrier type term |
volume |
| Carrier type code |
volume |
| 505 ## - FORMATTED CONTENTS NOTE |
| Formatted contents note |
ABSTRACT: This study presents a modified Iterative Dichotomiser 3 (ID3) decision tree algorithm, designed to address multi-value bias, equally important attribute problem, and overfitting where these challenges impact the tree’s classification accuracy. This aim to enhance the algorithm by improving its attribute selection, handling mechanism for tied attributes, and applying a regularization technique for better generalization. The modified ID3 utilized mutual information-based information gain for attribute selection, incorporated purity calculation for tie-breaking situations, and introduced the concept of dropout regularization to mitigate overfitting. Testing was conducted on a diabetes dataset containing 520 instances with 16 features of categorical values. The model was evaluated using standard performance measures (accuracy, precision, recall, F1 score) and compared to the traditional ID3 and other modified versions. The modified algorithm achieved an average accuracy of 97% which surpassed the traditional ID3 algorithm and other modified ID3 algorithms. These findings reveal an average accuracy improvement of approximately 1% to 3% across training holdouts of 50% to 90% and various dropout rates of 0.1 to 0.4. Additionally, the modified ID3 algorithm produced less average number of nodes compared to the traditional ID3 algorithm, ranging from a difference of 6 to 22 nodes. Overall, the modifications that were implemented further improved the traditional ID3 algorithm’s classification performance, producing a more accurate and reliable decision tree. |
| 526 ## - STUDY PROGRAM INFORMATION NOTE |
| Classification |
Filipiniana |
| 655 ## - INDEX TERM--GENRE/FORM |
| Genre/form data or focus term |
academic writing |
| 942 ## - ADDED ENTRY ELEMENTS |
| Source of classification or shelving scheme |
|
| Item type |
Thesis/Dissertation |