| 000 | 02164nam a22001817a 4500 | ||
|---|---|---|---|
| 003 | FT8928 | ||
| 005 | 20260113095558.0 | ||
| 050 | _aQA76.9 A43 L59 2025 | ||
| 100 | 1 | _aLiwag, Justin E.; Balaoro, Clarisse Anne D. | |
| 245 | _aEnhancement of random forest algorithm applied to SMS fraud detection | ||
| 264 | 1 | _cc2025 | |
| 300 | _bUndergraduate Thesis: (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2025 | ||
| 336 |
_2text _atext _btext |
||
| 337 |
_2unmediated _aunmediated _bunmediated |
||
| 338 |
_2volume _avolume _bvolume |
||
| 505 | _aABSTRACT: Random Forest is a powerful machine learning algorithm that builds multiple decision trees from randomly selected subsets of features and data. However, its performance declines when dealing with imbalanced datasets, which reduces the accuracy. Selecting features randomly slows down model training and contributes to a lack of interpretability. This study entitled Enhancement of Random Forest Algorithm Applied to SMS Fraud Detection aims to enhance the algorithm’s ability to manage imbalanced datasets and minimize false negatives in classifying fraudulent messages. Spectral Co-Clustering, Reduced Error Pruning, and Contextual Feature Contribution Network (CFCN) were incorporated to improve algorithm’s accuracy, training time, and transparency. The enhanced algorithm was evaluated using the SMS Spam Collection Dataset, with performance metrics compared against the existing Random Forest. The results show an increased accuracy by 1% (from 97% to 98%), a recall for spam detection by 5% (from 78% to 83%), and an F1-socre by (2% (from 93% to 95%). Reduced Error Pruning reduced time b y 65.8%, enhancing computational efficiency. The CFCN provided transparent insights into feature contributions, addressing traditional models :black-box” nature. These enhancements strengthen the model’s ability to detect SMS fraud while maintaining robustness against imbalanced data. The study contributes fraud detection systems by offering a more accurate, efficient, and interpretable machine learning framework for safeguarding digital communication. | ||
| 942 |
_2lcc _cMS _01 |
||
| 999 |
_c37430 _d37430 |
||