000 02164nam a22001817a 4500
003 FT8928
005 20260113095558.0
050 _aQA76.9 A43 L59 2025
100 1 _aLiwag, Justin E.; Balaoro, Clarisse Anne D.
245 _aEnhancement of random forest algorithm applied to SMS fraud detection
264 1 _cc2025
300 _bUndergraduate Thesis: (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2025
336 _2text
_atext
_btext
337 _2unmediated
_aunmediated
_bunmediated
338 _2volume
_avolume
_bvolume
505 _aABSTRACT: Random Forest is a powerful machine learning algorithm that builds multiple decision trees from randomly selected subsets of features and data. However, its performance declines when dealing with imbalanced datasets, which reduces the accuracy. Selecting features randomly slows down model training and contributes to a lack of interpretability. This study entitled Enhancement of Random Forest Algorithm Applied to SMS Fraud Detection aims to enhance the algorithm’s ability to manage imbalanced datasets and minimize false negatives in classifying fraudulent messages. Spectral Co-Clustering, Reduced Error Pruning, and Contextual Feature Contribution Network (CFCN) were incorporated to improve algorithm’s accuracy, training time, and transparency. The enhanced algorithm was evaluated using the SMS Spam Collection Dataset, with performance metrics compared against the existing Random Forest. The results show an increased accuracy by 1% (from 97% to 98%), a recall for spam detection by 5% (from 78% to 83%), and an F1-socre by (2% (from 93% to 95%). Reduced Error Pruning reduced time b y 65.8%, enhancing computational efficiency. The CFCN provided transparent insights into feature contributions, addressing traditional models :black-box” nature. These enhancements strengthen the model’s ability to detect SMS fraud while maintaining robustness against imbalanced data. The study contributes fraud detection systems by offering a more accurate, efficient, and interpretable machine learning framework for safeguarding digital communication.
942 _2lcc
_cMS
_01
999 _c37430
_d37430