| 000 -LEADER |
| fixed length control field |
02438nam a22002417a 4500 |
| 003 - CONTROL NUMBER IDENTIFIER |
| control field |
ft8908 |
| 005 - DATE AND TIME OF LATEST TRANSACTION |
| control field |
20251219130707.0 |
| 008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
| fixed length control field |
251219b ||||| |||| 00| 0 eng d |
| 041 ## - LANGUAGE CODE |
| Language code of text/sound track or separate title |
engtag |
| 050 ## - LIBRARY OF CONGRESS CALL NUMBER |
| Classification number |
QA76.9 A43 N38 2025 |
| 082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER |
| Classification number |
. |
| 100 1# - MAIN ENTRY--PERSONAL NAME |
| Personal name |
Navalta, Ashley Benette G.; Puzon, Trisha Gaile V.; Tagufa, Justin Adolf J. |
| 245 ## - TITLE STATEMENT |
| Title |
Enhancement of multinomial naïve bayes algorithm applied to sentiment analysis on reddit posts |
| 264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE |
| Place of production, publication, distribution, manufacture |
. |
| Name of producer, publisher, distributor, manufacturer |
. |
| Date of production, publication, distribution, manufacture, or copyright notice |
c2025 |
| 300 ## - PHYSICAL DESCRIPTION |
| Other physical details |
Undergraduate Thesis: (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2025 |
| 336 ## - CONTENT TYPE |
| Source |
text |
| Content type term |
text |
| Content type code |
text |
| 337 ## - MEDIA TYPE |
| Source |
unmediated |
| Media type term |
unmediated |
| Media type code |
unmediated |
| 338 ## - CARRIER TYPE |
| Source |
volume |
| Carrier type term |
volume |
| Carrier type code |
volume |
| 505 ## - FORMATTED CONTENTS NOTE |
| Formatted contents note |
ABSTRACT: Sentiment classification plays a crucial role in analyzing textual data, but traditional Multinomial Naïve Bayes (MNB) often struggles with class imbalance, high dimensionality, and zero probability issues. This study indented to enhance the performance of MNB by addressing these limitations through random oversampling, chi-square feature selection, and Laplace smoothing. The research focused on sentiment analysis of Reddit posts and conducted experiments on nine datasets, with five datasets containing 1000 data points each and four datasets containing 6000 datapoints each. The enhanced algorithm demonstrated significant improvements in classification accuracy and execution time. For datasets with 1000 data points, the enhanced MNB achieved an average accuracy improvement of 69.14% compared to the traditional approach, while datasets with 6000 data points showed an average accuracy improvement of 50.29%. Additionally, chi-square features, resulting in an average execution time improvement of 35.31% for the smaller datasets and 23.00% for the larger datasets. Laplace smoothing successfully addressed the zero-probability issue, ensuring more robust probability estimates in classification. The findings of this study confirm that addressing these algorithmic limitations significantly enhances the effectiveness and efficiency of sentiment classification using MNB. Future research may explore the application of the enhanced algorithm in other domains, such as customer feedback analysis, political sentiment detection, and social media monitoring, to further validate its adaptability and performance |
| 526 ## - STUDY PROGRAM INFORMATION NOTE |
| Classification |
Filipiniana |
| 655 ## - INDEX TERM--GENRE/FORM |
| Genre/form data or focus term |
academic writing |
| 942 ## - ADDED ENTRY ELEMENTS |
| Source of classification or shelving scheme |
|
| Item type |
Thesis/Dissertation |