Alter C. Orbino Jr. Seth Gabriel F. Payumo. 4 0

An Enhancement of Glove algorithm for extractive text summarization of news articles. 6 6 - - - - - - - - . - . - 0 . - . - 0 .

Undergraduate Thesis : (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2024.





5



ABSTRACT This research study addresses the limitations of the GloVe (Global Vectors for Word Representation) algorithm in the context of extractive text summarization, focusing on mitigating sparsity issues and enhancing contextual representation. The study begins by identifying three key challenges faced by GloVe: sparsity issues impacting representation accuracy, high computational demands, and difficulties in capturing contextual semantics. Firstly, the study introduces methods to mitigate sparsity issues in GloVe embeddings. By focusing on handling out-of-vocabulary words and improving the representation of rare terms, the algorithm aims to provide a more comprehensive representation of the entire corpus. Techniques such as subword embeddings and adaptive learning rates employed to address this challenge effectively. Secondly, to address the computational intensity and memory requirements of GloVe, the thesis incorporates dimensionality reduction techniques, specifically t-distributed Stochastic Neighbor Embedding (t-SNE). By reducing the dimensionality of the word embeddings, the algorithm achieves significant improvements in processing speed and memory efficiency without sacrificing summarization quality. Finally, the study enhances the contextual representation learned by GloVe for text summarization purposes. By weighing rare words and giving them better representation, the algorithm improves its ability to capture contextual semantics, thereby enhancing the quality of extractive text summarization. Through a series of experiments and evaluations using real-world news article datasets, the proposed enhancements are thoroughly validated. The results demonstrate notable improvements in the summarization performance of the enhanced GloVe algorithm compared to traditional GloVe implementations. The findings of this thesis contribute to advancing the state-of-the-art in extractive text summarization techniques and provice valuable insights for future research in natural language processing and machine learning.













5







2 = =









2




2 --0------


6 --0-- 2 --------



0 2 --


--20------





--------20--


--------20--


----2

/ 2

/ 2

/

/