Arpon, Jasmia C.; Japson Denise H.

Enhancement of K-means algorithm applied to movie recommendation system - Undergraduate Thesis: (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2025

ABSTRACT: This study aims to enhance the traditional K-Means clustering algorithm, which is known for its sensitivity to outliers, reliance on manually selected cluster numbers, and difficulty in clustering data with varying sizes and densities. To address these issues, the enhanced algorithm integrated three key enhancements: optimal cluster selection using the Calinski-Harabasz Index (CHI), outlier detection though Local Outlier Factor (LOF), and the use of Cosine Similarity for distance metric. The CHI determined that only 2 clusters were optimal, compared to the 5 clusters used in the original method, simplifying interpretation and automating the selection of k clusters. To address the algorithm’s challenges in clustering data of varying size and density, the enhanced method utilized Cosine Similarity, allowing it to handle clusters with irregular shapes and varying densities more effectively than Euclidean distance. This resulted in clearer boundaries and reduced overlap between user groups. Lastly, to address the algorithm’s sensitivity to outliers, LOF was implemented which effectively identified and removed 51 outliers from the original 610-user dataset. This resulted in tighter, less noisy clusters. These enhancements led to an improved silhouette score from 0.01012 to 0.1359, demonstrating greater intra-cluster cohesion and inter-cluster separation. The results, visualized through comparative plots, highlight the performance advantage of the enhanced algorithm in generating cleaner and more meaningful clusters. Overall, the enhanced K-Means method more effective in capturing user preferences by generating accurate and robust clusters, making it a valuable tool for recommendation systems and user behavior analysis.




academic writing

QA76.9 A43 B39 2025

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.