An enhancement of term-frequency-inverse document frequency algorithm applied in document retrieval system / Jeanne Marie Valle and Regine Victoria. 6

By: Jeanne Marie Valle and Regine Victoria. 4 0 16, [, ] | [, ] |
Contributor(s): 5 6 [] |
Language: Unknown language code Summary language: Unknown language code Original language: Unknown language code Series: ; 201846Edition: Description: 28 cm. 257 ppContent type: text Media type: unmediated Carrier type: volumeISBN: ISSN: 2Other title: 6 []Uniform titles: | | Related works: 1 40 6 []Subject(s): -- 2 -- 0 -- -- | -- 2 -- 0 -- 6 -- | 2 0 -- | -- -- 20 -- | | -- -- -- -- 20 -- | -- -- -- 20 -- --Genre/Form: -- 2 -- Additional physical formats: DDC classification: | LOC classification: | | 2Other classification:
Contents:
Action note: In: Summary: ABSTRACT: This study entitled An Enhancement of Term Frequency - Inverse Document Frequency Algorithm Applied in Document Retrieval System is an exploration of any improvements available for the algorithm. The main objective of this study is to enhance certain processes of the algorithm to improve its performance in retrieving relevant documents. Term Frequency-Inverse Document Frequency has been one of the most highly used document retrieval method for many years. The researchers found that there could be a big potential for improvement. Recent advances in computer and technology resulted into ever increasing set of documents. The term weighting strategy plays an essential role in document retrieval. Term weighting is proposed to give equal opportunities to retrieve both lengthy documents and shorter ones. The first goal is to include retrieving documents when the query contains pure stop words. Stop words are very common in a document that could have a higher TF-IDF score since they occur more frequent although they do not mean that much. The researchers wanted to retrieve documents if the query contains pure stop words for it might mean so much in the documents. Another objective is to treat multiple terms as one token for retrieval. The last one is retrieving of documents that includes the root word of the query. Fortunately, the researchers made the solution for these objectives become possible by their proposed new and additional methods to bring enhancement to the algorithm. The proponents suggest using the enhanced algorithm either in an offline document search engine or online because of its versatility. It is also recommended to use by an administrator-user basis because it has log-in and log-out feature. Other editions:
Tags from this library: No tags from this library for this title. Log in to add tags.
    Average rating: 0.0 (0 votes)

Thesis: (BSCS major in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2018. 56

5

ABSTRACT: This study entitled An Enhancement of Term Frequency - Inverse Document Frequency Algorithm Applied in Document Retrieval System is an exploration of any improvements available for the algorithm. The main objective of this study is to enhance certain processes of the algorithm to improve its performance in retrieving relevant documents. Term Frequency-Inverse Document Frequency has been one of the most highly used document retrieval method for many years. The researchers found that there could be a big potential for improvement. Recent advances in computer and technology resulted into ever increasing set of documents. The term weighting strategy plays an essential role in document retrieval. Term weighting is proposed to give equal opportunities to retrieve both lengthy documents and shorter ones. The first goal is to include retrieving documents when the query contains pure stop words. Stop words are very common in a document that could have a higher TF-IDF score since they occur more frequent although they do not mean that much. The researchers wanted to retrieve documents if the query contains pure stop words for it might mean so much in the documents. Another objective is to treat multiple terms as one token for retrieval. The last one is retrieving of documents that includes the root word of the query. Fortunately, the researchers made the solution for these objectives become possible by their proposed new and additional methods to bring enhancement to the algorithm. The proponents suggest using the enhanced algorithm either in an offline document search engine or online because of its versatility. It is also recommended to use by an administrator-user basis because it has log-in and log-out feature.

5

There are no comments for this item.

to post a comment.

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.