An Enhancement of tagalog Stemming Algorithm (TAGSA ) applied in Tagalog Dictionary Searching (Record no. 37210)

000 -LEADER
fixed length control field 03149nam a22002417a 4500
003 - CONTROL NUMBER IDENTIFIER
control field ft6103
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20251126142559.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 251126b ||||| |||| 00| 0 eng d
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title engtag
050 ## - LIBRARY OF CONGRESS CALL NUMBER
Classification number QA76.9 B84 2015
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number .
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Matthew Paulo B. Buena; Mary Ruth J. Datu and Abegail C. Estobanez.
245 ## - TITLE STATEMENT
Title An Enhancement of tagalog Stemming Algorithm (TAGSA ) applied in Tagalog Dictionary Searching
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture .
Name of producer, publisher, distributor, manufacturer .
Date of production, publication, distribution, manufacture, or copyright notice c2015
300 ## - PHYSICAL DESCRIPTION
Other physical details Undergraduate Thesis(BSCS major in Computer Science) - Pamantasan ng Lungsod ng Maynila. 2015.
336 ## - CONTENT TYPE
Source text
Content type term text
Content type code text
337 ## - MEDIA TYPE
Source unmediated
Media type term unmediated
Media type code unmediated
338 ## - CARRIER TYPE
Source volume
Carrier type term volume
Carrier type code volume
505 ## - FORMATTED CONTENTS NOTE
Formatted contents note ABSTRACT: Tagalog Stemming Algorithm or TagSa is an algorithm developed for all forms of Tagalog words as input. It basically used in information retrieval systems to improve performance. In this study it is used as a morphological analyser that extract the root words from Filipino words conjugated in different forms as inputs and produces affixes used and the tenses of the original input word. Studying and analysing the original and existing algorithm, the researchers found three problems that exist in the original algorithm in terms of deriving the root word. The researchers formulated three specific objectives to solve the observed problems. 1. To solve the understemming and overstemming error of the existing algorithm. 2. To provide rules in the Partial Reduplication Routine for words that starts with consonant blends. 3. To include sub-steps in the Context Sensitive Attribute for words that should end with “o”. This study applied descriptive research method and used surveying by creating questionnaire/as an instrument in order to gather data from the target population. The collected data is processed according to the requirement of the study. This method helped the researchers to improve the existing algorithm. The researchers conducted intensive research method in order to provide enhanced algorithm to solve the observed problems in the original algorithm and accomplish the objectives. For the first problem and objective, the researchers improved the stemming methods of Tagalog words to minimize the understemming and overstemming errors. For the second problem and objective, the researchers added sub-steps in the Partial Reduplication Routine that cater the first syllable with a cluster of consonants to produce the correct stem. There can be two ways to solve this depending on the input word, either it reduplicates the first consonant and the first vowel of the stem, or it reduplicates the cluster of consonants including the succeeding vowel of the stem. For the third problem and objective, the enhanced algorithm provided steps to check if the stemmed word should originally end with “o” and if the end of the stemmed word should be change with “u”. The existing and enhanced algorithm was compared using the simulator and a Tagalog Dictionary Searching application was developed by using the enhanced algorithm.
526 ## - STUDY PROGRAM INFORMATION NOTE
Classification Filipiniana
655 ## - INDEX TERM--GENRE/FORM
Genre/form data or focus term academic writing
942 ## - ADDED ENTRY ELEMENTS
Source of classification or shelving scheme
Item type Archival materials
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Collection code Permanent Location Current Location Shelving location Date acquired Total Checkouts Full call number Barcode Date last seen Price effective from Item type
          Filipiniana-Thesis PLM PLM Archives 2025-11-26   QA76.9 B84 2015 FT6103 2025-11-26 2025-11-26 Archival materials

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.