Enhanced named entity recognition algorithm for Filipino cultural and heritage texts (Record no. 37355)

000 -LEADER
fixed length control field 02202nam a22002417a 4500
003 - CONTROL NUMBER IDENTIFIER
control field FT8874
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20251216131015.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 251216b ||||| |||| 00| 0 eng d
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title engtag
050 ## - LIBRARY OF CONGRESS CALL NUMBER
Classification number QA76.9 A43 R63 2025
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number .
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Robantes, Jhan Lou P.; Serrano, Andreo A.
245 ## - TITLE STATEMENT
Title Enhanced named entity recognition algorithm for Filipino cultural and heritage texts
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture .
Name of producer, publisher, distributor, manufacturer .
Date of production, publication, distribution, manufacture, or copyright notice c2025
300 ## - PHYSICAL DESCRIPTION
Other physical details Undergraduate Thesis: (Bachelor of Science in Computer Science) - Pamantasan ng Lungsod ng Maynila, 2025
336 ## - CONTENT TYPE
Source text
Content type term text
Content type code text
337 ## - MEDIA TYPE
Source unmediated
Media type term unmediated
Media type code unmediated
338 ## - CARRIER TYPE
Source volume
Carrier type term volume
Carrier type code volume
505 ## - FORMATTED CONTENTS NOTE
Formatted contents note ABSTRACT: Named Entity Recognition (NER) is a crucial natural language processing task that extracts and classifies named entities from unstructured text into predefined categories. While existing NER methods have shown success in general domains, they often face significant challenges when applied to specialized context like Filipino cultural and historical texts. These challenges stem from unique linguistics features, and diverse naming conventions. This research introduces an enhanced rule-based NER approach that specifically addresses these challenges. At its core, the system utilizes curated Corpus of Historical Filipino and Philippine English (COHFIE), which serves as both training and evaluation data. This research an enhanced rule-based approach for NER using a Corpus of Historical Filipino and Philippine English (COHFIE) building on pattern-learning methods, incorporating character and token features, and by using positive and negative example sets. To enrich the classification process, we used the International Committee for Documentation – Conceptual Reference Model (CIDOC-CRM), a cultural heritage framework, to provide a more nuanced categorization of entities based on their historical and cultural significance. Tested across existing Filipino based models (calamanCy and RoBERTa), the enhanced model shows improvement on identifying entities related to Filipino culture (CUL) and history terms (PER, ORG, LOC).
526 ## - STUDY PROGRAM INFORMATION NOTE
Classification Filipiniana
655 ## - INDEX TERM--GENRE/FORM
Genre/form data or focus term academic writing
942 ## - ADDED ENTRY ELEMENTS
Source of classification or shelving scheme
Item type Thesis/Dissertation
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Collection code Permanent Location Current Location Shelving location Date acquired Total Checkouts Full call number Barcode Date last seen Price effective from Item type
          Filipiniana-Thesis PLM PLM Filipiniana Section 2025-10-24   QA76.9 A43 R63 2025 FT8874 2025-12-16 2025-12-16 Thesis/Dissertation

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.