UNIQ: Utilization of NLP techniques in plagiarism detection system through semantic analysis using WORD2VEC and BERT / Cabalsi, Glaidelyn M.; Latina, Jeffrey V.; Sanchez, Joervy R.; Vallejo, Elnard Don M. 6

By: Cabalsi, Glaidelyn M.; Latina, Jeffrey V.; Sanchez, Joervy R.; Vallejo, Elnard Don M. 4 0 16, [, ] | [, ] |
Contributor(s): 5 6 [] |
Language: Unknown language code Summary language: Unknown language code Original language: Unknown language code Series: ; January 2024.46Edition: Description: ix, 118 ppContent type: text Media type: unmediated Carrier type: volumeISBN: ISSN: 2Other title: 6 []Uniform titles: | | Related works: 1 40 6 []Subject(s): -- 2 -- 0 -- -- | -- 2 -- 0 -- 6 -- | 2 0 -- | -- -- 20 -- | | -- -- -- -- 20 -- | -- -- -- 20 -- --Genre/Form: -- 2 -- Additional physical formats: DDC classification: | LOC classification: | | 2Other classification:
Contents:
Action note: In: Summary: ABSTRACT: Detecting strongly paraphrased and translated texts presents a challenge for existing detection tools, which mainly rely on traditional word searching and matching methods. While automated systems are crucial for spotting plagiarism, they mostly focus on finding identical strings in both suspicious and source documents. As a result, detecting obfuscated plagiarism remains difficult because current tools are limited to identifying straightforward copy-and-paste cases, despite advancements in plagiarism detection technology. Thus, this arises a necessity for a plagiarism detection system tailored to identify paraphrased passages. In this study, it presents a system that detects plagiarism in paraphrased texts using NLP Technique through Word Embedding Techniques, specifically Word2Vec and Bidirectional Encoder Representation from Transformers (BERT). Additionally, a recommendation system is integrated to suggest scholarly materials based on the documents submitted by the users. This feature uses a Content-based filtering technique to offer multiple sources and references based on the calculation of the similarity between the features of the input document and each potential source in the corpus. By combining different approaches and techniques for checking the similarities, the results showed that the hybrid model to the system, the evaluation using the ISO 25010 accumulated excellent results of Functional Suitability, Performance Efficiency, Usability, and Reliability with means of 4.29, 4.29, 4.59, and 4.34, respectively. These results underscore the system's exceptional quality and outstanding performance in detecting plagiarism in paraphrased texts. Other editions:
Tags from this library: No tags from this library for this title. Log in to add tags.
    Average rating: 0.0 (0 votes)
Item type Current location Home library Collection Call number Status Date due Barcode Item holds
Book PLM
PLM
Filipiniana Section
Filipiniana-Thesis T T58.5 .C33 2024 (Browse shelf) Available FT7817
Total holds: 0

Undergraduate Thesis: (Bachelor of Science in Information Technology) - Pamantasan ng Lungsod ng Maynila, 2024. 56

5

ABSTRACT: Detecting strongly paraphrased and translated texts presents a challenge for existing detection tools, which mainly rely on traditional word searching and matching methods. While automated systems are crucial for spotting plagiarism, they mostly focus on finding identical strings in both suspicious and source documents. As a result, detecting obfuscated plagiarism remains difficult because current tools are limited to identifying straightforward copy-and-paste cases, despite advancements in plagiarism detection technology. Thus, this arises a necessity for a plagiarism detection system tailored to identify paraphrased passages. In this study, it presents a system that detects plagiarism in paraphrased texts using NLP Technique through Word Embedding Techniques, specifically Word2Vec and Bidirectional Encoder Representation from Transformers (BERT). Additionally, a recommendation system is integrated to suggest scholarly materials based on the documents submitted by the users. This feature uses a Content-based filtering technique to offer multiple sources and references based on the calculation of the similarity between the features of the input document and each potential source in the corpus. By combining different approaches and techniques for checking the similarities, the results showed that the hybrid model to the system, the evaluation using the ISO 25010 accumulated excellent results of Functional Suitability, Performance Efficiency, Usability, and Reliability with means of 4.29, 4.29, 4.59, and 4.34, respectively. These results underscore the system's exceptional quality and outstanding performance in detecting plagiarism in paraphrased texts.

5

There are no comments for this item.

to post a comment.

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.