An enhancement of random forest algorithm applied in credit card fraud detection system

By: Paola Allan Atgenal and Michael Tulagan
Language: English Publisher: . . c2017Description: Undergraudate Thesis: (BSCS major in Computer Science)- Pamantasan ng Lungsod ng Maynila, 2017Content type: text Media type: unmediated Carrier type: volumeGenre/Form: academic writingDDC classification: . LOC classification: QA76.9 Ar4 2017
Contents:
ABSTRACT One of the most popular frameworks used by data scientists is the random forest algorithm. It is one of the most accurate learning algorithms available. For many data sets, it produces a highly accurate classifier. The random forest algorithm is one of the best among classification algorithms able to classify large amounts of data with accuracy. This study aims to improve the algorithms accuracy by applying our solutions to the problems that always occur in the algorithm. The results should make the algorithms accuracy more accurate in its predictive performance in finding fraudulent transactions inside an e-commerce website single decision trees often have high variance or high bias. Random forest attempts to mitigate the problem of high variance and high bias by engaging to find a natural balance between the attributes that have been used. We have used sampling technique to cut out one third of unnecessary data sets to produce a reliable prediction to our data sets. The results of learning are incomprehensible. Compared to a single decision tree, or to a set of rules, they don't give a lot of insight. Researchers should also improve the tree structure instead of just improving the accuracy itself. Instead of having a big tree structure researchers should also focus on pre-building the tree to select the right attributes on building the tree.
Tags from this library: No tags from this library for this title. Log in to add tags.
    Average rating: 0.0 (0 votes)
Item type Current location Home library Collection Call number Status Date due Barcode Item holds
Archival materials PLM
PLM
Archives
Filipiniana-Thesis QA76.9 Ar4 2017 (Browse shelf) Available FT6035
Total holds: 0

ABSTRACT One of the most popular frameworks used by data scientists is the random forest algorithm. It is one of the most accurate learning algorithms available. For many data sets, it produces a highly accurate classifier. The random forest algorithm is one of the best among classification algorithms able to classify large amounts of data with accuracy. This study aims to improve the algorithms accuracy by applying our solutions to the problems that always occur in the algorithm. The results should make the algorithms accuracy more accurate in its predictive performance in finding fraudulent transactions inside an e-commerce website single decision trees often have high variance or high bias. Random forest attempts to mitigate the problem of high variance and high bias by engaging to find a natural balance between the attributes that have been used. We have used sampling technique to cut out one third of unnecessary data sets to produce a reliable prediction to our data sets. The results of learning are incomprehensible. Compared to a single decision tree, or to a set of rules, they don't give a lot of insight. Researchers should also improve the tree structure instead of just improving the accuracy itself. Instead of having a big tree structure researchers should also focus on pre-building the tree to select the right attributes on building the tree.

Filipiniana

There are no comments for this item.

to post a comment.

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.