Cando, Philip P. and Gayola, Brent Julian S. 4 0

An enhancement of mespotine run-length encoding applied in columnar databases in a big data environment / 6 6 Philip P. Cando, and Brent Julian S.Gayola. - - - 187 pp. 28 cm. - - - - - . - . - 0 . - . - 0 .

Thesis: (BSCS major in Computer Science)- Pamantasan ng Lungsod ng Maynila, 2018.





5



ABSTRACT: With the fast-growing economy of businesses, data storing schemes have been implemented to reduce the cost of maintaining information. One of these schemes is data compression, which can be either lossy or lossless. Mespotine Run-Length Encoding Algorithm, sometimes referred to this study as MRLE, is a variant of Run-Length Encoding formulated by Meo Mespotine in 2015. RLE algorithm work by counting the repetitive consecutive characters or data and storing it in a run-value, run-length format. While the MRLE is proven to be the best RLE algorithm implemented on most of the input data, it is observed that the algorithm still has a room for improvement : (1) Low compression rate against hybrid input data, (2) Slow compression speed of the algorithm and (3) More runs of the same run value in the encoded data means slower fetching of aggregate queries. To address the indicated problems, the researchers fulfilled three objectives , (1) Improving compression rate of the algorithm against hybrid data by at least 15%, (2) Implementing the algorithm in an improved time-complexity and (3) Manipulating the input to lessen the occurrences of runs with the same run value. The first objective was achieved by compressing between runs instead of considering the occurrences of the character in the whole input data. The time-complexity of the algorithm was improved by implementing it in a linear fashion on the first pass over the input data. Lastly, the manipulation of the input data was done by sorting the records in the database before actual compression. The results indicate that the Enhanced MRLE algorithm does not only address the three problems stated but also showed high compression rates in data stored in columnar-database.













5







2 = =









2




2 --0------


6 --0-- 2 --------



0 2 --


--20------





--------20--


--------20--


----2

/ 2

/ 2

/

/

© Copyright 2024 Phoenix Library Management System - Pinnacle Technologies, Inc. All Rights Reserved.