Dynamic Unbounded Integer Compression

dc.contributor.advisorTamir, Dan
dc.contributor.authorMulpuri, Naga Sai Gowtham
dc.contributor.committeeMemberGao, Byron
dc.contributor.committeeMemberBurtscher, Martin
dc.date.accessioned2018-09-20T19:52:41Z
dc.date.available2018-09-20T19:52:41Z
dc.date.issued2016-08
dc.description.abstractOne of the important applications of data compression is inverted indexes compression, which is a lossless compression. Presently, most of the internet search engines such as Google and Wikipedia process a large amount of documents and employ information retrieval policies through inverted indexes for fast search or query. This might necessitate compressing the data in order to store it in physical memory and enable computationally effective retrieval of information. Nevertheless, their currently used compression methods implement static compression procedures and do not fully exploit dynamic integer compression capabilities. In this research, we have exploited dynamic Huffman compression method referred to as δ-Huffman coding for compressing integers and improved the attainable compression ratios. To achieve this, we have explored a combination of static and dynamic compression algorithms. To the best of our knowledge this is the first work that combines unbounded integer compression methods with Huffman coding and comparatively evaluates their performance. We have applied compression algorithms on the data-sets drawn from Geometric probability distribution function, Poisson distributions, realistic inverted lists, and page-rank lists obtained from Wikipedia. Our results show that the bit-rate of δ-Huffman coding is the best of all of the tested coding methods including Comma code, Elias Gamma code, Elias Delta code, Elias Omega code, Fibonacci code, and Golomb code. Additionally, it is very close to the estimated entropy of the test sequences.
dc.description.departmentComputer Science
dc.formatText
dc.format.extent110 pages
dc.format.medium1 file (.pdf)
dc.identifier.citationMulpuri, N. S. G. (2016). <i>Dynamic unbounded integer compression</i> (Unpublished thesis). Texas State University, San Marcos, Texas.
dc.identifier.urihttps://hdl.handle.net/10877/7743
dc.language.isoen
dc.subjectInteger
dc.subjectCompression
dc.titleDynamic Unbounded Integer Compression
dc.typeThesis
thesis.degree.departmentComputer Science
thesis.degree.disciplineComputer Scienceen_US
thesis.degree.grantorTexas State Universityen_US
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MULPURI-THESIS-2016.pdf
Size:
1.63 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.53 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.13 KB
Format:
Plain Text
Description: