Article

Compression of small text files.

Advanced Engineering Informatics 01/2008; 22:410-417.
Source: DBLP
0 0
 · 
0 Bookmarks
 · 
49 Views
  • Article: Compression: a key for next-generation text retrieval systems
    [show abstract] [hide abstract]
    ABSTRACT: The continually growing Web challenges information retrieval systems to deliver data quickly. The authors' technique combines several data compression features to provide economical storage, faster indexing, and accelerated searches
    Computer 12/2000; 33(11):37-44. · 1.47 Impact Factor
  • Article: The Second Text Retrieval Conference (TREC-2).
    Inf. Process. Manage. 01/1995; 31:269-270.
  • Conference Proceeding: Compression of small text files using syllables
    [show abstract] [hide abstract]
    ABSTRACT: Summary form only given. We adapted well-known algorithms of adaptive Huffman coding and LZW to use syllables and words instead of characters for text compression. We tested the algorithms on collections of small or middle-sized files. Using syllable-based compression algorithms on English documents gives expected results: they outperform character-based and are outperformed by word-based versions of the same algorithm. According our tests both syllable- and word-based compression methods are sensitive to initial setting of their dictionaries. The decomposition of words into syllables is not trivial and is language dependent. An open issue is the applicability of syllable-based compression for different languages (like German, Rusian, or Hungarian) and its use in conjunction with other algorithms like block-sorting lossless compression
    Data Compression Conference, 2006. DCC 2006. Proceedings; 04/2006

Full-text (2 Sources)

View
11 Downloads
Available from
22 Jan 2013