Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Arroyuelo et al., 2006 - Google Patents
[go: Go Back, main page]

Arroyuelo et al., 2006 - Google Patents

Reducing the space requirement of LZ-index

Arroyuelo et al., 2006

View PDF
Document ID
5907190644828799347
Author
Arroyuelo D
Navarro G
Sadakane K
Publication year
Publication venue
Annual Symposium on Combinatorial Pattern Matching

External Links

Snippet

The LZ-index is a compressed full-text self-index able to represent a text P 1... m, over an alphabet of size and with k-th order empirical entropy H k (T), using 4 uH k (T)+ o (u log σ) bits for any k= o (log σ u). It can report all the occ occurrences of a pattern P 1... m in T in O …
Continue reading at users.dcc.uchile.cl (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30613Indexing
    • G06F17/30619Indexing indexing structures
    • G06F17/30625Trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30943Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
    • G06F17/30964Querying
    • G06F17/30979Query processing
    • G06F17/30985Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30943Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
    • G06F17/30946Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type indexing structures
    • G06F17/30961Trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30312Storage and indexing structures; Management thereof
    • G06F17/30321Indexing structures
    • G06F17/30324Vectors, bitmaps or matrices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30908Information retrieval; Database structures therefor; File system structures therefor of semistructured data, the undelying structure being taken into account, e.g. mark-up language structure data
    • G06F17/30914Mapping or conversion
    • HELECTRICITY
    • H03BASIC ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same information or similar information or a subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • H03M7/3086Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing a sliding window, e.g. LZ77
    • HELECTRICITY
    • H03BASIC ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same information or similar information or a subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/42Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters
    • G06F17/2217Character encodings

Similar Documents

Publication Publication Date Title
Claude et al. Improved grammar-based compressed indexes
Arroyuelo et al. Reducing the space requirement of LZ-index
Nishimoto et al. Optimal-time queries on BWT-runs compressed indexes
Jansson et al. Ultra-succinct representation of ordered trees with applications
Jansson et al. Ultra-succinct representation of ordered trees
Munro et al. Space efficient suffix trees
Gagie et al. Colored range queries and document retrieval
Bille et al. Random access to grammar-compressed strings
Mäkinen et al. Implicit compression boosting with applications to self-indexing
Kreft et al. Self-indexing based on LZ77
Golynski et al. On the size of succinct indices
Arroyuelo et al. Stronger Lempel-Ziv based compressed text indexing
Belazzougui Succinct dictionary matching with no slowdown
Prezza Optimal rank and select queries on dictionary-compressed text
Díaz-Domínguez et al. An LMS-based grammar self-index with local consistency properties
Claude et al. Self-indexed text compression using straight-line programs
Jansson et al. Linked dynamic tries with applications to LZ-compression in sublinear time and space
Prezza On locating paths in compressed tries
Golynski et al. On the redundancy of succinct data structures
Tsuruta et al. Grammar-compressed self-index with Lyndon words
Navarro et al. Faster top-k document retrieval in optimal space
Välimäki et al. Engineering a compressed suffix tree implementation
Gupta et al. A framework for dynamizing succinct data structures
Arroyuelo et al. Space-efficient construction of Lempel–Ziv compressed text indexes
Navarro Indexing text using the Ziv-Lempel trie