Patent · US Expired

Method for compressing full text indexes with document identifiers and location offsets

US5649183A · kind A · utility

9Cited by
9References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 8, 1992
Grant dateJul 15, 1997
Priority date
Expiry dateDec 8, 2012

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99936
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method is disclosed for recording a text index wherein the text index comprises a plurality of data key fields. Each data key field includes a data key identifier, document identifier data, and an offset field. The document identifier data is provided to identify each document in which the data key identifier appears. The offset field includes a plurality of offset sequences wherein each offset sequence is associated with a respective document identified by the document identifier data and wherein each offset sequence identifies the location of each data key within its associated document by identifying the offset of the data key from the preceding data key. In accordance with the subject invention, the document identifier data and the offset data field are compressed by disclosed methods.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.