Patent · US Active

Method of generating a distributed text index for parallel query processing

US7966332B2 · kind B2 · utility

6Cited by
2References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 8, 2007
Grant dateJun 21, 2011
Priority date
Expiry dateSep 24, 2029

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99933
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present invention relates to a method of generating a distributed text index for parallel query processing by a number of nodes. A set of node indices is generated for text indexing a set of documents, each node text index covering a subset of the documents. For each node text index, a local frequency measure for each term of the node text index is calculated on the basis of a frequency of documents containing the term in the subset of the documents of the node. A global frequency measure for each term is calculated on the basis of a frequency of documents containing the term in the set of documents. A quality measure for each node text index is calculated on the basis of the local frequency measures of the terms of the node and the global frequency measure of the terms of the node.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.