Method of generating a distributed text index for parallel query processing
US7966332B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 8, 2007 |
| Grant date | Jun 21, 2011 |
| Priority date | — |
| Expiry date | Sep 24, 2029 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99933
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention relates to a method of generating a distributed text index for parallel query processing by a number of nodes. A set of node indices is generated for text indexing a set of documents, each node text index covering a subset of the documents. For each node text index, a local frequency measure for each term of the node text index is calculated on the basis of a frequency of documents containing the term in the subset of the documents of the node. A global frequency measure for each term is calculated on the basis of a frequency of documents containing the term in the set of documents. A quality measure for each node text index is calculated on the basis of the local frequency measures of the terms of the node and the global frequency measure of the terms of the node.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.