Analyzing the ability to find textual content
US7792830B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 1, 2006 |
| Grant date | Sep 7, 2010 |
| Priority date | — |
| Expiry date | Jan 8, 2027 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/334
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and system for analyzing a document set (202, 420) are provided. The method includes determining a set of terms (312) from the terms of the document set that minimizes a distance measurement (405) from the given set of documents (420). The method includes using a greedy algorithm to build the set of terms incrementally, at each stage finding a single word that is closest to the document set (202, 420). The set of terms is evaluated to assess the ability to find the document set (202, 420). The set of terms are compared with expected terms to evaluate the ability to find the document set (202, 420). A measure of the ability to find a document set (202, 420) is provided by computing a distance measure (403) between a document set and an entire collection.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.