Rule induction for summarizing documents in a classified document collection
US7162413B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 9, 1999 |
| Grant date | Jan 9, 2007 |
| Priority date | — |
| Expiry date | Jul 9, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/345
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and apparatus for providing summaries of documents belonging to a class of documents in a classified document collection. A sample set of documents belonging to one or more classes is processed via a machine learning system in order to induce a set of rules associated with the sample set of documents. The vocabulary in the rules are extracted and compared to words, terms or phrases of an incoming document. Any matches between the extracted rules and the words, terms or phrases of the incoming document are used as a summary for the incoming document. By using the method and apparatus, each document does not have to be processed to find most important words and the like in order to provide a summary for that document and then repeating the same process for additional documents.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.