System and method of machine-aided information extraction rule development
US8165987B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Mar 27, 2009 |
| Grant date | Apr 24, 2012 |
| Priority date | — |
| Expiry date | Jun 23, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N5/022
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An automatic rule generation system generates rules for fact extraction. A rule generation module receives a sample and generates a rule from the sample. A rule relaxation module generates a relaxed rule from the rule. A rule testing module generates a reverse index from a corpus, applies the relaxed rule to the reverse index, and generates text segments. An information extraction module generates modified text segments from the relaxed rule and the text segments. A candidate suggestion module performs a candidate generation process: if the candidate generation process generates no candidates, the candidate suggestion module signals the rule relaxation module to generate a further relaxed rule to use as the relaxed rule. A user evaluates a candidate and provides the candidate as an additional sample for the automatic rule generation system to generate another rule to use as the rule. As a result of performing these actions iteratively, the rule is eventually generated and relaxed to result in an appropriate rule to use for fact extraction.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.