Method and system for generating structured data from semi-structured data sources
US6782505B1 · kind B1 · utility
Inventors
Key dates
| Filing date | Apr 19, 1999 |
| Grant date | Aug 24, 2004 |
| Priority date | — |
| Expiry date | Apr 19, 2019 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99943
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for generating structured data outputs from a semi-structured data source. The steps of this method include generating an example output from an example generator. The example output is generated in response to the acquisition of a sequence of annotated strings. The annotated strings are generated in response to the acquisition and modification of at least one data example and corresponding coarse structure from a predetermined input source. Also, a second sequence of annotated strings is generated from input from a semi-structured data source. Both the example output and second sequence of annotated strings are input to an acquisition engine that implements a grammar layer incorporating a top-down parsing method and a comparison layer. The structured data outputs are generated through the cooperation of the comparison layer and the grammar layer.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.