Patent · US Expired

Method and system for generating structured data from semi-structured data sources

US6782505B1 · kind B1 · utility

11Cited by
7References
7Claims
0Family size

Inventors

Key dates

Filing dateApr 19, 1999
Grant dateAug 24, 2004
Priority date
Expiry dateApr 19, 2019

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99943
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method for generating structured data outputs from a semi-structured data source. The steps of this method include generating an example output from an example generator. The example output is generated in response to the acquisition of a sequence of annotated strings. The annotated strings are generated in response to the acquisition and modification of at least one data example and corresponding coarse structure from a predetermined input source. Also, a second sequence of annotated strings is generated from input from a semi-structured data source. Both the example output and second sequence of annotated strings are input to an acquisition engine that implements a grammar layer incorporating a top-down parsing method and a comparison layer. The structured data outputs are generated through the cooperation of the comparison layer and the grammar layer.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.