Patent · US Active

Form-based ontology creation and information harvesting

US8103962B2 · kind B2 · utility

311Cited by
4References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 4, 2009
Grant dateJan 24, 2012
Priority date
Expiry dateJul 20, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/367
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Extracting data from web pages. User input is received defining a tabular form. User input is received correlating portions of the form with user selected data items contained in one or more first web pages. The user input is correlated to create an ontology defining relationships between the user selected data items based on the definition of the tabular form. One or more other web pages are accessed, and based on a context of the one or more data items in the first web page being similar to a context of the selected data items in the one or more first web pages, one or more similar data items are extracted from the one or more other web pages. The extracted data items are correlated to each other in accordance with the ontology defining relationships between the user selected data items and are output as a user searchable data structure.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.