Patent · US Active

Automatically extracting data from semi-structured documents

US8041695B2 · kind B2 · utility

7Cited by
1References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateApr 18, 2008
Grant dateOct 18, 2011
Priority date
Expiry dateMar 23, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/154
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

This description provides tools and techniques for automatically extracting data from semi-structured documents. A computer-readable storage medium may contain computer-executable instructions that, when executed by a computer, cause the computer to receive a request for data representing an inferred structure of an input document. For the request, the computer may determine whether a repository containing mined information includes the requested data. If the repository contains the requested data, the computer may return the data representing the inferred structure of the input document in response to the request.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.