Patent · US Expired

Structure extraction on electronic documents

US6298357A · kind A · utility

90Cited by
13References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 3, 1997
Grant dateOct 2, 2001
Priority date
Expiry dateJun 3, 2017

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/258
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and apparatus for extracting structure information from an unstructured electronic document is described. The method includes the step of identifying a structural type for each instance in the electronic document by examining presentation attributes associated with each instance. Examples of presentation attributes which are examined include numbering formats, indentations, and font sizes and weights.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.