Structure extraction on electronic documents
US6298357A · kind A · utility
90Cited by
13References
20Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Jun 3, 1997 |
| Grant date | Oct 2, 2001 |
| Priority date | — |
| Expiry date | Jun 3, 2017 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/258
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and apparatus for extracting structure information from an unstructured electronic document is described. The method includes the step of identifying a structural type for each instance in the electronic document by examining presentation attributes associated with each instance. Examples of presentation attributes which are examined include numbering formats, indentations, and font sizes and weights.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.