Patent · US Active

Grammatical parsing of document visual structures

US8249344B2 · kind B2 · utility

32Cited by
22References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 1, 2005
Grant dateAug 21, 2012
Priority date
Expiry dateFeb 20, 2027

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A two-dimensional representation of a document is leveraged to extract a hierarchical structure that facilitates recognition of the document. The visual structure is grammatically parsed utilizing two-dimensional adaptations of statistical parsing algorithms. This allows recognition of layout structures (e.g., columns, authors, titles, footnotes, etc.) and the like such that structural components of the document can be accurately interpreted. Additional techniques can also be employed to facilitate document layout recognition. For example, grammatical parsing techniques that utilize machine learning, parse scoring based on image representations, boosting techniques, and/or “fast features” and the like can be employed to facilitate in document recognition.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.