Parsing an image of a visually structured document
US9606897B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 16, 2011 |
| Grant date | Mar 28, 2017 |
| Priority date | — |
| Expiry date | Nov 21, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/205
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for automated semantic parsing of an image of a structured document includes acquiring the image of the structured document. The image of the structured document is lexed so as to associate each image element of a plurality of image elements of the image with a predefined token. A user defined template of expected semantically significant elements of the structured document is input into a parser, the expected elements being defined in a visibly pushdown language (VPL) format. The tokens are parsed into the expected elements. A computer readable medium containing executable instructions and a system are also described.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.