Ground truth generation from scanned documents
US11017498B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 14, 2019 |
| Grant date | May 25, 2021 |
| Priority date | — |
| Expiry date | Nov 22, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/245
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A plurality of electronic documents comprising one or more document pages are received. First position markers, second position markers and page identifiers are inserted to the pages. The plurality of electronic documents are printed, thereby generating a printed corpus comprising a plurality of printed documents. The plurality of printed documents are scanned, thereby generating a scanned corpus comprising a plurality of scanned images. Scanning frame positions of the first and the second position markers are detected and the detected scanning frame positions and the page positions are used to define affine transformations between the plurality of scanned images and the corresponding document pages. The affine transformations are applied to the plurality of scanned images to align the plurality of scanned images with the corresponding document pages of the plurality of electronic documents.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.