Systems, methods, and computer readable media for extracting data from portable document format (PDF) files
US9418315B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 14, 2016 |
| Grant date | Aug 16, 2016 |
| Priority date | — |
| Expiry date | Mar 14, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/225
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
According to one method, the method occurs at a data file analyzer. The method includes identifying at least one document identifier associated with a first document in a portable document format (PDF) file. The method further includes determining, using the at least one document identifier, a reference point identifier for identifying a reference point in the first document, an offset value for indicating a location of a first detection area in the first document, and size information for indicating a size of the first detection area in the first document. The method also includes identifying, using a reference point identifier, the reference point in the first document. The method further includes identifying, using the offset value and the size information, the first detection area in the first document and extracting, by processing binary data of the PDF file, data within the first detection area of the first document.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.