Patent · US Active

Systems, methods, and computer readable media for extracting data from portable document format (PDF) files

US9418315B1 · kind B1 · utility

9Cited by
3References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 14, 2016
Grant dateAug 16, 2016
Priority date
Expiry dateMar 14, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V10/225
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

According to one method, the method occurs at a data file analyzer. The method includes identifying at least one document identifier associated with a first document in a portable document format (PDF) file. The method further includes determining, using the at least one document identifier, a reference point identifier for identifying a reference point in the first document, an offset value for indicating a location of a first detection area in the first document, and size information for indicating a size of the first detection area in the first document. The method also includes identifying, using a reference point identifier, the reference point in the first document. The method further includes identifying, using the offset value and the size information, the first detection area in the first document and extracting, by processing binary data of the PDF file, data within the first detection area of the first document.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.