Patent · US Active

Extract data from a true PDF page

US12307801B2 · kind B2 · utility

0Cited by
92References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 21, 2022
Grant dateMay 20, 2025
Priority date
Expiry dateSep 25, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/279
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The system may perform a method comprising analyzing metadata of a text layer of a page of a first pdf document to determine that the pdf document is a first true pdf document; receiving the first true pdf document, in response to the first pdf document being the first true pdf document; receiving a selection of a field including first data to be extracted from the first true pdf document; displaying the first data; creating a template including the coordinates corresponding to the selected field and the first data of the first true pdf document; and extracting from an accessible text layer of a second true pdf document, second data based on the template from the first true pdf document.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.