Patent · US Active

Systems and methods for generating document numerical representations

US12033415B2 · kind B2 · utility

0Cited by
1References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 16, 2023
Grant dateJul 9, 2024
Priority date
Expiry dateFeb 16, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/51
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Described embodiments relate to a method comprising: determining a candidate document comprising image data and character data and extracting the image data and the character data from the candidate document. The method comprises providing, to an image-based numerical representation generation model, the image data, and generating, by the image-based numerical representation generation model, an image-based numerical representation of the image data. The method comprises providing, to a character-based numerical representation generation model, the character data; and generating, by the character-based numerical representation generation model, a character-based numerical representation of the character data. The method comprises providing, to a consolidated image-character based numerical representation generation model, the image-based numerical representation and the character-based numerical representation; and generating, by the consolidated image-character based numerical representation generation model, a combined image-character based numerical representation of the candidate document.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.