Systems and methods for generating document numerical representations
US12033415B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 16, 2023 |
| Grant date | Jul 9, 2024 |
| Priority date | — |
| Expiry date | Feb 16, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/51
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Described embodiments relate to a method comprising: determining a candidate document comprising image data and character data and extracting the image data and the character data from the candidate document. The method comprises providing, to an image-based numerical representation generation model, the image data, and generating, by the image-based numerical representation generation model, an image-based numerical representation of the image data. The method comprises providing, to a character-based numerical representation generation model, the character data; and generating, by the character-based numerical representation generation model, a character-based numerical representation of the character data. The method comprises providing, to a consolidated image-character based numerical representation generation model, the image-based numerical representation and the character-based numerical representation; and generating, by the consolidated image-character based numerical representation generation model, a combined image-character based numerical representation of the candidate document.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.