Systems and methods for generating document numerical representations
US11694463B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 20, 2022 |
| Grant date | Jul 4, 2023 |
| Priority date | — |
| Expiry date | Jul 20, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/51
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Described embodiments relate to a method comprising: determining a candidate document comprising image data and character data and extracting the image data and the character data from the candidate document. The method comprises providing, to an image-based numerical representation generation model, the image data, and generating, by the image-based numerical representation generation model, an image-based numerical representation of the image data. The method comprises providing, to a character-based numerical representation generation model, the character data; and generating, by the character-based numerical representation generation model, a character-based numerical representation of the character data. The method comprises providing, to a consolidated image-character based numerical representation generation model, the image-based numerical representation and the character-based numerical representation; and generating, by the consolidated image-character based numerical representation generation model, a combined image-character based numerical representation of the candidate document.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.