Separating documents based on machine learning models
US11568664B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Dec 1, 2020 |
| Grant date | Jan 31, 2023 |
| Priority date | — |
| Expiry date | Sep 21, 2041 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY02D10/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Some embodiments provide a non-transitory machine-readable medium that stores a program executable by a device. The program receives a request to process a file. The file includes a set of images of text. The program further converts the text in each image in the set of images into a set of machine-readable text. The program also uses a machine learning model to predict, based on the set of machine-readable text, whether the set of images of the file are images of pages that belong to a single document or images of pages that belong to different documents.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.