Patent · US Active

Multiple channels of rasterized content for page decomposition using machine learning

US11386685B2 · kind B2 · utility

1Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 17, 2019
Grant dateJul 12, 2022
Priority date
Expiry dateAug 31, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/414
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques are provided for identifying structural elements of a document. One Methodology includes generating a first channel of rasterized content by rasterizing a full page of the document and generating one or more additional channels of rasterized content from the page of the document by rasterizing one or more corresponding content types from the page of the document. Each of the one or more additional channels includes a specific type of content that is different from each of the other one or more additional channels. The methodology further includes inputting the first channel of rasterized content and the one or more additional channels of rasterized content into a machine learning (ML) model. The methodology continues with determining location and classification for each of a plurality of structural elements on the page of the document using the ML model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.