Patent · US Active

Graph-based document layout detection

US12387517B1 · kind B1 · utility

0Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 13, 2022
Grant dateAug 12, 2025
Priority date
Expiry dateOct 9, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/416
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A document layout system for determining a layout of a document. The document layout system is configured to apply an OCR technique to identify the text in the document. The document layout system is further configured to generate a graph representation of the document, wherein the graph representation comprises a plurality of nodes and a plurality of edges that connect different ones of the plurality of nodes, wherein individual ones of the nodes correspond to different portions of the text. The document layout system is also configured to apply a graph cluster network machine learning model to the graph representation to identify a layout of different sections of the document according to respective merge inferences determined for individual ones of the plurality of edges. The document layout system is also configured to provide the layout of different sections of the document.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.