Patent · US Active

Searchable data structure for electronic documents

US12032605B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateNov 11, 2022
Grant dateJul 9, 2024
Priority date
Expiry dateNov 11, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/414
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method includes obtaining, at a device, a hierarchical structure representing a graphical layout of content items of an electronic document, the content items including at least text. The method also includes generating a word embedding representing a word of the electronic document. The method further includes determining position information of a location of the word in the electronic document. The method also includes determining a descriptor that indicates a relationship of the location to the hierarchical structure. The method further includes providing input data to a machine learning model to generate a semantic region category label of a semantic region of the electronic document. The semantic region includes the word. The input data includes the word embedding, the position information, and the descriptor.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.