Patent · US Active

Multimodal table encoding for information retrieval systems

US11687514B2 · kind B2 · utility

0Cited by
7References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 15, 2020
Grant dateJun 27, 2023
Priority date
Expiry dateJan 13, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Multimodal table encoding, including: Receiving an electronic document that contains a table. The table includes multiple rows, multiple columns, and a schema comprising column labels or row labels. The electronic document includes a description of the table which is located externally to the table. Next, operating separate machine learning encoders to separately encode the description, schema, each of the rows, and each of the columns of the table, respectively. The schema, the rows, and the columns are encoded together with end-of-column tokens and end-of-row tokens that mark an end of each column and row, respectively. Then, applying a machine learning gating mechanism to the encoded description, encoded schema, encoded rows, and encoded columns, to produce a fused encoding of the table, wherein the fused encoding is representative of both a structure of the table and a content of the table.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.