Semantic representation of text in document
US12374141B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 12, 2020 |
| Grant date | Jul 29, 2025 |
| Priority date | — |
| Expiry date | Mar 24, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/412
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
There is provided a solution for semantic representation of text in a document. In this solution, textual information comprising a sequence of text elements (220) and layout information (230) of the text element are determined from a document. The layout information (230) indicates a spatial arrangement of the plurality of text elements (220) presented within the document. Based at least in part on the plurality of text elements (220) and the layout information (230), respective semantic feature representations (180) of the plurality of text elements (220) are generated. By jointly using both the textual information and the layout information (230), rich semantics of the text elements (220) in the document can be effectively captured in the feature representations.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.