System and method for language-independent contextual embedding
US11170169B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Mar 29, 2019 |
| Grant date | Nov 9, 2021 |
| Priority date | — |
| Expiry date | Aug 5, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N5/022
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed is a system for language-independent contextual embedding of entities in a document that includes sentences. The system has a database and a processing arrangement. The processing arrangement has a tokenizer module for tokenizing sentences to obtain tokens, an encoder module for determining character coordinate corresponding to the tokens, wherein the character coordinates corresponding to the tokens occur in a multi-dimensional hierarchical space. The system has a transmutation module for processing the character coordinates to generate contextual embeddings thereof in the multi-dimensional hierarchical space and a prediction module for memorizing sequential information pertaining to the contextual embeddings of the character coordinates.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.