Implicit discourse relation classification with contextualized word representation
US11526676B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 13, 2020 |
| Grant date | Dec 13, 2022 |
| Priority date | — |
| Expiry date | Oct 16, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method includes: initializing a list of token embeddings, each of the token embeddings corresponding to a tokenized word from text in a corpus of text; generating a graph for a group of consecutive words s from the text, said graph including binary relations between pairs of tokenized words of the group of consecutive words; selecting the token embeddings representing the words of the group of consecutive words from the list of token embeddings; computing a tensor of binary relations as the product between a matrix of the selected token embeddings and a tensor representing discourse relations, the computed tensor representing the binary relations between the pairs of tokenized words; computing a loss using the computed tensor; optimizing the list of token embeddings using the computed loss. The above may be repeated until the computed loss is within a predetermined range.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.