Probabilistic word embeddings for text classification
US11120223B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 18, 2019 |
| Grant date | Sep 14, 2021 |
| Priority date | — |
| Expiry date | Sep 29, 2039 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04L51/046
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed are systems, methods, and non-transitory computer-readable media for probabilistic word embeddings for text classification. A text classification system receives a message including a keyword and determines an embedding probability distribution representing the keyword. The text classification system then determines an embedding value for the keyword based on the embedding probability distribution. The text classification system uses the embedding value as input into a set of mathematical functions, yielding a first set of coefficient values for the keyword. Each respective mathematical function from the set corresponds to a respective classification label from a set of classification labels and defines a continuous surface. Each respective mathematical function is determined from embedding values for a set of known keywords, distribution variance values for the set of known keywords, and a subset of coefficient values for the set of known keywords that corresponds to the respective classification label.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.