Event clustering and classification with document embedding
US10762439B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 26, 2016 |
| Grant date | Sep 1, 2020 |
| Priority date | — |
| Expiry date | Nov 12, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embedding representation for a document is generated based on clustering words in the document. Representative clusters are selected and a weighted sum of the embeddings of the words in the selected clusters is determined as a document embedding. Documents are labeled based on document embeddings. A machine learning algorithm is trained using the documents. The machine learning algorithm predicts a label of a given document based on the given document's document embedding.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.