Classification and moderation of text
US11698922B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Nov 2, 2018 |
| Grant date | Jul 11, 2023 |
| Priority date | — |
| Expiry date | May 12, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q10/107
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed herein are techniques and systems for classifying and moderating text using a machine learning approach that is based on a word embedding process. For instance, word embedding vectors may be used to determine clusters of associated text (e.g., similar words) from a corpus of comments maintained by a remote computing system. The remote computing system may then identify, within the corpus of comments, a subset of comments that include text from a given cluster that was determined, from human labeling input, to include a particular type of word or speech. Using this information, the corpus of comments may be labeled with one of multiple class labels. A machine learning model(s) may be trained to classify text as one of the multiple class labels using a sampled set of labeled comments as training data. At runtime, text can be moderated based on its class label.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.