Patent · US Active

Method and apparatus for labeling data

US11386463B2 · kind B2 · utility

0Cited by
1References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 5, 2020
Grant dateJul 12, 2022
Priority date
Expiry dateMay 5, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06Q30/0275
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Aspects of the subject disclosure may include, for example, determining classes from a corpus based on topic modeling, data clustering and unsupervised learning. Labels are determined for each of the classes and trained models are generated for each of the classes by assignment of a plurality of textual documents to labels based on a highest number of matches. A raw textual document can be tokenized and stop words removed. A corresponding one of the trained models can be selected according to a class that is applicable to subject matter of the raw textual document. The processed document can be assigned to a target label based on a highest number of matches of words. Other embodiments are disclosed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.