Patent · US Active

Automatically labeling data using conceptual descriptions

US11720748B2 · kind B2 · utility

0Cited by
6References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 27, 2020
Grant dateAug 8, 2023
Priority date
Expiry dateAug 3, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system for automatically labeling data using conceptual descriptions. In one example, the system includes an electronic processor configured to generate unlabeled training data examples from one or more natural language documents and, for each of a plurality of categories, determine one or more concepts associated with a conceptual description of the category and generate a weak annotator for each of the one or more concepts. The electronic processor is also configured to apply each weak annotator to each training data example and, when a training data example satisfies a weak annotator, output a category associated with the weak annotator. For each training data example, the electronic processor determines a probabilistic distribution of the plurality of categories. For each training data example, the electronic processor labels the training data example with a category having the highest value in the probabilistic distribution determined for the training data example.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.