Patent · US Active

Labeling of data for machine learning

US10902352B2 · kind B2 · utility

0Cited by
7References
7Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 6, 2020
Grant dateJan 26, 2021
Priority date
Expiry dateJan 6, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N5/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer generates labels for machine learning algorithms by retrieving, from a data storage circuit, multiple label sets that contain labels that each classify data points in a corpus of data. A graph is generated that includes a plurality of edges, each edge between two respective labels from different label sets of the multiple label sets. Weights are determined for the plurality of edges based upon a consistency between data points classified by two labels connected by the edges. An algorithm is applied that groups labels from the multiple label sets based upon the weights for the plurality of edges. Data points are identified from the corpus of data that represent conflicts within the grouped labels. An electronic message is transmitted in order to present the identified data points to entities for further classification. A new label set is generated using the further classification received from the entities.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.