Patent · US Active

Labeling of data for machine learning

US9747555B2 · kind B2 · utility

2Cited by
4References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 5, 2014
Grant dateAug 29, 2017
Priority date
Expiry dateOct 4, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N5/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer generates labels for machine learning algorithms by retrieving, from a data storage circuit, multiple label sets that contain labels that each classify data points in a corpus of data. A graph is generated that includes a plurality of edges, each edge between two respective labels from different label sets of the multiple label sets. Weights are determined for the plurality of edges based upon a consistency between data points classified by two labels connected by the edges. An algorithm is applied that groups labels from the multiple label sets based upon the weights for the plurality of edges. Data points are identified from the corpus of data that represent conflicts within the grouped labels. An electronic message is transmitted in order to present the identified data points to entities for further classification. A new label set is generated using the further classification received from the entities.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.