Patent · US Active

Systems and methods for the distributed categorization of source data

US10157217B2 · kind B2 · utility

5Cited by
21References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 27, 2016
Grant dateDec 18, 2018
Priority date
Expiry dateMay 27, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/24573
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for the crowdsourced clustering of data items in accordance embodiments of the invention are disclosed. In one embodiment of the invention, a method for determining categories for a set of source data includes obtaining a set of source data, determining a plurality of subsets of the source data, where a subset of the source data includes a plurality of pieces of source data in the set of source data, generating a set of pairwise annotations for the pieces of source data in each subset of source data, clustering the set of source data into related subsets of source data based on the sets of pairwise labels for each subset of source data, and identifying a category for each related subset of source data based on the clusterings of source data and the source data metadata for the pieces of source data in the group of source data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.