Patent · US Active

Systems and methods for the distributed categorization of source data

US9355167B2 · kind B2 · utility

7Cited by
7References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 17, 2013
Grant dateMay 31, 2016
Priority date
Expiry dateOct 13, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/24573
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for the crowdsourced clustering of data items in accordance embodiments of the invention are disclosed. In one embodiment of the invention, a method for determining categories for a set of source data includes obtaining a set of source data, determining a plurality of subsets of the source data, where a subset of the source data includes a plurality of pieces of source data in the set of source data, generating a set of pairwise annotations for the pieces of source data in each subset of source data, clustering the set of source data into related subsets of source data based on the sets of pairwise labels for each subset of source data, and identifying a category for each related subset of source data based on the clusterings of source data and the source data metadata for the pieces of source data in the group of source data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.