Systems and methods for the distributed categorization of source data
US9355167B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 17, 2013 |
| Grant date | May 31, 2016 |
| Priority date | — |
| Expiry date | Oct 13, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/24573
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods for the crowdsourced clustering of data items in accordance embodiments of the invention are disclosed. In one embodiment of the invention, a method for determining categories for a set of source data includes obtaining a set of source data, determining a plurality of subsets of the source data, where a subset of the source data includes a plurality of pieces of source data in the set of source data, generating a set of pairwise annotations for the pieces of source data in each subset of source data, clustering the set of source data into related subsets of source data based on the sets of pairwise labels for each subset of source data, and identifying a category for each related subset of source data based on the clusterings of source data and the source data metadata for the pieces of source data in the group of source data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.