Techniques for classifying and labeling data
US10037378B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 11, 2015 |
| Grant date | Jul 31, 2018 |
| Priority date | — |
| Expiry date | Nov 21, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/35
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for classifying and labeling data are disclosed. In one embodiment, the techniques may be realized as a system for classifying and labeling data comprising one or more processors. The one or more processors may be configured to distribute training data across a plurality of hosts. Each of the hosts may be assigned a random subset of the training data, and configured to cluster its own subset independently. The one or more processors may be further configured to label each cluster of the training data. The one or more processors may be further configured to receive new data, associate the new data with a plurality of the clusters of the training data, and assign the new data a label. The label may be chosen from labels of the plurality of the clusters. And the label may have a maximum associative factor of the new data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.