Patent · US Active

Distributed predictive analytics data set

US11416713B1 · kind B1 · utility

1Cited by
74References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 18, 2019
Grant dateAug 16, 2022
Priority date
Expiry dateJun 16, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A novel distributed method for machine learning is described, where the algorithm operates on a plurality of data silos, such that the privacy of the data in each silo is maintained. In some embodiments, the attributes of the data and the features themselves are kept private within the data silos. The method includes a distributed learning algorithm whereby a plurality of data spaces are co-populated with artificial, evenly distributed data, and then the data spaces are carved into smaller portions whereupon the number of real and artificial data points are compared. Through an iterative process, clusters having less than evenly distributed real data are discarded. A plurality of final quality control measurements are used to merge clusters that are too similar to be meaningful. These distributed quality control measures are then combined from each of the data silos to derive an overall quality control metric.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.