Patent · US Active

Consistent sort-based record-level shuffling of machine learning data

US10713589B1 · kind B1 · utility

34Cited by
12References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 3, 2016
Grant dateJul 14, 2020
Priority date
Expiry dateMay 24, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A determination that a machine learning data set is to be shuffled is made. Tokens corresponding to the individual observation records are generated based on respective identifiers of the records' storage objects and record key values. Respective representative values are derived from the tokens. The observation records are rearranged based on a result of sorting the representative values and provided to a shuffle result destination.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.