Consistent sort-based record-level shuffling of machine learning data
US10713589B1 · kind B1 · utility
34Cited by
12References
21Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Mar 3, 2016 |
| Grant date | Jul 14, 2020 |
| Priority date | — |
| Expiry date | May 24, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/08
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A determination that a machine learning data set is to be shuffled is made. Tokens corresponding to the individual observation records are generated based on respective identifiers of the records' storage objects and record key values. Respective representative values are derived from the tokens. The observation records are rearranged based on a result of sorting the representative values and provided to a shuffle result destination.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.