Unique sampling of datasets
US12259865B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 14, 2022 |
| Grant date | Mar 25, 2025 |
| Priority date | — |
| Expiry date | Feb 27, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
One embodiment of the present invention sets forth a technique for sampling from a dataset. The technique includes determining a plurality of embeddings for a plurality of data points included in the dataset. The technique also includes populating a tree structure with the plurality of embeddings by generating a first node that stores a first set of embeddings included in the plurality of embeddings and generating a first plurality of nodes as children of the first node, where each node in the first plurality of nodes stores a different subset of embeddings in the first set of embeddings. The technique further includes sampling a subset of embeddings from the plurality of embeddings via a traversal of the tree structure, and generating a sampled dataset that includes a subset of data points corresponding to the subset of embeddings.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.