Patent · US Active

Unique sampling of datasets

US12259865B2 · kind B2 · utility

0Cited by
0References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 14, 2022
Grant dateMar 25, 2025
Priority date
Expiry dateFeb 27, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

One embodiment of the present invention sets forth a technique for sampling from a dataset. The technique includes determining a plurality of embeddings for a plurality of data points included in the dataset. The technique also includes populating a tree structure with the plurality of embeddings by generating a first node that stores a first set of embeddings included in the plurality of embeddings and generating a first plurality of nodes as children of the first node, where each node in the first plurality of nodes stores a different subset of embeddings in the first set of embeddings. The technique further includes sampling a subset of embeddings from the plurality of embeddings via a traversal of the tree structure, and generating a sampled dataset that includes a subset of data points corresponding to the subset of embeddings.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.