Patent · US Active

Data file clustering with KD-classifier trees

US12405920B2 · kind B2 · utility

0Cited by
3References
23Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 5, 2023
Grant dateSep 2, 2025
Priority date
Expiry dateFeb 21, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/285
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A data processing service generates a data classifier tree for managing data files of a data table. The data classifier tree may be configured as a KD-classifier tree and includes a plurality of nodes and edges. A node of the data classifier tree may represent a splitting condition with respect to key-values for a respective key. A node of the data classifier tree may be associated with one or more data files assigned to the node. The data files assigned to the node each include a subset of records having key-values that satisfy the conditions represented by the node and parent nodes of the node. The data processing service may efficiently cluster the data in the data table while reducing the number of data files that are rewritten when data is modified or added to the data table.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.