Patent · US Active

Data file clustering with KD-epsilon trees

US12332862B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 6, 2023
Grant dateJun 17, 2025
Priority date
Expiry dateDec 6, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/285
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.