Patent · US Active

Techniques for a deterministic distributed cache to accelerate SQL queries

US12111832B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 16, 2021
Grant dateOct 8, 2024
Priority date
Expiry dateJan 26, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/24552
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for providing improved distributed caching are disclosed. A distributed computing system can be implemented with a cluster including a plurality of worker nodes configured to host one or more executors for processing data related to a query. The worker nodes can host a cache accessible to the executors. The data can be processed as a plurality of data segments. The worker nodes can be uniformly assigned a plurality of token bounds defining a range of integer token values. A hashing algorithm can be used to compute a token for each data segment associated with the query. Tasks can be launched on the executors preferentially, such that the task for processing a data segment having a token within the token bounds associated with the preferred executor. Executors can be instructed to review the associated cache to identify outlier data segments and inform other nodes in the cluster.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.