Techniques for a deterministic distributed cache to accelerate SQL queries
US12111832B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 16, 2021 |
| Grant date | Oct 8, 2024 |
| Priority date | — |
| Expiry date | Jan 26, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/24552
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for providing improved distributed caching are disclosed. A distributed computing system can be implemented with a cluster including a plurality of worker nodes configured to host one or more executors for processing data related to a query. The worker nodes can host a cache accessible to the executors. The data can be processed as a plurality of data segments. The worker nodes can be uniformly assigned a plurality of token bounds defining a range of integer token values. A hashing algorithm can be used to compute a token for each data segment associated with the query. Tasks can be launched on the executors preferentially, such that the task for processing a data segment having a token within the token bounds associated with the preferred executor. Executors can be instructed to review the associated cache to identify outlier data segments and inform other nodes in the cluster.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.