Patent · US Active

Index generation using lazy reassembling of semi-structured data

US11816107B2 · kind B2 · utility

0Cited by
67References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 27, 2022
Grant dateNov 14, 2023
Priority date
Expiry dateDec 27, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F17/18
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A pruning index is generated for a source table organized into a set of batch units. The source table comprises a column of semi-structured data. The pruning index comprises a set of filters that index distinct values in each column of the source table. Rather than reassembling an entire tree structure of the semi-structured data prior to indexing, the generating of the pruning index comprises traversing a reassembly hook object that represents a first portion of the semi-structured data that is subcolumnarized and traversing a residual object that represents a second portion of the semi-structured data that is not subcolumnarized. The reassembly hook object is traversed to identify values corresponding to the first portion of the semi-structured data and the residual object is traversed to identify values corresponding to the second portion. The pruning index is stored with an association with the source table.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.