Patent · US Active

Lazy reassembling of semi-structured data

US11567939B2 · kind B2 · utility

2Cited by
44References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 21, 2022
Grant dateJan 31, 2023
Priority date
Expiry dateJul 21, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F17/18
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A pruning index is generated for a source table organized into a set of batch units. The source table comprises a column of semi-structured data. The pruning index comprises a set of filters that index distinct values in each column of the source table. Rather than reassembling an entire tree structure of the semi-structured data prior to indexing, the generating of the pruning index comprises traversing a reassembly hook object that represents a first portion of the semi-structured data that is subcolumnarized and traversing a residual object that represents a second portion of the semi-structured data that is not subcolumnarized. The reassembly hook object is traversed to identify values corresponding to the first portion of the semi-structured data and the residual object is traversed to identify values corresponding to the second portion. The pruning index is stored with an association with the source table.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.