Patent · US Active

Record deduplication in database systems

US11321288B2 · kind B2 · utility

0Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 5, 2020
Grant dateMay 3, 2022
Priority date
Expiry dateDec 29, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/2379
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for execution by a record processing and storage system includes receiving a plurality of records and corresponding row numbers. Pages are generated from the received records. Page metadata is generated for each page that includes row number span data based on row numbers of the records included in each page. Pairs of pages are identified in the plurality of pages based on having row number span data in their page metadata that include a row number span overlap. For each pair of pages, row number span data is updated for a first page in the pair by removing the row number span overlap with a second page in the pair. Reads of pages are performed based on their row number span data. Only records of each first page of each pair of pages having row numbers that are within the updated row number span data are read.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.