Patent · US Active

Duplicate table identification in enterprise database systems for data storage optimization

US11422993B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 25, 2020
Grant dateAug 23, 2022
Priority date
Expiry dateAug 13, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/285
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

There are provided systems and methods for duplicate table identification in enterprise database systems for data storage optimization. A service provider, such as an electronic transaction processor for digital transactions, may determine data duplication in database tables so that database storage resources may be optimized. In order to determine data duplication, within database tables, a data collector daemon operation and/or application may collect metadata for tables within a domain. Using the metadata, a master table and derived tables may be determined for a group of the tables. Further, a duplication factor may be determined based on matching columns in the tables, a usage factor may be determined using processing hits to the tables, and a size factor may be determine based on table size. This allows for determination of a relevance score of the group, which provides a measure of duplication of data within those tables.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.