Duplicate table identification in enterprise database systems for data storage optimization
US11422993B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 25, 2020 |
| Grant date | Aug 23, 2022 |
| Priority date | — |
| Expiry date | Aug 13, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/285
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
There are provided systems and methods for duplicate table identification in enterprise database systems for data storage optimization. A service provider, such as an electronic transaction processor for digital transactions, may determine data duplication in database tables so that database storage resources may be optimized. In order to determine data duplication, within database tables, a data collector daemon operation and/or application may collect metadata for tables within a domain. Using the metadata, a master table and derived tables may be determined for a group of the tables. Further, a duplication factor may be determined based on matching columns in the tables, a usage factor may be determined using processing hits to the tables, and a size factor may be determine based on table size. This allows for determination of a relevance score of the group, which provides a measure of duplication of data within those tables.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.