Storage configuration in data warehouses
US9563687B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 13, 2014 |
| Grant date | Feb 7, 2017 |
| Priority date | — |
| Expiry date | Jul 25, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/217
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques are described for employing a graph-based analysis to determine a configuration of datasets to be stored on data storage systems in a data warehouse environment. Associations between datasets may be determined based on the parsing of join statements or other types of statements in jobs that are executed on the data storage systems. A graph may be generated that describes the associations among datasets. A greedy breadth-first traversal of the graph may be performed to determine sets of associated datasets. A utilization metric describing a weight of storing the datasets may be determined and employed to identify a data storage system on which to store a set of associated datasets, given the storage and processing capacity of the data storage system.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.