Patent · US Active

Storage configuration in data warehouses

US9563687B1 · kind B1 · utility

1Cited by
7References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 13, 2014
Grant dateFeb 7, 2017
Priority date
Expiry dateJul 25, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/217
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques are described for employing a graph-based analysis to determine a configuration of datasets to be stored on data storage systems in a data warehouse environment. Associations between datasets may be determined based on the parsing of join statements or other types of statements in jobs that are executed on the data storage systems. A graph may be generated that describes the associations among datasets. A greedy breadth-first traversal of the graph may be performed to determine sets of associated datasets. A utilization metric describing a weight of storing the datasets may be determined and employed to identify a data storage system on which to store a set of associated datasets, given the storage and processing capacity of the data storage system.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.