Optimizing sparse schema-less data in data stores
US9715560B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 27, 2013 |
| Grant date | Jul 25, 2017 |
| Priority date | — |
| Expiry date | May 26, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/211
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Various embodiments of the invention relate to optimizing storage of schema-less data. At least one of a schema-less dataset including a plurality of resources one or more query workloads associated with the plurality of resources is received. Each resource is associated with at least a plurality of properties. At least one set of co-occurring properties from the plurality of properties is identified. A graph including a plurality of nodes is generated. Each of the nodes represents a unique property in the set of co-occurring properties. The graph further includes an edge connecting each node representing a pair of co-occurring properties. A schema is generated based on the graph that assigns a column identifier from a table to each unique property represented by one of the nodes in the graph.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.