Data record compression with progressive and/or selective decomposition
US9025892B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 2, 2014 |
| Grant date | May 5, 2015 |
| Priority date | — |
| Expiry date | Dec 2, 2034 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH03M7/6088
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed herein are systems and methods for compressing structured or semi-structured data in a horizontal manner achieving compression ratios similar to vertical compression. Collections include structured or semi-structured data include a number of fields and are described using a schema. Fields include information having semantic similarity and are compressed using methods suitable for compressing the type of data. Data of a collection is compressed after fragmentation or may be normalized prior to compression. Data with semantic similarity is compressed using token tables and/or n-gram tables, where higher weighted, consisting of the product of frequency and length, occurring values may be stored in the lower numbered indices of the data table. Records include record descriptor bytes, field descriptor bytes, zero or more array descriptor bytes, zero or more object descriptor bytes, or bytes representing the data associated with the record. Data is indexed or compressed by a suitable module.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.