Out-of-core similarity matching
US8914338B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 22, 2011 |
| Grant date | Dec 16, 2014 |
| Priority date | — |
| Expiry date | May 1, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/24556
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for storing data in a data storage system by partitioning the data into a plurality of data chunks and generating representative data for each of the plurality of chunks by applying a predetermined algorithm to each chunk of the plurality of chunks. Subsequently, the representative data is compared and sorted. Representative data for base data chunks and representative data for other data chunks that can be stored relative to the base data chunks are identified by evaluating the sorted set of representative data. Finally, each of the other data chunks identified as those that can be stored relative to a base data chunk are stored in the data storage system as the difference between the data chunk and a base data chunk.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.