Synchronization of metadata in a distributed storage system
US10740300B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 7, 2017 |
| Grant date | Aug 11, 2020 |
| Priority date | — |
| Expiry date | May 21, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/275
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A client machine writes to and reads from a virtual disk on a remote storage platform using a storage protocol. Metadata is generated and is stored in replicas on different metadata nodes of the storage platform. A modified log-structured merge tree is used to store and compact string-sorted tables of metadata. During file storage and compaction, a consistent file identification scheme is used across all metadata nodes. A fingerprint file is calculated for each SST file on disk that includes hash values corresponding to regions of the SST file. To synchronize, the fingerprint files of two SST files are compared, and if any hash values are missing from a fingerprint file then the key-value-timestamp triples corresponding to these missing hash values are sent to the SST file that is missing them in the SST file is compacted with the missing triples to create a new version of the SST file. The other fingerprint file is then analyzed the same way.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.