Method for managing complex genomic data workflows
US10331848B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 26, 2017 |
| Grant date | Jun 25, 2019 |
| Priority date | — |
| Expiry date | Dec 7, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F17/18
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for efficiently processing and storing large data sets associated with a multi-stage bioinformatics analysis of genomic data is disclosed. The present method increases the efficiency of the electronic storage of these large data sets by automatically deleting or compressing intermediate data or a portion of output data and compressing input data, where both deletion and compression are based on predetermined characteristics of said data. When necessary, such data can be recovered using generated metadata associated with the data. Doing so, not only improves the storage efficiency of massively large genomic datasets, but also allows for the consistent reproduction of output data with the re-processing of intermediate data based on information stored in metadata.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.