Optimizing k-mer databases by k-mer subtraction
US11809498B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 7, 2019 |
| Grant date | Nov 7, 2023 |
| Priority date | — |
| Expiry date | Apr 2, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG16B50/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods are disclosed for reducing the size of a k-mer reference database used for queries and/or taxonomic classifications when available computer storage and/or memory are inadequate. The k-mers of the reference database have been previously classified to a taxonomy, preferably based on genetic distances. In one method, the k-mers are separated into one or more groups followed by removing k-mers common to the groups. In another method, k-mers are removed based on a selected taxonomic threshold level. A third method combines the features of the previous two methods. The methods are adaptable to machine learning.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.