Methods and systems for assembly of protein sequences
US10309968B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 18, 2017 |
| Grant date | Jun 4, 2019 |
| Priority date | — |
| Expiry date | Nov 17, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG01N2560/00
- WIPO fieldMeasurement
- WIPO sectorInstruments
Abstract
Methods and systems for determining amino acid sequence of a polypeptide or protein from mass spectrometry data is provided, using a weighted de Bruijn graph. Extracted and purified protein is cleaved into a mixture of peptide and then analyzed using mass spectrometry. A list of peptide sequences is derived from mass spectrometry fragment data by de novo sequencing, and amino acid confidence scores are determined from peak fragment ion intensity. A weighted de Bruijn graph is constructed for the list of peptide sequences having node weights defined by k−1 mer confidence scores. At least one contig is assembled from the de Bruijn graph by identifying node weights having the highest k−1 mer confidence scores.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.