Patent · US Active

Methods and systems for assembly of protein sequences

US10309968B2 · kind B2 · utility

1Cited by
0References
31Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 18, 2017
Grant dateJun 4, 2019
Priority date
Expiry dateNov 17, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG01N2560/00
  • WIPO fieldMeasurement
  • WIPO sectorInstruments

Abstract

Methods and systems for determining amino acid sequence of a polypeptide or protein from mass spectrometry data is provided, using a weighted de Bruijn graph. Extracted and purified protein is cleaved into a mixture of peptide and then analyzed using mass spectrometry. A list of peptide sequences is derived from mass spectrometry fragment data by de novo sequencing, and amino acid confidence scores are determined from peak fragment ion intensity. A weighted de Bruijn graph is constructed for the list of peptide sequences having node weights defined by k−1 mer confidence scores. At least one contig is assembled from the de Bruijn graph by identifying node weights having the highest k−1 mer confidence scores.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.