David Tabb
Faculty Member
Last active: 6/26/2014

MS2Grouper: group assessment and synthetic replacement of duplicate proteomic tandem mass spectra.

Tabb DL, Thompson MR, Khalsa-Moyers G, VerBerkmoes NC, McDonald WH
J Am Soc Mass Spectrom. 2005 16 (8): 1250-61

PMID: 15979332 · DOI:10.1016/j.jasms.2005.04.010

Shotgun proteomics experiments require the collection of thousands of tandem mass spectra; these sets of data will continue to grow as new instruments become available that can scan at even higher rates. Such data contain substantial amounts of redundancy with spectra from a particular peptide being acquired many times during a single LC-MS/MS experiment. In this article, we present MS2Grouper, an algorithm that detects spectral duplication, assesses groups of related spectra, and replaces these groups with synthetic representative spectra. Errors in detecting spectral similarity are corrected using a paraclique criterion-spectra are only assessed as groups if they are part of a clique of at least three completely interrelated spectra or are subsequently added to such cliques by being similar to all but one of the clique members. A greedy algorithm constructs a representative spectrum for each group by iteratively removing the tallest peaks from the spectral collection and matching to peaks in the other spectra. This strategy is shown to be effective in reducing spectral counts by up to 20% in LC-MS/MS datasets from protein standard mixtures and proteomes, reducing database search times without a concomitant reduction in identified peptides.

MeSH Terms (8)

Algorithms Animals Databases, Protein Mass Spectrometry Peptides Proteome Proteomics Software

Connections (2)

This publication is referenced by other Labnodes entities: