Loading…
A Simulated MS/MS Library for Spectrum-to-spectrum Searching in Large Scale Identification of Proteins
Identifying peptides from mass spectrometric fragmentation data (MS/MS spectra) using search strategies that map protein sequences to spectra is computationally expensive. An alternative strategy uses direct spectrum-to-spectrum matching against a reference library of previously observed MS/MS that...
Saved in:
Published in: | Molecular & cellular proteomics 2009-04, Vol.8 (4), p.857-869 |
---|---|
Main Authors: | , , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Identifying peptides from mass spectrometric fragmentation data (MS/MS spectra) using search strategies that map protein sequences
to spectra is computationally expensive. An alternative strategy uses direct spectrum-to-spectrum matching against a reference
library of previously observed MS/MS that has the advantage of evaluating matches using fragment ion intensities and other
ion types than the simple set normally used. However, this approach is limited by the small sizes of the available peptide
MS/MS libraries and the inability to evaluate the rate of false assignments. In this study, we observed good performance of
simulated spectra generated by the kinetic model implemented in MassAnalyzer (Zhang, Z. (2004) Prediction of low-energy collision-induced
dissociation spectra of peptides. Anal. Chem. 76, 3908â3922; Zhang, Z. (2005) Prediction of low-energy collision-induced dissociation spectra of peptides with three or
more charges. Anal. Chem. 77, 6364â6373) as a substitute for the reference libraries used by the spectrum-to-spectrum search programs X!Hunter and
BiblioSpec and similar results in comparison with the spectrum-to-sequence program Mascot. We also demonstrate the use of
simulated spectra for searching against decoy sequences to estimate false discovery rates. Although we found lower score discrimination
with spectrum-to-spectrum searches than with Mascot, particularly for higher charge forms, comparable peptide assignments
with low false discovery rate were achieved by examining consensus between X!Hunter and Mascot, filtering results by mass
accuracy, and ignoring score thresholds. Protein identification results are comparable to those achieved when evaluating consensus
between Sequest and Mascot. Run times with large scale data sets using X!Hunter with the simulated spectral library are 7
times faster than Mascot and 80 times faster than Sequest with the human International Protein Index (IPI) database. We conclude
that simulated spectral libraries greatly expand the search space available for spectrum-to-spectrum searching while enabling
principled analyses and that the approach can be used in consensus strategies for large scale studies while reducing search
times. |
---|---|
ISSN: | 1535-9476 1535-9484 |
DOI: | 10.1074/mcp.M800384-MCP200 |