posted on 2012-07-06, 00:00authored byJeffrey
A. Milloy, Brendan K. Faherty, Scott A. Gerber
Modern mass spectrometers are now capable of producing
hundreds of thousands of tandem (MS/MS) spectra per experiment, making
the translation of these fragmentation spectra into peptide matches
a common bottleneck in proteomics research. When coupled with experimental
designs that enrich for post-translational modifications such as phosphorylation
and/or include isotopically labeled amino acids for quantification,
additional burdens are placed on this computational infrastructure
by shotgun sequencing. To address this issue, we have developed a
new database searching program that utilizes the massively parallel
compute capabilities of a graphical processing unit (GPU) to produce
peptide spectral matches in a very high throughput fashion. Our program,
named Tempest, combines efficient database digestion and MS/MS spectral
indexing on a CPU with fast similarity scoring on a GPU. In our implementation,
the entire similarity score, including the generation of full theoretical
peptide candidate fragmentation spectra and its comparison to experimental
spectra, is conducted on the GPU. Although Tempest uses the classical
SEQUEST XCorr score as a primary metric for evaluating similarity
for spectra collected at unit resolution, we have developed a new “Accelerated
Score” for MS/MS spectra collected at high resolution that
is based on a computationally inexpensive dot product but exhibits
scoring accuracy similar to that of the classical XCorr. In our experience,
Tempest provides compute-cluster level performance in an affordable
desktop computer.