posted on 2024-06-26, 05:14authored byChris Bielow, Nils Hoffmann, David Jimenez-Morales, Tim Van Den Bossche, Juan Antonio Vizcaíno, David L. Tabb, Wout Bittremieux, Mathias Walzer
Mass
spectrometry is a powerful technique for analyzing
molecules
in complex biological samples. However, inter- and intralaboratory
variability and bias can affect the data due to various factors, including
sample handling and preparation, instrument calibration and performance,
and data acquisition and processing. To address this issue, the Quality
Control (QC) working group of the Human Proteome Organization’s
Proteomics Standards Initiative has established the standard mzQC
file format for reporting and exchanging information relating to data
quality. mzQC is based on the JavaScript Object Notation (JSON) format
and provides a lightweight yet versatile file format that can be easily
implemented in software. Here, we present open-source software libraries
to process mzQC data in three programming languages: Python, using
pymzqc; R, using rmzqc; and Java, using jmzqc. The libraries follow
a common data model and provide shared functionalities, including
the (de)serialization and validation of mzQC files. We demonstrate
use of the software libraries in a workflow for extracting, analyzing,
and visualizing QC metrics from different sources. Additionally, we
show how these libraries can be integrated with each other, with existing
software tools, and in automated workflows for the QC of mass spectrometry
data. All software libraries are available as open source under the
MS-Quality-Hub organization on GitHub (https://github.com/MS-Quality-Hub).