posted on 2024-10-19, 14:10authored byHamid Hachemi, Jean Armengaud, Lucia Grenga, Olivier Pible
Metaproteomics is
a powerful tool to characterize how
microbiota
function by analyzing their proteic content by tandem mass spectrometry.
Given the complexity of these samples, accurately assessing their
taxonomical composition without prior information based solely on
peptide sequences remains a challenge. Here, we present LineageFilter,
a new python-based AI software for refined proteotyping of complex
samples using metaproteomics interpreted data and machine learning.
Given a tentative list of taxa, their abundances, and the scores associated
with their identified peptides, LineageFilter computes a comprehensive
set of features for each identified taxon at all taxonomical ranks.
Its machine-learning model then assesses the likelihood of each taxon’s
presence based on these features, enabling improved proteotyping and
sample-specific database construction.