posted on 2020-11-15, 15:43authored byCheloor
Kovilakam Sruthi, Hemalatha Balaram, Meher K. Prakash
Protein structure
and function can be severely altered
by even a single amino acid mutation. Predictions of mutational effects
using extensive artificial intelligence (AI)-based models, although
accurate, remain as enigmatic as the experimental observations in
terms of improving intuitions about the contributions of various factors.
Inspired by Lipinski’s rules for drug-likeness, we devise simple
thresholding criteria on five different descriptors such as conservation,
which have so far been limited to qualitative interpretations such
as high conservation implies high mutational effect. We analyze systematic
deep mutational scanning data of all possible single amino acid substitutions
on seven proteins (25153 mutations) to first define these thresholds
and then to evaluate the scope and limits of the predictions. At this
stage, the approach allows us to comment easily and with a low error
rate on the subset of mutations classified as neutral or deleterious
by all of the descriptors. We hope that complementary to the accurate
AI predictions, these thresholding rules or their subsequent modifications
will serve the purpose of codifying the knowledge about the effects
of mutations.