posted on 2025-09-03, 23:43authored byValerio Piomponi, Alberto Cazzaniga, Francesca Cuturello
Investigating structural variability is essential for
understanding
protein biological functions. Although AlphaFold2 accurately predicts
static structures, it fails to capture the full spectrum of functional
states. Recent methods have used AlphaFold2 to generate diverse structural
ensembles, but they offer limited interpretability and overlook the
evolutionary signals underlying the predictions. In this work, we
enhance the generation of conformational ensembles and identify sequence
patterns that influence the alternative fold predictions for several
protein families. Building on prior research that clustered multiple
sequence alignments to predict fold-switching states, we introduce
a refined clustering strategy that integrates protein language model
representations with hierarchical clustering, overcoming limitations
of density-based methods. Our strategy effectively identifies high-confidence
alternative conformations and generates abundant sequence ensembles,
providing a robust framework for applying direct coupling analysis
(DCA). Through DCA, we uncover key coevolutionary signals within the
clustered alignments, leveraging them to design mutations that stabilize
specific conformations, which we validate using alchemical free energy
calculations from molecular dynamics. Notably, our method extends
beyond fold-switching, effectively capturing a variety of conformational
changes.