Full text
2,112 characters
· extracted from
oa-doi-fallback
· click to expand
Abstract
The evolutionary co-occurrence of amino acid changes between protein residues underlies key structural and functional properties of protein families. Building on these coevolution patterns, methods have been developed to identify groups of residues associated with enzyme functionalities, such as Statistical Coupling Analysis (SCA) or Specificity-Determining Position (SDP) methods. These methods and their variations differ in the metrics used to quantify coevolution, residues weighting schemes, and corrections introduced to mitigate noise and phylogenetic biases. Yet, systematic comparisons across methods are rarely performed, and the evolutionary origins of the coevolutionary patterns highlighted by each approach are seldom addressed, limiting our ability to disentangle functional from phylogenetic contributions.
To address these issues, we introduce COCOA-Tree, a Python library for SCA-like dimensionality-reduction analyses. COCOA-Tree supports custom metrics and enables visualization of coevolutionary patterns on phylogenetic trees. We also provide guidance to map results onto 3D structures in PyMOL. Using COCOA-Tree, we reanalyze published datasets and uncover previously unnoticed evolutionary properties of groups of coevolving residues detected by SCA, known as sectors. In particular, in the well-studied S1A serine protease family, we show that two of the three known sectors exhibit qualitatively distinct levels of sequence conservation depending on the enzymatic functions and on the phylogenetic clades to which the proteins belong. We further show that different coevolution metrics often identify qualitatively distinct groups of coevolving residues, although they yield consistent results for mildly conserved residues. Overall, we expect COCOA-Tree to help identify residues that control protein function and thereby improve our capacity for functional engineering and our understanding of the principles governing protein evolution.
COCOA-Tree website: https://tree-timc.github.io/cocoatree
Competing Interest Statement
The authors have declared no competing interest.
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.