COCOA-Tree: Phylogenetic visualization and comparative analysis of coevolving residues

preprint OA: closed
Full text JSON View at publisher
Full text 2,112 characters · extracted from oa-doi-fallback · click to expand
Abstract The evolutionary co-occurrence of amino acid changes between protein residues underlies key structural and functional properties of protein families. Building on these coevolution patterns, methods have been developed to identify groups of residues associated with enzyme functionalities, such as Statistical Coupling Analysis (SCA) or Specificity-Determining Position (SDP) methods. These methods and their variations differ in the metrics used to quantify coevolution, residues weighting schemes, and corrections introduced to mitigate noise and phylogenetic biases. Yet, systematic comparisons across methods are rarely performed, and the evolutionary origins of the coevolutionary patterns highlighted by each approach are seldom addressed, limiting our ability to disentangle functional from phylogenetic contributions. To address these issues, we introduce COCOA-Tree, a Python library for SCA-like dimensionality-reduction analyses. COCOA-Tree supports custom metrics and enables visualization of coevolutionary patterns on phylogenetic trees. We also provide guidance to map results onto 3D structures in PyMOL. Using COCOA-Tree, we reanalyze published datasets and uncover previously unnoticed evolutionary properties of groups of coevolving residues detected by SCA, known as sectors. In particular, in the well-studied S1A serine protease family, we show that two of the three known sectors exhibit qualitatively distinct levels of sequence conservation depending on the enzymatic functions and on the phylogenetic clades to which the proteins belong. We further show that different coevolution metrics often identify qualitatively distinct groups of coevolving residues, although they yield consistent results for mildly conserved residues. Overall, we expect COCOA-Tree to help identify residues that control protein function and thereby improve our capacity for functional engineering and our understanding of the principles governing protein evolution. COCOA-Tree website: https://tree-timc.github.io/cocoatree Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00