GaugeFixer: overcoming parameter non-identifiability in models of sequence-function relationships

preprint OA: closed CC-BY-4.0
Full text 1,962 characters · extracted from oa-doi-fallback · 3 sections · click to expand

Abstract

Background Mathematical models that describe sequence-function relationships are widely used in computational biology. A key challenge when interpreting these models is that their parameters are not uniquely determined, i.e., many different parameter choices can encode the same sequence-function landscape. These ambiguities, which are known as “gauge freedoms,” must be removed before parameter values can be meaningfully interpreted. Doing this requires imposing additional mathematical constraints on parameter values, a procedure called “fixing the gauge.” We recently developed mathematical methods for fixing the gauge of a large class of commonly used models, but the direct computational implementation of these methods is often impractical due to the need for a projection matrix whose size scales quadratically with the number of parameters.

Results

Here we introduce GaugeFixer, a Python package that exploits the specific mathematical structure of gauge-fixing projections to achieve linear scaling in both time and memory. This dramatically increases efficiency, enabling application to models with millions of parameters. As one application, we analyzed the local structure of peaks in an empirical fitness landscape for translation initiation. GaugeFixer reveals striking similarities, but also fine-scaled variation, in ribosome binding preferences at different positions relative to the start codon, thereby aiding the interpretation of an otherwise unwieldy fitness landscape.

Conclusions

GaugeFixer thus fills an unmet need in the computational tools available for the biological interpretation of sequence-function relationships. Competing Interest Statement The authors have declared no competing interest. Footnotes Text revised for clarity and completeness. Figure 1 was also split into 3 figures. List of Abbreviations - 5’UTR - 5’ untranslated region - DNA - deoxyribonucleic acid - RNA - ribonucleic acid - SD - Shine-Dalgarno

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-24T02:00:01.246996+00:00
License: CC-BY-4.0