Conserved statistical organization of cis-regulatory landscapes defines a fluctuation-structured regulatory phase

preprint OA: closed
Full text JSON View at publisher
Full text 2,227 characters · extracted from oa-doi-fallback · click to expand
Abstract Cis-regulatory elements undergo extensive evolutionary turnover, particularly among enhancers, yet gene expression programs remain remarkably conserved across species. This paradox suggests that regulatory conservation may reside not in individual DNA elements but in higher-order organizational principles of regulatory sequence space. Here, we analyze gene-centered cis-regulatory islands (GCICs) across rice and Arabidopsis genomes to characterize the macroscopic statistical organization of cis-regulatory landscapes. GCICs serve as gene-centered representations of combinatorial motif-family overlap that resolve intrinsic sequence organization rather than discrete functional elements. Across both genomes, combinatorial vocabulary diversity scales sublinearly with motif-family abundance, motif-family combinations exhibit conserved hierarchical rank–frequency organization, dominant combinatorial identities are extensively shared with near-proportional quantitative correspondence, and higher-order GCIC architectures composed of multiple islands per gene are stably maintained. Disruption of native sequence structure through randomization induces systematic collapse of higher-order GCIC architectures and drives motif-family combinatorial interactions toward independence, indicating that regulatory landscape organization emerges from intrinsic hierarchical sequence correlations. Together, these conserved scaling laws, hierarchical distributions, stable combinatorial identities, and characteristic responses to perturbation define a conserved statistical regulatory phase underlying cis-regulatory organization across plant genomes. This macroscopic regulatory architecture provides a probabilistic substrate for redundant regulatory potential, enabling functional cis-regulatory elements to emerge and stabilize despite extensive enhancer turnover. Our findings establish that cis-regulatory conservation operates at the level of conserved statistical landscape organization rather than individual regulatory sequences, offering a quantitative framework for understanding regulatory robustness and evolutionary flexibility. Competing Interest Statement The authors have declared no competing interest.

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00