Abstract
We describe Structured Random Binding (SRB), a minimal model of protein-protein interac-
tions rooted in the statistical physics of disordered systems. In this model, nonspecific binding is
a generic consequence of the interaction between random proteins, exhibiting a phase transition
from a high temperature state where nonspecific complexes are transient and lack well-defined
interaction interfaces, to a low temperature state where the complex structure is frozen and
a definite interaction interface is present. Numerically, weakly-bound nonspecific complexes
can evolve into tightly-bound, highly specific complexes, but only if the structural correlation
length along the peptide backbone is short; moreover, evolved tightly-bound homodimers favor
the same interface structure that is predominant in real protein homodimers.
In cells, protein-protein interactions are vital because proteins generally execute their activities in
complex with other proteins [23]. The textbook picture where proteins are depicted as rigid bodies
interacting via interlocking surfaces, and where the specificity of an interaction originates from
the specific geometry of its interface [1], collides uncomfortably with the fact pulldown experiments
targeting one specific protein will routinely co-precipitate hundreds of additional proteins. How can
any protein have sufficient surface structure to mediate this many interactions? Experimentally,
detectable but weak interactions are often dismissed ad hoc as “nonspecific”, but what distinguishes
nonspecific interactions from weak but specific interactions? Here, we describe Structured Random
Binding (SRB), a minimal model of protein-protein interactions where nonspecific binding is a
generic consequence of the interaction between random proteins. SRB exhibits a phase transition
from a high temperature state where nonspecific complexes are transient and lack well-defined
interaction interfaces, to a low temperature state where the complex structure is frozen and a
definite interface is present. In numerical simulations, weakly-bound nonspecific complexes can
evolve into tightly-bound, highly specific complexes, but only if the structural correlation length
along the peptide backbone is short. Furthermore, evolved tightly-bound homodimers favor the
same interface structure that is predominant in real protein homodimers. Our model thus captures
two salient features of protein-protein interactions — the ubiquity of nonspecific complexes, and
the structure of homodimers — that cut across the diversity of protein species, and root these
phenomena firmly in the statistical physics of disordered systems.
1
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
Figure 1: Structured Random Binding model of protein-protein interactions. Sliding ξ-tuples of the
primary sequence p maps onto structural motifs mi ∈ M, and endows p with a secondary structure
f(p). Paired motif-motif interactions, encoded by J(m, m′), between two secondary structures
(shown in parallel orientation) sum to yield the peptide-peptide binding energy u(p, b).
Defining the model
To begin, we first consider the interaction between a small peptide and a protein. We represent
peptide p = p1p2 . . . pl as a sequence of length l (the primary sequence), where pi are letters
drawn from a primary alphabet A (the 20 proteinogenic amino acids). We represent protein B
as a collection of nB peptide epitopes (also of length l), B = {bα, α = 1 , . . . , nB}, where bα =
bα
1 bα
2 . . . bα
l , bα
i ∈ A; since only surface exposed residues participate in protein-protein interactions,
nB is proportional to the protein surface area. The interaction between residues on the peptide,
and residues on the protein, depends on their respective structural contexts. We define a map
f : Aξ → M from ξ-tuples of A to structural motifs,
f(pipi+1 . . . pi+ξ−1) = mi, pi ∈ A, mi ∈ M, (1)
where M is the motif alphabet, andξ is the structural correlation length along the peptide backbone;
we assume f is bijective and ∥M ∥ = ∥A∥ξ. By acting on each sliding ξ-tuple of p, f endows p with
a secondary structure f(p) = m1m2 . . . ml−ξ+1, mi ∈ M (Fig. 1). We define the binding energy
u(p, b) between p and b ∈ B as the sum of l′ = l − ξ + 1 paired motif-motif interactions,
u↑ ↑(p, b) =
l′
X
i=1
J[f(p)i, f(b)i], (2)
u↑ ↓(p, b) =
l′
X
i=1
J[f(p)l′−i+1, f(b)i], (3)
depending on whether p, b are oriented in parallel or antiparallel. Here, J(m, m′), m, m′ ∈ M, is a
∥M ∥ × ∥M ∥ real symmetric matrix, whose elements are independently and identically distributed
random variables with mean ⟨J⟩ = 0, and finite variance σ2
J that sets the energy scale of motif-
motif interactions. In the low temperature limit T → 0, the binding energy U(P, B) of the binary
complex PB is given by
U(P, B) = min{u(p, b), p ∈ P, b ∈ B}. (4)
Binding energy of nonspecific complexes
These ingredients are sufficient to fix the binding energy distribution PNS(U) for nonspecific binary
complexes of random proteins. Since u(p, b) between a random peptide p, and an epitope b ∈ B
2
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
–15 –10 –5 0 5
0
0.2
0.4
0.6(a)
100 101 102 103
–12
–8
–4
0(b)
Figure 2: The binding energy U for nonspecific binary complexes of random proteins. (a) The
distribution PNS(U) for nonspecific complexes; n = 2nPnB is twice the product of epitope counts
for each protein. Filled histograms are empirical distributions from numerical simulations; solid
lines are Gumbel distributions given by Eq. 6, with µ, ϕ given by Eqs. 7, 8. (b) Mean nonspecific
binding energy ⟨U ⟩NS as a function of n, for epitope sizes l′ = 2, 6, 10, 14, 18.
on a random protein, is the sum of l′ = l − ξ + 1 random variables J(m, m′), for any choice of P(J)
with zero mean and finite variance σ2
J, in the limit l′ → ∞, u will be normally distributed
P(u) = 1q
2πl′σ2
J
exp
− u2
2l′σ2
J
. (5)
For a binary complex of two random proteins P, B, its binding energy U(P, B) will follow the
extreme value distribution for sample minima of n = 2nPnB point samples from P(u). In the limit
n → ∞, this is a Gumbel distribution [6, 18],
PNS(U) = 1
ϕ exp
− U − µ
ϕ + exp
− U − µ
ϕ
, (6)
with location µ and scale ϕ parameters given by
µ = −
q
2l′σ2
J erf −1 (1 − 2/n) , (7)
ϕ = −µ −
q
2l′σ2
J erf −1 (1 − 2/ne) . (8)
The average nonspecific binding energy over all SRB realizations (“disorder averaged”) is
⟨U ⟩NS = µ + γϕ, (9)
where γ is the Euler–Mascheroni constant. Note that neither P(u) nor PNS(U) depends on the size
of the primary alphabet ∥A∥ (assuming it is nontrivial). The weight in PNS(U) for U > 0 is negli-
gible (Fig. 2a); the average binding energy ⟨U ⟩NS is strictly negative and decreases monotonically
with increasing n (Fig. 2b). Thus in the low-T limit, nonspecific binding is a generic consequence
of the interaction between random proteins. We can understand this intuitively as follows: given
two random proteins, if their surfaces are large enough, we can always find a pair of epitopes, one
on each protein, that will bind each other, even if the average epitope-epitope interaction is zero.
3
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
100 101 102 103
0
1
2
3
4
(a)
0 1 2 3
0
0.2
0.4
0.6
0.8
1
(b)
0 1 2 3
0
0.2
0.4
0.6
0.8
1
-8.5401
-9.0920
-9.2464
-10.1038
0 1 2 3
0
1
2
3
4
–15 0 15
10–3
10–2
10–1
100
(c)
(d)
Figure 3: The condensation transition in SRB. (a) Critical temperature Tc as a function of n =
2nPnB, for epitope sizes l′ = 2, 6, 10, 14, 18. (b) The mean participation ratio ⟨Y (T )⟩ as a function
of T /Tc for n = 8, 72, 800, 7200, 80000. In the limit n → ∞, ⟨Y (T )⟩ → 1 − T /Tc for T < T c (dashed
line). (c) Y (T ) for 4 specific realization of random proteins P, B, and J (l = 10, ξ = 3, and
nP = nB = 20); the binding energy U(P, B) of the PB complex is indicated in the legend. Dashed
line is the average ⟨Y (T )⟩ for n = 800. (d) −dY (T )/dT for the same realizations; the location of
the peak is usually interpreted as the melting temperature of the protein complex. Inset: energy
level distributions P(u) for these realizations; dashed line is the asymptotic distribution (Eq. 5).
The condensation transition
These results describes the low- T limit of SRB. To obtain finite T behavior, we note that the
interaction between random proteins P, B describes a n = 2nPnB level system whose energy levels
are independently and identically distributed per P(u), with density of states given by
⟨ρ(u)⟩ = nP(u) = e
log(n)− u2
2l′σ2
J
q
2l′σ2
J
. (10)
So stated, the equilibrium statistical physics of SRB is analogous to that of the exactly solvable
Random Energy Model (REM) [7, 9]. Originally formulated as a minimal model of spin glasses,
REM has also proven influential in informing the statistical physics of protein folding [27, 24]. In the
thermodynamic limit, REM undergoes a phase transition, known as condensation, at temperatures
T < T c to a frozen phase where the Boltzmann measure condenses onto a smaller-than-exponential
4
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
set of configurations. In SRB, the corresponding Tc (in the limit n → ∞) is
Tc = σJ
s
l′
2 log(n) , (11)
and is slowly varying in both n and l′ (Fig. 3a). For T > T c, the complex between two random
proteins is dynamic and lacks a definite interaction interface; for T < T c, the complex freezes into
a fixed conformation and a definite interface is present. The participation ratio
Y (T ) =
P
{p∈P,b∈B} e−2u(p,b)/T
P
{p∈P,b∈B} e−u(p,b)/T
2 (12)
is the probability that two samples of the system return the same configuration; thus 1 /Y (T ) is
effectively the number of states that dominate the Boltzmann measure. In REM, ⟨Y (T )⟩ = 0 for
T > T c, ⟨Y (T )⟩ = 1 − T /Tc for T < T c [8, 21]; SRB exhibits the same behavior as n → ∞ (Fig. 3b).
For SRB realizations with finite-sized proteinsP, B, the condensation phase transition broadens
into crossover representing the binding/unbinding of thePB complex. Remarkably, individual SRB
realizations, founded on the same underlying P(J), can have drastically different Y (T ) curves and
apparent melting temperatures (where −dY /dT is maximal); only upon averaging do we recover the
REM-like transition (Figs. 3c, d). This chaotic behavior is a direct consequence of the condensation
transition, where the T < T c thermodynamics of SRB is dominated by a small number of low- u
configurations. Different SRB realizations, despite their statistically similar energy spectra, have
distinct thermodynamics that arise entirely from subtle differences in the low-u tail (Fig. 3d, inset).
Evolution of tightly-bound complexes
In SRB, nonspecific complexes have a well-defined energy scale ⟨U ⟩NS. How can we obtain tightly-
bound complexes with U ≪ ⟨U ⟩NS? Computationally, this is an optimization problem: given J
and peptide b, find peptide p such that u(p, b) is minimal. In the limit ξ → 1, single residue
changes in p results in single motif changes in f(p), and u(p, b) can be minimized term by term.
In the opposite limit ξ → l, single residue changes in p completely alters f(p), and there can be no
correlation between binding energy and primary sequence; minimizing u(p, b) requires exhaustive
search through all possible primary sequences. In the regime 1< ξ < l , fixing f(p) at motif f(p)i, by
fixing p at residues pi, ..., pi+ξ−1, also restricts the possible values of motifsf(p)i−ξ+1, ..., f(p)i+ξ−1.
This means while every primary sequence p can be mapped to a secondary structure f(p), arbitrary
motif sequences m = m1m2 . . . ml′, mi ∈ M, are generally noninvertible, i.e. p = f −1(m) does
not exist. Minimizing u, which is a function of motif sequences, must be restricted to invertible
secondary structures, which forms a small (fraction =∥A∥l−(l−ξ+1)ξ) and sparse (single motif change
on an invertible secondary structure renders it noninvertible) subset in the space of motif sequences.
As is often the case with constrained optimization [22], we conjecture there is no fast algorithm for
solving SRB in this regime.
Biologically, tightly-bound protein complexes are products of evolution. We simulated affinity
evolution in a binary complex of two coevolving random proteins P, B; for tractable numerics,
we restrict ourselves to a small primary alphabet ∥A∥ = 4. We start from an uniform ensemble
{PαBα, α = 1, . . . , N}, PαBα = PβBβ for all α, β. At each generation t, we mutate the binary
complex (at rate ϵ = 0.01 per residue per generation), and apply selection with an affinity-dependent
survival probability
Θ(U; U0) = 1
e(U −U0)/σJ + 1. (13)
5
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
100 101 102 103
–8
–6
–4
–2
0
–30 –20 –10 0
0
0.2
0.4
0.6(a) (b)
Figure 4: Evolution of weakly-bound, nonspecific complexes of random proteins into tightly-bound,
highly specific complexes. (a) Binding energy evolution of the nonspecifc complex between random
proteins P, B (l = 10, nP = nP = 20) over 10 3 generations, for ξ = 2, 3, 4, 5, 6, 7; ⟨⟨U ⟩⟩ is doubly
averaged over the ensemble of N = 10 3 binary complex mutants, and over 10 2 SRB realizations.
(b) Distribution of binding energies P(U) for coevolved proteins P, B (ξ = 3), in binary complex
with each other (pink fill), and in binary complex with other random proteins (blue fill). Dashed
line is the binding energy distribution PNS(U) for nonspecific complexes of random proteins.
Here, the selection threshold U0(t) = ⟨U[P(t), B(t)]⟩ coevolves with the binary complex ensemble
to maintain selection pressure. As shown in Fig. 4a, the long time evolutionary dynamics is char-
acterized by a slow log( t) behavior similar to aging dynamics in glassy systems [5]; while steady
state could not be reached within the simulation window, it is clear that evolution is most effective
in enhancing binding affinity when the structural correlation length ξ is short. Where PB evolved
into tightly-bound complexes, the resulting interactions are highly specific: while the coevolved
proteins bind each other with U ≪ ⟨U ⟩NS, their binding energies with other random proteins is
indistinguishable from that between any two random proteins (Fig. 4b). This result suggests we can
make a physically principled distinction between specific and nonspecific protein complexes: specific
complexes are those whose binding energy U is unlikely to be the result of random protein-protein
interactions. It also suggests the evolution of tightly-bound, highly specific protein complexes is
not ex nihlio, but emerges from a basis of ubiquitous nonspecific interactions.
Structural bias in homodimeric complexes
Compared to heterodimeric interactions, homodimeric interactions in SRB exhibit a subtle but
significant distinction. For homodimers PP with dimer interface p, p′ ∈ P, if p ̸= p′ (heterologus),
or if p = p′ (isologus) are parallel, then u(p, p′) and u↑ ↑(p, p) will be the sum of l′ random variables
J(m, m′). However, when p = p′ are antiparallel, u↑ ↓(p, p) is twice the sum of l′/2 random
variables. From the arithmetic of random variables, P[u↑ ↓(p, p)] will have twice the variance of
P[u↑ ↑(p, p)] or P[u(p, p′)], p ̸= p′ (Fig. 5a). This means random homodimers with antiparallel
isologus interfaces are on average more tightly-bound than homodimers in other configurations
(Fig. 5b). As weakly-bound nonspecific homodimers evolve into tightly-bound homodimers, they
tend to retain antiparallel isologus interfaces (if present in the ancestral complex), or adopt it de
novo, such that antiparallel isologus interfaces are predominant in tightly-bound homodimers. For
l′ = 8, ξ = 3, nP = 20, random homodimers as generated were 33% antiparallel isologus, 3.3%
parallel isologus, and 63.7% heterologus; after 1000 generations of affinity evolution, the resulting
6
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
–15 –10 –5 0 5 10 15
10–3
10–2
10–1
100
–20 –15 –10 –5 0
10–3
10–2
10–1
100(a) (b)(a) (b)
Figure 5: Structural bias in homodimeric complexes. (a) Binding energy distribution P[u(p, p)]
for antiparallel (pink fill) and parallel (blue fill) isologus peptide-peptide interactions, for random
peptides (l = 10, ξ = 3). Dashed line indicates P(u) for heterlogous peptide-peptide interactions.
(b) Empirical binding energy distributions P(U) for random homodimers PP (l = 10, ξ = 3,
nP = 20) with antiparallel isologus (pink fill), parallel isologus (blue fill), and heterlogus (black
line) dimer interfaces.
tightly-bound homodimers are 98% antiparallel isologus, 1% parallel isologus, and 1% heterologus.
Real protein homodimers predominantly have antiparallel isologus interfaces [11, 3]; our results
supports the view that this phenomenon results from an intrinsic bias in homodimeric protein-
protein interactions [19, 20, 2]. In the case of SRB, this bias is entirely statistical in origin.
Discussion
SRB is a vastly simplified model of protein-protein interactions. Actual proteins are often some-
what conformationally flexible, and can undergo conformational changes upon complexation [17];
this has entropic effects which SRB neglects. Many proteins have interaction surfaces made up of
physically proximal, but sequentially distal residues, and the structural context of a surface-exposed
residue depends on more than just nearby residues along the peptide backbone. Nevertheless, there
is evidence the core physical insight of SRB, that the interaction between two proteins describes a
multi-level system with a random or weakly-correlated energy spectrum, holds for real protein com-
plexes [15, 4]. If SRB does indeed capture the statistical physics of real protein-protein interactions
(at least qualitatively), several implications are immediately evident:
Suppose the energy scale σJ ∼ kB × 300K, then at physiological temperature, a significant
fraction proteins inside a cell will be bound into nonspecific complexes that have no functional
significance. If true, then we must reassess the capabilities and limitations of protein pulldown,
proximity labelling, and two-hybrid experiments in identifying functional protein complexes [10,
25, 29]. While nonspecific protein complexes will be weakly-bound, they are ubiquitous due to the
generic nature of nonspecific binding, and are likely to dominate any experimental signal. Here,
the physically principled distinction between specific and nonspecific complexes offered by SRB can
inform new experiments and analyses seeking to identify functional protein-protein interactions [13].
Proteins in the cytoplasm, and in dense protein solutions, undergo anomalous diffusion where
⟨∆r2⟩ ∝ tβ, β < 1, and their rotational diffusion are more impeded in dense protein solutions
than in simple solvents of similar viscosity [30, 26, 14, 12]. SRB suggests we should not expect the
free diffusion of proteins in dense protein solutions, as they will be mostly bound into nonspecific
7
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
complexes. This will have an especially pronounced effect for rotational diffusion: while the trans-
lational diffusion constant Dt ∝ a−1, where a is the hydrodynamics radius, the rotational diffusion
constant Dr ∝ a−3. Additionally, due to the condensation transition, anomalous transport in dense
protein solutions should be even more pronounced at low temperatures, above and beyond effects
expected from temperature-dependent changes in solvent viscosity.
Finally, the sensitivity of protein complex thermodynamics to the exact disposition of low
energy states suggests the accurate prediction of protein complex binding free energies, which is a
long standing challenge in computational biology [16, 28], may be fundamentally limited. Here, it is
possible that different choices of necessary approximations, e.g. for molecular geometry, force fields,
etc., leading to subtle differences in low lying states, can nevertheless result in large differences in
computed thermodynamic properties.
Molecular investigations of protein complexes are singularly focused on the individuality of
the proteins involved. Our work demonstrates that statistical physics, encapsulated in a minimal
model, can illuminate features of protein-protein interactions that cut across the diversity of protein
species. SRB suggests nonspecific protein-protein binding is an ineluctable physical phenomenon,
and that the proteins inside a cell are ever participants in a plethora of specific and nonspecific
complexes. While these ubiquitous interactions challenge scientists who seek to decipher specific
functions of detected interactions, they may be biologically significant in a broader sense. We have
already suggested nonspecific interactions could be the source from which highly specific complexes
evolved; we speculate nonspecific interactions can also be a mechanism that cells can exploit, to
enable subtle modes of subcellular organization based on physical principles.
Acknowledgements. We are grateful to Stephen Benkovic for his advice and support. We also
benefited from discussions with Tom Witten and Nathan Keim. This work was supported by the
U.S. National Institutes of Health under award R01GM024129.
References
[1] I. M.A. Nooren and Janet M Thornton. Diversity of protein-protein interactions. The EMBO
Journal, 22(14):3486–3492, July 2003.
[2] Bruce Alberts, Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts, and Peter Wal-
ter. Molecular Biology of the Cell . Garland Science, 4th edition, 2002.
[3] Stuart Coles. An Introduction to Statistical Modeling of Extreme Values. Springer Series in
Statistics. Springer London, London, 2001.
[4] Samuel Kotz and Saralees Nadarajah. Extreme Value Distributions: Theory and Applications.
Imperial College Press, London, reprinted edition, 2015.
[5] B. Derrida. Random-Energy Model: Limit of a Family of Disordered Models. Physical Review
Letters, 45(2):79–82, July 1980.
[6] Bernard Derrida. Random-energy model: An exactly solvable model of disordered systems.
Physical Review B, 24(5):2613–2626, September 1981.
[7] E. I. Shakhnovich and A. M. Gutin. Implications of thermodynamics of protein folding for
evolution of primary sequences. Nature, 346(6286):773–775, August 1990.
8
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
[8] Jos´ e Onuchic, Zaida Luthey Schulten, and Peter Wolynes. Theory of Protein Folding: The
Energy Landscape Perspective. Annual review of physical chemistry, 48:545–600, February
1997.
[9] B. Derrida and G. Toulouse. Sample to sample fluctuations in the random energy model.
Journal de Physique Lettres , 46(6):223–228, 1985.
[10] M. M´ ezard, G. Parisi, and M.A. Virasoro. Random free energies in spin glasses. Journal de
Physique Lettres, 46(6):217–222, 1985.
[11] Marc M´ ezard and Andrea Montanari. Information, Physics, and Computation . Oxford Uni-
versity Press, January 2009.
[12] Ludovic Berthier and Giulio Biroli. Theoretical perspective on the glass transition and amor-
phous materials. Reviews of Modern Physics, 83(2):587–645, June 2011.
[13] David S. Goodsell and Arthur J. Olson. Structural Symmetry and Protein Function. Annual
Review of Biophysics and Biomolecular Structure , 29(1):105–153, June 2000.
[14] Ranjit Prasad Bahadur, Pinak Chakrabarti, Francis Rodier, and Jo¨ el Janin. Dissecting sub-
unit interfaces in homodimeric proteins. Proteins: Structure, Function, and Bioinformatics,
53(3):708–719, November 2003.
[15] D. B. Lukatsky, K. B. Zeldovich, and E. I. Shakhnovich. Statistically Enhanced Self-Attraction
of Random Patterns. Physical Review Letters, 97(17):178101, October 2006.
[16] D.B. Lukatsky, B.E. Shakhnovich, J. Mintseris, and E.I. Shakhnovich. Structural Similarity
Enhances Interaction Propensity of Proteins. Journal of Molecular Biology, 365(5):1596–1606,
February 2007.
[17] Ingemar Andr´ e, Charlie E. M. Strauss, David B. Kaplan, Philip Bradley, and David Baker.
Emergence of symmetry in homooligomeric biological assemblies. Proceedings of the National
Academy of Sciences, 105(42):16148–16152, October 2008.
[18] Ozlem Keskin, Attila Gursoy, Buyong Ma, and Ruth Nussinov. Principles of Protein-Protein
Interactions: What are the Preferred Ways For Proteins To Interact? Chemical Reviews,
108(4):1225–1244, April 2008.
[19] Jo¨ el Janin. Quantifying biological specificity: The statistical mechanics of molecular recogni-
tion. Proteins: Structure, Function, and Bioinformatics, 25(4):438–445, 1996.
[20] Julie Bernauer, Anne Poupon, J´ erˆ ome Az´ e, and Jo¨ el Janin. A docking analysis of the statistical
physics of protein–protein recognition. Physical Biology, 2(2):S17–S23, May 2005.
[21] Anne-Claude Gingras, Matthias Gstaiger, Brian Raught, and Ruedi Aebersold. Analysis of
protein complexes using mass spectrometry. Nature Reviews Molecular Cell Biology, 8(8):645–
654, August 2007.
[22] Wei Qin, Kelvin F. Cho, Peter E. Cavanagh, and Alice Y. Ting. Deciphering molecular
interactions by proximity labeling. Nature Methods, 18(2):133–143, February 2021.
[23] Bernhard Suter and Erich E. Wanker, editors. Two Hybrid Technologies: Methods and Proto-
cols, volume 812 of Methods in Molecular Biology. Humana Press, Totowa, NJ, 2012.
9
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint
[24] Jingxuan He, Ling-Nan Zou, Vidhi Pareek, and Stephen J. Benkovic. Multiplex mapping of
protein-protein interaction interfaces, January 2025.
[25] Yaqiang Wang, Conggang Li, and Gary J. Pielak. Effects of Proteins on Protein Diffusion.
Journal of the American Chemical Society , 132(27):9392–9397, July 2010.
[26] Matthias Roos, Maria Ott, Marius Hofmann, Susanne Link, Ernst R¨ ossler, Jochen Balbach,
Alexey Krushelnitsky, and Kay Saalw¨ achter. Coupling and Decoupling of Rotational and
Translational Diffusion of Proteins under Crowding Conditions. Journal of the American
Chemical Society, 138(32):10365–10372, August 2016.
[27] Felix H¨ ofling and Thomas Franosch. Anomalous transport in the crowded world of biological
cells. Reports on Progress in Physics , 76(4):046602, March 2013.
[28] Marco Grimaldo, Felix Roosen-Runge, Fajun Zhang, Frank Schreiber, and Tilo Seydel. Dy-
namics of proteins in solution. Quarterly Reviews of Biophysics, 52:e7, 2019.
[29] Panagiotis L. Kastritis and Alexandre M. J. J. Bonvin. Are Scoring Functions in Protein-
Protein Docking Ready To Predict Interactomes? Clues from a Novel Binding Affinity Bench-
mark. Journal of Proteome Research, 9(5):2216–2225, May 2010.
[30] Till Siebenmorgen and Martin Zacharias. Computational prediction of protein–protein binding
affinities. WIREs Computational Molecular Science, 10(3):e1448, 2020.
10
.CC-BY 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted March 29, 2025. ; https://doi.org/10.1101/2025.03.26.645477doi: bioRxiv preprint