References
Abe, K., Takamatsu, T., Sato, T. (2017) Mechanism of bacterial gene rearrangement:
SprA-catalyzed precise DNA recombination and its directionality control by SprB ensure
the gene rearrangement and stable expression of spsM during sporulation in Bacillus
subtilis. Nucleic Acids Res. 45, 6669-6683.
Abe,K., Takahashi,T. and Sato,T. (2021) Extreme C-terminal element of SprA serine
integrase is a potential component of the ‘molecular toggle switch’ which controls the
recombination and its directionality. Mol Microbiol, 115, 1110–1121.
Abioye, J., Lawson-Williams, M., Lecanda, A., Calhoon, B., McQue, A.L., Colloms, S.D.,
Stark, W.M., Olorunniji, F.J. (2023) High fidelity one-pot DNA assembly using orthogonal
serine integrases. Biotechnol J. 18(3):e2200411.
Adams, V., Lucet, I.S., Lyras, D., Rood, J.I. (2004) DNA binding properties of TnpX
indicate that different synapses are formed in the excision and integration of the Tn4451
family. Mol Microbiol. 53, 1195-1207.
Bibb, L.A., Hancox, M.I., and Hatfull, G.F. (2005) Integration and excision by the large
serine recombinase φRv1 integrase. Mol Microbiol 55: 1896–1910.
Blanch-Asensio, A., Grandela, C., Brandão, K.O., de Korte, T., Mei, H., Ariyurek, Y.,
Yiangou, L., Mol, M.P.H., van Meer, B.J., Kloet, S.L., Mummery, C.L., Davis, R.P. (2022)
STRAIGHT-IN enables high-throughput targeting of large DNA payloads in human
pluripotent stem cells. Cell Rep Methods. 2(10):100300.
Breüner, A., Brøndsted, L., Hammer, K. (1999) Novel organization of genes involved in
prophage excision identified in the temperate lactococcal bacteriophage TP901-1. J
Bacteriol. 181, 7291-7297.
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
11
Colloms, S. D., Merrick, C. A., Olorunniji, F. J., Stark, W. M., Smith, M., Osbourn, A.,
Keasling, J. D. & Rosser, S. J. (2014). Rapid metabolic pathway assembly and
modification using serine integrase site-specific recombination. Nucleic Acids Res., 42,
e23-e23.
Du, D,, Wang, L., Tian, Y., Liu, H., Tan, H., Niu, G. (2015) Genome engineering and direct
cloning of antibiotic gene clusters via phage ϕBT1 integrase-mediated site-specific
recombination in Streptomyces. Sci Rep. 5:8740.
Duportet, X., Wroblewska, L., Guye, P., Li, Y., Eyquem, J., Rieders, J., Rimchala, T., Batt,
G., & Weiss, R. (2014). A platform for rapid prototyping of synthetic gene networks in
mammalian cells. Nucleic Acids Res., 42, 13440–13451.
Evans, R., O’Neill, M., Pritzel, A., Antropova, N., Senior, A., Green, T., Zidek, A., Bates,
R., Blackwell, S., Yim, J., Ronneberger, O., Bodenstein, S., Zelinski, M., Bridgland, A.,
Potapenko, A., Cowie, A., Tunyasuvunakool, K. Jainn, R., Clancy, E., Kohli, P., Jumper, J.,
Hassabis, D. (2021) Protein complex prediction with AlphaFold-Multimer. biorxiv doi:
10.1101/2021.10.04.463034v1
Fogg, P.C.M., Haley, J.A., Stark, W.M., Smith, M.C.M. (2017) Genome Integration and
Excision by a New Streptomyces Bacteriophage, ϕJoe. Appl. Environ. Microbiol. 83,
e02767-16.
Fogg, P.C.M., Colloms, S.D., Rosser, S.J., Stark, W.M. & Smith, M.C.M. (2014). New
applications for phage integrases. J. Mol. Biol., 426, 2703-2716.
Fogg, P.C.M., Younger, E., Fernando, B.D., Khaleel, T., Stark, W.M., Smith, M.C.M.
(2018) Recombination directionality factor gp3 binds ϕC31 integrase via the zinc domain,
potentially affecting the trajectory of the coiled-coil motif. Nucleic Acids Res., 46, 1308–
1320.
Ghosh, P., Pannunzio, N.R., Hatfull, G.F. (2005) Synapsis in phage Bxb1 integration:
selection mechanism for the correct pair of recombination sites. J Mol Biol. 349, 331-348.
Ghosh, P., Wasil, L.R., and Hatfull, G.F. (2006) Control of phage Bxb1 excision by a novel
recombination directionality factor. PLoS Biol 4: e186.
Ginsburg, D.S., Calos, M.P. (2005) Site-specific integration with phiC31 integrase for
prolonged expression of therapeutic genes. Adv Genet. 54, 179-187.
Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH, Ferrin TE. (2018)
UCSF ChimeraX: Meeting modern challenges in visualization and analysis. Protein Sci.
27, 14-25.
Gregory, M.A., Till, R., Smith, M.C.M. (2003) Integration site for Streptomyces phage
φBT1 and development of site-specific integrating vectors. J Bacteriol 185, 5320–5323.
Herisse, M., Porter, J.L., Guerillot, R., Tomita, T., Goncalves Da Silva, A., Seemann, T.,
Howden, B.P., Stinear, T.P., Pidot, S.J. (2018) The ΦBT1 large serine recombinase
catalyzes DNA integration at pseudo-attB sites in the genus Nocardia. Peer J. 6:e4784.
Jumper et al. (2021) Highly accurate protein structure prediction with AlphaFold. Nature
596, 583-589.
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
12
Jusiak, B., Jagtap, K., Gaidukov, L., Duportet, X., Bandara, K., Chu, J., Zhang, L., Weiss,
R., & Liu, T.K. (2019) Comparison of Integrases Identifies Bxb1-GA Mutant as the Most
Efficient Site-Specific Integrase System in Mammalian Cells. ACS Synth. Biol., 8, 16−24
Khaleel, T., Younger, E., McEwan, A.R., Varghese, A.S., Smith, M.C. (2011) A phage
protein that binds φC31 integrase to switch its directionality. Mol Microbiol. 80, 1450-1463.
Mandali, S., Dhar, G., Avliyakulov, N.K., Haykinson, M.J., Johnson, R.C. (2013) The site-
specific integration reaction of Listeria phage A118 integrase, a serine recombinase. Mob.
DNA. 4, 2.
Mandali, S., Gupta, K., Dawson, A.R., Van Duyne, G.D., Johnson, R.C. (2017) Control of
Recombination Directionality by the Listeria Phage A118 Protein Gp44 and the Coiled-Coil
Motif of Its Serine Integrase. J. Bacteriol. 199, e00019-17.
McEwan, A.R., Rowley, P.A., Smith, M.C. (2009) DNA binding and synapsis by the large
C-terminal domain of phiC31 integrase. Nucleic Acids Res. 37, 4764-4773.
Mirdita M, Schütze K, Moriwaki Y, Heo L, Ovchinnikov S and Steinegger M. (2022)
ColabFold: Making protein folding accessible to all. Nature Methods 19, 679-682
Morita, K., Yamamoto, T., Fusada, N., Komatsu, M., Ikeda, H., Hirano, N., and Takahashi,
H. (2009a). The site-specific recombination system of actinophage TG1. FEMS Microbiol.
Lett., 297, 234–240.
Morita, K., Yamamoto, T., Fusada, N., Komatsu, M., Ikeda, H., Hirano, N., and Takahashi,
H. (2009b). In vitro characterization of the site-specific recombination system based on
actinophage TG1 integrase. Mol. Genet. Genomics 282, 607–616.
Muroi, T., Kokuzawa, T., Kihara, Y., Kobayashi, R., Hirano, N., Takahashi, H., Haruki, M.
(2013) TG1 integrase-based system for site-specific gene integration into bacterial
genomes. Appl. Microbiol. Biotechnol. 97, 4039-4048.
Olorunniji, F. J., Rosser, S. J. & Stark, W. M. (2016). Site-specific recombinases:
molecular machines for the Genetic Revolution. Biochem. J., 473, 673-684.
Olorunniji, F.J., Lawson-Williams, M., McPherson, A.L., Paget, J.E., Stark, W.M., &
Rosser, S.J. (2019) Control of ϕC31 integrase-mediated site-specific recombination by
protein trans-splicing. Nucleic Acids Res., 47, 1142-11460.
Olorunniji, F.J., McPherson, A.L., Rosser, S.J., Smith, M.C.M., Colloms, S.D. and Stark,
W.M. (2017). Control of serine integrase recombination directionality by fusion with the
directionality factor. Nucleic Acids Res., 45, 8635-8645.
Rice P. (2015). Serine resolvases. Microbiol. Spectrum 3(2): mdna3-0045-2014.
Russell, J, Chang, D., Tretiakova, A., & Padidam, M. (2006). Phage Bxb1 integrase
mediates highly efficient site-specific recombination in mammalian cells. BioTechniques,
40, 460–464.
Rutherford, K., Yuan, P., Perry, K., Sharp, R. & Van Duyne, G.D. (2013). Attachment site
recognition and regulation of directionality by the serine integrases. Nucleic Acids Res.,
41, 8341-8356.
Serrano, M., Kint, N., Pereira, F.C., Saujet, L., Boudry, P., Dupuy, B., Henriques, A.O.,
Martin-Verstraete, I. (2016) A Recombination Directionality Factor Controls the Cell Type-
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
13
Specific Activation of σK and the Fidelity of Spore Development in Clostridium difficile.
PLoS Genet. 12, e1006312.
Sievers, F., Wilm, A., Dineen, D.G., Gibson, T.J., Karplus, K., Li, W., Lopez, R., McWilliam,
H., Remmert, M., Söding, J., Thompson, J.D., Higgins, D.G. (2011) Fast, scalable
generation of high-quality protein multiple sequence alignments using Clustal Omega D
Mol. Syst. Biol. 7, 539.
Smith, M.C.M. (2015). Phage-encoded serine integrases and other large serine
recombinases. In Mobile DNA III (ed. Craig, N.L., Chandler, M., Gellert, M., Lambowitz, A.,
Rice, P.A. and Sandmeyer, S.B., ASM Press, Washington DC, USA), pp. 253-272.
Stark, W.M. (2017). Making integrases work for us. Current Opin. Microbiol., 38, 130-136.
Thorpe, H.M., Smith, M.C. (1998) In vitro site-specific integration of bacteriophage DNA
catalyzed by a recombinase of the resolvase/invertase family. Proc Natl Acad Sci U S A.
95, 5505-5510.
Xu, Z., Thomas, L., Davies, B., Chalmers, R., Smith, M., Brown, W. (2013). Accuracy and
efficiency define Bxb1 integrase as the best of fifteen candidate serine recombinases for
the integration of DNA into the human genome. BMC Biotechnol., 13, e87.
Yarnall, M.T.N., Ioannidi, EI.., Schmitt-Ulms, C., Krajeski, R.N., Lim, J., Villiger, L., Zhou,
W., Jiang, K., Garushyants, S.K., Roberts, N., Zhang, L., Vakulskas, C.A., Walker. J.A.
2nd, Kadina, A.P., Zepeda, A.E., Holden, K., Ma, H., Xie, J., Gao, G., Foquet, L., Bial, G.,
Donnelly, S.K., Miyata, Y., Radiloff, D.R., Henderson. J.M., Ujita. A., Abudayyeh. O.O.,
Gootenberg, J.S. (2023) Drag-and-drop genome insertion of large sequences without
double-strand DNA cleavage using CRISPR-directed integrases. Nat. Biotechnol. 41, 500-
512.
Zhang, L., Ou, X.J., Zhao, G.P., Ding, X.M. (2008) Highly efficient in vitro site-specific
recombination system based on Streptomyces phage ϕBT1 integrase. J. Bacteriol. 190,
6392–6397.
Zhang, L., Zhu, B., Dai, R., Zhao, G., Ding, X. (2013) Control of directionality in
Streptomyces phage ϕBT1 integrase-mediated site-specific recombination. PLoS One. 8,
e80434.
Zhang, M., Yang, C., Tasan, I., Zhao, H. (2021) Expanding the Potential of Mammalian
Genome Engineering via Targeted DNA Integration. ACS Synth Biol. 10, 429-446.
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
Figure 1: Integrative and excisive recombination reactions catalysed by large serine
integrases (LSIs). LSIs integrate DNA bearing an attP site into a genomic location that
harbours an attB site. In the reverse excision reaction, the recombination directionality factor
(RDF) binds to the LSI and modify its specificity to catalyse attR x attL recombination. In
both reactions, synapsis of att sites and catalysis of DNA strand exchange happen within a
synaptic complex involving a tetramer of the recombinase.
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
TG1_int MS----VKV--EGM-VILAGGYDRQSAERENSSTASPATQRAANRGKAEALAKEYARDGV 53
BT1_int MSPFIAPDVPEHLLDTVRVFLYARQSKGRSDGSDVSTEAQLAAGRAL---VASRNAQGGA 57
C31_int -------------M-DTYAGAYDRQSRERENSSAASPATQRSANEDKAADLQREVERDGG 46
: . * *** *.:.* .* :* :*.. : . :.*
TG1_int EVKWLGHFSEAPGTSAFTGVDRPEFNRILDMCRNREMNMIIVHYISRLSREEPLDIIPVV 113
BT1_int RWVVAGEFVDVGRSGWDPNVTRADFERMMGEVRAGEGDVVVVNELSRLTRKGAHDALEID 117
C31_int RFRFVGHFSEAPGTSAFGTAERPEFERILNECRAGRLNMIIVYDVSRFSRLKVMDAIPIV 106
. *.* :. :. . * :*:*::. * . ::::* :**::* * : :
TG1_int TELLRLGVTIVSVNEGTFRPGEMMDLIHLIMRLQASHDESKNKSVAVSNAKELAKRLGGH 173
BT1_int NELKKHGVRFMSVLEPFLDTSTPIGVAIFALIAALAKQDSDLKAERLKGAKDEIAALGGV 177
C31_int SELLALGVTIVSTQEGVFRQGNVMDLIHLIMRLDASHKESSLKSAKILDTKNLQRELGGY 166
.** ** ::*. * : . :.: : : ::.:*. *: : .:*: ***
TG1_int T-GSTPYGFDTVEEMVPNPEDGGKL-VAIRRLVPSAHTWEGAHGSEGAVIRWAWQEIKTH 231
BT1_int HSSSAPFGMRAVRKKVDN--------LVISVLEPDED--------NPDHVELV----ERM 217
C31_int VGGKAPYGFELVSETKEITRNGRMVNVVINKLAHSTTPLTGPFEFEPDVIRWWWREIKTH 226
..:*:*: * : :.* * . : :. :
TG1_int RDTPFKGGGAGSFHPGSLNGLCERLYRDKVPTRG----------------TLVGKKRAGS 275
BT1_int AKMSFEGVS--------DNAIATTFEKEKIPSPGMAERRATEKRLASVKARRLNGAEKPI 269
C31_int KHLPFKPGSQAAIHPGSITGLCKRMDADAVPTRG----------------ETIGKKTASS 270
. *: . ..:. : : :*: * :.
TG1_int DWDPGVLKRVLSDPRIAGYQADIAYKVRADGSR-GGFSHYKIRRDPVTMEPLTLPGFEPY 334
BT1_int MWRAQTVRWILNHPAIGGFAFERVKHGK--------AHINVIRRDPGGK-P--LTPHTGI 318
C31_int AWDPATVMRILRDPRIAGFAAEVIYKKKPDGTPTTKIEGYRIQRDPITLRPVELD-CGPI 329
* .: :* .* *.*: : : : *:*** * *
TG1_int IPPAEWWELQEWLQGRGR---GKGQYRGQSLLSAMDVLYCY-GSGQLDPETGYSNGSTMA 390
BT1_int LSGSKWLELQEKRSGKNLSDRKPGAEVEPTLLSGWRFLGCRICGGSMGQS--------QG 370
C31_int IEPAEWYELQAWLDGRGR---GKGLSRGQAILSAMDKLYCE-CGAVMTSK---------- 375
: ::* *** .*:. * ::**. * * .. : .
TG1_int GNVREGDQAHKSSYACKCPRRV------HDGSSCSITMHNLDPYIVGAIFARITAFDPAD 444
BT1_int GRK-RNGDLAEGNYMCANPKG---------HGGLSVKRSELDEFVASKVWARLRTADMED 420
C31_int ----RGEESIKDSYRCRRRKVVDPSAPGQHEGTCNVSMAALDKFVAERIFNKIRHAE--- 428
.. : :..* * : . .:. ** ::. :: :: :
TG1_int PDDLEGDTAALMYEAARRWGATHERPELKGQRSELMAQRADAVKALEELYEDKRNGGYRS 504
BT1_int EH-----DQAWIAAAAERFALQHDLAGVADERREQQAHLDNVRRSIKDLQADRKPGLYVG 475
C31_int ---GDEETLALLWEAARRFGKLTEAPEKSGERANLVAERADALNALEELYEDRAAGAYDG 485
* : **.*:. : .:* : *. :. .::::* *: * * .
TG1_int AMGRRAFLEEEAALTLRMEGAEERLRQLDAAD-SPVLPIGEWLGD-RGSDPTGPGSWWAL 562
BT1_int REELETWRSTVLQYRSYEAECTTRLAELDEKMNGSTRVPSEWFS---GEDPTAEGGIWAS 532
C31_int PVGRKHFRKQQAALTLRQQGAEERLAELEAAE-APKLPLDQWFPEDADADPTGPKSWWGR 544
. : . . ** :*: . .:*: . ***. . *.
TG1_int APLEDRRAFVRLFVDRIEVIKLPKGVQRPGRVPPIADRVRIHWAKPKVEEETEPETLNGF 622
BT1_int WDVYERREFLSFFLDSVMVDRGRHP--ETKKYIPLKDRVTLKWAELLKEEDEASEATERE 590
C31_int ASVDDKRVFVGLFVDKIVVTKSTTG---RGQGTPIEKRASITWAKPPTDDDEDDAQ-DGT 600
: ::* *: :*:* : * : : *: .*. : **: ::: :
TG1_int TAAA- 626
BT1_int LAAL- 594
C31_int EDVAA 605
TG1_int BT1_int C31_int
1: TG1_int 100.00 24.42 50.17
2: BT1_int 24.42 100.00 25.09
3: C31_int 50.17 25.09 100.00
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
Figure 2: Sequence alignments of TG1, ϕBT1, and ϕC31 integrases. Key amino acid
residues in the active site including the nucleophilic serine are highlighted in bold. Multiple
sequence alignment was generated using Clustal Omega (Sievers et al., 2011). A matrix of
pairwise percent identities is shown below. (https://www.ebi.ac.uk/jdispatcher/msa).
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
TG1_RDF MAGFNLWATDPENKPKPRQTFSDDAVGRLHGGYMDTTGR-KPKPVALSEWRFSTGDKVVA 59
BT1_RDF MAKRSIWAGDEDNKPKKRETYADDTVGRFHSGFSELNERGKLVPVALDKWRISTGEKTVA 60
C31_RDF MAKRSIWAGDEDNKPKKRETYADDTVGRFHSGYSETNERGKVVPVALDKWRISTGEQSVA 60
** .:** * :**** *:*::**:***:*.*: : . * * ****.:**:***:: **
TG1_RDF DALAQLFGGTPAENDDTDSEHFIDVFTEAKRVPVIVEPDGIEADMKQWVNGKLIHHCDGV 119
BT1_RDF DAVAQLFGGEPIENEDSTSENFIDVFTEKAKIPVIIAADGIHWDMKQWVNGKLKHHCDGY 120
C31_RDF DAVAQLFGGTPVENEESTSENFIDVFTDRPKVPVIIEADGIHWDMKLWLNGKLKHHCDGF 120
**:****** * **::: **:******: ::***: ***. *** *:**** *****
TG1_RDF KFLSPD--DKRGDLCGCPSLFAERKAAAKDYQGPNPSIKVTFRLADDPELGKFTYSTGSW 177
BT1_RDF DFVSHNDDDMVGQPCGCPTLFDERKADAKNDDGPNPAITVTFQLADDPELGRFKFQTGSW 180
C31_RDF DFVSHADEEMIGQPCGCPKLFDERKAAAKEYDAPNPAITVTFTLADDPELGRFKFQTGSW 180
.*:* : *: ****.** **** **: :.***:*.*** ********:*.:.****
TG1_RDF TLAAVLHEALNDLDRIGE--TAYGYVELEYVEFIPSRGPMRGKTVSYTKPVLKFVKPYND 235
BT1_RDF TLFKVLHEAEDDVERIGKGGPVFGYVELELVEYTPKKGPMRNKLVSYYKPVINVVKSYND 240
C31_RDF TLFKVLHEAEDDVERVGKGGAVLANLELELVEYTPKRGPMRNKLVSYYKPTITVLKSYND 240
** ***** :*::*:*: . . :*** **: *.:****.* *** **.:..:* ***
TG1_RDF AIAD--- 239
BT1_RDF AIADDAE 247
C31_RDF AIAD--- 244
****
TG1_RDF BT1_RDF C31_RDF
1: TG1_RDF 100.00 63.18 61.09
2: BT1_RDF 63.18 100.00 85.25
3: C31_RDF 61.09 85.25 100.00
Figure 3: Sequence alignments of TG1, ϕBT1, and ϕC31 Recombination Directionality
Factors (RDFs). Multiple sequence alignment was generated using Clustal Omega (Sievers
et al., 2011). A matrix of pairwise percent identities is shown below.
(https://www.ebi.ac.uk/jdispatcher/msa).
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
Figure 4: AlphaFold2-multimer structures of serine integrases and their RDFs. (A)
AlphaFold2-predicted models for the N-terminal catalytic domains and the first DNA-binding
domains (DBD1) of ϕC31 (blue), ϕBT1 (magenta), and TG1 (brown) integrases. These
models were calculated using full-length protein sequences. Because the flexible linkers
between domains made overall superpositions visually confusing, the models were then
mapped domain-wise (except for the coiled coil) onto the individual domains of 4kis.pdb,
which is a complex of the DNA binding domains of A118 integrase with half of attP. The DNA
from 4kis is shown to guide the eye. (B-D). The second DNA-binding domains (DBD2) of
TG1 (B), ϕBT1 (C), and ϕC31 (D) and their RDFs, taken from the same predicted structures
as in part (A) but shown separately for clarity. Structures were generated using AlphaFold
Multimer as described in Methods section. Flexible C-terminal extensions of integrases
were removed for clarity.
A B
C D
DBD1
Catalytic
domains
Coiled
coil
RDF
Coiled
coil
RDF RDF
Coiled
coil
DBD2 DBD2
DBD2
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
Figure 5: In vivo recombination reactions of ϕC31, ϕBT1, and TG1 integrases. (A)
Scheme illustrating the in vivo intramolecular recombination (inversion) assay. In its default
state, the promoter constitutively drives the expression of a red fluorescent protein (rfp)
gene (pink arrow). A terminator sequence upstream of the promoter inhibits transcriptional
read-through to the green fluorescent protein (gfp) gene (green arrow). Upon integrase-
catalysed site-specific inversion reaction, the orientation of the promoter is flipped to allow
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
the expression of GFP, and block RFP production. (B) Summary of the protocol for in vivo
recombination reactions using constitutive integrase and RDF expression vectors. (C)
Recombination activities of ϕC31 integrase in the presence of ϕC31-RDF, ϕBT1-RDF, and
TG1-RDF; and as integrase-RDF fusions. In attP/attB reactions, cells start expressing RFP
and produce GFP upon recombination. The extent of recombination is indicated as
percentage of cells expressing GFP as outlined in B above. The reverse applies in reactions
where the starting substrates are attR x attL. (D) Recombination activities of ϕBT1 integrase
in the presence of ϕC31-RDF, ϕBT1-RDF, and TG1-RDF; and as integrase-RDF fusions.
(E) Recombination activities of ϕTG1 integrase in the presence of ϕC31-RDF, ϕBT1-RDF,
and TG1-RDF; and as integrase-RDF fusions. In panels C, D, and E, integrases are
depicted as long ovals, and RDFs as short ovals. Each integrase and its cognate RDF are
colour-coded to highlight expected orthogonal interactions.
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
Figure 6: In vitro recombination reactions of ϕC31, ϕBT1, and TG1 integrases.
(A) Scheme illustrating the in vitro intramolecular recombination assay (substrate plasmid
pϕC31PBX for ϕC31 integrase is illustrated; the substrates for ϕBT1 and TG1 integrases are
of the same design). The plasmid substrates are named after the LSI (ϕC31) and the att
sites recombining (PBX; attP X attB resolution reaction). Upon recombination, the plasmid
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
substrate gives two circular products in which the attR and attL sites are separated. For the
reverse reaction, the starting substrate plasmid has attP and attB sites replaced by attR X
attL sites, respectively, with recombination giving attP and attB sites on separate circular
plasmid products. (B) Recombination activities of ϕC31 integrase in the presence of ϕC31-
RDF, ϕBT1-RDF, and TG1-RDF. Reactions were incubated for 2 hours in the reaction buffer
described in Materials and Methods. Reaction products were digested with the restriction
endonuclease NruI prior to 1.2% agarose gel electrophoresis. In reactions where the
integrase and RDF are added as separate proteins, the final concentration of both proteins
were 200 nM. When the reactions were carried out using integrase-RDF fusion, the final
concentration was 200 nM. The bands on the gel are labeled nr (non-recombinant, i.e.
substrate), rec (recombination product). The mean extent of recombination and standard
deviation (%) from quantitation of triplicate experiments are given below each lane. (C)
Recombination activities of ϕBT1 integrase in the presence of ϕC31-RDF, ϕBT1-RDF, and
TG1-RDF. Reaction conditions, gel electrophoresis, data acquisition and analyses areas
described above in (A). (D) Recombination activities of TG1 integrase in the presence of
ϕC31-RDF, ϕBT1-RDF, and TG1-RDF. Reaction conditions, gel electrophoresis, data
acquisition and analyses are as described in (A).
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
Figure 7: Correlation of extent of recombination in attP x attB and attR x attL
reactions in the presence of the recombination directionality factor. (A) In vivo
recombination, showing data taken from Figure 5. (B) In vitro recombination, showing data
taken from Figure 6. The colour scheme is the same as in Figures 5 and 6: blue, ϕC31
integrase; pink, ϕBT1 integrase, gold, TG1 integrase. Triangles denote a cognate RDF
fused to the integrase; square, a cognate RDF as a separate protein; circle, a non-cognate
RDF; and X, no RDF.
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint
Figure 8: AlphaFold2-predicted structures of the second DNA-binding domains
(DBD2) of ϕC31, ϕBT1, and TG1 integrases and their RDFs.These models were
mapped domain-wise onto the individual domains of 4kis.pdb as described in Figure 4. TG1
Integrase is shown with conservation mapped to semi-transparent surface (on RDF on left;
int on right). Yellow: none; green: identical in 2; blue: identical in all 3; brown: insertions in
TG1 integrase relative to other two. Note that the binding surface on RDF is much more
conserved than binding surface on Integrase.
~90° ~-90°
A B RDF
DBD2
Coiled
coil
RDF
DBD2
Coiled
coil
.CC-BY-NC-ND 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 3, 2024. ; https://doi.org/10.1101/2024.04.03.587898doi: bioRxiv preprint