Results
An inducible CRISPR-interference iPSC system for the repression of hominoid-specific
TEs
To investigate the role of human-specific transposable elements (TEs) ( SVAs and LTR5Hs;
Fig. 1a) in human cranial neural crest cell (CNCC) development, we designed a stable human
iPSC line with an inducible CRISPR-interference (CRISPRi) system with single-guide RNAs
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
(sgRNAs) targeting ~80% of all SVAs and LTR5Hs annotated in the human genome (Fig. 1b).
The sgRNAs were originally designed and validated in a study by the Trono group (Pontis et
al., 2019) and were further used in subsequent studies (Barnada et al., 2022; Patoori et al.,
2022). Briefly, we cloned a stable iPSC line with a permanently integrated TET-inducible,
catalytically dead, Cas9 fused to a repressive KRAB domain (dCas9 -KRAB), along with the
two gRNAs targeting the LTR5Hs and SVAs (hereafter +gRNA line) . The KRAB domain
recruits the transcriptional machinery necessary to deposit repressive histone methylation
(H3K9me3) to the regions targeted by dCas9. To account for potential off-target effects caused
by exposure to doxycycline (TET-ON) or by Cas9 expression, we cloned the same iPSC line
with an identical dCas9-KRAB construct but without any gRNA s (hereafter -gRNAs line).
Treating the cells with doxycycline for 24 hours was sufficient to activate dCas9 in both
CRISPRi lines (i.e. with and without gRNAs; Fig. 1c,d).
Next, we generated CNCCs from our CRISPRi-iPSC lines using an established 5-day protocol
(Fig. 1e -f; Leung et al., 2016) . Since both SVAs and LTR5Hs have been shown to have
important roles in human embryonic stem cells and iPSCs (Barnada et al., 2022; Fuentes et
al., 2018; Pontis et al., 2019) , doxycycline was only introduced 24h after differentiation (Fig.
1c). By day 5 of differentiation, both cell lines expressed markers typical of CNCC identi ty,
both at gene (SOX9, SOX10, TWIST1, TFAP2A) and protein (SOX9, AP2 a) level. We
generally did not observe significant differences in CNCC marker expression between the two
lines, apart from TFAP2A (AP2a) at the gene level (Fig. 1e,f), indicating that expression of the
main genes essential for CNCC identity was largely unaffected by the CRISPRi.
Hundreds of SVAs and LTR5Hs are accessible in human CNCCs
We set out to determine whether any LTR5Hs and human-specific SVA exhibited chromatin
accessibility in human CNCCs , and whether these elements could be repressed using our
CRISPRi system. To this end, we performed ATAC-seq (paired-end, 150 bp reads) in hiPSC-
derived CNCCs generated with our CRISPRi-iPSC lines (+ and -gRNAs). K-mer clustering of
the ATAC-seq data identified a total of 256 accessible LTR5Hs and human-specific SVAs (Fig.
2a; Supplementary File S1). Notably, 77% of these 256 TEs displayed decreased accessibility
in the +gRNA CRISPRi line relative to the -gRNA control (cluster 2, Fig. 2a). This is consistent
with the assumption that these gRNAs can target ~80% of all the human LR5Hs and SVAs
(Pontis et al. 2019) . LTR5Hs were significantly overrepresented among the 256 accessible
human-specific TEs (observed 87%, expected 29%; Fisher’s Exact Test p < 2.2 x 10 -16; Fig.
2b), while the SVAs were significantly underrepresented (13% observed vs 71% expected;
Fisher’s Exact Test p < 2.2 x 10-16; Fig. 2b), highlighting a potential primary role for LTR5Hs in
CNCCs.
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Furthermore, we wanted to confirm that the loss in chromatin accessibility observed in the
+gRNA lines was a direct consequence of the CRISPRi-mediated repression. To this end, we
performed ChIP-seq for H3K9me3 (paired-end 150 bp reads), which showed accumulation of
this repressive histone mark at approximately 75% of the 256 accessible LTR5Hs and human-
specific SVAs in the +gRNA samples (Fig. 2c). Notably, there was a remarkable overlap (92%)
between the TEs that lost chromatin accessibility (Fig. 2a) and those gaining H3K9me3 (Fig.
2c).
We further investigated the genomic locations of these accessible human -specific TEs.
Overall, 91% of the 256 accessible LTR5Hs and human -specific SVAs were located > 1 kb
from the nearest transcription start site ( TSS), and the median distance was 8.9 kb. This
suggests that these accessible mobile elements could be putative CNCC enhancers. To
confirm this, we leveraged publicly available H3K27ac ChIP-seq data previously generated by
our group in hiPSC-derived CNCCs (Barnada et al., 2024) . This analysis revealed that the
vast majority of the 256 accessible human -specific TEs are also decorated by the H3K27ac,
an established active enhancer mark, supporting their role as bona-fide human-specific CNCC
enhancers (Supplementary Figure S1a).
To ensure that our findings were not biased by a specific differentiation protocol, we used an
alternative iPSC-to-CNCC differentiation method (Bajpai et al., 2010) and performed ATAC-
seq on the differentiated cells. With this protocol, migratory CNCCs are obtained in 2-3 weeks
(Bajpai et al., 2010; Barnada et al., 2024; Mitchell et al., 2025; Pagliaroli et al., 2021) . Using
this approach, we similarly identified 374 LTR5Hs and human-specific SVA elements that are
accessible in CNCCs, the majority of which exhibited reduced accessibility in the +gRNA line
compared to the -gRNA control (Supplementary Fig. S1 b). Moreover, comparable to our
findings obtained with the 5-day protocol, LTR5Hs were significantly overrepresented in the
set of accessible TEs (83% observed vs 29% expected; Fisher’s Exact Test p < 2.2 x 10 -16),
while SVAs were significantly underrepresented (17% observed vs 71% expected; Fisher’s
Exact Test p < 2.2 x 10 -16). Importantly, 91% of the 256 human -specific TEs identified as
accessible in the 5 -day protocol were also found accessible in CNCCs derived using the
alternative protocol. This consistency suggests that our findings are robust and independent
of the CNCC differentiation method used.
Finally, since our gRNAs also target non -human specific SVAs (i.e. SVA -A, -B, -C, -D), we
examined the ATAC-seq signal across all existing SVA subfamilies. In total, 184 non-human-
specific SVAs (A –D subfamilies) were accessible in CNCCs and 3922 were inaccessible
(Supplementary Fig. S1c). Notably, even when considering all SVA groups, SVAs remained
significantly underrepresented among accessible TEs (46% observed vs. 89% expected;
Fisher’s Exact Test, p < 2.2 × 10⁻¹⁶).
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
In summary, these experiments identified approximately 550 hominoid -specific TEs, half of
which human -specific, that exhibit signatures of active enhancers in human iPSC -derived
CNCCs, with LTR5H s elements playing a prominent role. Notably, our CRISPR -based
approach enabled the inducible repression of ~75% of these putative TE -derived human-
specific CNCC enhancers.
Co-opted human-specific TEs are enriched with the CNCC coordinator motif
Next, we explored the genomic features driving the co -option of the LTR5Hs and SVAs as
CNCC enhancers. We performed computational DNA motif analysis on the set of 256
accessible human-specific TEs using the non-accessible LTR5Hs and SVAs as background
control for differential enrichment. Recent studies have identified a specific DNA motif, known
as coordinator, which is enriched at enhancers critical for the regulation of CNCC identity (Kim
et al., 2024; Prescott et al., 2015). Coordinator is a composite motif which consists of a fusion
between a generic AT-rich homeobox motif (TTAATTA) and the binding motif of the CNCC
master regulator TWIST1, typically joined by a stretch of A nucleotides (Kim et al., 2024;
Prescott et al., 2015). Importantly, we found both components of the coordinator motif as highly
enriched in the set of accessible LTR5Hs and SVAs (Fig. 2d ; Supplementary File S 2).
Specifically, the TWIST1 motif and the AT-rich homeobox motif were found in 83% and 62%
of accessible human-specific TEs, respectively (Fig. 2d; Supplementary File S2 ). Moreover,
we found that 29% of the accessible human -specific SVAs and LTR5Hs harbour the full
coordinator motif sequence, as opposed to only 14% of the non -accessible TEs. This
suggests that the human -specific TEs accessible in CNCCs are significantly more enriched
for the coordinator motif relative to the non-accessible ones (Fisher’s Exact Test p < 0.00001).
In addition, the motifs for other CNCC signature transcription factors, such as AP2a, SOX9/10,
SLUG and FOXD3 were also found as significantly enriched (Fig. 2d; Supplementary File S2).
Given the prevalence of the coordinator motif in accessible LTR5Hs and SVAs, we investigated
whether these TEs are directly bound by TWIST1 in human CNCCs. To address this, we
leveraged publicly available TWIST1 ChIP-seq data generated in iPSC-derived CNCCs (Kim
et al., 2024), which revealed that over half of the accessible LTR5H s and SVA elements are
bound by TWIST1 in human CNCCs (Fig. 2e).
Overall, these findings suggest that DNA sequence and transcription factor binding are a major
driver for TE co-option as active CNCC enhancers.
Repression of LTR5Hs and SVAs impairs expression of CNCC-migration genes
Our experiments so far identified ~550 LTR5Hs and SVAs (half of which human -specific)
displaying active enhancer signature in human CNCCs. Since our CRISPRi system enables
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
the simultaneous repression of ~75% of these elements, we next investigated whether their
repression had significant effects on the CNCC transcriptome . To this end, we differentiated
our doxycycline-treated CRISPRi lines (+ and -gRNAs) into CNCCs and performed RNA-seq.
First, we analyzed gene expression in relation to the nearest transcription start sites (TSS) of
the 256 accessible human -specific SVAs and LTR5H s. In total, we identified 107 genes
located near these elements that were actively expressed in CNCCs (median TPM >1 across
all -gRNA replicates). Overall, the expression of these genes was significantly decreased upon
TE repression (Wilcoxon’s Rank Sum test p < 0.037; Supplementary Fig. S 2a), with 83/107
(=77.5%) displaying lower expression in the +gRNA sample relative to the -gRNA counterpart
(Supplementary File S3).
We leveraged publicly available RNA-seq data from CNCCs derived from chimpanzee iPSCs
(Prescott et al., 2015), which we reanalysed using our pipeline, to compare the expression of
these 107 genes between the two closely related ape species. This analysis revealed that
these genes are typically expressed at significantly higher levels in humans than in
chimpanzees (Fig. 3a; Supplementary File S3). However, repressing the human-specific SVAs
and LTR5Hs eliminate d this expression difference between the two species (Fig. 3a),
demonstrating a direct role for these TEs in species -specific CNCC gene regulation .
Consistent with this , repressing the human -specific TEs increased the correlation between
human and chimpanzee expression levels for these genes (Supplementary Fig. S2b).
Next, we examined transcriptome-wide effects. Differential gene expression analysis identified
795 genes that were significantly differentially expressed between +gRNA and -gRNA CNCCs
(FDR < 0.05; FC 1.5; Fig. 3b; Supplementary File S4). Of the se, 501 genes were
downregulated, while 294 genes were upregulated in the +gRNA samples (Fig. 3b). Gene
ontology enrichment analysis of the downregulated genes revealed a significant enrichment
for cell migration -related processes, suggesting that many of these genes play key roles in
CNCC migration (Fig. 3c; Supplementary File S5). Conversely, upregulated genes were
primarily associated with mitochondrial and cell division processes (Supplementary File S6).
Notably, only 3 of the differentially expressed genes had an intronic SVA or LTR5Hs,
suggesting that the high number of differentially expressed genes is not an artifact of CRISPRi
mediated repression at gene bodies.
Silencing human-specific TEs functionally affects CNCC migratory potential in vitro
Since cell migration was the predominant signature enriched in the downregulated genes, we
investigated whether repressing human -specific SVAs and LTR5Hs could functionally affect
CNCC migration. To test this, we performed a transwell migration assay , in which hiPSC -
CNCCs were seeded on geltrex-coated transwell membranes overnight and subsequently
exposed to medium supplemented with the general chemoattractant fetal bovine serum (FBS)
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
in the lower chamber (Fig. 4a). After 24 hours of incubation, comparison of the –gRNA and
+gRNA lines revealed a significant reduction in CNCC migration across the membrane upon
repression of human-specific TEs, suggesting impaired migratory capacity (Fig. 4b,c). These
findings were consistent with the RNA -seq data and suggest that human-specific SVAs and
LTR5Hs have been co-opted to regulate human CNCC migration.
The LTR5Hs and SVAs active in CNCCs are silent in most human cell-types
Finally, we investigated whether the 256 human -specific SVAs and LTR5H s with active
enhancer signature in CNCCs were also co-opted as cis-regulatory elements in other human
cell types. To address this, we leveraged publicly available H3K27ac ChIP-seq data from 14
cell types generated by the Roadmap Epigenomics Consortium , including distinct brain
regions, lung, aorta, adipose tissue, pancreas and spleen (Kundaje et al., 2015). This analysis
revealed that , in addition to CNCCs, these 256 TE exhibit active enhancer signature
exclusively in endomesodermal cells, with a subset also active in iPSCs (Fig. 5). In contrast,
they remain completely silenced in all other cell types (Fig. 5). The activation of these elements
in endomesodermal cells is u nsurprising, as a previous study from our lab ha s shown that
human-specific SVAs are highly enriched for the motif of the key mesodermal regulator
EOMES ( = TBR2; Patoori et al., 2022 ). Similarly, the co -option of SVAs and LTR5H s as
regulatory elements in human iPSCs and ESCs has been suggested by previous research
(Barnada et al., 2022; Fuentes et al., 2018; Pontis et al., 2019).
In summary, these findings support the notion that TE co -option as functional cis -regulatory
elements is largely a cell -type-specific process, with distinct subsets of human -specific TEs
contributing to regulatory networks in different developmental contexts.
References
Ahlstrom, J. D., & Erickson, C. A. (2009). The neural crest epithelial-mesenchymal transition in
4D: A ‘tail’ of multiple non-obligatory cellular mechanisms. Development, 136(11), 1801–
1812. https://doi.org/10.1242/dev.034785
Bajpai, R., Chen, D. A., Rada-Iglesias, A., Zhang, J., Xiong, Y., Helms, J., Chang, C. P., Zhao, Y.,
Swigut, T., & Wysocka, J. (2010). CHD7 cooperates with PBAF to control multipotent neural
crest formation. Nature, 463(7283), 958–962. https://doi.org/10.1038/nature08733
Barnada, S. M., Giner de Gracia, A., Morenilla-Palao, C., López-Cascales, M. T., Scopa, C.,
Waltrich, F. J., Mikkers, H. M. M., Cicardi, M. E., Karlin, J., Trotti, D., Peterson, K. A.,
Brugmann, S. A., Santen, G. W. E., McMahon, S. B., Herrera, E., & Trizzino, M. (2024).
ARID1A-BAF coordinates ZIC2 genomic occupancy for epithelial-to-mesenchymal transition
in cranial neural crest specification. The American Journal of Human Genetics.
https://doi.org/10.1016/j.ajhg.2024.07.022
Barnada, S. M., Isopi, A., Tejada-Martinez, D., Goubert, C., Patoori, S., Pagliaroli, L., Tracewell,
M., & Trizzino, M. (2022). Genomic features underlie the co-option of SVA transposons as
cis-regulatory elements in human pluripotent stem cells. PLoS Genetics, 18(6), 1–19.
https://doi.org/10.1371/journal.pgen.1010225
Bejerano, G., Lowe, C. B., Ahituv, N., King, B., Siepel, A., Salama, S. R., Rubin, E. M., Kent, W.
J., & Haussler, D. (2006). A distal enhancer and an ultraconserved exon are derived from a
novel retroposon. Nature, 441(1), 87–90. https://doi.org/10.1038/nature04696
Bourque, G., Leong, B., Vega, V. B., Chen, X., Yen, L. L., Srinivasan, K. G., Chew, J. L., Ruan, Y.,
Wei, C. L., Huck, H. N., & Liu, E. T. (2008). Evolution of the mammalian transcription factor
binding repertoire via transposable elements. Genome Research, 18(11), 1752–1762.
https://doi.org/10.1101/gr.080663.108
Bray, N. L., Pimentel, H., Melsted, P., & Pachter, L. (2016). Near-optimal probabilistic RNA-seq
quantification. Nature Biotechnology, 34(5), 525–527. https://doi.org/10.1038/nbt.3519
Bronner, M. E., & LeDouarin, N. M. (2012). Development and evolution of the neural crest: An
overview. In Developmental Biology (Vol. 366, Issue 1, pp. 2–9). Academic Press Inc.
https://doi.org/10.1016/j.ydbio.2011.12.042
Capra, J. A., Hubisz, M. J., Kostka, D., Pollard, K. S., & Siepel, A. (2013). A Model-Based
Analysis of GC-Biased Gene Conversion in the Human and Chimpanzee Genomes. PLoS
Genetics, 9(8). https://doi.org/10.1371/journal.pgen.1003684
Carroll, S. B. (2005). Evolution at two levels: On genes and form. In PLoS Biology (Vol. 3, Issue
7, pp. 1159–1166). Public Library of Science. https://doi.org/10.1371/journal.pbio.0030245
Chuong, E. B., Elde, N. C., & Feschotte, C. (2016). Regulatory evolution of innate immunity
through co-option of endogenous retroviruses. Science, 351(6277), 1083–1087.
https://doi.org/10.1126/science.aad5497
Chuong, E. B., Elde, N. C., & Feschotte, C. (2017). Regulatory activities of transposable
elements: From conflicts to benefits. In Nature Reviews Genetics (Vol. 18, Issue 2, pp. 71–
86). Nature Publishing Group. https://doi.org/10.1038/nrg.2016.139
Chuong, E. B., Rumi, M. A. K., Soares, M. J., & Baker, J. C. (2013). Endogenous retroviruses
function as species-specific enhancer elements in the placenta. Nature Genetics, 45(3),
325–329. https://doi.org/10.1038/ng.2553
Cordero, D. R., Brugmann, S., Chu, Y., Bajpai, R., Jame, M., & Helms, J. A. (2011). Cranial neural
crest cells on the move: Their roles in craniofacial development. American Journal of
Medical Genetics, Part A, 155(2), 270–279. https://doi.org/10.1002/ajmg.a.33702
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Cosby, R. L., Judd, J., Zhang, R., Zhong, A., Garry, N., Pritham, E. J., & Feschotte, C. (2021).
Recurrent evolution of vertebrate transcription factors by transposase capture. Science,
371(6531). https://doi.org/10.1126/science.abc6405
Feschotte, C. (2008). Transposable elements and the evolution of regulatory networks. Nature
Genetics, 9, 397–405.
Frost, J. M., Amante, S. M., Okae, H., Jones, E. M., Ashley, B., Lewis, R. M., Cleal, J. K., Caley,
M. P., Arima, T., Maffucci, T., & Branco, M. R. (2023). Regulation of human trophoblast gene
expression by endogenous retroviruses. Nature Structural and Molecular Biology, 30(4),
527–538. https://doi.org/10.1038/s41594-023-00960-6
Fuentes, D., Swigut, T., & Wysocka, J. (2018). Systematic perturbation of retroviral LTRs reveals
widespread long-range effects on human gene regulation. ELife.
https://doi.org/10.7554/eLife.35989.001
Fueyo, R., Judd, J., Feschotte, C., & Wysocka, J. (2022). Roles of transposable elements in the
regulation of mammalian transcription. Nature Reviews Molecular Cell Biology, 23(7), 481–
497. https://doi.org/10.1038/s41580-022-00457-y
Gans, C., & Northcutt, R. G. (1983). Neural Crest and the Origin of Vertebrates: A New Head.
Science, 220(4594), 268–273. https://doi.org/10.1126/science.220.4594.268
Gokhman, D., Agoglia, R. M., Kinnebrew, M., Gordon, W., Sun, D., Bajpai, V. K., Naqvi, S., Chen,
C., Chan, A., Chen, C., Petrov, D. A., Ahituv, N., Zhang, H., Mishina, Y., Wysocka, J.,
Rohatgi, R., & Fraser, H. B. (2021). Human–chimpanzee fused cells reveal cis-regulatory
divergence underlying skeletal evolution. Nature Genetics, 53(4), 467–476.
https://doi.org/10.1038/s41588-021-00804-3
Goubert, C., Zevallos, N. A., & Feschotte, C. (2020). Contribution of unfixed transposable element
insertions to human regulatory variation. Philosophical Transactions of the Royal Society B:
Biological Sciences, 375(1795). https://doi.org/10.1098/rstb.2019.0331
Heinz, S., Benner, C., Spann, N., Bertolino, E., Lin, Y. C., Laslo, P., Cheng, J. X., Murre, C.,
Singh, H., & Glass, C. K. (2010). Simple Combinations of Lineage-Determining Transcription
Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell Identities.
Molecular Cell, 38(4), 576–589. https://doi.org/10.1016/j.molcel.2010.05.004
Imbeault, M., Helleboid, P. Y., & Trono, D. (2017). KRAB zinc-finger proteins contribute to the
evolution of gene regulatory networks. Nature, 543(7646), 550–554.
https://doi.org/10.1038/nature21683
Jacques, P. É., Jeyakani, J., & Bourque, G. (2013). The Majority of Primate-Specific Regulatory
Sequences Are Derived from Transposable Elements. PLoS Genetics, 9(5).
https://doi.org/10.1371/journal.pgen.1003504
Jheon, A. H., & Schneider, R. A. (2009). The cells that fill the bill: Neural crest and the evolution of
craniofacial development. In Journal of Dental Research (Vol. 88, Issue 1, pp. 12–21).
https://doi.org/10.1177/0022034508327757
Khouri-Farah, N., Winchester, E. W., Schilder, B. M., Robinson, K., Curtis, S. W., Skene, N. G.,
Leslie-Clarkson, E. J., & Cotney, J. (2025). Gene expression patterns of the developing
human face at single cell resolution reveal cell type contributions to normal facial variation
and disease risk. https://doi.org/10.1101/2025.01.18.633396
Kim, S., Morgunova, E., Naqvi, S., Goovaerts, S., Bader, M., Koska, M., Popov, A., Luong, C.,
Pogson, A., Swigut, T., Claes, P., Taipale, J., & Wysocka, J. (2024). DNA-guided transcription
factor cooperativity shapes face and limb mesenchyme. Cell.
https://doi.org/10.1016/j.cell.2023.12.032
King, M.-C., & Wilson, A. C. (1975). Evolution at Two Levels in Humans and Chimpanzees.
Science, 188, 107–116. https://www.science.org
Kunarso, G., Chia, N. Y., Jeyakani, J., Hwang, C., Lu, X., Chan, Y. S., Ng, H. H., & Bourque, G.
(2010). Transposable elements have rewired the core regulatory network of human
embryonic stem cells. Nature Genetics, 42(7), 631–634. https://doi.org/10.1038/ng.600
Le Douarin, N. M., & Dupin, E. (2018). The “beginnings” of the neural crest. In Developmental
Biology (Vol. 444, pp. S3–S13). Elsevier Inc. https://doi.org/10.1016/j.ydbio.2018.07.019
Leung, A. W., Leung, A. W., Murdoch, B., Salem, A. F., Prasad, M. S., Gomez, G. A., & García-
Castro, M. I. (2016). WNT/β-catenin signaling mediates human neural crest induction via a
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
pre-neural border intermediate. Development (Cambridge), 143(3), 398–410.
https://doi.org/10.1242/dev.130849
Li, H., & Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler
transform. Bioinformatics, 25(14), 1754–1760. https://doi.org/10.1093/bioinformatics/btp324
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., &
Durbin, R. (2009). The Sequence Alignment/Map format and SAMtools. Bioinformatics,
25(16), 2078–2079. https://doi.org/10.1093/bioinformatics/btp352
Liao, Y., Wang, J., Jaehnig, E. J., Shi, Z., & Zhang, B. (2019). WebGestalt 2019: gene set
analysis toolkit with revamped UIs and APIs. Nucleic Acids Research, 47(W1), W199–W205.
https://doi.org/10.1093/nar/gkz401
Lieberman, D. E. (1998). Sphenoid shortening and the evolution of modern human cranial shape.
Nature, 393(6681), 158–162. https://doi.org/10.1038/30227
Love, M. I., Huber, W., & Anders, S. (2014). Moderated estimation of fold change and dispersion
for RNA-seq data with DESeq2. Genome Biology, 15(12). https://doi.org/10.1186/s13059-
014-0550-8
Lynch, V. J., Leclerc, R. D., May, G., & Wagner, G. P. (2011). Transposon-mediated rewiring of
gene regulatory networks contributed to the evolution of pregnancy in mammals. Nature
Genetics, 43(11), 1154–1159. https://doi.org/10.1038/ng.917
Martik, M. L., & Bronner, M. E. (2021). Riding the crest to get a head: neural crest evolution in
vertebrates. In Nature Reviews Neuroscience (Vol. 22, Issue 10, pp. 616–626). Nature
Research. https://doi.org/10.1038/s41583-021-00503-2
Minoux, M., & Rijli, F. M. (2010). Molecular mechanisms of cranial neural crest cell migration and
patterning in craniofacial development. In Development (Vol. 137, Issue 16, pp. 2605–2621).
https://doi.org/10.1242/dev.040048
Mitchell, Z. H., den Hoed, J., Claassen, W., Demurtas, M., Deelen, L., Campeau, P. M., Liu, K.,
Fisher, S. E., & Trizzino, M. (2025). CHD3 regulates BMP signalling response during cranial
neural crest cell specification. https://doi.org/10.1101/2025.02.14.638260
Ostertag, E. M., Goodier, J. L., Zhang, Y., & Kazazian, H. H. (2003). SVA Elements Are
Nonautonomous Retrotransposons that Cause Disease in Humans. In Am. J. Hum. Genet
(Vol. 73).
Pagliaroli, L., Porazzi, P., Curtis, A. T., Scopa, C., Mikkers, H. M. M., Freund, C., Daxinger, L.,
Deliard, S., Welsh, S. A., Offley, S., Ott, C. A., Calabretta, B., Brugmann, S. A., Santen, G.
W. E., & Trizzino, M. (2021). Inability to switch from ARID1A-BAF to ARID1B-BAF impairs
exit from pluripotency and commitment towards neural crest formation in ARID1B-related
neurodevelopmental disorders. Nature Communications, 12(1).
https://doi.org/10.1038/s41467-021-26810-x
Patoori, S., Barnada, S. M., Large, C., Murray, J. I., & Trizzino, M. (2022). Young transposable
elements rewired gene regulatory networks in human and chimpanzee hippocampal
intermediate progenitors. Development (Cambridge), 49(23).
https://doi.org/10.1242/dev.200413
Playfoot, C. J., Duc, J., Sheppard, S., Dind, S., Coudray, A., Planet, E., & Trono, D. (2021).
Transposable elements and their KZFP controllers are drivers of transcriptional innovation in
the developing human brain. Genome Research, 31(9), 1531–1545.
https://doi.org/10.1101/gr.275133.120
Pontis, J., Planet, E., Offner, S., Turelli, P., Duc, J., Coudray, A., Theunissen, T. W., Jaenisch, R.,
& Trono, D. (2019). Hominoid-Specific Transposable Elements and KZFPs Facilitate Human
Embryonic Genome Activation and Control Transcription in Naive Human ESCs. Cell Stem
Cell, 24(5), 724-735.e5. https://doi.org/10.1016/j.stem.2019.03.012
Prescott, S. L., Srinivasan, R., Marchetto, M. C., Grishina, I., Narvaiza, I., Selleri, L., Gage, F. H.,
Swigut, T., & Wysocka, J. (2015). Enhancer Divergence and cis-Regulatory Evolution in the
Human and Chimp Neural Crest. Cell, 163(1), 68–83.
https://doi.org/10.1016/j.cell.2015.08.036
Quinlan, A. R., & Hall, I. M. (2010). BEDTools: A flexible suite of utilities for comparing genomic
features. Bioinformatics, 26(6), 841–842. https://doi.org/10.1093/bioinformatics/btq033
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Quinn, J. P., & Bubb, V. J. (2014). SVA retrotransposons as modulators of gene expression.
Mobile Genetic Elements, 4(4), e32102. https://doi.org/10.4161/mge.32102
Ramírez, F., Dündar, F., Diehl, S., Grüning, B. A., & Manke, T. (2014). DeepTools: A flexible
platform for exploring deep-sequencing data. Nucleic Acids Research, 42(W1).
https://doi.org/10.1093/nar/gku365
Roadmap Epigenomics Consortium, Kundaje, A., Meuleman, W., Ernst, J., Bilenky, M., Yen, A.,
Heravi-Moussavi, A., Kheradpour, P., Zhang, Z., Wang, J., Ziller, M. J., Amin, V., Whitaker, J.
W., Schultz, M. D., Ward, L. D., Sarkar, A., Quon, G., Sandstrom, R. S., Eaton, M. L., …
Kellis, M. (2015). Integrative analysis of 111 reference human epigenomes. Nature,
518(7539), 317–329. https://doi.org/10.1038/nature14248
Sambataro, S., Fiorillo, L., Bocchieri, S., Stumpo, C., Cervino, G., Herford, A. S., & Cicciù, M.
(2022). Craniofacial Evolution: From Australopithecus to Modern Man. Journal of
Craniofacial Surgery, 33(1), 325–332. https://doi.org/10.1097/SCS.0000000000007841
Schmidt, D., Schwalie, P. C., Wilson, M. D., Ballester, B., Gonalves, Â., Kutter, C., Brown, G. D.,
Marshall, A., Flicek, P., & Odom, D. T. (2012). Waves of retrotransposon expansion remodel
genome organization and CTCF binding in multiple mammalian lineages. Cell, 148(1–2),
335–348. https://doi.org/10.1016/j.cell.2011.11.058
Simões-Costa, M., Stone, M., & Bronner, M. E. (2015). Axud1 Integrates Wnt Signaling and
Transcriptional Inputs to Drive Neural Crest Formation. Developmental Cell, 34(5), 544–554.
https://doi.org/10.1016/j.devcel.2015.06.024
Spoor, F., Wood, B., & Zonneveld, F. (1994). Implications of early hominid labyrinthine
morphology for evolution of human bipedal locomotion. Nature, 369(6482), 645–648.
https://doi.org/10.1038/369645a0
Sundaram, V., Cheng, Y., Ma, Z., Li, D., Xing, X., Edge, P., Snyder, M. P., & Wang, T. (2014).
Widespread contribution of transposable elements to the innovation of gene regulatory
networks. Genome Research, 24(12), 1963–1976. https://doi.org/10.1101/gr.168872.113
Sundaram, V., & Wysocka, J. (2020). Transposable elements as a potent source of diverse cis-
regulatory sequences in mammalian genomes. In Philosophical Transactions of the Royal
Society B: Biological Sciences (Vol. 375, Issue 1795). Royal Society Publishing.
https://doi.org/10.1098/rstb.2019.0347
Theveneau, E., & Mayor, R. (2012). Neural crest migration: interplay between chemorepellents,
chemoattractants, contact inhibition, epithelial–mesenchymal transition, and collective cell
migration. WIREs Developmental Biology, 1(3), 435–445. https://doi.org/10.1002/wdev.28
Trainor, P. A. (2010). Craniofacial birth defects: The role of neural crest cells in the etiology and
pathogenesis of Treacher Collins syndrome and the potential for prevention. In American
Journal of Medical Genetics, Part A: Vol. 152 A (Issue 12, pp. 2984–2994).
https://doi.org/10.1002/ajmg.a.33454
Trizzino, M., Kapusta, A., & Brown, C. D. (2018). Transposable elements generate regulatory
novelty in a tissue-specific fashion. BMC Genomics, 19(1). https://doi.org/10.1186/s12864-
018-4850-3
Trizzino, M., Park, Y. S., Holsbach-Beltrame, M., Aracena, K., Mika, K., Caliskan, M., Perry, G. H.,
Lynch, V. J., & Brown, C. D. (2017). Transposable elements are the primary source of
novelty in primate gene regulation. Genome Research, 27(10), 1623–1633.
https://doi.org/10.1101/gr.218149.116
Wang, H., Xing, J., Grover, D., Hedges Kyudong Han, D. J., Walker, J. A., & Batzer, M. A. (2005).
SVA elements: A hominid-specific retroposon family. Journal of Molecular Biology, 354(4),
994–1007. https://doi.org/10.1016/j.jmb.2005.09.085
Wang, J., Xie, G., Singh, M., Ghanbarian, A. T., Raskó, T., Szvetnik, A., Cai, H., Besser, D.,
Prigione, A., Fuchs, N. V., Schumann, G. G., Chen, W., Lorincz, M. C., Ivics, Z., Hurst, L. D.,
& Izsvák, Z. (2014). Primate-specific endogenous retrovirus-driven transcription defines
naive-like stem cells. Nature, 516(7531), 405–409. https://doi.org/10.1038/nature13804
Wray, G. A. (2007). The evolutionary significance of cis-regulatory mutations. In Nature Reviews
Genetics (Vol. 8, Issue 3, pp. 206–216). https://doi.org/10.1038/nrg2063
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Figure 1. An inducible CRISPRi tool to repress human -specific SVAs and LR5Hs in
iPSC-derived CNCCs – (a) Schematic overview of SVA and LTR5Hs transposable elements,
illustrating their structure and phylogenetic distribution across Old World monkeys, apes, and
humans. Human-specific SVA subfamilies (i.e., SVA_E/F/F1) and the LTR5Hs are highlighted,
along with the approximate number of elements in each category. (b) CRISPR-dCas9-KRAB
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
targeting strategy for SVAs and LTR5Hs. Two guide RNAs (gRNA1 and gRNA2) target ~80%
of all SVAs and LTR5Hs, directing dCas9-KRAB to deposit repressive epigenetic marks at the
TEs, thereby reducing their transcriptional activity. (c) Experimental timeline for doxycycline
(dox)-induced expression of dCas9-KRAB. Dox is added to the media 24 hours after the start
of differentiation into cranial neural crest cells, which spans 5 days. (d) Western blot confirming
dox-inducible dCas9 -KRAB expression. Lysates from cells cultured with or without TE -
targetting gRNAs were probed with antibodies against dCas9 (top band, ~160 kDa) and
GAPDH (bottom band, loading control). (e) Immunofluorescence displaying expression of
CNCC signature markers AP2a (red) and SOX9 (green) in CNCCs differentiated using
CRISPRi-iPSCs with out (left) and with gRNAs ( right). Signal intensity was quantified and
normalised to the cell number per frame . (f) RT-qPCR of CNCC markers SOX9, TWIST1,
SOX10 and TFAP2A. n=6. Statistical analyses were performed using unpaired t-tests. Error
bars indicate mean ± SEM. **, p<0.01.
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Figure 2 – Hundreds of human -specific SVAs and LTR5Hs function as enhancers in
human CNCCs – (a) ATAC-seq signal across human -specific SVAs and LTR5Hs. Heatmap
rows represent individual transposable elements (TEs) aligned at their start or end (±500 bp),
and the intensity reflects chromatin accessibility. Clusters were generated with deepTools
using k -mer algorithm. (b) Pie charts showing the observed versus expected fractions of
accessible TEs belonging to the LTR5Hs/SVA subfamilies. (c) H3K9me3 enrichment at the
accessible human-specific SVAs and LTR5Hs. The top panel displays the average H3K9me3
signal (average profile) aligned to the TE start or end (±3 kb), while the heatmap below shows
individual TEs ranked by signal intensity. (d) Transcription factor motif analysis of accessible
human-specific SVAs and LTR5Hs. Representative enriched motifs (TWIST1, HOMEOBOX,
SOX9/10, AP2a, SLUG, and FOXD3) in accessible human-specific TEs are shown with the
percentage of TEs containing each motif. (e) Heatmaps of TWIST1 ChIP-seq signal (left) and
corresponding input control (right) centred on the accessible human -specific SVAs and
LTR5Hs (±2 kb). Colo ur intensity indicates normalized enrichment, highlighting TWIST1
occupancy at these accessible elements.
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Figure 3. Human-specific SVAs and LTR5Hs regulate the expression of CNCC migration
genes – (a) Violin plots displaying expression (log2 TPM) of genes located near accessible
human-specific SVAs and LTR5Hs. Three conditions are shown: human CNCCs generated
using CRISPRi iPSC line without guide RNAs (−gRNA), human CNCCs generated using
CRISPRi iPSC l ine with guide RNAs (+gRNA), and chimpanzee CNCCs. The decreased
expression in the +gRNA condition indicates potential regulatory contributions of these human-
specific TEs. (b) Volcano plot illustrating differentially expressed genes between the −gRNA
and +gRNA conditions in human CNCCs. The x-axis shows the log2 fold change, while the y-
axis represents the −log10 P-value. Significantly upregulated (green) and downregulated (red)
genes are highlighted, with total numbers indicated. Dashed lines mark common significance
thresholds. (c) Functional enrichment analysis of downregulated genes. Bubble plots display
top enriched pathways, with the x -axis indicating enrichment ratio (blue) or the -log10 FDR
(red) and the y -axis listing pathway terms. The size of each bubble reflects statistical
significance. Pathways involved in cell migration and motility are among the most prominently
enriched.
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Figure 4. HiPSC-NCCs show inferior migration capability when human-specific TEs are
silenced. (a) Outline of the transwell migration assay. (b) 1 x 105 hiPSC-NCCs (+/- gRNAs)
were seeded overnight, followed by a media change in the upper chamber and incubation with
differentiation medium +10% FBS in the lower chamber for 24h. At 24h, non-migrated hiPSC-
NCCs on the top of the membrane were swabbed away and migrated hiPSC-CMs were fixed,
stained and imaged. DAPI staining was used to visualise nuclei. Scale bar: 50 μm. (c)
Quantification of transwell membranes. 10X images were taken from 4 different fields of view
and the average number of migrated cells was calculated . n=5. Statistical analyses were
performed using an unpaired t -test. Error bars indicate mean ± S EM. This experiment was
repeated twice in two independent rounds of differentiation, which yielded comparative results.
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint
Figure 5. Co -option of SVAs and LTR5Hs as active enhancers is cell -type specific –
Heatmaps of normalized H3K27ac ChIP -seq signal (Roadmap Epigenomics Consortium)
across 256 accessible human -specific SVAs and LTR5Hs in multiple tissues and cell types.
Each row corresponds to an individual transposable element (TE), aligned at its cent re (±3
kb), while columns represent different tissues/cell lines. The colour bar on the right indicates
the relative intensity of H3K27ac signal, with higher enrichment in red and lower enrichment
in blue.
.CC-BY 4.0 International licensemade available under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is
The copyright holder for this preprintthis version posted April 6, 2025. ; https://doi.org/10.1101/2025.04.04.647334doi: bioRxiv preprint