PIWI proximity proteome reveals Set1-mediated piRNA biogenesis for transposon silencing in telomere

preprint OA: closed CC-BY-4.0
📄 Open PDF Full text JSON View at publisher
Full text 114,356 characters · extracted from oa-pdf · 8 sections · click to expand

Abstract

16 Silencing complexes formed by PIWI -clade Argonaute (Ago) proteins and 17 PIWI-interacting RNAs (piRNAs) are essential guardians of genome integrity, 18 controlling the deleterious activities of transposable elements (TEs) in animal germline. 19 However, our understanding of PIWI-piRNA-directed TE silencing remains incomplete. 20 Here, we systemically characterize the proximity proteome of PIWI members, Piwi, 21 Aubergine (Aub), and Ago3 in the germline of Drosophila ovaries. Functional screening 22 identifies previously uncharacterized factors involved in TE silencing, including 23 H3K4me3 writer and transcriptional coactivator Set1. Transcriptome analysis reveals 24 that Set1 acts as an indispensable repressor for TEs , particularly those forming 25 telomeres. The involvement of Set1 in Piwi pathway is further supported by its critical 26 role in the production of antisense, TE-targeting piRNAs. Notably, catalytic activity of 27 Set1 is dispensable for TE silencing . Genome-wide chromatin binding analysis using 28 CUT&Tag demonstrates that Set1 preferentially associates with TE sequences and 29 localizes to subtelomeric piRNA cluster loci , suggesting a role in promoting piRNA 30 precursor transcription through direct binding . Collectively, these findings uncover a 31 noncanonical function of Set1 in Piwi-mediated TE silencing and telomere control in 32 germline nuclei. 33 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

Introduction

34 Transposable elements (TEs) are mobile DNA sequences constituting a substantial part 35 of genomes in most organisms (Wells & Feschotte, 2020). Although TEs act as drivers 36 of genomic evolution and species diversification, the harmful potential as mutagens 37 requires the hosts to establish the control mechanisms. The control is particularly 38 important in germline cells where activated TEs accumulate damages on the inheritable 39 genomes and impair the production of functional gametes. In animals, TE control in 40 germline cells relies on silencing mediated by complexes of PIWI clade Argonaute 41 (Ago) family proteins and the sequence-specific guide, PIWI -interacting (pi)RNAs 42 (Wang et al, 2023b). 43 PIWI-piRNA-directed TE silencing has been studied in different animals 44 including the invertebrate model, Drosophila melanogaster. D. melanogaster possesses 45 three PIWI members including Piwi, Aubergine (Aub), and Ago3 , all of which are 46 expressed in germline cells. Piwi differs from the other two in that it functions in the 47 nucleus. Guided by piRNAs, Piwi complexes recognize the nascent transcripts from 48 target loci, mediating the (co -)transcriptional silencing and heterochromatin formation 49 (Le Thomas et al, 2013; Yu et al, 2015b; Sienski et al, 2015a; Mugat et al, 2020; Ariura et 50 al, 2024). In contrast, Aub and Ago3 are cytoplasmic proteins, localizing to nuage, a 51 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint perinuclear membraneless organelle formed in germline cells (Suyama & Kai, 2025; 52 Kawaguchi et al, 2025). In nuage, Aub and Ago3 mediate the reciprocal cleavage of 53 targets containing sense and antisense TE sequences. 3’ cleavage products generated by 54 either of the two proteins are passed to the other (i.e., from Aub to Ago3, or from Ago3 55 to Aub), then processed into a piRNA. Newly generated piRNAs direct the next round 56 of target cleavage and piRNA production. These cycles are called ping -pong, coupling 57 TE degradation and piRNA amplification in germline cells. Because target cleavage 58 occurs between the nucleotides complementary to 10th and 11 th nucleotides of guide 59 piRNA, and new piRNA is produced from the 5’ end of cleavage product, germline 60 piRNAs exhibit 10-nt overlap signature (ping-pong signature). In addition to ping-pong, 61 precursor fragments incorporated to Aub can be processed into multiple piRNAs on 62 mitochondrial outer membrane. These phasing/trailer piRNAs are mainly loaded onto 63 Piwi, translocating the complexes to the nucleus (Ge et al, 2019). Ping-pong cycles 64 process diverse transcripts including intact TE mRNAs and long non -coding RNAs 65 derived by noncanonical transcription of heterochromatinized, large intergenic regions 66 accumulating truncated TE sequences, defined as piRNA cluster loci (Brennecke et al, 67 2007a). 68 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint piRNA-directed TE silencing is achieved through interactions between PIWI 69 proteins and other components . These include nuclear factors involved in 70 Piwi-piRNA-directed target heterochromatinization (e.g., Panx, Nxf2, SetDB1/Eggless) 71 (Zhao et al, 2019; Murano et al, 2019; Fabry et al, 2019; Batki et al, 2019; Yu et al, 72 2015b; Sienski et al , 2015a). In cytoplasm, Tudor domain proteins (Krimper, Tejas, 73 Tapas, Qin/Kumo) and RNA helicases (Spn-E, Vas) are enriched in nuage and support 74 the ping-pong cycles mediated by Aub and Ago3 (Webster et al, 2015; Lin et al, 2023; 75 Handler et al, 2011; Zhang et al, 2011; Anand & Kai, 2012; Qi et al, 2011; Saito et al, 76 2010; Lim & Kai, 2007; Kawaguchi et al , 2025) . Nonetheless, characterizing the 77 interactions between PIWI members and other factors are often challenging because of 78 their dynamics in piRNA biogenesis and TE silencing. Hence, the repertoire of piRNA 79 pathway components is heretofore unelucidated, and ou r understanding of 80 piRNA-directed TE silencing remains incomplete. 81 Proximity-dependent biotin labelling (BioID/TurboID) is a powerful tool for 82 characterizing the physical interactions and proximity relationships of proteins in vivo 83 (Roux et al, 2012; Branon et al, 2018; Choi‐Rhee et al, 2004). These techniques rely on 84 the promiscuous biotin ligase activity exhibited by BirA derived from Escherichia coli. 85 TurboID utilizes an efficient and compact BirA variant called mini(m)Turbo established 86 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint through directed evolution (Branon et al, 2018), which we previously introduced in male 87 germline cells to study testis -specific factors (Iki et al, 2023; Kai et al, 2025; Iki et al, 88 2020). In order to deepen the understanding of piRNA-directed TE silencing, this study 89 characterized the proteins in the physical proximity of individual PIWI proteins in 90 female germline cells where TE silencing has been extensively studied . The obtained 91 proximity proteome of germline PIWI members not only confirmed reported 92 interactions but also revealed the unrecognized links. Moreover, PIWI proximity 93 proteome contained a list of factors which have not been characterized in piRNA 94 pathway thus far. Of those, this study further characterizes Set1, a histone 95 methyltransferase and transcriptional coactivator conserved across eukaryotes. Our data 96 highlight the hitherto unappreciated importance of Set1 in Piwi-piRNA-directed 97 silencing of TEs forming telomeres in Drosophila genome. 98 99 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

Results

100 Labelling of PIWI proximity factors and purification from germline cells 101 To characterize the proximity proteome of PIWI proteins using TurboID, we generated a 102 series of transgene s encoding mTurbo -GFP-fused Piwi, Aub, or Ago3 (Figure 1A). 103 These transgenes were expressed in ovarian germline cells under the control of nos 104 promoter activity (UASp/Gal4 binary system using NGT 40;nos-Gal4 (NN) driver) 105 (Rørth, 1998; Grieder et al , 2000) . The biotinylation activities of individual fusion 106 proteins were confirmed by detecting the streptavidin-HRP-dependent signals (Figure 107 S1A). However, pulldown using streptavidin was inefficient and biotinylated proteins 108 largely remained in the flow through , possibly due to interfer ing factors accumulated 109 during differentiation. To circumvent this technical challenge, we depleted one of the 110 differentiation factor s, bag of marbles (bam) (McKearin & Spradling, 1990) , by 111 germline knockdown (GLKD) , and collected germline stem cell (GSC)-like cells 112 (GSCLCs) impaired in differentiation (Figure 1A, S1BC). Although TurboID in 113 GSCLCs showed weaker biotinylation signals in the input, the pulldown efficiency was 114 dramatically improved (Figure S1A). 115 In GSCLCs, mTurbo-GFP alone displayed a broad distribution across both the 116 nucleus and cytoplasm (Figure 1B). In contrast, mTurbo -GFP-PIWI proteins exhibited 117 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint subcellular localization patterns consistent with those of their endogenous counterparts; 118 Piwi accumulated in the nucleus, while Aub and Ago3 were enriched in the perinuclear 119 nuage. The concordance suggests that N -terminal mTurbo -GFP tagging does not 120 substantially disrupt the interaction networks of individual PIWI proteins. Supporting 121 this notion, N-terminal mTurbo-FLAG fusion preserved Aub functions in male germline 122 (Iki et al, 2023). Notably, the steady-state levels of fusion proteins were below those of 123 endogenous counterparts (Figure S1D) . mTurbo-GFP-Ago3 was the least stable and 124 barely detectable in the input (Figure 1C , anti -GFP), accounting for the weak 125 streptavidin-HRP signals in Ago3 -TurboID condition (Figure 1 C). Nevertheless, the 126 streptavidin pulldown displayed distinct signal patterns, suggesting 127 biotinylation-dependent enrichment of specific factors in each PIWI-TurboID condition. 128 129 Characterization of germline PIWI proximity proteomes 130 Mass spectrometry analysis and the label-free quantification (LFQ) of streptavidin 131 pulldown samples identified the proximity factors of individual PIWI proteins (average 132 abundance ratios >1 compared to the control GFP-TurboID conditions in biological 133 duplicates) (Table S1). These included 37, 33, and 49 factors for Piwi, Aub, and Ago3, 134 respectively (Figure 2A and 2B, S2). Notably, the identified proximity factors contained 135 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint known piRNA pathway components (Figure 2A, highlighted in yellow, Table S2), and 136 the significant enrichment was confirmed by Gene Ontology (GO) analysis (Figure 2C). 137 These results suggest that the proteome reflects the physical and functional interactions 138 between PIWI members and other factors in germline cells. 139 140 Screening of PIWI proximity factors involved in piRNA biogenesis and TE 141 silencing 142 Most factors identified in the proximity of PIWI members have not been functionally 143 characterized in piRNA pathway or TE silencing. Hence, w e investigated the possible 144 involvement of 54 factors, for which short hairpin-based RNAi lines were available, by 145 germline knockdown (GLKD) screening using NN driver. Given that impaired piRNA 146 biogenesis can be associated with nuage collapse and mislocalization of its components, 147 we examined the localization of Krimp, a key scaffold of nuage (Lim & Kai, 2007; Patil 148 & Kai, 2010). In parallel, we observed the accumulation of Gag protein encoded by a 149 non-LTR retroelement HeT-A, as a readout of TE derepression (Shpiz et al, 2011). We 150 confirmed that GLKD of aub leads to the accumulation of HeT-A Gag protein around 151 the oocytes of developing egg chambers (Figure 3A) , and the loss of Krimp from 152 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint perinuclear nuage and the formation of abnormal foci in the cytoplasm (Figure 3B , 153 S3B). 154 GLKD screening identified several factors including Hrb27C and Set1 , the 155 proximity factor of Aub and Piwi, respectively (Table S3). Both factors are conserved 156 across eukaryotes , and their molecular and physiological functions have been 157 characterized. Hrb27C (also known as Hrp48) is a n hnRNP family member required in 158 GSC for the maintenance (Yan et al, 2014; Ables et al, 2016; Finger et al, 2023). Set1 is 159 an enzyme acting inside COMPASS complex responsible for di- and tri-methylation of 160 histone H3 at lysine 4 (H3K4me2/3), a modification associated with transcription start 161 sites (TSSs) for RNA polymerase (pol)II activation (Ardehali et al, 2011; Mohan et al, 162 2011; Wang et al , 2023a) . We observed distinct phenotypes in hrb27C-GLKD and 163 set1-GLKD. In hrb27C-GLKD, abnormal Krimp foci appeared in the cytoplasm, similar 164 to aub-GLKD (Figure 3B). However, HeT-A Gag protein was under detectable level 165 (Figure 3A). In set1-GLKD, though abnormal Krimper foci formation was not clear, 166 HeT-A Gag protein accumulated around the oocyte. In both hrb27c- and set1-GLKD 167 conditions, perinuclear Krimp signals could be weaker but still observed (Figure 3B). 168 Consistently, Aub and Piwi did not show drastic changes in their localization (Figure 3C, 169 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint S3C). Of note, RT -qPCR confirmed target downregulation in individual GLKD 170 conditions (Figure S3D). 171 172 set1 and hrb27C functionally interact with piwi and aub, respectively, to control 173 TEs 174 To investigate the global effect of Hrb27C or Set1 depletion in the germline on TE 175 expression, we compared the transcriptome from hrb27C- or set1-GLKD ovaries with 176 that from gfp-GLKD control ovaries (Figure 4 A, S4 A). In parallel, aub- and 177 piwi-GLKD ovary transcriptomes were generated as references for TE derepression. 178 Our analysis included germline stem cell -like cell (GSCLC) condition in which PIWI 179 proximity proteome was characterized (Figure 4B, S4B). First, comparison of 180 gfp-GLKD transcriptomes between non-GSCLC and GSCLC contexts revealed that TE 181 transcript levels are generally elevated in GSCLCs (Figure S4C). This baseline increase 182 may account for the relatively modest TE derepression observed for aub- or 183 piwi-GLKD in GSCLC setting (Figure 4AB). 184 Hrb27C was identified as the proximity factor of Aub (Figure 2). H owever, 185 unlike aub-GLKD, hrb27C-GLKD did not result in clear TE derepression in 186 non-GSCLC ovaries (Figure 4A, S4A) . In contrast , TE derepression was evident in 187 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint GSCLC ovaries (Figure 4B, S4B) , and the patterns were positively correlated between 188 hrb27C- and aub-GLKD conditions (r = 0.57, Figure S4D). These suggest a functional 189 link between Hrb27C and Aub in GSCs. 190 For Set1 identified as the proximity factor of Piwi (Figure 2), its GLKD caused 191 remarkable derepression of a subset of TEs in non-GSCLC ovaries (Figure 4A, S4A). 192 These set1-sensitive TEs included HeT-A, consistent with its Gag protein accumulation 193 (Figure 3) , as well as TART family members , which together with HeT-A maintain 194 telomeres as HTT array (Villasante et al, 2007). TAHRE, another HTT component, was 195 not analyzed due to the lack of annotated insertions. In addition to HTT, several LTR 196 retrotransposons including HMS-Beagle, Max-element, diver, 3S18, and gypsy12 197 exhibited robust derepression (Figure 4A) . This result was corroborated by analyses 198 based on mapping reads to consensus sequences of individual TE families (Figure 4C). 199 Overall, the derepression profile of set1-GLKD showed a strong positive correlation 200 with that of piwi-GLKD (r = 0. 73), but not with aub-GLKD (r = 0.02, Figure 4 C). 201 Consistent with this, re-analysis of publicly available dataset (GSE103582) indicated 202 that HTT members are more sensitive to the loss of piwi than that of aub (Figure 4D, 203 S4E) (Teixeira et al , 2017a). Moreover, similar to set1-GLKD, HTT members were 204 strongly derepressed upon loss of Panoramix (Panx), an essential cofactor of Piwi 205 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint (GSE71374) (Sienski et al, 2015b; Yu et al, 2015a). In contrast to the pronounced effect 206 observed in germline, knockdown of set1 in ovarian follicle cells using traffic jam-Gal4 207 did not derepress zam and gypsy, which are silenced by piwi in this somatic lineage 208 (Figure S4F). Taken together, these results suggest that Set1 function is associated with 209 Piwi-mediated TE silencing in germline cells. 210 211 Catalytic activity-independent role of Set1 in TE silencing 212 The C-terminal SET domain of Set1 is responsible for H3K4me3 modification (Mohan 213 et al , 2011; Ardehali et al , 2011; Wilson et al , 2002) . To investigate whether this 214 methyltransferase activity is required for TE silencing , we performed rescue 215 experiments by expressing RNAi-resistant transgene encoding GFP -tagged wild -type 216 Set1 (Set1 WT) or a catalytically inactive variant (Set1E1613K) in germline cells depleted 217 of endogenous Set1 (set1-GLKD) (Figure 5A) (Vidaurre et al, 2024; Hallson et al, 2012). 218 Both GFP-Set1WT and GFP -Set1E1613K proteins accumulated in the germline nuclei 219 (Figure 5B). Weaker GFP signals observed for E1613K variant suggest reduced protein 220 stability. 221 Transcriptome analysis revealed that HTT and other TE families were fully 222 re-repressed by GFP -Set1WT introduced in the set1-GLKD background, indicating the 223 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint depletion of set1 is causal for their derepression (Figure 5C). Remarkably, although less 224 effective than GFP-Set1WT, expression of GFP-Set1E1613K led to substantial 225 re-repression of TEs. This effect was most pronounced for HTT, with TART-A1 as an 226 exception (Figure 5C). Together, these results suggest that Set1 plays a noncanonical, 227 catalytic activity-independent role in TE silencing. 228 To further support the above notion , we analyzed H3K4me3 signals in 229 set1-GLKD and the transgene-rescue conditions. Ovary immunotaining showed that 230 H3K4me3 signals were depleted from germline nuclei in set1-GLKD (Figure 5 B). 231 Germline H3K4me3 signals were fully recovered by GFP -Set1WT, while no recovery 232 was observed with GFP-Set1E1613K. We further employed CUT&Tag to genome-widely 233 characterize H3K4me3 signals. H3K4me3 peaks were identified around the 234 transcription start site (TSS) of genes including ago3 and aub (Figure 5D). Consistent 235 with the immunostaining data, H3K4me3 signals on ago3 and aub were markedly 236 reduced in set1-GLKD ovaries, and fully restored by GFP-Set1WT, but not by 237 GFP-Set1E1613K (Figure 5D). These results confirm the catalytic inactivity of E1613K 238 variant and further support the catalytic activity -independent role in TE silencing . 239 Notably, despite these changes in H3K4me3 signals, steady-state transcript levels of 240 piRNA pathway components were only modestly affected in set1-GLKD ovaries 241 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint (Figure S5 A). Moreover, RT -qPCR analysis showed that depletion of COMPASS 242 subunits led to milder derepression of HeT-A compared with that of set1 (Figure S5B). 243 Taken together, these results argue against the notion that Set1 functions solely as an 244 H3K4me3 writer and represses TEs simply by promoting the expression of piRNA 245 pathway components. 246 247 Loss of TE-targeting piRNAs in set1-depleted ovaries 248 TE derepression pattern suggests the functional link between Set1 and Piwi (Figure 4, 249 S4). Moreover, consistent with the ir close proximity (Figure 2), Set1 and Piwi interact 250 physically, as Piwi co-precipitated with GFP-Set1WT and GFP -Set1E1613K expressed in 251 germline cells (Figure 6A). Notably, the interaction appears transient or weak, given the 252 dependency on crosslinking (Figure S6A) . These physical and functional links 253 prompted us to examine whether piRNA expression is affected by set1 depletion. 254 Deep-sequencing of ovarian small RNAs showed that overall abundance of piRNAs 255 (23~29-nt fragments and those derived from TE sequences) was largely unchanged in 256 set1-GLKD (Figure 6B). However, analysis at the level of individual TE families 257 revealed a striking loss of piRNAs mapping to telomeric HTT members (Figure 6C). 258 Notably, antisense piRNAs, which directly target cognate TE transcripts, were reduced 259 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint more severely than sense piRNAs. A prominent example is HeT-A, for which sense 260 piRNAs were largely unaffected whereas antisense species were nearly lost (Figure 261 6CD). Supporting the link to Piwi pathway, re -analysis of published small RNA -seq 262 data (GSE71374) revealed a similar antisense-biased reduction of HTT-mapping 263 piRNAs in ovaries lacking Panx, a key cofactor of Piwi (Figure 6E) (Yu et al, 2015a; 264 Sienski et al, 2015b). An antisense-biased reduction of piRNAs was also observed for 265 other Set1-controlled TEs, including 3S18, gypsy12, Max-element, diver, and 266 HMS-Beagle (Figure 6C). Despite this loss, the ping-pong signature ( 10-nt overlap 267 frequency) of piRNAs was preserved (Figure S 6). This is consistent with the 268 maintenance of nuage structure in set1-GLKD ovaries (Figure 3B) . Together, t hese 269 findings indicate that Set1 is required to express TE-targeting functional piRNAs, 270 supporting its crucial role in Piwi-mediated TE silencing. 271 272 Set1 binds TE sequences and localizes to subtelomeric piRNA cluster loci 273 How does Set1 contribute to the expression of antisense piRNAs? To address this 274 question, we analyzed the chromatin binding by GFP-Set1 expressed in set1-depleted 275 germline cells, using CUT&Tag with anti-GFP antibodies. In parallel with GFP-Set1WT, 276 GFP-Set1E1613K was included in the analysis , given the functionality in TE silencing 277 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint (Figure 5) . Comparison between GFP -Set1 and GFP control conditions identified 278 Set1-binding peaks ( p<0.01, Figure S7A ). Notably, genes and TEs associated with 279 peaks from Set1WT or Set1 E1613K showed limited overlap (Figure 7A) . However, this 280 does not indicate their targets are distinct; rather, it reflects differential affinity between 281 Set1WT and Set1 E1613K to shared targets (r=0.85, Figure 7B). Consistent with its role as 282 H3K4me3 writer, targets identified from Set1WT peaks were predominantly endogenous 283 genes (541 of 548, Figure 7A). These bindnig peaks appeared around transcription start 284 site ( TSS) and coincided with set1-dependent H3K4me3 signals, as exemplified with 285 aub (Figure 7C). In contrast, targets identified from Set1E1613K peaks were enriched with 286 TEs (Figure 7A, 84 of 306) , including HeT-A, TART, diver, HMS-Beagle, and 287 Max-element, regulated by Set1 (Figure 4A, 5C). At the level of consensus sequences , 288 two major peaks were typically observed: one spanning the 5’UTR-ORF1 region and 289 another within the 3’UTR (Figure 7 D, S7CD). Collectively, these results revealed the 290 affinity of Set1 to TE sequences. 291 We mention that H3K4me3 signals on TEs behaved differently from those at 292 gene TSS: they increased upon set1 depletion and decreased upon expression of both 293 Set1WT and Set1 E1613K, indicating a lack of correlation with Set1’s catalytic activity 294 (Figure 7D, S7CD). Instead, TE-associated H3K4me3 signals were positively correlated 295 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint with the transcript abundance (Figure 5) , suggesting that these marks are deposited 296 either by residual Set1 in the set1-depleted germline cells or through Set1-independent 297 mechanisms (Ardehali et al, 2011). 298 3’UTRs of HTT members possess promoter and TSS for bidirectional 299 transctiption (Figure 7D) (Danilevskaya & Arkhipova, 1997; Radion et al , 2017; 300 Maxwell et al, 2006). Intriguingly, HTT 3’UTRs interacting with Set1 are maintained as 301 truncated insertions in subtelomeric piRNA cluster loci. HeT-A{}6278 in cluster 3 is one 302 of those 3’UTR fragments. Alignment to the consensus sequence suggests that 303 HeT-A{}6278 maintains promoter and antisense TSS but not sense TSS (Figure S7E). 304 piRNA hot spot starting from th is 3’UTR fragment implies the involvement in piRNA 305 precursor transcription (Figure 7E). Set1 showed affinity to other sites on cluster 3 , 306 which is contrastive to non-telomeric clusters lacking, if not any, Set1-binding signals 307 (Figure S7F). Insertions from set1-sensitive LTR retrotransposons are rare within major 308 piRNA clusters. Max{}2206 is one of exceptions found in 42AB cluster, but it did not 309 exhibit Set1 binding (Figure S7F). Hence, Set1 binding to cluster loci is biased toward 310 subtelomeric regions, whose piRNA production is extremely set1-dependent (Figure 311 7E). Set1 binding to LTR retrotransposons may instead reflect its affinity for potentially 312 mobile copies located outside piRNA clusters (Wang et al, 2018). Collectively, t hese 313 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

Results

support a model in which Set1 binding to TE sequences promotes precursor 314 transcription for piRNA biogenesis in germline cells (Figure 7F). 315 316 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

Discussion

317 To deepen the understanding of piRNA -directed TE silencing, this study explored the 318 proximity factors of individual PIWI proteins expressed in female germline cells. The 319 proteome provided additional clues supporting the known interactions, and, more 320 importantly, revealed otherwise unrecognized links between factors in piRNA 321 biogenesis. First, we point out that Zucchini (Zuc), an endonuclease on mitochondrial 322 outer membrane, was identified in the proximity of both Aub and Piwi . These three 323 proteins are together involved in phased piRNA production (Ge et al, 2019), but their 324 physical links have not been reported in ovaries. It should be noted, a study conducting 325 TurboID using Zuc as bait did not find Aub or Piwi in its proximity (Nguyen et al, 2023). 326 This might be due to the difference of drivers (matalfa -Gal4 vs nos -Gal4), stages of 327 germline cells (differentiating cells vs stem -cell like cells), or bait proteins (Zuc vs 328 PIWI proteins) . In addition to Zuc, our data also showed that Nxf3 and Bootlegger 329 (Boot) mediating the export of piRNA precursors are in the proximity of Ago3, but not 330 of Aub (ElMaghraby et al, 2019; Kneuss et al, 2019). The selective proximity supports 331 the model whereby precursor transcripts delivered by Nxf3 and Boot from nucleus to 332 nuage in cytoplasm are targeted by Ago3-piRNA complexes (Wang et al, 2023b). 333 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Hrb27C emerged as a functionally relevant partner of Aub (Figure 2-4). As a 334 member of hnRNP family, Hrb27C regulates the transport, localization, and translation 335 of interacting mRNAs during oogenesis (Yano et al, 2004; Goodrich et al, 2004; Huynh 336 et al, 2004). Both Hrb27C and Aub are intrinsic factors required for GSC maintenance 337 (Ma et al, 2017; Rojas‐Ríos et al, 2017; Rojas-Ríos et al, 2024; Finger et al, 2023). In this 338 study, we provided evidence supporting the involvement of Hrb27C in Aub -mediated 339 TE silencing in GSCLCs (GSC-like cells induced by depletion of differentiation factor, 340 bam) (Figure 4). The cooperation between Hrb27C and Aub in GSCs warrants further 341 detailed investigation. 342 H3K4 methyltransferase Set1 was identified and characterized as a key factor 343 in Piwi-mediated TE silencing in female germline (Figure 2-7). We initially as sumed 344 that, despite the proximity to Piwi, TE silencing by Set1 could be attributable to its 345 general coactivator function (Ardehali et al, 2011; Mohan et al, 2011). Set1 acting as 346 H3K4me3 writer inside COMPASS potentially underlies the expression of germline 347 genes including piRNA pathway components. However, our results argue against this 348 notion but instead shed light on its specialized role in Piwi pathway. First, set1 and piwi 349 but not aub showed similar TE derepression pattern upon depletion from germline cells 350 (Figure 4) . Second, set1 depletion caused severe loss of piRNAs derived from HTT, 351 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint which we found ocurring in panx-deficient ovaries disrupting Piwi pathway (Figure 6). 352 Moreover, w e demonstrated that Set1 can control TEs , HTT members in particular, 353 without its catalytic activity (Figure 5). Last, CUT&Tag revealed the affinity of Set1 to 354 TE sequences, which is associated with piRNA production (Figure 7). 355 How Set1 recognizes TE sequences has remained elusive. Set1 interacts with 356 phosphorylated C -terminal domain ( CTD) of RNA polII in elongation phase, which 357 allows targeted recruitment and H3K4me3 deposition on actively transcribed genes (Ng 358 et al, 2003). In o ur CUT&Tag analysis, H3K4me3 signal s within TE sequences were 359 largely overlap ped with Set1 -binding regions. Although our results did not support a 360 role for Set1 in depositing these TE-associated H3K4me3 (Figure 7D), RNA polII on 361 transcriptionally active TEs could contribute to Set1 recruitment. However, HeT-A 362 showed only mild derepression by downregulation of a COMPASS component, Wdr82 363 that is the orthologue of yeast Swd2 linking Set1 and RNA polII (Figure S5A) (Bae et al, 364 2020). Hence, RNA polII-dependent recruitment of Set1 as part of COMPASS may be 365 insufficient, and additional mechanisms may contribute. The e levated TE binding 366 observed for the catalytically inactive Set1 variant, compared with the wild-type protein, 367 support a possible alternative mode of recruitment (Figure 7B). Given the physical and 368 function links, involvement of Piwi machinery in recruiting Set1 to TEs is a plausible 369 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint possibility. Our proximity proteome did not identify SetDB1/Eggless, the H3K9me3 370 writer responsible for heterochromatin formation at Piwi-piRNA target loci (Rangan et 371 al, 2011; Osumi et al , 2019; Akkouche et al , 2017) . This absence suggests that our 372 dataset may be biased toward a specific stage at which Piwi cooperates with Set1 . 373 Further characterization of additional factors identified by our Piwi proxiome should 374 provide deeper insight into the mode of actions of Set1 in TE silencing. 375 Most piRNAs are derived from defined genomic regions called piRNA c luster 376 loci (Brennecke et al , 2007b) . The heterochromati c structure of these loci requires 377 noncanonical RNA polII recruitment for piRNA precursor transcription, mediated by 378 H3K9me3 reader Rhino and the cofactors (Klattenhoff et al, 2009; Mohn et al, 2014; 379 Andersen et al , 2017) . Recent studies uncovered cluster-specific molecular 380 underpinnings for piRNA precursor transcription. These include DNA-binding protein, 381 Kipferl, and enhancer of zeste, E(z), the writer of H3K27me3 , each required at distinct 382 cluster loci (Akkouche et al, 2025; Baumgartner et al, 2022). In gonadal soma, Traffic 383 jam (Tj), a fly orthologue of large Maf transcription factors , activates soma-restricted 384 cluster, flamenco (Rivera et al, 2025; Alizada et al, 2025). Subtelomeric piRNA cluster 385 loci involve Set1 (this study) and NSL complex (Iyer et al , 2023). Together, these 386 findings indicate that precursor transcription and piRNA production are governed by 387 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint diverse chromatin modifiers and transcriptional regulators . Such diversity may be 388 advantageous, enabling spaciotemporal control of the piRNA repert oire to support 389 multitasks of a common set of PIWI proteins. By localizing to subtelomeric cluster loci, 390 Set1 would exert its role in telomere control with Piwi . Characterization of telomere 391 phenotypes in set1-depleted germline cells will be an important direction for future 392 studies. Beyond piRNA clusters, Set1 also exhibited affinity for several TE families 393 including HMS-Beagle and Max-element, both of which maintains mobility in female 394 germline and transmits new copies to next generations through oocyte targeting (Yang et 395 al, 2023; Wang et al , 2018). The role of Set1 in silencing of such active TEs also 396 represent a future avenue. 397 Fly Set1 can silence TEs forming telomeres regardless of its methyltransferase 398 activity (Figure 5). This finding is reminiscent of molecular features reported for yeast 399 Set1, which was originally identified as a transcriptional repressor in the telomere and 400 silent mating loci (Nislow et al, 1997). Notably, its repressive functions on genes and 401 TEs are often associated with H3K4 methylation-independent mechanisms (Jezek et al, 402 2023; Lee et al, 2018; Lorenz et al, 2012). However, t he m echanisms underlying TE 403 silencing are different between the two organisms, as yeast lacks PIWI clade proteins 404 and does not produce piRNAs. Nonetheless, Set1-dependent antisense non-coding 405 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint RNAs have been characterized as repressors of retrotransposons in yeast (Berretta et al, 406 2008). The role of Set1 in generating antisense , silencing-competent transcripts for 407 genome defense may be conserved across broader contexts. 408 409 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

Materials and methods

410 Fly stocks and culture 411 All stocks and crosses were raised at 25 ˚C on a molasses/yeast medium [5% (w/v) dry 412 yeast, 5% (w/v) corn flower, 2% (w/v) rice bran, 10% (w/v) glucose, 0.7% (w/v) agar, 413 0.2% (v/v) propionic acid, and 0.05% (v/v) p -hydroxy butyl benzoic acid]. Fly stocks 414 used in this study are listed in Table S4. 415 416 Plasmid construction and generation of transgenic flies 417 All the primers used for plasmid construction are listed in Table S 5. To generate 418 UASp-mTurbo-GFP, UASp -mTurbo-GFP-Piwi, UASp -mTurbo-GFP-Aub and 419 UASp-mTurbo-GFP-Ago3, mTurbo fragment was amplified by PCR using 420 attB_miniTurbo_Fw_general and miniTurbo_linker_Rv_general as primers and 421 3xHA-miniTurboNLS_pCDNA3 (addgene #107172) as template. GFP fragment was 422 amplified by PCR using 2 -1vLinker>GFP5_F and Ti876 -GFP-G5-R as primers. Piwi, 423 Aub or Ago3 fragment was amplified by PCR using primer pairs of Linker_to_Piwi and 424 Piwi_to_Vector, Ti893-G5-Aub-CDS-F and Ti894 -Aub-UASp-R1, or Linker_to_Ago3 425 and Ago3 _to_Vector, respectively, with cDNA from yw ovaries as template . PCR 426 fragments were introduced into the XbaI site in pUASp-K10-attB vector (Koch et al, 427 2009) using In -Fusion HD Cloning (Takara). UASp-mTurbo-GFP and 428 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint UASp-mTurbo-GFP-Ago3 were integrated to attP40 , while UASp -mTurbo-GFP-Piwi 429 and UASp-mTurbo-GFP-Aub were integrated to attP2, and attP-3B sites, respectively. 430 431 Proximity-dependent biotin pulldown in ovarian germline cells 432 After eclosion, female s expressing mini(m)Turbo -GFP-PIWI proteins were reared at 433 25˚C for 2 -3 ( non-GSCLC ovaries) or 5 -7 days (GSCLC ovaries) in the modified 434 molasses/yeast medium supplemented with 100 μM biotin (Nacalai). For individual 435 conditions, ~ 300 ovaries from 150 progenies were collected . Ovaries were 436 homogenized in 150 μL of PI -lysis buffer [50 mM t ris-HCl (pH 7.5), 500 mM NaCl, 2 437 mM EDTA, 2 mM dithiothreitol (DTT), 0.4% (w/v) SDS, and cOmplete Protease 438 Inhibitor Cocktail Tablet (Roche)] , using Bioruptor (Diagenode) for 30s (power H) for 439 six times with 30s intervals. Triton X -100 was then added to sa mples at a final 440 concentration of 2% (v/v), and homogenization was further performed for 30s for three 441 times with 30s intervals. After centrifugation (20,000 g, 10 min, 4 ˚C), the supernatant 442 was diluted with equal amount of 50 mM Tris -HCl (pH 7.5) buffer and incubated with 443 pre-equilibrated 15-μL slurry volume of Dynabeads MyOne Streptavidin C1 (Thermo 444 Fisher Scientific), overnight at 4 ˚C with gentle rotation. Next day, beads were washed 445 twice with W1 buffer [50 mM tris -HCl (pH 7.5), 250 mM NaCl, 0.2% (w/v) SDS, 1 446 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint mM EDTA, and 1 mM DTT], twice with W2 buffer [50 mM H EPES-KOH (pH 7.4), 447 500 mM NaCl, 0.1% (w/v) deoxycholate (Nacalai), 1% (v/v) Triton X -100, and 1 mM 448 EDTA], twice with W3 buffer [10 mM tris -HCl (pH 8.0), 250 mM LiCl, 0.5% (w/v) 449 deoxycholate, 0.5% (v/v) NP-40 (Nacalai), and 1 mM EDTA], and four times with W4 450 buffer [50 mM tris -HCl (pH 7.5) and 50 mM NaCl]. The purified proteins bound to 451 beads were stored at -80˚C until analysis. 452 453 Immunoblotting and biotinylated protein detection 454 Protein s amples wer e denatured at 95°C for 5 min in 2× protein loading buffer [4% 455 (w/v) SDS, 200 mM DTT, 0.1% (v/v) bromophenol blue (BPB), and 20% (v/v) 456 glycerol] saturated with biotin, resolved by SDS –polyacrylamide gel electrophoresis 457 (SDS-PAGE) and transferred to 0.2 -μm polyvinylidene difluoride membrane (Wako) 458 using the semi-dry system (Trans-blot Turbo, Bio-Rad). The membrane was blocked in 459 4% (w/v) skim milk (Nacalai) in 1× phosphate -buffered saline (PBS) supplemented 460 with 0.1% (v/v) Tween 20 and further incubated with primary antibodies: rabbit 461 anti-GFP (1:1000, Clonetech), mouse anti -Piwi (1:100) (Saito et al, 2006), guinea pig 462 anti-Aub (1:1000) and rat anti -Ago3 (1:200 ) (Lim et al, 2022), mouse anti -Tubulin 463 (1:3000, Santa Cruz) . Secondary antibodies were anti-guinea pig 464 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint immunoglobulins-HRP (1:1000; Dako), anti -rabbit immunoglobulin G (IgG) –HRP 465 (1:3000; Bio-Rad), anti-mouse IgG-HRP (1:3000; Bio -Rad), and anti-rat IgG (1:1000; 466 DAKO). HRP-conjugated Streptavidin (Clonetech) was used for detecting biotinylated 467 Proteins. The chemiluminescent signals generated with Chemi-Lumi One (Nacalai) 468 were detected by Chemidoc MP Imaging system (Bio-Rad). The images were processed 469 with Fiji. 470 471 Proteomic Sample Preparation and Label -Free Quantification using Orbitrap 472 Eclipse Tribrid Mass Spectrometer 473 Biotinylated proteins bound to magnetic beads were reduced with 1 mM dithiothreitol 474 (DTT) and alkylated with 5.5 mM iodoacetamide. Proteins were digested overnight at 475 37°C with Trypsin Gold, Mass Spectrometry Grade (Promega, Madison, WI). The 476 resulting peptides were captured and desalted using ZipTip C18 (Millipore, Billerica, 477 MA). Shotgun proteomic analysis was performed using an Orbitrap Eclipse Tribrid 478 mass spectrometer equipped with a FAIMS Pro interface (Thermo Fisher Sci entific, 479 Waltham, MA), which was coupled to a Vanquish Neo UHPLC system (Thermo Fisher 480 Scientific, Waltham, MA). Peptides were separated on a reversed -phase column using a 481 linear gradient of 2 –24% acetonitrile in 0.1% formic acid at a flow rate of 300 nL/m in. 482 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Full MS scans were acquired in the Orbitrap at a resolution of 120,000, followed by 483 MS/MS scans in the ion trap using higher -energy collisional dissociation (HCD) with a 484 normalized collision energy of 35% and a maximum injection time of 10 ms. Label-free 485 quantification (LFQ) was performed using Proteome Discoverer version 2.5 (Thermo 486 Fisher Scientific, Waltham, MA) with the Sequest HT search engine. MS/MS spectra 487 were searched against the UniProt Drosophila melanogaster reference proteome 488 (UP000000803). Protein identifications were filtered at a false discovery rate (FDR) of 489 <1%. Peptide intensities were extracted using the Minora Feature Detector node and 490 used for protein quantification. GO analysis was conducted using MetaScape. (Zhou et 491 al, 2019) 492 493 Croslinking and immunoprecipitation 494 Ovaries were dissected from 50 females expressing GFP, GFP -Set1WT, or 495 GFP-Set1E1613K in phosphate -buffered saline (PBS) and immediately fixed with 0.1% 496 (w/v) paraformaldehyde (Electron Microscopy Sciences) for 10 min at room 497 temperature (RT). Fixation was quenched with 125 mM glycine. Samples were then 498 snap-frozen in liquid nitrogen and stored at -80°C until use . Ovary s amples were 499 homogenized with a pestle in lysis buffer [20 mM Tris -HCl (pH 7.5), 135 mM NaCl , 500 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint 1.5 mM MgCl 2, 10% (v/v) glycerol, 0.2% (v/v) TritonX-100] supplemented with 501 cOmplete Protease Inhibitor Cocktail (Roche). Pre-equilibrated a nti-GFP 502 antibody-conjugated magnetic beads (MBL, D153-11) were added to ovarian lysate and 503 incubated for 1h at 4°C. After three times of washing with lysis buffer, bead-bound 504 proteins were extracted by boiling in SDS-PAGE loading buffer [50 mM Tris–HCl (pH 505 6.8), 2% (w/v) SDS, 100 mM 1,4-dithiothreitol (DTT), 10% (v/v) glycerol, 0.05%(w/v) 506 bromophenol blue]. 507 508 Histochemistry and image acquisition 509 Ovaries were dissected from adult females in 1x PBS buffer supplemented with 0.4% 510 (w/v) bovine serum albumin (BSA; Wako) and fixed in 5.3% (v/v) paraformaldehyde 511 (Nacalai) in 0.67x PBS buffer for 10 min. To observe DNA, ovaries were incubated 512 with 1 μM 40,6-diamidino-2-phenylindole (DAPI) in PBX buffer [1x PBS containing 513 0.2% (v/v) Triton X -100]. For immunostaining, fixed ovaries were washed with PBX 514 and incubated with Image -iT™ FX Signal Enhancer (Invitrogen) for 30 min and PBX 515 containing 2% (w/v) BSA for 30 min for blocking. The primary antibody incubation 516 was performed overnight at 4 ˚C, and ovaries were washed with PBX at RT for 1h. The 517 secondary antibody incubation was then performed at RT for 2 h, and then ovaries were 518 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint washed with PBX at RT for 1 h. The antibodies used for immunostaining were rabbit 519 anti-HeT-A Gag (1:2000),(Lin et al, 2023) guinea pig anti-Krimp (1:2000),(Lim & Kai, 520 2007) mouse anti-Piwi (1:50, Santa Cruz, sc-390946), guinea pig anti-Aub (1:500),(Lim 521 et al, 2022) mouse anti-GFP (1:50, Invitrogen, A11120) . Secondary antibodies were as 522 follows: Alexa Fluor 488 - and 555 -conjugated goat antibodies at 1:500 (Molecular 523 Probes) and CF®633 goat antibodies at 1:500 (Biotium) . Antibodies were diluted in 524 0.4% (w/v) BSA containing PBX as the working solution. Images were taken by ZEISS 525 LSM 900 using C-Apochromat 40x/1.20 W Korr objective lens and processed with 526 ZEISS ZEN 3.0 and Fiji. 527 528 Reverse transcription and qPCR analysis 529 Total RNA was extracted from ovaries using TRIzol™ LS (Invitrogen) following the 530 manufacturer’s instructions. Using DNase I (NEB) –treated RNA, cDNA was 531 synthesized with 2.5 μM oligo(dT) adaptor using SuperS cript III reverse transcriptase 532 (Thermo Fisher Scientific). Quantitative reverse transcription PCR (qPCR) reaction was 533 performed using SYBR™ Green qPCR Master Mix (Thermo Fisher Scientific) and 534 gene-specific primers (Table S5) in QuantStudio 5 Real-Time PCR system (ABI). 535 536 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Transcriptome analysis 537 Total RNA was extracted from 100 ovaries (non-GSCLC ovaries or GSCLC ovaries) 538 using TRIzol™ LS (Invitrogen). To obtain GSCLC ovaries expressing shRNA for aub, 539 hrb27C, set1 or gfp, shRNA lines were combined ( bam shRNA; aub shRNA, bam 540 shRNA; hrb27C shRNA, bam shRNA; set1 shRNA, bam shRNA; gfp shRNA) and 541 crossed with NN. DNase I-treated RNA was sent to Rhelixa (Japan) for library 542 construction and deep -sequencing. Polyadenylated RNA selection was performed with 543 NEBNext Poly(A) mRNA Magnetic Isolation Module (NEB). Libraries were 544 constructed with the NEBNext Ultra II Directional RNA Library Prep Kit (NEB) and 545 sequenced by using NovaSeq 6000 (Illumina). Trimming was performed by Cutadapt 546 (v1.18, -j 12 -a AGATCGGAAGAGCA CACGTCTGAACTCCAGTCA -A 547 AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT, -m 20). Trimmed paired-end 548 reads were mapped to the genome (BDGP6.46, dm6) using STAR ( v2.7.10b, 549 --outFilterMultimapNmax 100 --outSAMmultNmax 1 --outMultimapperOrder Random). 550 Read counting was performed with featureCounts (-M -p --countReadPairs -t exon -g 551 gene_id) using Drosophila_melanogaster.BDGP6.46.112.gtf (ensembl.org) . Using the 552 sum of RPK ( read per kilobase) for individual genes and TEs, TPM ( transcript per 553 million) was calculated. Trimmed paired -end reads were also mapped to consensus 554 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint sequences of individual TE families (transposon_sequence_set.embl.v.9.41, flybase ) 555 with STAR using the same options . Mappers on individual TE families were counted 556 using pileup.sh (BBMap). TPM were calculated using the sum of RPK for genes and 557 TEs obtained in genome mapping . Published small RNA data ( GSE103582 and 558 GSE71374) were processed in the same way. 559 560 Small RNA sequencing and analysis 561 Biological duplicate dataset for Set1 -present condition was provided by gfp-GLKD 562 ovaries (control RNAi condition), and set1-GLKD ovaries expressing GFP -Set1WT 563 (rescue condition). Dataset for Set1 -depleted condition was provided by set1-GLKD 564 ovaries, and th e siblings of rescue conditions (CyO). T otal RNA was extracted using 565 TRIzol™ LS (Invitrogen) from 60 to 100 ovaries of ~3 days old adult females. After the 566 addition of chloroform and centrifugation (12,000g 15min, 4˚C), short (< ~200 nt) RNA 567 in the aqueous phase was purified using RNA Clean & Concentrator 5 (R1015 Zymo ). 568 To deplete 2S ribosomal RNA (rRNA) , 2 pmol of complementary oligo DNA 569 (5’-AGTCTTACAACCCTCAACCATATGTAGTCCAAGCAGCACT-3’) was added 570 per 1 μg of RNA. The mixture was heated (95 ˚C, 2min), gradually cooled down to form 571 DNA/RNA hybrid, and then treated with RNase H (NEB) at 37 ˚C for 30min. RNase H 572 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint was inactivated by heating at 65 ˚C for 20 min. The 2S rRNA -depleted RNA mixture 573 was loaded onto 8 M urea -polyacrylamide gel (12%) and size-separated in parallel with 574 visible RNA ladder (Dynamarker DM253, BioDynamics Laboratory Inc.) in 0.5x 575 tris-borate EDTA buffer. Gel within the range from 20 to 30nt was excised and passed 576 through a small hole opened by needle (23Gx1, TERUMO) at the bottom of 0.5 mL 577 DNA LoBind tubes (Eppendorf) by centrifugation (15,000g). The fine particles were 578 recovered in 2.0 mL DNA LoBind tubes, and RNAs were eluted overnight at 4 ˚C with 579 gentle rotation in the presence of 300 mM NaOAc (pH 5.2). After removing the 580 particles using cell ulose 0.22 µm membrane filter Spin -X (Costar, 8160), RNA was 581 precipitated in the presence of 80% (v/v) ethanol and glycogen (40 μg/mL) (Nacalai) for 582 overnight at -20˚C. After centrifugation (20,000g, 20min, 4˚C), RNA pellet rinsed twice 583 with 80% (v/v) ethanol was resuspended in RNase-free water. Small RNA libraries were 584 prepared using NEBNext® Multiplex Small RNA Library Prep Set for Illumina (NEB, 585 E7300S) following manufacture’s procedure. After 15 cycles of PCR amplification, the 586 libraries were purifi ed using MagMAX™ Pure Bind Beads (Applied Biosystems), 587 size-separated in 3% (w/v) low melting agarose (HydraGene) in the presence of SYBR 588 gold (Thermo Fisher). ~150 -bp library fragments containing 20~30 -bp inserts were 589 purified using QIAquick gel extractio n kit (Qiagen). Libraries were sequenced by 590 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint RIMD NGS core facility ( The University of Osaka ) using the NovaSeqX Plus platform 591 (Illumina). Adaptor sequences (AGATCGGAAGAGC) were removed by Trim Galore 592 (v0.6.2, --length 15) with quality filter cut off (Phred ≥20). Adaptor-trimmed reads were 593 mapped to rRNA, tRNA, snoRNA, and snRNA sequences using bowtie (v1.3.1), and 594 the unmappers were collected (samtools view -f 4). From the unmappers, 18~29 -nt 595 reads were selected as small RNAs using seqkit (v2.4.0 ) and mapped to miRNA 596 precursors using bowtie ( -v 0) to collect “miRNA” populations. The size -selected 597 (23~29-nt) unmappers were considered as “piRNA”. These piRNA fragments were 598 mapped to TE consensus sequences (transposon_sequence_set.embl.v.9.41, flybase ) 599 using bowtie allowing up to 3 mismatches and taking one from multi mappers (-v 3 -M 600 1 --best --strata). Fw and Rv mappers were counted for individual TEs using pileup.sh. 601 Read counts were normalized using the sum of 18~29 -nt reads and reads per million 602 (RPM) was obtained. For visualization, Fw and Rv mappers were separated using 603 splitsam.sh. Bedgraph was generated using bedtools (v2.26.0, genomecov -bga -split), 604 values normalized using scale factors given by the sum of 18~29-nt reads. Mean values 605 of biological replicates were obtained using bedtools unionbedg. Track view was 606 generated in IGV (v2.16.0). piRNA overlap scores (z) were measured using signature.py 607 (Antoniewski, 2014) . piRNA fragments were also mapped to Drosophila genome 608 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint (BDGP6.46, dm6) using bowtie without allowing mismatch (-v 0 -M 1 --best --strata). 609 Bedgraph generation, normalization, averaging, and visualization follow the processes 610 in the analysis using TE consensus sequences. Published small RNA data (GSE71374) 611 were processed in the same way. 612 613 CUT&Tag sequencing and analysis 614 We referred (Anderson et al, 2023) for performing whole ovary C UT&Tag. Twenty 615 ovaries from 10 adult females at ~ 3 days old were dissected in PBS buffer 616 supplemented with cOmplete Protease Inhibitor (Roche). Ovaries were permeabilized in 617 PBX buffer (PBX containing 0.2%(v/v) Triton-X100) for 30 min at RT. After removing 618 PBX, ovaries were washed once with Wash+ buffer ( 20mM HEPES [pH 7.4], 150 mM 619 NaCl, 0.5 mM spermidine[Nacalai], 2 mM EDTA, 1%[w/v] BSA, cOmplete Protease 620 Inhibitor) and incubated overnight at 4 ˚C with primary antibodies diluted by 1:50 with 621 Wash+ buffer. Anti-H3K4me3 (#61979, mouse, Active Motif), or anti-GFP (#598, rabbit, 622 MBL) antibodies were used . Next day, ovaries were washed three times with Wash+ 623 buffer, and incubated with secondary antibodies diluted by 1:50 with Wash+ buffer. 624 Anti-mouse IgG (#52885L, Cell Signalling), or anti -rabbit IgG (#35401S, Cell 625 Signalling) were used. After removing the secondary antibody solution, ovaries were 626 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint washed three times with 300Wash+ buffer (20 mM HEPES [pH 7.4], 300 mM NaCl, 0.5 627 mM spermidine, cOmplete Protease Inhibitor ) and incubated for 1h at RT with loaded 628 pAG-Tn5 ( #79561S, Cell Signalling) diluted by 1:25 with 300Wash+ buffer. After 629 removing the Tn5 solution, ovaries were washed three times with 300Wash+ buffer. 630 Tagmentation was performed by incubating ovaries in 300Wash+ buffer supplemented 631 with 10 mM MgCl2 for 1h at 37˚C. After removing the supernatant, ovaries were treated 632 with collagenase (2 mg/mL, Sigma C9407) in HEPESCA buffer (50 mM HEPES [pH 633 7.4], 360 µM CaCl 2) for 1h at 37 ˚C. Then, the reaction mixture was added with SDS, 634 EDTA, and proteinase K (Nacalai) (final concentration 0.2%[w/v], 16 mM, and 0.3 635 mg/mL, respectively), and further incubated for 1h at 58 ˚C. After the reaction, DNA 636 was purified by using Quick -DNA Microprep kit (D30 20, ZYMO). Using the purified 637 DNA as template, libraries were synthesized by PCR (72 ˚C 5min, 98 ˚C 30s, 14 cycles 638 of [98 ˚C 10s, 63 ˚C 15s], 65 ˚C 5min) using Ultra II Q5 Master Mix (NEB) and 639 dual-indexed primers (Table S 5). Generated libraries were purified using x1.3 volume 640 of AmpureXP (beckman). The concentration of libraries was measured using Qubit 641 dsDNA HS assay kit (Thermo). Libraries were sequenced with NovaSeqX (Illumina) 642 and 150+ 150bp paired -end reads were obtaine d (H3K4me3 libraries sequenced by 643 RIMD in The University of Osaka , GFP libraries sequenced by Rhelixa ). Adaptor 644 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint trimming was performed by Cutadapt ( -a 645 CTGTCTCTTATACACATCTCCGAGCCCACGAGAC -A 646 CTGTCTCTTATACACATCTGACGCTGCCGACGA, -m 20). Trimmed reads were 647 mapped to D. melanogaster genome (BDGP6.46, dm6) using Bowtie 2 -p 12 --no-unal 648 --end-to-end --very-sensitive --no-mixed --no-discordant -q --phred33 -I 10 -X 700 ). 649 Read counts were normalized using deeptools (bamCoverage --normalizeUsing CPM 650 -of bigwig --binSize=1). Average bigwig was generated from biological duplicate 651 bigwig data using wiggletools and ucsc-wigtobigwig (wigToBigWig). Track view was 652 generated in IGV (v2.16.0). Counting reads on gene TSS±500bp (H3K4me3) or exon 653 (GFP) was performed using featureCounts (v2. 1.1, -M -p --countReadPairs -O) with 654 Drosophila_melanogaster.BDGP6.46.112.gtf (Ensembl). RPM (read per million) was 655 calculated using the number of aligned reads in Bowtie2 mapping . Trimmed reads were 656 also mapped to TE consensus sequences (transposon_sequence_set.embl.v.9.41, 657 flybase). Bigwig was generated using deeptools (bamCoverage), normalizing the values 658 with scale factors given by the sum of aligned reads on the genome . Mean values of 659 biological replicates were obtained using wiggletools and wigToBigWig 660 (ucsc-wigtobigwig). Track view was generated in IGV (v2.16.0). 661 662 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

Acknowledgement

663 We thank Dr. Prof. Xin Chen for providing fly stocks. We thank Chisato Yanagisawa, 664 Minako Moriguchi for their helps with maintenance of fly lines. We are also grateful to 665 Bloomington Drosophila Stock Center for providing fly stocks. We also thank all 666 members in our lab for their insightful discussion and suggestions. This study is 667 supported by JSPS Grant -in-Aid for Scientific Research C (22K06081) for TI, Life 668 Science Foundation of Japan (J231503018) for TI, Takeda Foundation (J241503007) for 669 TI, Daiichi Sankyo Foundation of Life Science (J241503010) for TI, Naito Foundation 670 (J241503011) for TI, JSPS Grant -in-Aid for Scientific Research B (21H02401) for TK, 671 JSPS Grant -in-Aid for Transformative Research Areas A (21H05275) for TK, 672 Grant-in-Aid for JSPS Fellows (23KJ1521) for WI, and Grant from Open and 673 Transdisciplinary Research Initiatives (OTRI) RNA Frontier Science Division for TI. 674 The founders had no role in study design, data collection and analysis, decision to 675 publish, or preparation of the manuscript. 676 677 Author contributions 678 Conceptualization: TI and TK 679 Methodology: TI, TK, and WI 680 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Investigation: TI, WI, HK-H and MO 681 Supervisions: TI and TK 682 Writing-original draft: WI and TI 683 Writing-review and editing: TI, WI and TK 684 685 Competing interest statement 686 The authors declare no competing interests. 687 688 Data availability 689 Newly generated transcriptome data are available with BioProject accession ID: 690 PRJNA1345876; Drosophila ovary polyA transcriptome sequencing, PRJNA1345540; 691 Drosophila ovary small RNA sequencing, PRJNA1345531; Drosophila ovary 692 CUT&Tag sequencing 693 694 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Figure legends 695 Figure 1. PIWI proximity biotin labelling in germline stem cell-like cells (GSCLCs) 696 (A) Experimental design for PIWI proximity biotin labelling in female germline cells . 697 mTurbo-GFP-Piwi, -Aub, or -Ago3 is expressed in germline cells using UASp/Gal4 698 system with NGT40; nos-Gal4 (NN) drive r. mTurbo-GFP serves as negative contr ol. 699 Depletion of bam using short hairpin (sh)RNA enables TurboID in GSCLC s. (B) 700 Subcellular localization of mTurbo-GFP fusion proteins in GSCLC ovary. Upper panels; 701 fluorescent signals from GFP (green). Lower panels; immunostaining signals from 702 endogenous Piwi, Aub, or Ago3 (green). DAPI (blue) for nuclei. Scale bar = 50 μm or 5 703 μm in the insets. (C) Blotting images for p roteins extracted from GSCLC ovaries 704 expressing mTurbo fusion proteins (Input), and the streptavidin-bound fraction 705 (Pulldown). Upper panels; immunoblotting with anti-GFP antibody. Lower panels; 706 blotting with streptavidin-HRP. White and black arrowheads indicate mTurbo -GFP and 707 mTurbo-GFP-PIWI proteins, respectively. 708 709 Figure 2. Proximity proteome of PIWI proteins in germline cells 710 (A) Network diagram summarizing proximity factors of individual PIWI proteins (Piwi, 711 Aub, and Ago3) identified by mass spectrometry ( nanoLC-MS/MS) followed by 712 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint label-free quantification. Yellow nodes: factors known to be involved in piRNA pathway. 713 Size of c ircle reflects abundance ratio (mTurbo -GFP-PIWI/mTurbo-GFP) based on 714 biological duplicate TurboID data. (B) Venn diagram show s the number of proximity 715 proteins shared by or unique to individual PIWI members. (C) GO enrichment analys is 716 of PIWI proximity factors. Hierarchical clustering and heatmap show significantly 717 enriched GO terms. Color intensity represents the significance of enrichment ( -log10 718 p-value). 719 720 Figure 3. Germline knockdown (GLKD) screening of PIWI proximity factors 721 (A) Immunofluorescence signals from HeT-A Gag protein (top panels) in egg chambers 722 of control (y w) or GLKD of indicated genes. DNA stained with DAPI (bottom panels). 723 Arrowheads; Gag proteins accumulating around oocyte. Scale bars = 20 μm. (B) 724 Immunofluorescence signals of Krimp (green) in the egg chambers of control ( y w) or 725 of indicated GLKD conditions . DNA stained with DAPI (magenta). Fluorescence 726 signals along yellow arro ws are plotted in the bottom panels (signals normalized by 727 setting 1 for the highest value of DAPI). Scale bars = 10 μm. (C) Immunofluorescence 728 signals of Aub or Piwi (green) in the egg chambers of indicated conditions. 729 730 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Figure 4. Germline knockdown (GLKD) effect on ovary transcriptome 731 (A-B) Transcriptome analysis and comparison in non-GSCLC ovaries (A) or in GSCLC 732 ovaries (B). Transcript levels (TPM) of genes and TE insertions in set1-, hrb27C-, aub-, 733 or piwi-GLKD condition were compared with those in gfp-GLKD control. Dots in the 734 volcano plots; genes (gray) and TEs (gray with red outline) included in the differential 735 expression analysis (EdgeR). ( C) Scatter plot c omparing the effect of GLKD on the 736 transcript levels of i ndividual TE families (consensus sequence mapping). Left : 737 set1-GLKD compared with aub-GLKD. Right: set1-GLKD compared with piwi-GLKD. 738 Fold change from control gfp-GLKD (TPM/TPM) were shown. r = pearson correlation 739 coefficient. (D) Transcript levels (TPM) for HeT-A and TAHRE in different 740 transcriptome datasets. GSE103582 including aub and piwi knockouts, and GSE71374 741 including panx knockout were analyzed together(Yu et al, 2015a; Teixeira et al, 2017b). 742 Error bars indicate ±standard deviation (SD) of biological duplicate data. 743 744 Figure 5. Effect of set1 transgene expression on H3K4me3 modification and TE 745 expression in set1-GLKD ovaries 746 (A) Schemetic of GFP-Set1 transgenes (Vidaurre et al, 2024). Nucleotide substitution to 747 replace 1613 th glutamate ( E) of Set1 polypeptide to lysine ( K) abolishes catalytic 748 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint activity. Nonsense substitutions render resistance to siRNA targeting. Both placed 749 downstream of UASp and expressed by germline driver NN. (B) GFP fluorescence (top) 750 and immunofluorescence staining for H3K4me3 and Vas (bottom) in egg chambers. Vas 751 marks germline cells. Scale bar = 10μm. (C) Transcript levels (TPM) of TE families in 752 gfp-GLKD, set1-GLKD, and transgene expression conditions ( GFP-Set1WT or 753 GFP-Set1E1613K expressed in set1-GLKD). Error bars indicate ±SD from biological 754 duplicates. (D) CUT&Tag analysis of H3K4me3 in ovaries of indicated conditions. 755 Left: track view at the ago3 and aub gene loci. Bar graph on the right shows H3K4me3 756 signals (mean CPM) around TSS (±500bp) of ago3, aub and piwi . Error bars indicate 757 ±SD from biological duplicates. 758 759 Figure 6. Co-precipitation of Piwi with Set1, and e ffect of set1-GLKD on piRNA 760 accumulation in ovaries 761 (A) Immunoprecipitation of GFP or GFP -Set1 proteins from ovaries using anti -GFP 762 antibody (GFP-IP) after crosslinking treatment. Tubulin serves as loading control. (B) 763 Proportion (%) of miRNA and piRNA in ovarian small RNAs. miRNA: reads mapping 764 to miRNA precursor s. piRNA: 23~29-nt reads (excluding rRNA, tRNA, snoRNA, 765 snRNA, and miRNA mappers), or 23~29-nt reads mapping to TE consensus sequences. 766 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Control: mean values given by single replicate of gfp-GLKD and GFP-Set1WT-rescue 767 conditions. set1-GLKD: mean values given by biological duplicate of set1-GLKD 768 condition. Error bars indicate ±SD of biological duplicates. (C) Fold changes (FC) of 769 piRNA levels (RPM) in set1-GLKD compared to control condition. piRNA : 23~29-nt 770 mappers on individual TE families . TEs having few mappers ( RPM <5) are excluded. 771 Sense or antisense piRNA : mappers having TE sense strand or the reverse 772 complementary sequences. (D) Track view for 23~29-nt mappers on consensus 773 sequences of telomeric TE families. (E) Fold change (FC) of the levels (RPM) of 774 piRNA mapping to telomeric TE families . Small RNA libraries for panx KO and the 775 heterozygous control conditions (GSE71374) are included (Yu et al, 2015a). 776 777 Figure 7. Genome-wide Set1-binding analysis, and a model for Set1 function in TE 778 silencing 779 (A) Venn diagram for genes and TEs having GFP-Set1WT- or GFP-Set1E1613K-binding 780 peaks. Peak calling by MACS2 (p<0.01). (B) Scatter plot comparing fold differences of 781 CUT&Tag signals (GFP-Set1E1613K - GFP GFP -Set1WT - GFP) on genes and TEs 782 identified by peak calling . r = pearson correlation coefficient. (C, D) Track view for 783 CUT&Tag signals of GFP proteins (green) and H3K4me3 (black). aub (C) and HeT-A 784 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint (D) represent genes or TEs, respectively. TSS: transcription starting site. (E) Track view 785 for CUT&Tag signals of GFP proteins (green) and piRNA levels on subtelomeric 786 piRNA cluster 3. piRNAs having genomic plus strand sequence (Blue) or minus strand 787 sequences (Red) are separately shown . (F) A model for Set1 -mediated production of 788 antisense-biased piRNAs on subtelomeric cluster loci and on individual TE insertions 789 outside of cluster loci. Subtelomere regions maintain fragments of 3’UTR derived from 790 HTT, which are bound by Set1 to initiate antisense transcription. Catalytic activity of 791 Set1 is dispensable. Similar mechanism is applied for antisense piRNA production from 792 potentially mobile TE insertions outside of piRNA cluster loci . Antisense piRNAs are 793 loaded onto Piwi to reinforce transcriptional silencing. 794 795 Figure S1. Optimization of germline PIWI proximity labelling and purification 796 (A) Comparison of biotinylation efficiency and biotinylated protein purification 797 between non-GSCLC and GSCLC ovaries. Co omassie brilliant blue (CBB) staining 798 serving as protein loading control. Ovaries expressing mTurbo -GFP or 799 mTurbo-GFP-Aub were analyzed with negative control (without mTurbo protein 800 expression). Asterisks : non-specific signals. (B) Schematic illustration f or germline 801 differentiation and the function of Bam. Left; the germarium in wild -type, non-GSCLC 802 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint ovaries. A germline stem cell (GSC, blue) divides to produce a daughter cell (red) which 803 expresses Bam and starts the differentiation. Two-to-three GSCs are normally 804 maintained in a germarium. Right: by bam-GLKD, daughter cells cannot differentiate 805 and thus accumulate as GSCLCs. (C) A pair of wild -type ovaries (y w, left) and that of 806 bam-GLKD ovaries (right). Scale bar = 1 mm. (D) Immunoblotting images com paring 807 the expression level of mTurbo -GFP-tagged Piwi or Aub with that of endogenous Piwi 808 or Aub in GSCLC ovaries. 809 810 Figure S2. Quality assessment of PIWI proximity proteome data 811 V olcano plots for the proximity factors of Piwi, Aub, or Ago3 in individual biological 812 replicates. Red dots highlight the factors showing significant enrichment (abundance 813 ratio > 1, adjusted p < 0.05). Statistics relies on background-based t-test. 814 815 Figure S3. Germline knockdown (GLKD) efficiency and the effect on protein 816 subcellular localization 817 (A-C) Plotting the fluorescence signals from egg chambers of indicated genotypes, 818 related to Figure 3B. (A) Krimp (green) and DAPI (magenta) , (B) Aub (green) and 819 DAPI (magenta), (C) Piwi (green) and DAPI (magenta) . (D) Transcript levels of aub, 820 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint hrb27C and set1 in individual GLKD conditions (ΔΔCt method using rp49 as reference). 821 Error bars indicate standard deviation (SD) from three biological replicates. p values 822 from unpaired Student's t-test. 823 824 Figure S4. GLKD effect on ovary transcriptome, and somatic KD effect on TEs 825 (A-B) Transcriptome analysis and comparison in non-GSCLC ovaries (A) or in GSCLC 826 ovaries (B). Transcript levels (TPM) of genes and TE insertions in set1-, hrb27C-, aub-, 827 or piwi-GLKD conditions were compared with those in the control gfp-GLKD. Dots in 828 the MA plots; genes (gray) and TEs (gray with red outline) included in the differential 829 expression analysis (EdgeR). (C) Transcript levels (TPM) of i ndividual TE families in 830 gfp-GLKD (consensus sequence mapping) compared between non-GSCLC and GSCLC 831 ovaries. (D) Scatter plot c omparing the effect of GLKD on the transcript levels of 832 individual TE families (consensus sequence mapping). Left : hrb27C-GLKD compared 833 with aub-GLKD. Right : hrb27C-GLKD compared with piwi-GLKD. F old change s 834 (TPM/TPM) from control gfp-GLKD are shown. r = pearson correlation coefficient. (E) 835 Transcript levels (TPM) for TART-A1, TART-B, and TART-C, in different transcriptome 836 datasets. GSE103582 including aub and piwi knockout conditions, and GSE71374 837 including panx knockout condition were analyzed. Error bars indicate ± SD of 838 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint biological duplicates. (F) RT-qPCR measurement of TE transcript levels in ovaries upon 839 somatic KD of gfp, piwi or set1 using tj-Gal4 driver. ΔΔCt method using rp49 as 840 reference. Error bars indicate ±SD from three biological replicates. 841 842 Figure S5. Effect of COMPASS subunit GLKD on HeT-A expression, and effect of 843 set1-GLKD on the expression of piRNA pathway factors 844 (A) RT-qPCR measurement of HeT-A transcript levels in COMPASS subunit GLKD 845 ovaries. ΔΔCt method using rp49 as a reference. Error bars indicate ±SD from three 846 biological replicates. p value from unpaired Student's t -test is indicated. Asterisk; p < 847 0.05. (B) Bar graph shows the transcript levels (TPM) of piRNA pathway components 848 in gfp-GLKD and set1-GLKD ovaries. Error bar s indicate ± SD of biological 849 duplicates. 850 851 Figure S6. Immunoprecipitation of GFP-Set1 without crosslinking, and e ffect of 852 set1-GLKD on ping-pong signature of piRNAs 853 (A) Immunoprecipitation of GFP or GFP -Set1 proteins from ovaries using anti -GFP 854 antibody (GFP -IP) without crosslinking treatment. Tubulin serves as loading control. 855 (B) Ping-pong signature analysis on piRNAs mapping to telomeric TE families . Z10 856 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint scores (reflecting 10-nt overlap frequenc y within piRNA groups) were highlighted . 857 Error bars indicate ±SD of biological duplicates. 858 859 Figure S7. Genome-wide Set1-binding analysis 860 (A) Venn diagram shows overlap of genes and TEs between replicate 1 and 2 of 861 GFP-Set1WT CUT&Tag (peak calling using GFP control) . (B) Venn diagram shows 862 overlap of genes and TEs between replicate 1 and 2 of GFP -Set1E1613K CUT&Tag (peak 863 calling using GFP control) . (C) Track view for CUT&Tag signals of GFP proteins 864 (green) and H3K4me3 (black) on TAHRE (left) and TART-C (right). (D) Track view for 865 CUT&Tag signals of GFP proteins (green) and H3K4me3 (black) on HMS-Beagle (left) 866 and Max-element (right). (E) Alignment of HeT-A{}6278 to HeT-A consensus sequence. 867 (F) Track view for CUT&Tag signals of GFP proteins (green) and piRNA levels on 868 42AB locus. Blue and Red indicate piRNAs containing genomic plus and minus strand 869 sequences, respectively. 870 871 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

References

872 Ables ET, Hwang GH, Finger DS, Hinnant TD & Drummond-Barbosa D (2016) A Genetic Mosaic 873 Screen Reveals Ecdysone-Responsive Genes Regulating Drosophila Oogenesis. G3 874 Genes|Genomes|Genetics 6: 2629–2642 875 Akkouche A, Kneuss E, Bornelöv S, Renaud Y, Eastwood EL, van Lopik J, Gueguen N, Jiang 876 M, Creixell P, Maupetit-Mehouas S, et al (2025) Binding of heterochromatin 877 protein Rhino to a subset of piRNA clusters depends on a combination of two histone 878 marks. Nat Struct Mol Biol 32: 1517–1527 879 Akkouche A, Mugat B, Barckmann B, Varela-Chavez C, Li B, Raffel R, Pélisson A & 880 Chambeyron S (2017) Piwi Is Required during Drosophila Embryogenesis to License 881 Dual-Strand piRNA Clusters for Transposon Repression in Adult Ovaries. Mol Cell 882 66: 411-419.e4 883 Alizada A, Martins A, Mouniée N, Rodriguez Suarez J V., Bertin B, Gueguen N, Mirouse 884 V, Papameletiou A-M, Rivera AJ, Lau NC, et al (2025) The transcription factor 885 Traffic jam orchestrates the somatic piRNA pathway in Drosophila ovaries. Cell 886 Rep 44: 115453 887 Anand A & Kai T (2012) The tudor domain protein Kumo is required to assemble the nuage 888 and to generate germline piRNAs in Drosophila. EMBO J 31: 870–882 889 Andersen PR, Tirian L, Vunjak M & Brennecke J (2017) A heterochromatin-dependent 890 transcription machinery drives piRNA expression. Nature 549: 54–59 891 Anderson JT, Henikoff S & Ahmad K (2023) Chromosome-specific maturation of the 892 epigenome in the Drosophila male germline. Elife 12 893 Antoniewski C (2014) Computing siRNA and piRNA Overlap Signatures. In Methods in 894 Molecular Biology pp 135–146. Humana Press Inc. 895 Ardehali MB, Mei A, Zobeck KL, Caron M, Lis JT & Kusch T (2011) Drosophila Set1 is 896 the major histone H3 lysine 4 trimethyltransferase with role in transcription. 897 EMBO J 30: 2817–2828 898 Ariura M, Solberg T, Ishizu H, Takahashi H, Carninci P, Siomi H & Iwasaki YW (2024) 899 Drosophila Piwi distinguishes transposons from mRNAs by piRNA complementarity 900 and abundance. Cell Rep 43: 115020 901 Bae HJ, Dubarry M, Jeon J, Soares LM, Dargemont C, Kim J, Geli V & Buratowski S (2020) 902 The Set1 N-terminal domain and Swd2 interact with RNA polymerase II CTD to recruit 903 COMPASS. Nat Commun 11: 2181 904 Batki J, Schnabl J, Wang J, Handler D, Andreev VI, Stieger CE, Novatchkova M, 905 Lampersberger L, Kauneckaite K, Xie W, et al (2019) The nascent RNA binding 906 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint complex SFiNX licenses piRNA-guided heterochromatin formation. Nat Struct Mol 907 Biol 26: 720–731 908 Baumgartner L, Handler D, Platzer SW, Yu C, Duchek P & Brennecke J (2022) The Drosophila 909 ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters. Elife 11 910 Berretta J, Pinskaya M & Morillon A (2008) A cryptic unstable transcript mediates 911 transcriptional trans -silencing of the Ty1 retrotransposon in S. cerevisiae. 912 Genes Dev 22: 615–626 913 Branon TC, Bosch JA, Sanchez AD, Udeshi ND, Svinkina T, Carr SA, Feldman JL, Perrimon 914 N & Ting AY (2018) Efficient proximity labeling in living cells and organisms 915 with TurboID. Nat Biotechnol 36: 880–887 916 Brennecke J, Aravin AA, Stark A, Dus M, Kellis M, Sachidanandam R & Hannon GJ (2007a) 917 Discrete Small RNA-Generating Loci as Master Regulators of Transposon Activity 918 in Drosophila. Cell 128: 1089–1103 919 Brennecke J, Aravin AA, Stark A, Dus M, Kellis M, Sachidanandam R & Hannon GJ (2007b) 920 Discrete Small RNA-Generating Loci as Master Regulators of Transposon Activity 921 in Drosophila. Cell 128: 1089–1103 922 Choi‐Rhee E, Schulman H & Cronan JE (2004) Promiscuous protein biotinylation by 923 Escherichia coli biotin protein ligase. Protein Science 13: 3043–3050 924 Danilevskaya ON & Arkhipova IR (1997) Promoting in Tandem: The Promoter for Telomere 925 Transposon HeT-A and Implications for the Evolution of Retroviral LTRs 926 ElMaghraby MF, Andersen PR, Pühringer F, Hohmann U, Meixner K, Lendl T, Tirian L & 927 Brennecke J (2019) A Heterochromatin-Specific RNA Export Pathway Facilitates 928 piRNA Production. Cell 178: 964-979.e20 929 Fabry MH, Ciabrelli F, Munafò M, Eastwood EL, Kneuss E, Falciatori I, Falconio FA, 930 Hannon GJ & Czech B (2019) piRNA-guided co-transcriptional silencing coopts 931 nuclear export factors. Elife 8 932 Finger DS, Williams AE, Holt V V & Ables ET (2023) Novel roles for RNA binding proteins 933 squid , hephaesteus , and Hrb27C in Drosophila oogenesis. Developmental Dynamics 934 252: 415–428 935 Ge DT, Wang W, Tipping C, Gainetdinov I, Weng Z & Zamore PD (2019) The RNA-Binding 936 ATPase, Armitage, Couples piRNA Amplification in Nuage to Phased piRNA Production 937 on Mitochondria. Mol Cell 74: 982-995.e6 938 Goodrich JS, Clouse KN & Schüpbach T (2004) Hrb27C, Sqd and Otu cooperatively regulate 939 gurken RNA localization and mediate nurse cell chromosome dispersion in 940 Drosophila oogenesis. Development 131: 1949–1958 941 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Grieder NC, Cuevas M de & Spradling AC (2000) The fusome organizes the microtubule 942 network during oocyte differentiation in Drosophila. Development 127: 4253–4264 943 Hallson G, Hollebakken RE, Li T, Syrzycka M, Kim I, Cotsworth S, Fitzpatrick KA, 944 Sinclair DAR & Honda BM (2012) dSet1 Is the Main H3K4 Di- and 945 Tri-Methyltransferase Throughout Drosophila Development. Genetics 190: 91–100 946 Handler D, Olivieri D, Novatchkova M, Gruber FS, Meixner K, Mechtler K, Stark A, 947 Sachidanandam R & Brennecke J (2011) A systematic analysis of Drosophila TUDOR 948 domain-containing proteins identifies Vreteno and the Tdrd12 family as essential 949 primary piRNA pathway factors. EMBO J 30: 3977–3993 950 Huynh J-R, Munro TP, Smith-Litière K, Lepesant J-A & Johnston DS (2004) The Drosophila 951 hnRNPA/B Homolog, Hrp48, Is Specifically Required for a Distinct Step in osk mRNA 952 Localization. Dev Cell 6: 625–635 953 Iki T, Kawaguchi S & Kai T (2023) miRNA/siRNA-directed pathway to produce noncoding 954 piRNAs from endogenous protein-coding regions ensures Drosophila 955 spermatogenesis. Sci Adv 9 956 Iki T, Takami M & Kai T (2020) Modulation of Ago2 Loading by Cyclophilin 40 Endows 957 a Unique Repertoire of Functional miRNAs during Sperm Maturation in Drosophila. 958 Cell Rep 33: 108380 959 Iyer SS, Sun Y, Seyfferth J, Manjunath V, Samata M, Alexiadis A, Kulkarni T, Gutierrez 960 N, Georgiev P, Shvedunova M, et al (2023) The NSL complex is required for piRNA 961 production from telomeric clusters. Life Sci Alliance 6: e202302194 962 Jezek M, Sun W, Negesse MY, Smith ZM, Orosz A & Green EM (2023) Set1 regulates telomere 963 function via H3K4 methylation–dependent and -independent pathways and calibrates 964 the abundance of telomere maintenance factors. Mol Biol Cell 34 965 Kai T, Zheng J & Iki T (2025) Paralog‐Dependent Specialization of Paf1C 966 Subunit, Ctr9, for Sex Chromosome Gene Regulation and Male Germline 967 Differentiation in Drosophila. Genes to Cells 30: e70040 968 Kawaguchi S, Isshiki W & Kai T (2025) Factories without walls: The molecular 969 architecture and functions of non-membrane organelles in small RNA-guided genome 970 protection. Biochimica et Biophysica Acta (BBA) - General Subjects 1869: 130811 971 Klattenhoff C, Xi H, Li C, Lee S, Xu J, Khurana JS, Zhang F, Schultz N, Koppetsch 972 BS, Nowosielska A, et al (2009) The Drosophila HP1 Homolog Rhino Is Required for 973 Transposon Silencing and piRNA Production by Dual-Strand Clusters. Cell 138: 974 1137–1149 975 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Kneuss E, Munafò M, Eastwood EL, Deumer U-S, Preall JB, Hannon GJ & Czech B (2019) 976 Specialization of the Drosophila nuclear export family protein Nxf3 for piRNA 977 precursor export. Genes Dev 33: 1208–1220 978 Koch R, Ledermann R, Urwyler O, Heller M & Suter B (2009) Systematic Functional 979 Analysis of Bicaudal-D Serine Phosphorylation and Intragenic Suppression of a 980 Female Sterile Allele of BicD. PLoS One 4: e4552 981 Lee KY, Chen Z, Jiang R & Meneghini MD (2018) H3K4 Methylation Dependent and 982 Independent Chromatin Regulation by JHD2 and SET1 in Budding Yeast. G3 983 Genes|Genomes|Genetics 8: 1829–1839 984 Lim AK & Kai T (2007) Unique germ-line organelle, nuage, functions to repress selfish 985 genetic elements in Drosophila melanogaster. Proceedings of the National Academy 986 of Sciences 104: 6714–6719 987 Lim L-X, Isshiki W, Iki T, Kawaguchi S & Kai T (2022) The Tudor Domain-Containing 988 Protein, Kotsubu (CG9925), Localizes to the Nuage and Functions in piRNA 989 Biogenesis in D. melanogaster. Front Mol Biosci 9: 1–15 990 Lin Y, Suyama R, Kawaguchi S, Iki T & Kai T (2023) Tejas functions as a core component 991 in nuage assembly and precursor processing in Drosophila piRNA biogenesis. 992 Journal of Cell Biology 222 993 Lorenz DR, Mikheyeva I V., Johansen P, Meyer L, Berg A, Grewal SIS & Cam HP (2012) 994 CENP-B Cooperates with Set1 in Bidirectional Transcriptional Silencing and 995 Genome Organization of Retrotransposons. Mol Cell Biol 32: 4215–4225 996 Ma X, Zhu X, Han Y, Story B, Do T, Song X, Wang S, Zhang Y, Blanchette M, Gogol M, 997 et al (2017) Aubergine Controls Germline Stem Cell Self-Renewal and Progeny 998 Differentiation via Distinct Mechanisms. Dev Cell 41: 157-169.e5 999 Maxwell PH, Belote JM & Levis RW (2006) Identification of multiple transcription 1000 initiation, polyadenylation, and splice sites in the Drosophila melanogaster 1001 TART family of telomeric retrotransposons. Nucleic Acids Res 34: 5498–5507 1002 McKearin DM & Spradling AC (1990) bag-of-marbles: a Drosophila gene required to 1003 initiate both male and female gametogenesis. Genes Dev 4: 2242–2251 1004 Mohan M, Herz H-M, Smith ER, Zhang Y, Jackson J, Washburn MP, Florens L, Eissenberg 1005 JC & Shilatifard A (2011) The COMPASS Family of H3K4 Methylases in Drosophila. 1006 Mol Cell Biol 31: 4310–4318 1007 Mohn F, Sienski G, Handler D & Brennecke J (2014) The Rhino-Deadlock-Cutoff complex 1008 licenses noncanonical transcription of dual-strand piRNA clusters in Drosophila. 1009 Cell 157: 1364–1379 1010 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Mugat B, Nicot S, Varela-Chavez C, Jourdan C, Sato K, Basyuk E, Juge F, Siomi MC, 1011 Pélisson A & Chambeyron S (2020) The Mi-2 nucleosome remodeler and the Rpd3 1012 histone deacetylase are involved in piRNA-guided heterochromatin formation. Nat 1013 Commun 11: 2818 1014 Murano K, Iwasaki YW, Ishizu H, Mashiko A, Shibuya A, Kondo S, Adachi S, Suzuki S, 1015 Saito K, Natsume T, et al (2019) Nuclear RNA export factor variant initiates 1016 piRNA‐guided co‐transcriptional silencing. EMBO J 38 1017 Ng HH, Robert F, Young RA & Struhl K (2003) Targeted Recruitment of Set1 Histone 1018 Methylase by Elongating Pol II Provides a Localized Mark and Memory of Recent 1019 Transcriptional Activity. Mol Cell 11: 709–719 1020 Nguyen TTM, Munkhzul C, Kim J, Kyoung Y, Vianney M, Shin S, Ju S, Pham-Bui H-A, Kim 1021 J, Kim J-S, et al (2023) In vivo profiling of the Zucchini proximal proteome in 1022 the Drosophila ovary. Development 150 1023 Nislow C, Ray E & Pillus L (1997) SET1 , A Yeast Member of the Trithorax Family, 1024 Functions in Transcriptional Silencing and Diverse Cellular Processes. Mol Biol 1025 Cell 8: 2421–2436 1026 Osumi K, Sato K, Murano K, Siomi H & Siomi MC (2019) Essential roles of Windei and 1027 nuclear monoubiquitination of Eggless/ SETDB 1 in transposon 1028 silencing. EMBO Rep 20 1029 Patil VS & Kai T (2010) Repression of Retroelements in Drosophila Germline via piRNA 1030 Pathway by the Tudor Domain Protein Tejas. Current Biology 20: 724–730 1031 Qi H, Watanabe T, Ku H-Y, Liu N, Zhong M & Lin H (2011) The Yb Body, a Major Site 1032 for Piwi-associated RNA Biogenesis and a Gateway for Piwi Expression and 1033 Transport to the Nucleus in Somatic Cells. Journal of Biological Chemistry 286: 1034 3789–3797 1035 Radion E, Ryazansky S, Akulenko N, Rozovsky Y, Kwon D, Morgunova V, Olovnikov I & 1036 Kalmykova A (2017) Telomeric Retrotransposon HeT-A Contains a Bidirectional 1037 Promoter that Initiates Divergent Transcription of piRNA Precursors in 1038 Drosophila Germline. J Mol Biol 429: 3280–3289 1039 Rangan P, Malone CD, Navarro C, Newbold SP, Hayes PS, Sachidanandam R, Hannon GJ & 1040 Lehmann R (2011) piRNA Production Requires Heterochromatin Formation in 1041 Drosophila. Current Biology 21: 1373–1379 1042 Rivera AJ, Lee J-HR, Gupta S, Yang L, Goel RK, Zaia J & Lau NC (2025) Traffic Jam 1043 activates the Flamenco piRNA cluster locus and the Piwi pathway to ensure 1044 transposon silencing and Drosophila fertility. Cell Rep 44: 115354 1045 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Rojas-Ríos P, Chartier A, Enjolras C, Cremaschi J, Garret C, Boughlita A, Ramat A 1046 & Simonelig M (2024) piRNAs are regulators of metabolic reprogramming in stem 1047 cells. Nat Commun 15: 8405 1048 Rojas‐Ríos P, Chartier A, Pierson S & Simonelig M (2017) Aubergine and pi 1049 RNA s promote germline stem cell self‐renewal by repressing the 1050 proto‐oncogene Cbl. EMBO J 36: 3194–3211 1051 Rørth P (1998) Gal4 in the Drosophila female germline. Mech Dev 78: 113–118 1052 Roux KJ, Kim DI, Raida M & Burke B (2012) A promiscuous biotin ligase fusion protein 1053 identifies proximal and interacting proteins in mammalian cells. Journal of Cell 1054 Biology 196: 801–810 1055 Saito K, Ishizu H, Komai M, Kotani H, Kawamura Y, Nishida KM, Siomi H & Siomi MC (2010) 1056 Roles for the Yb body components Armitage and Yb in primary piRNA biogenesis in 1057 Drosophila. Genes Dev 24: 2493–2498 1058 Saito K, Nishida KM, Mori T, Kawamura Y, Miyoshi K, Nagami T, Siomi H & Siomi MC (2006) 1059 Specific association of Piwi with rasiRNAs derived from retrotransposon and 1060 heterochromatic regions in the Drosophila genome. Genes Dev 20: 2214–2222 1061 Shpiz S, Olovnikov I, Sergeeva A, Lavrov S, Abramov Y, Savitsky M & Kalmykova A (2011) 1062 Mechanism of the piRNA-mediated silencing of Drosophila telomeric 1063 retrotransposons. Nucleic Acids Res 39: 8703–8711 1064 Sienski G, Batki J, Senti K-A, Dönertas D, Tirian L, Meixner K & Brennecke J (2015a) 1065 Silencio/CG9754 connects the Piwi–piRNA complex to the cellular heterochromatin 1066 machinery. Genes Dev 29: 2258–2271 1067 Sienski G, Batki J, Senti KA, Dönertas D, Tirian L, Meixner K & Brennecke J (2015b) 1068 Silencio/CG9754 connects the piwi-piRNA complex to the cellular heterochromatin 1069 machinery. Genes Dev 29: 2258–2271 1070 Suyama R & Kai T (2025) pi RNA processing within non‐membrane structures 1071 is governed by constituent proteins and their functional motifs. FEBS J 292: 2715–1072 2736 1073 Teixeira FK, Okuniewska M, Malone CD, Coux R-X, Rio DC & Lehmann R (2017a) 1074 piRNA-mediated regulation of transposon alternative splicing in the soma and germ 1075 line. Nature 552: 268–272 1076 Teixeira FK, Okuniewska M, Malone CD, Coux RX, Rio DC & Lehmann R (2017b) 1077 PiRNA-mediated regulation of transposon alternative splicing in the soma and germ 1078 line. Nature 552: 268–272 1079 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Le Thomas A, Rogers AK, Webster A, Marinov GK, Liao SE, Perkins EM, Hur JK, Aravin 1080 AA & Tóth KF (2013) Piwi induces piRNA-guided transcriptional silencing and 1081 establishment of a repressive chromatin state. Genes Dev 27: 390–399 1082 Vidaurre V, Song A, Li T, Ku WL, Zhao K, Qian J & Chen X (2024) The Drosophila histone 1083 methyltransferase SET1 coordinates multiple signaling pathways in regulating 1084 male germline stem cell maintenance and differentiation. Development 151 1085 Villasante A, Abad JP, Planelló R, Méndez-Lago M, Celniker SE & de Pablos B (2007) 1086 Drosophila telomeric retrotransposons derived from an ancestral element that was 1087 recruited to replace telomerase. Genome Res 17: 1909–1918 1088 Wang H, Fan Z, Shliaha P V., Miele M, Hendrickson RC, Jiang X & Helin K (2023a) H3K4me3 1089 regulates RNA polymerase II promoter-proximal pause-release. Nature 615: 339–1090 348 1091 Wang L, Dou K, Moon S, Tan FJ & Zhang ZZ (2018) Hijacking Oogenesis Enables Massive 1092 Propagation of LINE and Retroviral Transposons. Cell 174: 1082-1094.e12 1093 Wang X, Ramat A, Simonelig M & Liu M-F (2023b) Emerging roles and functional mechanisms 1094 of PIWI-interacting RNAs. Nat Rev Mol Cell Biol 24: 123–141 1095 Webster A, Li S, Hur JK, Wachsmuth M, Bois JS, Perkins EM, Patel DJ & Aravin AA (2015) 1096 Aub and Ago3 Are Recruited to Nuage through Two Mechanisms to Form a Ping-Pong 1097 Complex Assembled by Krimper. Mol Cell 59: 564–575 1098 Wells JN & Feschotte C (2020) A Field Guide to Eukaryotic Transposable Elements. Annu 1099 Rev Genet 54: 539–561 1100 Wilson JR, Jing C, Walker PA, Martin SR, Howell SA, Blackburn GM, Gamblin SJ & Xiao 1101 B (2002) Crystal Structure and Functional Analysis of the Histone 1102 Methyltransferase SET7/9. Cell 111: 105–115 1103 Yan D, Neumüller RA, Buckner M, Ayers K, Li H, Hu Y, Yang-Zhou D, Pan L, Wang X, Kelley 1104 C, et al (2014) A regulatory network of Drosophila germline stem cell self-renewal. 1105 Dev Cell 28: 459–473 1106 Yang F, Su W, Chung OW, Tracy L, Wang L, Ramsden DA & Zhang ZZ (2023) Retrotransposons 1107 hijack alt-EJ for DNA replication and eccDNA biogenesis. Nature 620: 218–225 1108 Yano T, de Quinto SL, Matsui Y, Shevchenko A, Shevchenko A & Ephrussi A (2004) Hrp48, 1109 a Drosophila hnRNPA/B Homolog, Binds and Regulates Translation of oskar mRNA. 1110 Dev Cell 6: 637–648 1111 Yu Y, Gu J, Jin Y, Luo Y, Preall JB, Ma J, Czech B & Hannon GJ (2015a) Panoramix enforces 1112 piRNA-dependent cotranscriptional silencing. Science (1979) 350: 339–342 1113 Yu Y, Gu J, Jin Y, Luo Y, Preall JB, Ma J, Czech B & Hannon GJ (2015b) Panoramix enforces 1114 piRNA-dependent cotranscriptional silencing. Science (1979) 350: 339–342 1115 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Zhang Z, Xu J, Koppetsch BS, Wang J, Tipping C, Ma S, Weng Z, Theurkauf WE & Zamore 1116 PD (2011) Heterotypic piRNA Ping-Pong Requires Qin, a Protein with Both E3 Ligase 1117 and Tudor Domains. Mol Cell 44: 572–584 1118 Zhao K, Cheng S, Miao N, Xu P, Lu X, Zhang Y, Wang M, Ouyang X, Yuan X, Liu W, et 1119 al (2019) A Pandas complex adapted for piRNA-guided transcriptional silencing 1120 and heterochromatin formation. Nat Cell Biol 21: 1261–1272 1121 Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, Benner C & Chanda 1122 SK (2019) Metascape provides a biologist-oriented resource for the analysis of 1123 systems-level datasets. Nat Commun 10: 1523 1124 1125 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Figure 1 A mTurbo GFP PIWI UASp (Piwi, Aub or Ago3) Gal4 B GSCLC ovaries mTurbo-GFP-PiwimTurbo-GFP mTurbo-GFP-Aub mTurbo-GFP-Ago3 GFP DAPI GSCLC ovaries (bam-GLKD) expressing: anti-Piwi DAPI anti-Aub DAPI anti-Ago3 DAPIbam shRNAUASp Gal4 ♀ Gal4 nos promoter Input Streptavidin pulldown 180 140 100 75 60 anti-GFP 180 140 100 75 60 45 35 Streptavidin-HRP C (kDa) mTurbo-GFP mTurbo-GFP-Piwi mTurbo-GFP-Aub mTurbo-GFP-Ago3 mTurbo-GFP mTurbo-GFP-Piwi mTurbo-GFP-Aub mTurbo-GFP-Ago3 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Figure 2 A B C .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint 0 5 0 0. 5 1 0 5 Figure 3 aub-GLKDy w hrb27C-GLKD set1-GLKD Krimp HeT-A Gag A B aub-GLKDy w hrb27C-GLKD set1-GLKD 0 5 0 5 0 1 2 0 5 0 5 0 5 Normalized signal Distance (µm) 0 5 DAPIAubDAPI Normalized signal DAPI Normalized signal PiwiDAPI Distance (µm) Distance (µm) Distance (µm) 0 1 0 5 10 0 5 10 0 5 100 5 10 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint GSE71374 gfp-GLKD set1-GLKD aub-GLKD piwi-GLKD panx het panx KO 0 10 00 20 00 30 00 0 20 0 40 0 60 0 B 80 40 0 120 160 200 log2FC (hrb27C-GLKD / gfp-GLKD) log2FC (aub-GLKD / gfp-GLKD)log2FC (set1-GLKD / gfp-GLKD) HeT-A HMS-Beagle TART-B HeT-A gypsy12 log2FC (hrb27C-GLKD / gfp-GLKD) Figure 4 A C Non-GSCLC ovary (Genes and individual TE insertions) D GSCLC ovary (Genes and individual TE insertions) -log 10 FDR -log 10 FDR log2FC (set1-GLKD / gfp-GLKD) Transcript level (TPM) r = -0.02 log2FC (aub-GLKD / gfp-GLKD) log 2FC ( set1 -GL KD /gfp -GLKD ) TEs included in EdgeR output Genes and TEs included in EdgeR output TAHRE Non-GSCLC ovary (Individual TE families) log2FC (piwi-GLKD / gfp-GLKD) log2FC (piwi-GLKD / gfp-GLKD) log2FC (aub-GLKD / gfp-GLKD) 80 40 0 100 120 140 60 20 -10 -5 0 5 10 15 -10 -5 0 5 10 15 -10 -5 0 5 10 15 TEs included in EdgeR output Genes and TEs included in EdgeR output -10 -5 0 5 10 15 -10 -5 0 5 10 15 -10 -5 0 5 10 15 -10 -5 0 5 10 15 -10 -5 0 5 10 15 log2FC (piwi-GLKD / gfp-GLKD) mdg3 DM88 copia Circe mdg3HMS-Beagle flea Max-element diver flea HMS-Beagle diver 4 0 -4 6 8 10 2 -2 HeT-A gypsy12 TAHRE TART-A1 TART-B TART-C Max-element 40-4 6 8 102-2 diver HMS-Beagle flea mdg3 4 0 -4 6 8 10 2 -2 40-4 6 8 102-2 r = 0.73 HeT-A TAHRE gypsy12 TART-B TART-A1 TART-C Max-element HMS- Beagle diver HeT-A HeT-A mdg3 invader3 HMS-Beagle TART-A Max-element diver gypsy12 TART-C 3S18 3S18 3S18 TART-B TART-A rover aub het aub KO piwihet piwiKO This study GSE103582 3S18 3S18 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Figure 5 A B GFPGFP-Set1WT SET RNAi-resistant nonsense substitutions GFPGFP-Set1E1613K SET E1613K Transcript level (mean TPM) GLKD GFP set1 set1 set1 GFP-set1WT GFP-set1E1613K Transgene400 300 200 100 0 400 300 200 100 0 3000 2000 1000 0 HeT-A TAHRE TART-A1 TART-B TART-C diver HMS- Beagle 3S18 Max- element Telomeric TE families C D [0-50] [0-50] [0-50] [0-50] GLKD GFP set1 set1 set1 GFP-set1WT GFP-set1E1613K Transgene ago3 ago3 aub piwi 200 160 120 80 40 0 H3K4me3 signal (mean CPM) aub [0-15] [0-15] [0-15] [0-15] TSS RRM RRM GFPH3K4me3Vas set1-GLKDgfp-GLKD GFP-Set1WT in set1-GLKD GFP-Set1E1613K in set1-GLKD H3K4me3 (CUT&Tag) TSS .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint HMS-Beagle (0.51) diver (0.59) 0 60 120 Figure 6 D log2FC 23-29nt mapper (set1-GLKD / control) C B Control set1 GLKD 6,063 bp HeT-A 10,463bp TAHRE 13,424bp TART-A1 0 1 2 -1 -2 -3 0 60 120 Individual TE families ordered by FC value Proportion (%) Control set1-GLKD 60 50 40 30 20 10 0 0 60 120 Sense Antisense Sense+Antisense TART-C (0.16) TART-A1 (0.24) TART-B (0.35) TAHRE (0.49) HeT-A (1.10) TART-C (0.18) TART-A1 (0.19) TAHRE (0.20) HeT-A (0.23) TART-B (0.28) TART-C (0.16) TART-A1 (0.20) TART-B (0.29) TAHRE (0.31) HeT-A (0.51) Max-element (1.36) diver (1.60) HMS-Beagle (1.86) gypsy12 (0.72) gypsy12, 3S18 (0.52) Max-element (0.71) gypsy12 (0.66) Max-element (0.84) HMS-Beagle (0.99) diver (1.01) A E set1-GLKD / Control Sense Antisense log2FC 23-29nt mapper Sense Antisense HeT-A TAHRE TART-A1 TART-B TART-C 1 0 -1 -2 -3 1 0 -1 -2 -3 HeT-A TAHRE TART-A1 TART-B TART-C HeT-A TAHRE TART-A1 TART-B TART-C HeT-A TAHRE TART-A1 TART-B TART-C 3S18 (2.04) 3S18 (1.13) CPM [0-250] [0-250] CPM [0-250] [0-250] 10,176bp TART-B Sense Antisense Sense Antisense panx KO / Het (GSE71374) 11,124bp TART-C [0-250] [0-250] [0-250] [0-250] [0-50] [0-50] [0-50] [0-50] [0-50] [0-50] [0-50] [0-50] [0-50] [0-50] [0-50] [0-50] WT EKGFP GFP-Set1 Piw i GFP-Set1 GFP Tubulin Input WT EKGFP GFP-Set1 GFP-IP (Crosslinking) 245 35 100 60 piRNA .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint Subtelomeric HTT cluster set1-GLKD Figure 7 A B TE 84 (HeT-A, TART, diver, HMS-Beagle, Max, etc) Gene 222 TE 7 Gene 541 (aub, etc) TE 1 Gene 91 Peak call (p<0.01) GFP-Set1 (WT or E1613K) / GFP control -4 -2 0 2 4 6 8 10 12 -4 -2 0 2 4 6 8 10 12 log2 fold difference (GFPSet1WT - GFP) TEs with peaks Genes and TEs with peaks C D GFP GFP-Set1-WT in set1-GLKD H3K4me3 gfp-GLKD [0-15] [0-15] [0-15] [0-15] [0-50] [0-50] [0-50] ORF1 TSSHeT-A (consensus) TSS aub [0-500] [0-500] [0-500] [0-500] [0-50] [0-50] [0-50] 5’ 3’ GFP-Set1-EK in set1-GLKD GFP-Set1-WT in set1-GLKD GFP-Set1-EK in set1-GLKD GFP E CUT&Tag Cluster 3 (chr4:1,267,100 -1,348,250) TE Gene track GFP GFP-Set1-WT in set1-GLKD [0-15] [0-15] [0-15] GFP-Set1-EK in set1-GLKD GFP(CUT&Tag ) Control [0-10] plus st rand (antisense) [0-10] minus strand (sense) [0-10] plus st rand (antisense) [0-10] minus strand (sense) set1-GLKD piRNA 6083 F Precursors piRNA (Sense < Antisense) 3’UTR HTT Piwi TE Individual TE insertions Telomere Precursor Antisense piRNA Piwi Set1 log2 fold difference (GFPSet1E1613K - GFP) r = 0.85 Set1Set1 Set1 Set1 Set1 .CC-BY 4.0 International licenseperpetuity. It is made available under a preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in The copyright holder for thisthis version posted March 30, 2026. ; https://doi.org/10.64898/2026.03.30.715253doi: bioRxiv preprint

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-pdf

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00
unpaywall
last seen: 2026-05-23T02:00:01.238055+00:00
License: CC-BY-4.0