Abstract
MicroRNAs (miRNAs) are post- transcriptional regulators of gene expression whose
contributions to cell biology remain underexplored. Here, we used Cell Painting to quantify the
morphological effect of 2,565 human and 1,900 mouse miRNA mimics on 24 million cells
across five cell lines. To do so, we developed a novel single- cell morphological profiling
analysis framework, involving stringent batch correction, feature selection, and hit calling. With
this, we discovered that 9% of human and 15% of mouse miRNA mimics significantly alter cell
morphology in at least one cell line. Eighteen miRNAs caused significant changes in multiple
cell lines, including eight orthologous miRNAs that altered morphology in both human and
mouse cells. Among the replicating miRNAs were human and mouse miR -155-5p, which
affected morphology in at least one replicate of four cell lines. As expected, miRNAs with
identical seed sequences induced more similar morphological changes tha n miRNAs with
different seeds. Also, morphological changes were associated with cytotoxicity and annotation
confidence. This comprehensive single-cell morphological resource will help elucidate human
and mouse miRNA cellular function.
Introduction
MicroRNAs (miRNAs) are small (~22 nucleotide) non -coding RNAs that are important
modulators of post -transcriptional regulation.1 Bound to an Argonaute family member, they
form the RNA- induced silencing complex (RISC) that binds target mRNAs through partial
sequence complementarity to the miRNA seed region (nucleotides 2 –8).2,3 Canonically,
binding triggers degradation of the target mRNA or inhibits its translation, thereby reducing
the abundance of the protein encoded by the bound mRNA. 4 miRNA-mediated post -
transcriptional regulation may be widespread: approximately 60% of all human protein-coding
transcripts contain a conserved miRNA binding site. 5 Many of the 2,654 human miRNAs
discovered to date 6 play roles in processes such as cell proliferation 7, differentiation 1,
signaling8,9, metabolism 10,11, embryonic development 12, and disease. 13,14 Whilst addition of
miRNA mimics to cells often affects their proliferation and survival15, it remains unclear how
many and which miRNAs fulfil specific cellular roles.
Associating miRNAs to phenotypes presents three challenges. First, although most miRNAs
affect the abundance of individual mRNAs only modestly, they could substantially alter
phenotypes by co-ordinately modulating the abundance of hundreds of mRNA targets.14,16,17
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
2
Second, the miRNA literature is contaminated by paper mill publications and by loci incorrectly
annotated as miRNAs.2,17–21 Third, forward genetic studies of miRNA function are potentially
hampered by cell -type specificity of miRNA action. 23 Comprehensive data resources and
Methods
are therefore required to link miRNAs to phenotypes of diverse cell types.
Cell Painting is a morphological profiling assay based on high -content microscopy that can
compare cellular morphologies across thousands of perturbations. It uses six fluorescent dyes
to stain eight cellular components in five channels, which yield numerical morphological
descriptors (“profiles”) for each cell.
22,23 These profiles capture cell size and shape information
as well as fluorescent intensities of cell organelles and compartments, which permit detection
of changes in cell morphology. Singh et al. 24, for example, used Cell Painting and 315 short
hairpin RNAs targeting 41 genes to show that seed sequence and off-target effects determine
cell morphology. Insights gained from morphological profiling are, however, usually limited due
to cell profiles being averaged per well or treatment , rather than cells being analysed
individually. Averaging could mask subtle changes occurring in subpopulations of cells. This
is particularly relevant to experiments using transfection whose efficiency is imperfect.
25
Retention of single-cell information throughout an analysis can improve analysis accuracy,26,27
motivating a single-cell morphological analysis of miRNA activity.
Here, we used single-cell analysis with Cell Painting to test the effects of all available human
and mouse miRNA mimics in three human and two mouse cell lines. We found that 227 human
and 286 mouse miRNAs induce significant changes in cell morphology in at least one cell line,
including 18 miRNAs that induce changes in two cell lines. This is the first single-cell study to
comprehensively associate miRNAs with cellular morphology . It also highlights the steps
needed to analyse single-cell data from complex morphological profiling experiments.
Results
We measured the effects on single-cell morphology from transfecting each of 2,565 miRNA
mimics in three human cell lines , and 1,900 miRNA mimics in two mouse cell lines (Figure
1A). We set up 384- well plates such that each miRNA was administered four times per cell
line: in two wells of one plate, and in two wells of a second plate seeded with a different
passage of cells (termed “replicates” throughout), yielding a total of 134 plates (Figure 1B).
We assessed effects of individual miRNA mimics (hereafter miRNAs) on cell morphology using
the Cell Painting assay. This is a high-content image- based profiling assay that stains eight
components of the cell – DNA, endoplasmic reticulum (ER), cytoplasmic RNA, nucleoli, actin,
Golgi apparatus, plasma membrane and mitochondria – in five channels (Figure 1C). Then,
we obtained 205,824 fluorescent microscopy images of 24, 127,517 cells. From these, we
extracted 1,386 descriptors of cell morphology (“features”), such as intensity of stains and cell
size, which together form the morphological profile of a cell (Figure 1D).
Single-cell morphological profiling analysis framework
To understand which miRNA treatments caused quantifiable changes in cell morphology, we
needed to overcome three challenges. First, images had to be quality controlled to remove
artifacts like dust precipitation, which we addressed with a nearest -neighbour approach that
identified and removed affected images. Second, plate-positional and batch effects caused by
technical factors such as photobleaching, common for Cell Painting experiments,
25 needed to
be reduced. Left unaccounted for, these technical effects can overshadow treatment effects.28
We performed batch correction using a linear model and selected features that were not
significantly associated with plate position or batch (Methods and Supplementary Figure S1).
Third, we established methods that determine whether a miRNA treatment significantly altered
cell morphology in successfully transfected cells.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
3
To do so, we developed scmorph, a Python package for the single -cell analysis of
morphological profiles.29 This measures morphological heterogeneity in two cell populations:
(i) cells treated with a scramble miRNA, termed “negative controls”, and (ii) cells treated with
a miRNA of interest. More specifically, we first reduce the dimensionality of the morphological
profiles using principal component analysis (PCA), before computing the Mahalanobis
distance of each cell to the average location of negative control cells on the first 10 PCs
(“centroid”; Figure 1E). This yields a vector of distances for each treatment (“treatment
distances”) and the negative control (“control distances”). For each treatment we then
summarise its difference to negative control s by performing the Kolmogorov –Smirnov (KS)
test, comparing the treatment and control distances (Figure 1F). This test measures the
probability of the null hypothesis that treatment and control distances are drawn from the same
distribution, i.e. that there are no differences between treated and negative control cells.
Figure 1. Single-cell morphological profiling of miRNAs. A: Three human and two mouse cell lines were transfected
with 2,565 human or 1,900 mouse miRNA mimics, respectively. Two experimental replications per cell line were
used. B: Each miRNA was applied to two wells, requiring 134 plates in total. C: After Cell Painting, eight cellular
compartments were visualised with fluorescent microscopy, as exemplified with images of HEK293T cells. The
scale bar is 25 μM. D: Single-cell profiles were computed for 24.1M cells across 1,386 features. E: After image QC
and batch correction (Methods), a Mahalanobis distance for each single cell was calculated from its location on the
first 10 principal components (PC) to the average location of negative control cells (control centroid). These
distances were calculated in a space of PCs derived from 1,386 features. F: The Kolmogorov-Smirnov (KS) statistic
was used to compare the treated cells’ phenotypic distance from the control centroid to the distances of negative
control cells. G: To estimate the significance of these phenotypic shifts (“hit calling”), a background distribution was
built from pairwise KS tests between negative control wells across all plates of a replicate experiment. Treatments
were considered significantly different from negative controls when their p-value was below the 5% FDR threshold
determined by the background distribution.
We then tested whether a miRNA affected cell morphology (i.e., was a “hit”) by combining
cells from two technical controls (on the same plate) and comparing it to a negative control,
which required new methods to overcome the high sensitivity of morphological profiling to
minor technical variation .25 First, we decided which of three negative controls to use as a
Reference
DMSO only, DMSO with transfection reagent (DMSO+TR), or DMSO+TR with an
additional scrambled miRNA. The three negative controls yielded similar results when
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
4
compared to positive controls (Supplementary Figure S2). miRNAs could significantly change
morphology differently across the negative controls (Supplementary Figure S3). However,
96% of miRNAs yielded the same result (hit or no hit) using each of two biologically relevant
negative control s. These results indicate that choice of negative control can impact
downstream interpretation, potentially due to scrambled miRNAs competing with endogenous
miRNAs for RISC loading. W e opted to report results from wells treated with scrambled
miRNAs as negative control, because treatments differ only in miRNA sequence, but results
from DMSO plus transfection reagent can be found in Supplementary Tables 1-3.
Hit calling was performed using an empirically controlled false discovery rate (FDR). This
involved building a null distribution of pairwise tests of negative control wells, between which
we expect no morphological differences (see Methods). If a miRNA’s p-value was smaller than
the smallest 5% of p-values of the null distribution then it was considered significant (FDR <
5%, Figure 1G). Lastly, we defined miRNAs as hits only when they were significant in both
replicates of a cell line, which reduces false positives (Methods).
Cell death siRNAs impact morphology of surviving cells
To test our single-cell analysis methods, we first investigated whether treatment of HEK293T
cells with a cocktail of cell death-inducing siRNAs resulted in significant changes to cell counts
and cell morphology. This positive control treatment was applied to specified wells on every
plate; the treatment also served as a transfection control. As expected, 24 h after exposure to
the siRNA cocktail the average cell count was significantly reduced by 39 % relative to a
negative control of scrambled miRNAs, indicating successful transfection ( P<0.001, Figure
2A). To determine whether cell count reduction was accompanied by morphological changes,
we performed PCA of the single -cell profiles. We quantified effects of batch and treatment
using ANOVA, finding that batch effects outweighed treatment effects (η² of batch: 0.0505, η²
of treatment: 0.0073). Batch effects were corrected using a linear model that first measures
the mean deviation per feature and plate among negative control cells and subsequently
removes it (Figure 2B, Methods).30 This method does not remove all batch effects, but reduces
the variance they explain by an order of magnitude (η² of batch: 0.0059, η² of treatment:
0.0062). To further mitigate plate-specific effects, we then derived per-plate PCs.
Figure 2. Cell death siRNAs tend to reduce cell counts in HEK293T, compared to cells only treated with scrambled
miRNAs, confirming successful transfection. A: Cell counts per well and condition. Horizontal lines on each violin
plot indicate 25th, 50th and 75th percentiles. B: Left: At the single -cell profile level, batch effects explain more
variance than differences between treatments, as measured by variance explained by each variable. Bold lines
represent averages per replicate; thin lines are per-plate profiles. Right: Correcting batch effects with a linear model
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
5
reduces the differences between replicates and batches, and reduces variance explained by batch by an order of
magnitude. C: Single-cell Mahalanobis distances to the control centroid highlight the phenotypic differences of cells
treated with cell death siRNAs. D: Left: Background distribution of well-to-well KS tests of negative controls, with
the dashed line indicating the 95th percentile. Right: Cell death siRNA treatments induced a significantly different
morphology compared to control treatments in 26 of 30 plates tested. E: Per -channel images of HEK293T cells
treated with cell death siRNAs. Scale bar is 50 μM.
In this PC space, treated cells were consistently farther from the control centroid (Figure 2C).
This showed that the effect of cell death siRNAs on cell morphology was quantifiable and
significant at the single -cell level. This effect was reproducible because 26 of the 30 (87%)
independently tested plates showed significant differences in cell morphology between the
two groups (Figure 2D). Consistent with the computationally inferred change in cell
morphological profiles, images of these treated cells show cell rounding and detachment, both
features of apoptosis (Figure 2E). In other cell lines, the morphological effects of cell death
siRNAs varied, reaching significance in 5%, 10%, 20%, and 68% of experimental replicates
for N2a, HepG2, SH- SY5Y, and C2C12 cells, respectively (Supplementary Figure S4) and
leading to average cell count reductions of 4%, 27%, 28%, and 17%. This varied response is
likely due to the early time point assessed in this study (24 h after transfection) and due to
differences in morphological plasticity between cell lines.
31–33
We also tested an additional positive control that acts independent ly of transfection, the
insecticide rotenone, which impacts tubulin and mitochondria. 34,35 Rotenone yielded
substantial morphological changes in HEK293T (87% replicates significant) and HepG2 cells
(57%), but less change in other lines (0 -14%, Supplementary Figure S5). In summary, cell
death siRNAs and rotenone significantly and reproducibly altered single -cell morphology in
HEK293T cells, whereas their activity was variable in the other cell lines tested.
Following this confirmation of the sensitivity of our approach to significance testing, we tested
its specificity. For this, we investigated miRNAs that are not expected to alter cell morphology.
Among all tested miRNAs, 28 have been removed from the most recent version of miRBase
(v22.1) following new evidence that they are not encoded in human or mouse genomes, or
instead are rRNA or tRNA fragments, for example. We hypothesised that these retired
miRNAs would not alter cell morphology. Indeed, of 82 independent tests of these 28 retired
miRNAs, only 3 (4%) were significant at 5% FDR indicating, as expected, that most fail to
induce morphological change.
Hundreds of miRNAs impact cell morphology, with 18 affecting two
cell lines
We then tested all 2,565 human and 1 ,900 mouse miRNAs in the library for morphological
effects. Of these, 227 (9%) human and 286 (15%) mouse miRNAs were hits in both replicates
of one or more cell line (at 5% FDR, Table 1 and Supplementary Table 4). Ten of these human
miRNAs induced changes in two human cell lines (Figure 3A). Across organisms, 18 of 4,163
human or mouse sequence-unique miRNAs affected two cell lines, six of these are sequence-
identical orthologues, and an additional two were orthologues sharing an identical seed
sequence (Figure 3B). Using a different negative control as reference showed agreement with
these results (Figure S 6). We did not find any miRNAs acting in more than two tested cell
lines, suggesting that most miRNAs have detectable effects in single cell lines only.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
6
Cell line Hits
replicate 1
Hits
replicate 2
Hits
shared
Total sequence-
unique miRNA hits by
organism
Total unique
miRNA hits
HEK293T 463 262 114
227
507
HepG2 353 371 93
SH-SY5Y 262 361 30
C2C12 787 446 277 286 N2a 112 68 9
Table 1. 2,565 human and 1,900 mouse miRNAs, totalling 4 ,163 miRNAs with a unique sequence, were tested
across five cell lines using two experimental replicates. miRNAs that induced significant morphological changes in
both replicates were defined as hits. 227 human and 286 mouse miRNAs were a hit in at least one cell line, resulting
in 507 unique miRNAs that were hits in at least one cell line.
Figure 3. Most miRNA hits are cell line specific, except 18 notable miRNAs. A: Venn diagram showing number of
miRNAs being a hit in each cell line, including overlaps. Unlabelled segments indicate no shared miRNAs. B: 18
miRNAs significantly alter cell morphology in both experimental replicates for each of two cell lines, including six
orthologues with perfect sequence identity (“identical orth.”) and two orthologues with sequence identity in the seed
and up two 2 nucleotide differences (Levenshtein distance) outside the seed (“similar orth.”). NS = not significant,
NT = not tested.
Morphological effects of miR-155-5p may relate to seed sequence
Among replicating miRNAs wa s human and mouse miR-155-5p (Figure 3), which has been
well characterised in the contexts of inflammation and cancer drug resistance.36–39 This miRNA
significantly altered cell morphology in at least one replicate of four cell lines (Figure 4A). We
therefore investigated its impact on each of 142 morphological features relative to miRNAs
with seed sequences differing by up to one nucleotide . To do so, we performed t -tests per
feature, comparing treated to negative control cells. This resulted in 142 t-statistics per miRNA,
representing how much each feature altered following treatment, which we then clustered
(Methods). The cellular phenotypic effect conveyed by hsa- miR-155-5p in HepG2 and
HEK293T cells was most similar to effects of hsa-miR-9-3p and hsa-miR-137-3p (Figure 4B).
Subtle differences in miRNA effects were evident: miR-9-3p and miR- 155-5p increased
mitochondrial granularity, miR-9-3p and miR-137-3p decreased the nuclear perimeter, and the
outgroup, miR-4473, resembled negative control cells of HepG2 (Figure 4C). This fine-grained
view helps explain the clustering observed in Figure 4B. The activity of miR -155-5p, miR-9-
3p, and miR-137-3p may therefore depend on their seed sequence. Single nucleotide changes
in miRNA seed sequence (Figure 4A) are thus associated with similar (hsa-miR-9-3p and hsa-
miR-137-3p) and dissimilar (hsa -miR-4473) morphological changes in HepG2 cells. These
miRNA-induced changes in morphology were barely discernible by eye, underlining the need
for sensitive, quantitative analysis of morphological profiles (Figure 4D).
miRNA significance
NS
NT
5%
Replicating in
Human
Both (identical orth.)
Both (similar orth.)
miR−9−3p
miR−9−5p
miR−141−3p
miR−142−3p
miR−155−5p
miR−182−5p
miR−194−3p
miR−216b−5p
miR−548as−5p
miR−642b−5p
miR−708−5p
miR−1238−3p
miR−2113
miR−4422
miR−4720−5p
miR−5095
miR−5681a
miR−5693
HEK293T
HepG2
SH−SY5Y
C2C12
N2a
C2C12
N2a
HepG2
SH-SY5Y
HEK293T
269
9
102
85
22
4
3
1
3
5
2
A B
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
7
Figure 4. miR-155-5p-induced morphological changes are similar to those induced by seed sequence-similar
miRNAs. A: miR-155-5p and three closely related miRNAs affect cell morphology in both human and mouse cell
lines. Heatmap shows hits called for replicates 1 and 2 in each cell line. B: Three of the four miRNAs yield similar
morphological changes to control cells, resulting in their co- clustering in HEK293T and HepG2 lines. miR-4473,
however, does not yield comparable changes. C: In HepG2, miR-9-3p and miR -155-5p impact mitochondrial
granularity, and miR-9-3p and miR-137-3p reduce the nuclear perimeter. This view of cell morphological changes
helps to explain the clustering observed in B. D: Differences in HepG2 morphology are not immediately discernible
by eye, except for differences in vacuole or droplet content. Scale bar is 50 μM.
To connect these cellular changes with molecular phenomena, we next investigated whether
these miRNAs – which induce similar cellular changes – share similar mRNA target ontologies.
Using gene ontology enrichment (Methods), we found that mRNA targets of miR-9-3p and
miR-155-5p were each enriched in TGF -β signaling (both p<0.01) , and miR-9-3p's mRNA
targets were associated with mesenchyme development (p<1x10 -5). MiR-155-5p has
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
8
previously been implicated in the TGF-β signaling cascade through targeting SMAD proteins,
changes to which HepG2 cells are sensitive.36,40–44 Together, these results show that single-
cell analysis can detect subtle changes in cell morphology, similarities in seed sequence can
reflect similarities of morphological changes, and cell morphology adds an orthogonal
informational resource for miRNA target annotation.
A shared seed in the hsa-miR-518 family mediates morphological
impact
While the above results suggested that a similarity in seed sequence may influence
morphological similarity, the case study of hsa-miR-518 shows that seed identity can outweigh
miRNA family membership. Of the 11 members of hsa-miR-518, five share an identical seed
sequence (Figure 5A). All five miRNAs led to a significant morphological difference (i.e., were
all a hit) in HEK293T, whereas the other six miRNAs did not. MiRNAs with the AAAGCGC
seed were phenotypically similar and clustered separately from miRNAs with a different seed
sequence ( Figure 5A). Among the features most changed by these five miRNAs was
mitochondrial granularity, which was significantly higher compared to the negative control
(Figure 5B). As for hsa-miR-155-5p, this effect is difficult to spot by eye (Figure 5C). Overall,
these findings confirm that identical seed sequences can mediate similar morphological
changes.
In vivo, miRNA duplexes are typically processed into the 3p and 5p arms, with passenger
strand (“star sequence”) degraded swiftly while the lead strand (“mature sequence”) can exert
its effect of downregulating mRNAs. This imbalance of expressed arms is governed by
sequence-specific properties, such as thermodynamics.
45,46 In our experiments, the use of
stabilized miRNA mimics allows testing of the passenger strand for biological relevance, too.
Using our hsa -miR-518 case study, we thus tested if morphological impact was related to
markers of arm dominance. To this end, we linked our results to information from
MirGeneDB47, miRSwitch 48, and miRNATissueAtlas 49. Endogenous expression of miR -518
members is specific to the placenta.48,50 Indeed, we find that the arm predominantly expressed
in the placenta tends to with significant morphological changes (Figure 5D). An exception is
represented by miR-518d, which features lower expression in the placenta. While the 3p arm
contains the shared seed sequence, its 5p arm is annotated as mature. However, our miRNA
experiments show that the seed sequence in its 3p arm still exerts the morphological change.
This suggests that expression patterns of miRNA arms are related to but not fully predictive of
morphological activity. We stress that here we used miRNA mimics which are chemically
stabilized, meaning that hsa -miR-518d-3p is unlikely to exert this effect endogenously.
Instead, the research of miRNA mimics using morphology in this way may reveal miRNAs of
therapeutic use that would go missed when relying on expression patterns as markers of
significance alone.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
9
Figure 5. A shared seed in the hsa -miR-518 miRNAs mediates morphological effects in HEK293T. A: The seed
AAAGCGC is harboured within 5/11 of the miRNAs. All five were considered a hit compared to negative control,
i.e. significantly altered cell morphology, while the other six did not. Clustering the miRNAs by phenotypic similarity
confirms that the five miRNAs induce similar morphological changes (dendrogram). B: Among the top features
affected by the five miRNAs is mitochondrial granularity in cells (“Cells_Granularity_6_ W5”). Horizontal lines in
violins indicate quantiles at 0.25, 0.5 (median), and 0.75. Compared to negative control, the miRNAs have
significantly higher mitochondrial granularity. Letters indicate results from pairwise Wilcoxon rank sum tests.
Groups that do not share a letter are significantly different at p < 0.05 after Benjamini-Hochberg (BH) correction51.
C: Images of DNA, AGP of cells treated with negative control or hsa -miR-518b show minor differences in cell
morphology. The scale bar is 25 μM. D: miR-518 miRNAs morphological changes are related to seed sequence
and endogenous expression patterns. MiR-518a, miR-518c and miR-518f show a consistent pattern of the 3p arm
being predominantly expressed in the placenta , the tissue predominantly expressing miR -518.48,50 MiR-518f
switches from 5p to 3p dominance in the placenta specifically, showcasing its tissue specific activity (1).
Endogenous expression alone is insufficient to predict morphological activity, however, with miR -518d-3p lowly
expressed natively but still showing significant morphological changes in our assay, suggesting that the seed
sequence is sufficient to induce morphological changes . (2 ) miR -518-5p is sometimes detected in deep
sequencing,47 but not annotated in miRBase and was not tested in our experiments. Expression in placenta from
miRNATissueAtlas.49
Trends of how miRNAs influence cellular morphology
Next, we investigate d trends of miRNA seed sequence, annotation confidence, and cell
toxicity. Motivated by results from miR-155-5p and miR-518 mimics (Figure 4 and Figure 5),
we investigated how morphological changes relate to miRNA seed sequence changes across
all miRNAs. Among miRNA hits identified in C2C12, the cell line with the most hits, we found
that similar miRNA seeds tended to yield more similar morphological profiles compared to the
negative control (Figure 6A, Mann-Whitney-U test, p<0.001 after Benjamini-Hochberg (BH)
correction51). Identical seeds and s eeds differing by one nucleotide induce similar
morphologies, but this similarity diminishes as more nucleotide changes are added. This seed
sequence-dependent effect mirrors similar findings for shRNAs.24
Differences in morphological features are negatively correlated with cell counts, indicating that
cytotoxicity is a predictor of morphological change. 52 In our miRNA experiments, we also
A
D
B
C
miR-518f-5p
miR-518e-3p
miR-518c-5p
miR-518d-5p
miR-518a-5p
miR-518e-5p
miR-518a-3p
miR-518f-3p
miR-518b
miR-518d-3p
miR-518c-3p
miRNA
0
2
Seed
AAAGCGC
AAAGCGC
AAAGCGC
AAAGCGC
AAAGCGC
UCUAGAG
UGCAAAG
UCUAGAG
CUCUGGA
AAGCGCU
UCUAGAG
KS statistic
Significance
5%
NS
Hit
KS
High
(0.2)
Low
(0)
Distance
ControlmiR-518b
a
b,c
b
b,c
c,d
d
0 10
Mitochondria granularity
(6) in cells
518c-3p
Control
518d-3p
518b
518f-3p
518a-3p
DNA AGP MitochondriamiRNA
5p
3p
5p
3p
5p
3p
5p
3p
5p
3p
5p
3p
Expression in placenta
log10(RPMM)
Mature
Seed
Expression
Low High
miR−518a-
miR−518b-
miR−518c-
miR−518d-
miR−518e-
miR−518f-
Mature (MirGeneDB)
Mature sequence
Star sequence
Dominantly expressed
(miRSwitch across tissues)
Yes
No
Seed
AAAGCGC
Other
Hit significance
5%
Not significant
Hit
(2)
Dominant
(1)
NA NA
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
10
observed a negative correlation between cell counts and morphological difference to negative
control, although this effect varied by cell line (Figure 6B and Supplementary Figure S7).
However, one may find miRNAs such as miR-19a- 5p escaping this trend by mediating large
morphological changes without altering cell count (Supplementary Figure S7). As observed in
our positive control experiments using cell death siRNAs, cytotoxicity is therefore associated
with morphological changes among surviving cells.
Figure 6. Trends of how miRNAs influence cellular morphology. A: Among C2C12 hits, miRNAs with identical seed
sequence yield more similar morphological changes compared to negative control s. Differences between seed
sequence were quantified with the Hamming distance. Letters indicate results from Mann-Whitney-U tests between
pairs of the 8 groups. Groups labelled with different letters are significantly different at p < 0.001 after BH correction.
B: Cell count (y-axis) is inversely correlated with cell morphological change (p<0.0001), as measured by the KS
statistic (x-axis; Methods). Results are per miRNA and were averaged across all cell lines. C: In C2C12, high
confidence miRNAs were more frequently significantly different (137/801) from controls than non-high confidence
miRNAs (143/1099), a difference which is significant (Chi-square test p<0.05).
We f inally investigated whether well -studied miRNAs tended to mediate stronger
morphological changes. miRBase annotates a subset of miRNAs as “high confidence” based
on experimental evidence, including their processing by Drosha and Dicer proteins.6 Of
miRNAs tested in this experiment, 861 (34%) human and 801 (42%) mouse miRNAs are
annotated as high confidence. In C2C12, 17% (137 of 801) of “high confidence” miRNAs were
a hit in both replicates ( Figure 6C), significantly higher than th is proportion among other
miRNAs (13%; 143 of 1099; Chi-square test, p < 0.05). This indicates that mimics of miRNAs
with experimental evidence are more likely to yield a discernible cellular morphological
response.
Discussion
We comprehensively profiled the morphological effects of 2,565 human and 1, 900 mouse
individual miRNA mimics on single cells for three human and two mouse cell lines. Of these,
227 human and 286 mouse miRNAs had a significant impact on cell morphology (Table 1),
including 18 miRNAs in multiple cell lines ( Figure 3). We envision that this comprehensive
Reference
of single-cell morphological profiles upon miRNA treatment of diverse cell lines will
begin to bridge the current gulf in understanding between miRNA cellular and molecular
function.
Seed sequence-similar miRNAs can mediate similar changes in cell morphology (Figure 4,
Figure 5, Figure 6A). miR -155-5p is expected to induce morphological effects similar to
miRNAs that differ slightly in the seed sequence. This is because about 40% of miR -155-5p
targets are bound with a mismatch or wobble in the seed sequence, most commonly in seed
positions 4 and 6.
53 Nevertheless, it was unexpected that miR -155-5p was more similar to
miR-137-3p than miR-9-3p in affecting HepG2 cell morphology, because miR -137-3p differs
in sequence elsewhere, at seed position 3. For hsa-miR-581 members, we observed a strong
similarity among miRNAs sharing the same seed, confirming the paradigm of the seed
sequence's crucial role.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
11
Our understanding of miRNAs remains limited due to : i) cell type specificity of their
expression,21 ii) cell type specific expression of their target mRNA s, iii) imperfect miRNA
annotation17,18,54 and, iv) inconsistencies in the miRNA literature.2,55 While much research has
sought to quantify miRNA abundance in various tissues, fewer studies have systematically
linked miRNAs to phenotypes. Also, t hose studies that were systematic often focused on
specific outcomes, such as cell proliferation, rather than measuring diverse phenotypes.56–58
Our large-scale single-cell study of miRNA effects on cellular morphological phenotypes
begins to overcome these limitations.
Single-cell analysis accounts for: i) incomplete transfection, because not all cells are
transfected and thus are expected to display altered phenotypes, ii) variability within cell lines,
because averaging profiles obscures dynamic processes such as mitosis ,46 and iii) the
possibility of miRNA-induced subpopulations, for example if miRNAs were to induce cell cycle
arrest. Our single -cell framework for hit calling identifies miRNAs that cause significant
morphological changes, in contrast to previous single -cell research which predicted
mechanism-of-action and dose-dependent phenotypic changes. 46 Further, the framework
reduces batch and plate -position effects through rigorous batch correction and feature
selection, thereby minimising technical artefacts. In these applications, our analysis methods
retain known biological differences, while allowing discovery of morphological changes for
subsets of cells. We anticipate that these single -cell analysis methods, consolidated in the
Python software scmorph ,29 prove useful for other complex morphological profiling
experiments when subpopulation -level changes are likely. Among the most obvious
applications are experiments with multiple cell lines,59,60 differentiating cells,61 and transfected
treatments (as in this work). In these applications, our analysis methods retain known
biological differences, while allowing discovery of morphological changes of a subset of cells.
Mullokandov et al. found that only the most highly expressed miRNAs suppressed their target
genes.62 In contrast, our findings show that, even at lower concentrations, hundreds of
miRNAs induce morphological change. This may be attributable to miRNAs conveying small
but coordinated effects on many mRNA targets, or to them regulating the abundance of
transcription factors, thereby changing the transcription landscape more widely .14,63 Further,
we showed that cytotoxic miRNAs induce stronger morphological shifts, mirroring similar
Results
in small molecule screens (Figure 6B).60 This suggests that miRNAs may mediate
morphological and cytotoxic effects at relatively modest concentrations.
Overall, across diverse cell lines, most miRNAs do not yield significant morphological
changes. This is consistent with results that over 60% of miRNAs do not discernibly suppress
their mRNA targets, as measured with a reporter gene linked to miRNA target sites. 62
Compared to simply counting cells, we identified miRNAs that alter cell morphology without
causing cytotoxicity (Supplementary Figure S7). Our approach provides several advantages
over previous large-scale studies of miRNA activity that focused on single cell lines, such as
HeLa,56 or on a specific outcome, most often cell proliferation or viability,56–58 even when using
high-content microscopy.64,65 Instead, we identified significant morphological changes across
five cell lines, including miRNAs that did not impact cell viability but still changed cell
morphology. This also highlighted that cell lines exhibit different sensitivity to perturbations, in
line with similar findings made by others.33,66
Our approach is limited by only considering miRNAs to be significant by their induction of
measured morphological changes, disregarding other cellular effects and molecular changes
such as mRNA levels which could provide complementary information. 67,68 Ideally, RNA and
morphology of the same cell could be quantified, yet this remains experimentally challenging.
A complement to this study would be a morphological profiling experiment of miRNA
knockouts, which is achievable due to recent advances. 69,70 An additional limitation is that
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
12
during hit calling we prioritised true positives over false negatives through stringent
significance testing, leading us to underestimate the true number of cell morphology-
associated miRNAs. Lastly, we performed the Cell Painting assay after 24h to avoid overly
dense growth of cells in the microwells, which could have masked morphological changes.
Later time points (e.g. 48-72h) would likely lead to stronger effects.31,32 The number of miRNAs
with a significant effect on morphology at later time points would therefore be expected to be
higher than found here.
This study provides a resource to help assess whether a particular miRNA affects cell
morphology in any one of 5 cell lines (Supplementary Table 4 ) and which features it affects
most (Supplementary Table 5). For more systematic miRNA research, our whole single- cell
dataset can be intersected with external information on miRNAs, such as their effect on the
transcriptome, or with data on miRNA-binding dynamics of target mRNAs.71 In summary, we
anticipate that this study will serve as a starting point for future systematic characterization s
of miRNAs.
Methods
miRNA Mimic Library
A mirVana miRNA mimic library containing all human and all mouse miRNAs (miRBase v21)
was obtained (ThermoFisher, 4464074, 0.25 nmol). This encompassed 2, 565 human and
1,900 mouse miRNA mimics (called “miRNAs” throughout for brevity). Handling of the miRNA
library used an automated liquid handler (Biomek FX). The library was prepared first as a 5
μM stock in DNase/RNase free dH 20, before creating intermediate plates with 2 μL miRNA
stock in 59 μL dH 20. miRNAs for transfections were taken from intermediate p lates. miRNA
sequences were obtained from miRBase (v21) and seed sequences were defined as positions
2-8.
Cell Culture
All cell lines were grown in DMEM Glutamax (Gibco, 31966021) supplemented with 10% fetal
calf serum, incubated at 37°C with 5% CO2. Cell numbers were optimised for 384-well optical
bottomed imaging plates (Greiner Bio-One, 781090). Prior to cell seeding, imaging plates were
coated with Poly-L-Lysine (Millipore, A-005-C) to ensure adherence of cells. Numbers of cells
seeded per well were: 1600 cells/well for HEK293T, 1500 cells/well for SHSY -5Y, 1900
cells/well for HepG2, 1000 cells/well for N2a and 900 cells/well for C2C12.
Cell Transfection
Cells were transfected with the miRNA mimics via reverse transfection. For each cell line a
transfection mix was made from an optimised concentration of Lipofectamine RNAiMAX
(ThermoFisher, 13778150) in Optimem (Gibco, 31985047) , using 12 μL RNAiMAX per 1 ml
Optimem for HEK293T, 18 μL for SH-SY5Y and C2C12, and 24 μL for HepG2 and N2a. 5 μL
of this transfection mix was added to each well of the 384 -well imaging plates. 2 μL of the
miRNA mimic stock was then added into the transfection mix and the plates vortexed. The
plates were left to incubate at room temperature for 30 min before addition of 25 μL of cells in
culture medium for a final miRNA mimic concentration of 10 nM. Note that the final intracellular
concentration of functional miRNAs depends on transfection efficiency and processing of
vesicles and will therefore vary across cell lines and conditions
72. Plates were transfected in
batches of up to 4 plates to ensure the timing of cell addition was optimal for transfection
efficiency. Cells were incubated for 24 hours prior to fixing.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
13
Experimental Design
The miRNA library was profiled in three human (HEK293T, SH -SY5Y and HepG2) and two
mouse cell lines (C2C12 and N2a). Within each cell line, each miRNA was profiled in two wells
(same plate, separate well) and two replicates (separate experimental batch and plate). During
analyses, the two wells on the same plate were pooled. miRNAs were split over 15 plates for
human and 11 plates for mouse cell lines (cf. Figure 1B for plate layout). Each plate included
multiple wells each of three negative and two positive controls: DMSO only, DMSO with
transfection mix, and DMSO with negative control mimic (i.e. scrambled miRNA, Thermofisher,
4464058) for negative controls, and AllStars HS Cell Death siRNA (Qiagen, 1027299) and 5
nM rotenone for positive controls. The positive controls were chosen to investigate transfection
efficiency and changes to mitochondrial networks, respectively. Throughout the study, the
main negative control referenced was scrambled mimics because all three negative controls
yielded similar separation from positive controls (see Results).
Cell Staining and Imaging
We followed the Cell Painting protocol as described by Bray et al. (2016)23, with changes only
to the mitochondrial staining procedure. Briefly, cells were fixed 24 h after transfection of
miRNA mimics by addition of an equal volume of 9% formaldehyde (Merck, 437533W) to give
a final concentration of 4% formaldehyde. Cells were incubated in this solution for 30 minutes
at room temperature and then washed twice with PBS. Cells were then permeabilised for 20
minutes at room temperature with 50 μL per well of 0.1% Triton-X in a solution of 1% bovine
serum albumin (BSA) (Sigma, A7030) in PBS. A Cell Painting solution was made up with dyes
as listed in Table 2 in 1% BSA/PBS. Of this solution, 20 μL was added to each well and
incubated away from light for 30 minutes. Wells were then washed three times with PBS and
the plates sealed and stored in the dark at 4°C until imaging.
Imaging was carried out on an ImageXpress micro XL (Molecular Devices, USA) equipped
with a robotic plate loader (Scara4, PAA, UK). Four images were captured per well with 20x
magnification and 5 fluorescent channels (see Table 2). Exposure times were optimised
separately for each cell line but kept consistent across the plates and replicates for each cell
line.
Stain Target Wavelength
(ex/em [nm])
Channel Assay
concentration
[μL/mL]
Cat. No.
Hoechst 33342 Nuclei 387/447 DAPI 0.2 H3570
Phalloidin 594 F-actin 531/593 TxRED 0.14 A12381
Wheat germ
agglutinin
Golgi and
plasma
membrane
562/624 TxRED 1 W11262
Concanavalin A
Alexa Fluor 488
Endoplasmic
reticulum
562/624 FITC 4 C11252
SYTO 14 Green
fluorescent nucleic
acid stain
RNA, nucleoli 462/520 SYTO14 0.6 S7576
MitoTracker
DeepRed
Mitochondria 628/692 CY5 0.6 M22426
Table 2. Fluorescent stains used for Cell Painting, cellular targets and imaging channels. Wavelength ex/em =
excitation and emission, respectively.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
14
Cell Segmentation
To detect cells from the 205,824 images we developed an approach based on CellPose ,73 a
deep-learning model for cell and nuclei segmentation, that accounts for diverse cell shapes,
multinucleation and segmentation errors. Briefly, it first detects cell boundaries from the TxRed
channel and in a second pass detects nuclei within these cells. More specifically, cell
boundaries were detected from images with the cyto2 CellPose model, with expected sizes
listed in Table 3. Cells touching image boundaries were removed from further analysis. To
detect nuclei within the cell boundaries, single- cell crops were created with pixels not
belonging to the targeted cell set to the background pixel value 0. Crops were padded with a
20 px boundary on each side to ensure the presence of sufficient background for CellPose
auto-normalization. Another round of segmentation of the DAPI channel to detect nuclei using
the cyto CellPose model was then conducted for each cell crop. The resulting nuclei
predictions were trimmed to ensure that detected nuclei f ell strictly within the detected cell
boundary. Note that this method may detect more than one nucleus per cell, as is the case for
multinucleated, mis-segmented and mitotic cells. In this study, only mono-nucleated cells were
analysed, because multinucleated cells tended to be either mis-segmented or M-phase cells
prior to cytoplasmic division . Nuclei masks were then gathered across crops to match the
original image dimensions. Lastly, cytoplasmic masks were created that represent cell masks
with the nuclear portion removed. These and other analytical steps until and including feature
extraction were facilitated by a Nextflow pipeline found at
https://github.com/edbiomedai/cellextract. Together, these steps yield a cell, nuclear, and
cytoplasm mask per image.
Expected size (μM) HEK293T HepG2 SH-SY5Y C2C12 N2a
Cells 17 30 17 34 17
Nuclei 10 15 10 17 10
Table 3. CellPose size parameters during cell and nuclei segmentation.
Image correction and QC
Fluorescent microscopy images frequently contend with uneven illumination within and across
images25. While the employed microscope integrated field-of-view illumination correction, we
noticed that, across images, background intensities varied, possibly due to minor differences
in focus. We aimed to remove the average background intensity from all pixels to ensure equal
measurements across all images. To this end, per image and channel, the 25 th percentile of
Background
pixels was subtracted from all pixels in the image. We chose the 25 th percentile
because background detection relies on good cell segmentation. Choosing a higher percentile
might therefore risk the inclusion of incorrectly segmented foreground when estimating
Background
intensities. After this correction step, pixels with negative intensities were set to
zero intensity, indicating background. Next, 78 image QC features were computed using
CellProfiler’s MeasureImageQuality module, a description of which can be found in the
CellProfiler manual
74 and in Bray et al. (2012).75 Features included object counts, descriptors
of image-level intensities and measures of focus. These well-established metrics can capture
a variety of quality issues, including precipitate and air bubbles.
Image QC was then performed using a kNN-based metric. The underlying assumption of this
approach was that images of low quality will result in outlier QC features (measured above)
and that detection of such outliers can therefore remove these images. Specifically, image QC
was performed per replicate by centre-scaling image QC features and computing the first three
principal components (PCs) from the 78 QC features. For each data point, i.e. image, in this
PC space, we then measured the distance to the 10
th nearest neighbour, arguing that images
of lower quality will have a lower neighbourhood density and thus larger distance to
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
15
neighbouring images. The image QC distance was defined as 𝐷𝐷!" =
#(%,%!")
√) , where 𝑑𝑑(𝑝𝑝, 𝑝𝑝*+)
is the Euclidean distance of an image 𝑝𝑝 to its 10th nearest neighbour, 𝑝𝑝*+ and M is the number
of PCs the distance 𝑑𝑑 is measured in. Images with a distance greater than 0.1 (arbitrary units)
were flagged and removed from further analysis. This QC step removed ~1.1M cells ( ~5%,
Supplementary Figure S1). Without performing the image QC step, the number of significant
hit miRNAs changes from 227 (with QC) to 334 (no QC) in human and from 286 (QC) to 263
(no QC) in mouse cell lines.
Single-cell feature extraction
Single-cell features were computed from images with matched nuclei, cell and cytoplasm
mask using the scikit -learn and skimage libraries, with granularity features computed via
CellProfiler. Features describing cell size and shape, channel intensities, distribution, texture
and granularity were computed for 1 ,386 features per cell. Features missing valid
measurements in >90% of cells were removed from analysis, leaving 1 ,377 features
(Supplementary Figure S1, Supplementary Table 6). We thus derived features for 22,999,877
cells, exporting results in AnnData format.76
Batch correction
To assess the magnitude of batch and treatment effects of cell death siRNAs in HEK293T (as
in Figure S2B), we tested the proportion of variance in position on PC1 explained by treatment
and batch (plate number) using ANOVA. The η² statistic was used to measure the variance
explained by treatment and batch.
We used a linear model, which maintains interpretability of features, to reduce inter -plate
variability (hereafter termed batch effects). Following Cole et al. 30, who developed a linear
model that can remove nested batch effects (equation 3 in Cole et al.30), we computed the
average batch effect per feature 𝑖𝑖 as
𝐸𝐸[𝑌𝑌|𝑉𝑉, 𝑈𝑈] = 𝛼𝛼, + 𝛽𝛽, + 𝛾𝛾, (Equation 1)
where 𝑉𝑉 and 𝛽𝛽 are the model matrix and coefficient of known factors of wanted variation, e.g.
cell line differences, 𝑈𝑈 and γ are the model matrix and coefficient of sources of unwanted
variation such as batch effects, and α is an intercept. Since the model is built across all single
cells 𝑛𝑛, the matrices 𝑌𝑌, 𝑉𝑉 and 𝑈𝑈 all have 𝑛𝑛 rows and respectively 𝐽𝐽, 𝑀𝑀, and 𝐻𝐻 columns,
containing one-hot encoded information of the covariates. This model is fit to negative control
cells across batches, which allows identification of the batch effect γ. Subsequently, batch
correction is performed for all single-cell profiles, treated or not, by removing the average batch
effect 𝑋𝑋<, = 𝑋𝑋, − 𝛾𝛾,. Note that the nested model can maintain differences between cell lines in
the form of the 𝛽𝛽 coefficient. However, we did not make use of this option in the present study,
because we did not directly compare cell morphologies between cell lines. Instead, we
performed batch correction for each cell line and replicate experiment separately
(Supplementary Figure S1). This method and others used for downstream analysis were
implemented in scmorph, our single-cell morphological profiling package in Python.29
Removing plate map effects
After batch correction, we observed considerable association of features with well positions
within a plate. We tested each feature for association with column or row position as well as
plate ID using a Kruskal-Wallis H -test per replicate and cell line. Note that this test was
performed on all wells including treated wells, because negative controls were not randomly
dispersed within plates. This means that features may incorrectly be associated with technical
variability when strong biological effects are associated with e.g. column position. This is
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
16
mitigated by testing across all plates in a replicate experiment, making individual treatments
less likely to have undue influence on the statistic. For each covariate 𝑗𝑗 (column, row, or plate),
we compute the 𝐻𝐻 statistic for each feature 𝑋𝑋,, yielding a test statistic 𝐻𝐻,- for each such
combination. The collection of all test statistics 𝐻𝐻,- across all features 𝑖𝑖 is referred to as 𝐻𝐻??⃑-.
Then, we retain only features that satisfy
𝐻𝐻,- < 𝑚𝑚𝑚𝑚𝑑𝑑D𝐻𝐻??⃑-E + 𝑀𝑀𝑀𝑀𝐷𝐷(𝐻𝐻??⃑-) (Equation 2)
in each cell line and replicate, where 𝑚𝑚𝑚𝑚𝑑𝑑 i
s the median and 𝑀𝑀𝑀𝑀𝐷𝐷 i
s the median absolute
deviation. Across all cell lines and replicates, 142 features never exceeded this threshold and
were therefore retained for further analysis (Supplementary Table 6, Supplementary Figure
S1). Features related to MitoTracker and SYTO 14 Green were more often removed than
features from other dyes, highlighting potential issues with experimental procedures (Figure
7). This is in line with previous observations that measures of fluorescence , including SYTO
14, are more often affected by positional effects.77 Only these 142 features not associated with
confounders were retained in further analyses.
Figure 7. For each feature, represented by a point, the Kruskal- Wallis (KW) test of association with known
covariates in HEK293T replicate experiment 1, including plate positions (row, column, left panel) and batch effects
(PlateID, right panel y-axis) was computed. MitoTracker and SYTO 14 were more often associated with technical
effects. Features measuring cell shape and size were the least frequently di scarded. Dashed lines indicate the
cutoff for each covariate above which features were discarded.
Hit calling
To identify treatments that change morphology compared to a negative control, a PCA-based
hit calling pipeline was devised. Hit calling was performed strictly per plate to reduce any
residual impact of inter- plate differences. First, for each plate, the cell × feature matrix of
morphological profiles was standardized (z-scored) and used to compute a PCA, retaining the
first 10 PCs (Supplementary Figure S8). Second, the centroid of negative control cells in each
PC was obtained. Third, the control cells’ covariance matrix in PCA space was estimated.
Fourth, using the inverse of this covariance matrix, the Mahalanobis distance from each cell
to the centroid was computed as:
𝐷𝐷
). /(𝑥𝑥 ⃑,, 𝐶𝐶⃑) = I(𝑥𝑥 ⃑, − 𝐶𝐶⃑)0Σ1*(𝑥𝑥 ⃑, − 𝐶𝐶⃑) (Equation 3)
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
17
Here 𝑥𝑥 ⃑, and 𝐶𝐶⃑ are the PCA -coordinate vector of the 𝑖𝑖th cell and negative control centroid,
respectively, and Σ1* is the inverse covariance matrix estimated from the PC coordinates of
negative control cells. Fifth, for each treatment including negative control, all single -cell
distances to the centroid were used to build an empirical cumulative distribution. Sixth, the
Kolmogorov–Smirnov (KS) statistic between ECDF from a treatment and the negative control
ECDF was computed. To assess statistical significance of observed morphological changes,
a null distribution was created by repeating steps two to six, retaining only negative control
cells and defining wells as treatments. By using each negative control well as a reference well
and computing the well -to-well KS statistic, an estimate of the heterogeneity present within
negative control cells was created. False discovery rate (FDR) was controlled per plate by
defining the 5
th percentile of p -values of KS tests from the null distribution as significance
threshold. Treatments were considered hits if the p-value of the KS test comparing the ECDF
of treated cells to negative control cells was lower than this threshold in both experimental
replicates. This pipeline was applied to each plate independently to further reduce the impact
of residual batch effects, with results being aggregated on the level of KS statistics after
controlling for false-discovery rate using the Benjamini-Hochberg method.
51
Sensitivity of hit calling depends on population size
We use the KS statistic and its p -value to assess significance of morphological changes. As
described above, we create a background distribution by performing pairwise tests between
negative control wells. The population size in these tests is the number of cells in each well,
limiting the lower bound of p-values achievable in the background distribution (Figure 8). When
testing treatments, wells on the same plate are pooled, thus increasing the population size
and increasing power. This means that a treatment with a KS statistic of e.g. 0.2 may be
considered significant even if a KS statistic of 0.25 was considered not significant in the
Background
distribution. However, the lower bound of the p-value asymptotically approaches
zero with sample sizes below the number of cells typically observed per well in negative
controls (320-810 for SH -SY5Y and HepG2, respectively) (Figure 8). Given these sample
sizes, it is unlikely that the lower bound of the p -values is restricting our power to detect
differences between control wells and we can therefore assume that our null distribution
accurately captures morphological heterogeneity naturally present in these experiments. That
said, at an expected KS statistic of 0.2, sample size impacts p-values by orders of magnitude
(Figure 9). It should therefore be assumed that the realised FDR is higher than the intended
α. To mitigate this problem, we additionally require treatments to be considered significant in
both replicates to be termed a hit.
Figure 8. The lower bound of the p-value of a two-sample two-sided KS test is determined by the sample sizes of
both samples, n and m.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
18
Figure 9. Sensitivity analysis of the KS test p-values, demonstrating that sample sizes of the two samples (n and
m) impact expected p-value by orders of magnitude.
Morphological profile clustering
The outlined hit calling methodology operates on PCs and is therefore not readily interpretable.
An orthogonal analysis therefore compared morphological profiles by computing the per -
treatment and per -feature t -statistic between treated and negative control cells. For each
treatment, cell line and replicate, a vector of 142 t-statistics describing per-feature differences
was thus derived. When comparing treatments, as in Figure 4B, the Spearman correlation
coefficient between two treatments’ t -statistic vectors was used as a descriptor of similarity.
Hierarchical clustering with average linkage and Euclidean distance was then used to cluster
treatments.
miRNA target enrichment
Predicted miRNA targets were obtained from TarBase 78 and filtered for interaction scores
greater than 0.7. Only expressed mRNA targets were considered, which were defined as
genes with at least 1 FPKM expression in baseline expression data. Baseline gene expression
data for HepG2 was obtained from Gene Expression Omnibus with accession number
GSM3478960. Ontology enrichment of targets was tested using g:Profiler ,
79 using as
Background
all expressed genes (>=1 FPKM) and with KEGG and all GO terms as sources.
Funding and Acknowledgments
We thank John C. Dawson for assistance in experimental procedures and Neil O. Carragher
for providing reagents and equipment. JW is supported by an MRC PhD studentship (grant
no. MC_ST_00035). This work was also funded by the Wellcome Trust (grant no.
106956/Z/15/Z). For the purpose of open access, the author has applied a Creative Commons
Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this
submission.
Bibliography
1. Bartel, D. P. MicroRNAs: Genomics, Biogenesis, Mechanism, and Function. Cell 116,
281–297 (2004).
2. Kilikevicius, A., Meister, G. & Corey, D. R. Reexamining assumptions about miRNA-
guided gene silencing. Nucleic Acids Res. 50, 617–634 (2022).
3. Shang, R., Lee, S., Senavirathne, G. & Lai, E. C. microRNAs in action: biogenesis,
function and regulation. Nat. Rev. Genet. 24, 816–833 (2023).
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
19
4. Valencia-Sanchez, M. A., Liu, J., Hannon, G. J. & Parker, R. Control of translation and
mRNA degradation by miRNAs and siRNAs. Genes Dev. 20, 515–524 (2006).
5. Friedman, R. C., Farh, K. K.-H., Burge, C. B. & Bartel, D. P. Most mammalian mRNAs
are conserved targets of microRNAs. Genome Res. 19, 92–105 (2009).
6. Kozomara, A., Birgaoanu, M. & Griffiths-Jones, S. miRBase: from microRNA sequences
to function. Nucleic Acids Res. 47, D155–D162 (2019).
7. Kent, O. A. & Mendell, J. T. A small piece in the cancer puzzle: microRNAs as tumor
suppressors and oncogenes. Oncogene 25, 6188–6196 (2006).
8. Mendell, J. T. & Olson, E. N. MicroRNAs in Stress Signaling and Human Disease. Cell
148, 1172–1187 (2012).
9. Zhang, P. et al. MiR-155 Is a Liposarcoma Oncogene That Targets Casein Kinase-1α
and Enhances β-Catenin Signaling. Cancer Res. 72, 1751–1762 (2012).
10. Wang, L. et al. A MicroRNA Linking Human Positive Selection and Metabolic Disorders.
Cell 183, 684-701.e14 (2020).
11. Rottiers, V. & Näär, A. M. MicroRNAs in metabolism and metabolic disorders. Nat. Rev.
Mol. Cell Biol. 13, 239–250 (2012).
12. Park, C. Y., Choi, Y. S. & McManus, M. T. Analysis of microRNA knockouts in mice.
Hum. Mol. Genet. 19, (2010).
13. Naeini, M. M. & Ardekani, A. M. Noncoding RNAs and Cancer. Avicenna J. Med.
Biotechnol. 1, 55–70 (2009).
14. Smillie, C. L., Sirey, T. & Ponting, C. P. Complexities of post-transcriptional regulation
and the modeling of ceRNA crosstalk. Crit. Rev. Biochem. Mol. Biol. 53, 231–245
(2018).
15. Fischer, S., Handrick, R., Aschrafi, A. & Otte, K. Unveiling the principle of microRNA-
mediated redundancy in cellular pathway regulation. RNA Biol. 12, 238–247 (2015).
16. DeVeale, B., Swindlehurst-Chan, J. & Blelloch, R. The roles of microRNAs in mouse
development. Nat. Rev. Genet. 22, 307–323 (2021).
17. Alles, J. et al. An estimate of the total number of true human miRNAs. Nucleic Acids
Res. 47, 3353–3364 (2019).
18. Kim, K. et al. A quantitative map of human primary microRNA processing sites. Mol. Cell
81, 3422-3439.e11 (2021).
19. Byrne, J. A. et al. Protection of the human gene research literature from contract
cheating organizations known as research paper mills. Nucleic Acids Res. 50, 12058–
12070 (2022).
20. Zhou, B. et al. EVLncRNAs 2.0: an updated database of manually curated functional
long non-coding RNAs validated by low-throughput experiments. Nucleic Acids Res. 49,
D86–D91 (2021).
21. Nowakowski, T. J. et al. Regulation of cell-type-specific transcriptomes by microRNA
networks during human brain development. Nat. Neurosci. 21, 1784–1792 (2018).
22. Gustafsdottir, S. M. et al. Multiplex Cytological Profiling Assay to Measure Diverse
Cellular States. PLOS ONE 8, e80999 (2013).
23. Bray, M.-A. et al. Cell Painting, a high-content image-based assay for morphological
profiling using multiplexed fluorescent dyes. Nat. Protoc. 11, 1757–1774 (2016).
24. Singh, S. et al. Morphological Profiles of RNAi-Induced Gene Knockdown Are Highly
Reproducible but Dominated by Seed Effects. PLOS ONE 10, e0131370 (2015).
25. Caicedo, J. C. et al. Data-analysis strategies for image-based cell profiling. Nat. Methods
14, 849–863 (2017).
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
20
26. Stossi, F. et al. SPACe: An open-source, single-cell analysis of Cell Painting data. Nat.
Commun. 15, 10170 (2024).
27. Serrano, E. et al. Progress and new challenges in image-based profiling. Preprint at
https://doi.org/10.48550/arXiv.2508.05800 (2025).
28. Arevalo, J. et al. Evaluating batch correction methods for image-based cell profiling. Nat.
Commun. 15, 6516 (2024).
29. Wagner, J., Warden, H., Khamseh, A. & Beentjes, S. V. scmorph: Single-cell
morphological analysis. J. Open Source Softw. https://doi.org/10.21105/joss.08324
(2025) doi:10.21105/joss.08324.
30. Cole, M. B. et al. Performance Assessment and Selection of Normalization Procedures
for Single-Cell RNA-Seq. Cell Syst. 8, 315-328.e8 (2019).
31. Forsgren, E., Cloarec, O., Jonsson, P., Lovell, G. & Trygg, J. A scalable, data analytics
workflow for image-based morphological profiles. Chemom. Intell. Lab. Syst. 254,
105232 (2024).
32. Sivagurunathan, S. et al. Alternate dyes for image-based profiling assays. Preprint at
https://doi.org/10.1101/2025.02.19.639058 (2025).
33. Wolff, C. et al. Morphological profiling data resource enables prediction of chemical
compound properties. iScience 28, (2025).
34. Pahl, A. et al. Morphological subprofile analysis for bioactivity annotation of small
molecules. Cell Chem. Biol. 30, 839-853.e7 (2023).
35. Li, N. et al. Mitochondrial complex I inhibitor rotenone induces apoptosis through
enhancing mitochondrial reactive oxygen species production. J. Biol. Chem. 278, 8516–
8525 (2003).
36. O’Connell, R. M. et al. MicroRNA-155 Promotes Autoimmune Inflammation by
Enhancing Inflammatory T Cell Development. Immunity 33, 607–619 (2010).
37. Bayraktar, R. & Van Roosbroeck, K. miR-155 in cancer drug resistance and as target for
miRNA-based therapeutics. Cancer Metastasis Rev. 37, 33–44 (2018).
38. Rodriguez, A. et al. Requirement of bic/microRNA-155 for Normal Immune Function.
Science 316, 608–611 (2007).
39. Mahesh, G. & Biswas, R. MicroRNA-155: A Master Regulator of Inflammation. J.
Interferon CYTOKINE Res. 39, 321–330 (2019).
40. Buenemann, C. L., Willy, C., Buchmann, A., Schmiechen, A. & Schwarz, M.
Transforming growth factor-β1-induced Smad signaling, cell-cycle arrest and apoptosis
in hepatoma cells. Carcinogenesis 22, 447–452 (2001).
41. Xu, N., Hurtig, M., Zhang, X.-Y., Ye, Q. & Nilsson-Ehle, P. Transforming growth factor-
beta down-regulates apolipoprotein M in HepG2 cells. Biochim. Biophys. Acta BBA -
Mol. Cell Biol. Lipids 1683, 33–37 (2004).
42. Busso, N., Chesne, C., Delers, F., Morel, F. & Guillouzo, A. Transforming growth-factor-
β (TGF-β) inhibits albumin synthesis in normal human hepatocytes and in hepatoma
HepG2 cells. Biochem. Biophys. Res. Commun. 171, 647–654 (1990).
43. Rai, D., Kim, S.-W., McKeller, M. R., Dahia, P. L. M. & Aguiar, R. C. T. Targeting of
SMAD5 links microRNA-155 to the TGF-β pathway and lymphomagenesis. Proc. Natl.
Acad. Sci. 107, 3111–3116 (2010).
44. Narita, M. et al. Chronic treatment of non-small-cell lung cancer cells with gefitinib leads
to an epigenetic loss of epithelial properties associated with reductions in microRNA-155
and -200c. PLOS ONE 12, e0172115 (2017).
45. Meijer, H. A., Smith, E. M. & Bushell, M. Regulation of miRNA strand selection: follow
the leader? Biochem. Soc. Trans. 42, 1135–1140 (2014).
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
21
46. Hutvagner, G. Small RNA asymmetry in RNAi: Function in RISC assembly and gene
regulation. FEBS Lett. 579, 5850–5857 (2005).
47. Clarke, A. W. et al. MirGeneDB 3.0: improved taxonomic sampling, uniform
nomenclature of novel conserved microRNA families and updated covariance models.
Nucleic Acids Res. 53, D116–D128 (2025).
48. Kern, F. et al. miRSwitch: detecting microRNA arm shift and switch events. Nucleic
Acids Res. 48, W268–W274 (2020).
49. Rishik, S., Hirsch, P., Grandke, F., Fehlmann, T. & Keller, A. miRNATissueAtlas 2025:
an update to the uniformly processed and annotated human and mouse non-coding
RNA tissue atlas. Nucleic Acids Res. 53, D129–D137 (2025).
50. Zhang, M., Muralimanoharan, S., Wortman, A. C. & Mendelson, C. R. Primate-specific
miR-515 family members inhibit key genes in human trophoblast differentiation and are
upregulated in preeclampsia. Proc. Natl. Acad. Sci. 113, E7069–E7076 (2016).
51. Benjamini, Y. & Hochberg, Y. Controlling the False Discovery Rate: A Practical and
Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B Methodol. 57, 289–300
(1995).
52. Seal, S. et al. Small molecule bioactivity benchmarks are often well-predicted by
counting cells. 2025.04.27.650853 Preprint at https://doi.org/10.1101/2025.04.27.650853
(2025).
53. Loeb, G. B. et al. Transcriptome-wide miR-155 Binding Map Reveals Widespread
Noncanonical MicroRNA Targeting. Mol. Cell 48, 760–770 (2012).
54. Fromm, B. et al. A Uniform System for the Annotation of Vertebrate microRNA Genes
and the Evolution of the Human microRNAome. Annu. Rev. Genet. 49, 213–242 (2015).
55. Fromm, B. et al. Quo vadis microRNAs? Trends Genet. 36, 461–463 (2020).
56. Eulalio, A. & Mano, M. MicroRNA Screening and the Quest for Biologically Relevant
Targets. J. Biomol. Screen. 20, 1003–1017 (2015).
57. Kurata, J. S. & Lin, R.-J. MicroRNA-focused CRISPR-Cas9 library screen reveals
fitness-associated miRNAs. RNA 24, 966–981 (2018).
58. Renikunta, H. V. et al. Large-scale microRNA functional high-throughput screening
identifies miR-515-3p and miR-519e-3p as inducers of human cardiomyocyte
proliferation. iScience 26, 106593 (2023).
59. Boyd, J. C., Pinheiro, A., Del Nery, E., Reyal, F. & Walter, T. Domain-invariant features
for mechanism of action prediction in a multi-cell-line drug screen. Bioinformatics 36,
1607–1613 (2020).
60. Elliott, R. J. R. et al. A comprehensive pharmacological survey across heterogeneous
patient-derived GBM stem cell models. 2024.11.27.625719 Preprint at
https://doi.org/10.1101/2024.11.27.625719 (2024).
61. Graham, R. E. et al. Single-cell morphological tracking of cell states to identify small-
molecule modulators of liver differentiation. iScience 28, 111871 (2025).
62. Mullokandov, G. et al. High-throughput assessment of microRNA activity and function
using microRNA sensor and decoy libraries. Nat. Methods 9, 840–846 (2012).
63. Cloonan, N. Re-thinking miRNA-mRNA interactions: Intertwining issues confound target
discovery. BioEssays 37, 379–388 (2015).
64. Rodrigues Lopes, I., Silva, R. J., Caramelo, I., Eulalio, A. & Mano, M. Shedding light on
microRNA function via microscopy-based screening. Methods 152, 55–64 (2019).
65. Boissinot, M. et al. Profiling cytotoxic microRNAs in pediatric and adult glioblastoma cells
by high-content screening, identification, and validation of miR-1300. Oncogene 39,
5292–5306 (2020).
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
22
66. Willis, C., Nyffeler, J. & Harrill, J. Phenotypic Profiling of Reference Chemicals across
Biologically Diverse Cell Types Using the Cell Painting Assay. SLAS Discov. 25, 755–
769 (2020).
67. Way, G. P. et al. Morphology and gene expression profiling provide complementary
information for mapping cell state. Cell Syst. 13, 911-923.e9 (2022).
68. Nyffeler, J. et al. Combining phenotypic profiling and targeted RNA-Seq reveals linkages
between transcriptional perturbations and chemical effects on cell morphology: Retinoic
acid as an example. Toxicol. Appl. Pharmacol. 444, 116032 (2022).
69. Sivanandan, S. et al. A Pooled Cell Painting CRISPR Screening Platform Enables de
Novo Inference of Gene Function by Self-supervised Deep Learning. Preprint at
https://doi.org/10.1101/2023.08.13.553051 (2023).
70. Ramezani, M. et al. A genome-wide atlas of human cell morphology. Nat. Methods 22,
621–633 (2025).
71. McGeary, S. E. et al. The biochemical basis of microRNA targeting efficacy. Science
366, eaav1741 (2019).
72. Thomson, D. W., Bracken, C. P., Szubert, J. M. & Goodall, G. J. On Measuring miRNAs
after Transient Transfection of Mimics or Antisense Inhibitors. PLOS ONE 8, e55214
(2013).
73. Pachitariu, M. & Stringer, C. Cellpose 2.0: How to train your own model. Nat. Methods
19, 1634–1641 (2022).
74. Stirling, D. R. et al. CellProfiler 4: Improvements in speed, utility and usability. BMC
Bioinformatics 22, 433 (2021).
75. Bray, M.-A., Fraser, A. N., Hasaka, T. P. & Carpenter, A. E. Workflow and Metrics for
Image Quality Control in Large-Scale High-Content Screens. SLAS Discov. 17, 266–274
(2012).
76. Virshup, I., Rybakov, S., Theis, F. J., Angerer, P. & Wolf, F. A. Anndata: Access and
store annotated data matrices. J. Open Source Softw. 9, 4371 (2024).
77. Pearson, Y. E. et al. A statistical framework for high-content phenotypic profiling using
cellular feature distributions. Commun. Biol. 5, 1–15 (2022).
78. Skoufos, G. et al. TarBase-v9.0 extends experimentally supported miRNA–gene
interactions to cell-types and virally encoded miRNAs. Nucleic Acids Res. 52, D304–
D310 (2024).
79. Kolberg, L. et al. g:Profiler—interoperable web service for functional enrichment analysis
and gene identifier mapping (2023 update). Nucleic Acids Res. 51, W207–W212 (2023).
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
23
Supplementary Figures
Figure S1. Pre-processing steps of single-cell morphological profiles (see Methods for details). A: Measurements
with missing values were removed. B: Image-level features were used for quality control to remove artefacts. Gray
lines indicate cutoff beyond which images were discarded due to quality concerns and percentages in title refer to
percent of images retained after quality control. C: ~5% of cells that were in images failing QC were removed,
leaving 23M single-cell profiles. D: Batch correction was conducted with a linear model that removed the average
between-plate effect per cell line and replicate. E: Kruskal-Wallis statistics were used to identify features affected
by plate position or residual batch effect. F: Before step E (feature selection), pairwise tests of negative control
wells yielded p-values that are heavily skewed towards zero, indicating false positives. After feature selection, the
skew was reduced, although not fully removed. To ensure a well-controlled false discovery rate, we used these p-
values as a null distribution (see Methods). G: Together, these steps quality -controlled single-cell morphological
profiles, leaving 23M cells and 142 features for analysis (see also Supplementary Table 6).
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
24
Figure S2. Choice of negative control has only a minor influence on effect size compared to positive controls.
Choosing either of the three negative controls as reference and comparing the two positive controls against them
shows that the KS statistic, quantifying morphological changes, varies only modestly. Points are mean averages
across all plates and experimental replicates; error bars are mean ± standard deviation. Using only DMSO as
negative control is associated with slightly larger standard deviation, possibly because there were only four DMSO-
only wells per plate, as opposed to eight wells of DMSO with transfection reagent (DMSO+TR) or with additional
scrambled miRNA.
Figure S3. A subset of miRNAs is considered significant compared to two negative controls (intersection) –
DMSO + transfection reagent (left circle) and scrambled miRNAs (right circle). In cell lines other than C2C12
there is strong congruence of miRNAs that did not induce significant morphological changes to either negative
control (bottom bar).
Figure S4. Cell death siRNAs significantly change morphology in HEK293T and C2C12, but less consistently rise
above statistical significance in the other cell lines. NS = not significant.
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
25
Figure S5. Rotenone significantly changes morphology in most HEK293T and many HepG2 replicates, but not in
other cell lines. NS = not significant.
Figure S6. Results for replicating miRNA hits using DMSO + transfection reagent as negative control resemble
Results
shown in Figure 3. A: Venn diagram showing number of miRNAs being a hit in each cell line, including
overlaps. Unlabelled segments indicate no shared miRNA. B: 18 miRNAs significantly alter cell morphology in
both experimental replicates for each of two cell lines, including six orthologues with perfect sequence identity
(“identical orth.”) and three orthologues with sequence identity in the seed and up two 2 nucleotide differences
(Levenshtein distance) outside the seed (“similar orth.”). NS = not significant, NT = not tested.
Figure S7. Morphological changes broadly correlate with cytotoxicity, though some miRNAs impact cell morphology
without toxicity. Strength of association of morphological changes and cytotoxicity depends on cell line. Each point
represents one miRNA; values of the two experimental replicates were averaged. For the “Averaged” panel, data
from all cell lines was averaged per treatment. Blue line indicates linear fit.
miRNA significance
NS
NT
5%
Replicating in
Human
Both (identical orth.)
Both (similar orth.)
C2C12
N2a
HepG2
SH-SY5Y
HEK293T
440
7
49
38
66
1
1
1
1
1
1
4
3
A B
let−7c−3p
miR−9−5p
miR−155−5p
miR−182−5p
miR−185−3p
miR−194−3p
miR−194−5p
miR−200c−3p
miR−206
miR−221−5p
miR−548e−5p
miR−598−3p
miR−624−5p
miR−708−5p
miR−5095
miR−5681a
miR−6828−3p
miR−8065
HEK293T
HepG2
SH−SY5Y
C2C12
N2a
2
1
miR−19a−5p
miR−19a−5p
miR−19a−5p
miR−19a−5p
miR−19a−5p
C2C12 N2a Averaged
HEK293T HepG2 SH−SY5Y
0.0 0.1 0.2 0.0 0.1 0.2 0.3 0.40.0 0.1 0.2
0.0 0.1 0.2 0.0 0.1 0.0 0.1 0.2 0.3
−5
0
5
−5
0
5
KS statistic
Cell counts (z−score)
R = -0.49R = -0.18
R = -0.5
R = -0.3 R = -0.31 R = -0.35
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
26
Figure S8. Scree plot for plate 1 of experimental replicate 1 of HEK293T shows that 61.7% of the variance is
explained by the first 10 PCs.
Supplementary Tables
Supplementary Table 1, hit calling and cell count results using DMSO+TR as negative
control.
Supplementary Table 2, top 3 differential features by treatment as assessed via t-test
compared to DMSO+TR as negative control.
Supplementary Table 3, feature names of 155 features retained after feature selection using
DMSO+TR as negative control.
Supplementary Table 4, hit calling and cell count results using scrambled miRNA as
negative control.
Supplementary Table 5, top 3 differential features by treatment as assessed via t-test using
scrambled miRNA as negative control.
Supplementary Table 6, feature names of 142 features retained after feature selection using
scrambled miRNA as negative control.
0
10
20
30
40
50
0
5
10
15
Principal Component
Variance
explained (%)
Variance
.CC-BY-NC-ND 4.0 International licenseperpetuity. It is made available under a
preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in
The copyright holder for thisthis version posted November 14, 2025. ; https://doi.org/10.1101/2025.11.14.687149doi: bioRxiv preprint
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.