Results
demonstrate that Volaria's gene-centered, cell type-aware genomic
representations encode outcome-relevant signals even in the absence of clinical
features, providing a complementary perspective to phenotypic risk modeling.
Embedding structure contributes to the predictive signal
To evaluate whether gene- and cell-type-specific embeddings provide predictive value
beyond unstructured representations, we compared Volaria to a series of embeddings
with simplified structures. Specifically, we constructed flat functional embeddings for the
same genes and variant scores by removing gene and cell-type-specific structure and
aggregating variant scores by taking the maximum across the dropped dimensions
(Methods). To isolate contributions from each signal type, we evaluated models using
only regulatory predictions (Volaria flat regulatory), only exonic predictions (Volaria flat
exonic), or providing both regulatory and exonic flat embeddings. Across all three renal
outcomes, full Volaria embedding consistently outperformed unstructured ones (Figure
3A-C, Supp. Figure 1), indicating that gene-centered, cell-type-aware representations
capture predictive signals not recoverable from unstructured inputs. Notably, the
performance of simplified embeddings varied by outcome: regulatory signals performed
comparably or better than exonic scores in predicting kidney failure and eGFR decline
(Figure 3A-B), but were less informative for predicting steroid resistance (Figure 3C),
suggesting task-specific relevance of variant classes.
7
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Figure 3 | Extended performance evaluation. (A-C) Volaria’s embedding structure contributes
to the predictive performance: (A) kidney failure, (B) eGFR decline, and (C) steroid resistance.
To evaluate contribution of the gene-level resolution and cell-type, we compare flattened
functional embeddings lacking gene-level or cell-type structure, separated into regulatory-only,
exonic-only, and combined input configurations, as well as a simple baseline of only sex as an
input. Volaria’s full structured embedding outperforms all non-structured variations, showing that
both gene resolution and cell type specificity contribute to the downstream performance. (D)
ROC AUC curves for predicting 40% eGFR decline using Volaria versus using polygenic scores
(PGS) for kidney failure from the PGS Catalog; (E) Average precision improvements over
baseline for the 40% eGFR decline using Volaria or each of the PGS scores. (F) GTEx cohort
evaluation of Volaria models trained on CureGN (ROC AUC for kidney failure prediction in GTEx
individuals). Volaria generalizes effectively across cohorts with different contexts and data
collection protocols.
8
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Comparison to genome-wide polygenic risk models
To benchmark Volaria against established approaches for genome-wide risk prediction,
we compared its performance to published polygenic scores for kidney-related traits.
Specifically, we evaluated all five polygenic scores (PGS) associated with kidney failure
or related phenotypes (acute and chronic renal failure) in the PGS Catalog: PGS000708
[18]; PGS004491, PGS004492, PGS004561, and PGS004562 [19]. These scores differ
in phenotype definitions and modeling strategies. For example, PGS000708 defines
kidney disease based on a combination of self-reported diagnoses and ICD-10 codes
(N17–N19) in the UK Biobank, including both acute and chronic renal failure. At the
same time, [19] did not perform aggregation over acute and chronic renal failure and
provide different ways to compute scores (PGS004491 and PGS004492 represent
acute and chronic renal failure, respectively, using LDpred2-based scores, while
PGS004561 and PGS004562 represent acute and chronic renal failure, respectively
and use elastic net regression, integrating disease-specific and risk factor polygenic
scores). In accordance with these previous studies we computed PGS values for each
of the five scores for each individual in the CureGN cohort. We then trained the same
outcome-specific classifiers as used for Volaria, with identical train/test splits, for both
40% eGFR decline and kidney failure outcomes (Figure 3D–E, Supp. Figure 2).
Across all five scores and both outcomes, Volaria consistently outperformed PGS-based
models in both ROC AUC and precision-recall metrics. For 40% eGFR decline, PGS
average performance was 0.51±0.03 and the best-performing PGS (PGS004492,
chronic renal failure LDpred2 [19]) achieved an ROC AUC of 0.56, substantially lower
than Volaria’s 0.67. Volaria’s corresponding AP was more than twice that of the
best-performing PGS model (Figure 3E). A similar pattern was observed for kidney
failure, where PGS average performance was 0.56±0.03, with best performing
PGS-based ROC AUC of 0.59, short of Volaria’s ROC AUC of 0.63, and with Volaria
also achieving a two-fold improvement in PR AUC vs the best PGS (Supp.Figure 2).
Volaria captures broad signal contributions
A subset of the CureGN participants have been previously reported to present with
monogenic forms of kidney disease driven by pathogenic exonic variants [20]. To
9
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
evaluate whether Volaria’s performance was driven solely by these high-penetrance
cases, we assessed the model performance with and without the individuals carrying
these variants.
The model accurately predicted outcomes for all individuals carrying monogenic
variants, achieving high performance in this small group (e.g., kidney failure AUC =
0.79). Removing carriers of reported monogenic variants had minimal impact on
predictive performance of the models in the rest of the patients for all three outcomes,
kidney failure, eGFR decline, and steroid resistance. Kidney failure prediction and
eGFR decline remained unchanged (ROC AUC = 0.63 for kidney failure and ROC AUC
= 0.67 for eGFR decline). Steroid resistance showed a minor increase after excluding
monogenic cases (AUC = 0.58) (Supp. Table 1). This demonstrates that Volaria
captures both known monogenic disease signals and other novel predictive features,
capturing broad polygenic and regulatory contributions to kidney disease progression.
Volaria is robust across specific disease diagnoses
The CureGN cohort includes individuals diagnosed with one of four rare glomerular
diseases (Focal Segmental Glomerulosclerosis (FSGS), IgA Nephropathy (IgAN),
Membranous Nephropathy (MN), and Minimal Change Disease (MCD)), each with
distinct histopathological characteristics, clinical presentations, and rates of progression,
but overlapping disease initiation and progression factors. To understand how predictive
performance varies by disease context, we evaluated model performance separately
within each diagnosis group for kidney failure and eGFR decline outcomes, the two
outcomes relevant to all four diseases. Volaria consistently achieved strong
performance for both kidney failure and eGFR decline for FSGS, IgAN, and MCD
(though the MCD group’s very small sample size limits interpretability of this result)
(Supp. Table 2). Performance in MN was lower, but precision-recall remained
comparable for eGFR, suggesting that the model still captures meaningful risk signals
and indicating that an MN-specific model would be advantageous as increased sample
size becomes available to facilitate its training.
10
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Cross-cohort generalization
To assess generalizability, we applied Volaria to the GTEx dataset [17] - a large cohort
with 440 participants with whole-genome sequencing and clinical annotations. While
structured longitudinal outcomes are not available in this cohort, we identified 51
participants with end stage renal disease (annotations for renal failure or dialysis)
(Methods). The Volaria model trained on CureGN data achieved renal failure prediction
ROC AUC = 0.75, despite differences in cohort design, population and phenotype
granularity (Figure 3F). This highlights Volaria’s ability to generalize across cohorts and
maintain predictive performance under real-world heterogeneity.
Feature importance highlights inflammatory and immune pathways
To investigate the biological underpinnings of Volaria's predictions, we performed
SHAP-based [21] feature importance analysis across all outcomes. This approach
quantifies the marginal contribution of each feature to model output by estimating
changes in predicted score when that feature is perturbed while all others are held
constant. We analyzed features with non-zero SHAP values across all random seeds to
ensure robustness. For kidney failure prediction, a total of 261 genes consistently
contributed to the model output across all diagnosis and genetic backgrounds. To
assess whether these features reflect shared molecular mechanisms, we applied
Louvain clustering on a tissue-specific functional gene network constructed from a large
compendium of functional genomic data [22]. This revealed three functionally coherent
modules (Figure 4A). The first module, characterized by IL-1 mediated inflammatory
signaling, was enriched for genes involved in IL-1 production and regulation, as well as
inositol lipid-mediated signaling, highlighting the contribution of inflammation in
progressive kidney injury. The second module represented cellular stress and immune
modulation, and included genes linked to cell cycle regulation, DNA damage
checkpoints, proteasomal catabolism, and negative regulation of T cell activation,
suggesting coordinated stress response and immune regulation mechanisms
associated with disease progression. The third module was linked to the negative
regulation of signaling receptor activity, and consisted of a smaller set of genes enriched
for modulation of receptor signaling processes.
11
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Figure 4 | Feature importance and computational perturbation . (A) Functional modules
derived from genes with non-zero SHAP-based feature importance for kidney failure prediction,
showing enrichment for immune regulation, inflammatory signaling, and modulation of receptor
activity. Community detection was performed on SHAP-selected genes using functional
networks. (B) Genomic features with non-zero SHAP importance scores in all three outcomes
(kidney failure, eGFR decline, and steroid resistance) ranked by scaled median contribution
across random seeds. (C) Schematic of the computational perturbation analysis. A single
feature is computationally modified within the embedding, and model predictions are
recomputed to quantify its isolated contribution (see Methods). (D) Predicted risk of eGFR
decline following computational perturbation of the RPS6KA1 coding score. Perturbation
analysis indicated a protective effect of strongly deleterious mutations: individuals with less
deleterious scores (Low group) showed significantly higher predicted risk than those with highly
deleterious values (p = 1.14 × 10⁻ ²⁴⁰). (E) Measured eGFR at intake for individuals in the cohort
within the top and bottom 10% of RPS6KA1 scores. Lower scores were associated with reduced
baseline kidney function (p = 3.87e-02), consistent with the model’s predicted direction of effect.
12
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
We next extended the analysis to all three outcomes. Four features were consistently
predictive across all three outcomes: coding disruptions in RPS6KA1 (p90 RSK) and
TMEM40, and regulatory dysregulations of AGRN and EYA2 in the glomerular
endothelium cell type. Specifically, RPS6KA1 showed strong contribution to both kidney
failure and eGFR decline, and non-zero contribution to steroid resistance (Figure 4B).
Computational perturbation reveals protective and adverse gene effects
We then examined whether individual gene-level features can have biologically
interpretable effects on clinical outcome prediction by modifying individual embeddings
to simulate perturbations affecting specific genes at the regulatory or coding level
(Figure 4C). Given its consistently high SHAP contributions, we focused on RPS6KA1
for targeted computational perturbation. We first perturbed the RPS6KA1 coding score,
setting it to zero ("Low") or scaling it up ("High") across all individuals. The modified
embeddings were passed through the fixed outcome model to quantify changes in
predicted risk. Individuals with low RPS6KA1 values showed significantly elevated
predicted risk of eGFR decline compared to the high group (p = 1.14 × 10⁻²⁴⁰, effect size
r = 0.74; Figure 4D), suggesting RPS6KA1 may be protective against eGFR decline.
This trend persisted both in training and in the held-out test set (Supp. Figure 3A).
To evaluate concordance of this prediction with clinical data, we stratified cohort
participants by their true RPS6KA1 coding scores and compared their measured eGFR
at study intake. Individuals in the top decile of RPS6KA1 scores had significantly higher
baseline eGFR than those in the bottom decile (p = 0.039; Figure 4E), consistent with
the direction of model predictions.
We repeated this analysis in a high-impact regulatory feature (Figure 4B), specifically
cell-type specific regulation of EYA2 in the glomerular endothelium. In contrast to the
protective effect observed with RPS6KA1, increasing the EYA2 regulatory disruptions in
the glomerular endothelium led to significantly higher predicted risk of eGFR decline
(Supp. Figure 3 B-C). Consistent with Volaria’s predictions, CureGN participants with
high baseline EYA2 scores had lower eGFR at intake (p = 0.05; Supp. Figure 3D).
These findings illustrate that Volaria’s structured embeddings encode interpretable, cell
type-specific and directionally consistent biological signals.
13
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
References
1. Whole-genome sequencing of 490,640 UK Biobank participants. Nature. 2025; 1–10.
2. The 100, 000 Genomes Project Pilot Investigators. 100,000 Genomes Pilot on
Rare-Disease Diagnosis in Health Care — Preliminary Report. New England Journal of
Medicine. 2021 [cited 7 Aug 2025]. doi:10.1056/NEJMoa2035790
3. Sherman CA, Claw KG, Lee S-B. Pharmacogenetic analysis of structural variation in the
1000 genomes project using whole genome sequences. Scientific Reports. 2024;14: 1–13.
4. Leong IUS, Cabrera CP, Cipriani V, Ross PJ, Turner RM, Stuckey A, et al. Large-Scale
Pharmacogenomics Analysis of Patients With Cancer Within the 100,000 Genomes Project
Combining Whole-Genome Sequencing and Medical Records to Inform Clinical Practice.
Journal of Clinical Oncology. 2025 [cited 7 Aug 2025]. doi:10.1200/JCO.23.02761
5. Wray NR, Goddard ME, Visscher PM. Prediction of individual genetic risk to disease from
genome-wide association studies. Genome Res. 2007;17: 1520–1528.
6. Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of
protein-coding genetic variation in 60,706 humans. Nature. 2016;536: 285–291.
7. Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. The
mutational constraint spectrum quantified from variation in 141,456 humans. Nature.
2020;581: 434–443.
8. Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits: from polygenic to
omnigenic. Cell. 2017;169: 1177.
9. Dornbos P, Koesterer R, Ruttenburg A, Nguyen T, Cole JB, Leong A, et al. A combined
polygenic score of 21,293 rare and 22 common variants improves diabetes diagnosis based
on hemoglobin A1C levels. Nature Genetics. 2022;54: 1609–1614.
10. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:
421–427.
11. Fritsche LG, Igl W, Bailey JN, Grassmann F, Sengupta S, Bragg-Gresham JL, et al. A large
genome-wide association study of age-related macular degeneration highlights
contributions of rare and common variants. Nature genetics. 2016;48. doi:10.1038/ng.3448
12. Chen KM, Wong AK, Troyanskaya OG, Zhou J. A sequence-based global map of regulatory
activity for deciphering human genetics. Nat Genet. 2022;54: 940–949.
13. Avsec Ž, Agarwal V, Visentin D, Ledsam JR, Grabska-Barwinska A, Taylor KR, et al.
Effective gene expression prediction from sequence by integrating long-range interactions.
Nature Methods. 2021;18: 1196–1203.
14. Sokolova K, Theesfeld CL, Wong AK, Zhang Z, Dolinski K, Troyanskaya OG. Atlas of
primary cell-type-specific sequence models of gene expression and variant effects. Cell
Rep Methods. 2023;3: 100580.
15. Cheng J, Novati G, Pan J, Bycroft C, Žemgulytė A, Applebaum T, et al. Accurate
22
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
proteome-wide missense variant effect prediction with AlphaMissense. Science. 2023 [cited
19 Mar 2025]. doi:10.1126/science.adg7492
16. Mariani LH, Bomback AS, Canetta PA, Flessner MF, Helmuth M, Hladunewich MA, et al.
CureGN Study Rationale, Design, and Methods: Establishing a Large Prospective
Observational Study of Glomerular Disease. American journal of kidney diseases : the
official journal of the National Kidney Foundation. 2018;73: 218.
17. Lonsdale J, Thomas J, Salvatore M, Phillips R, Lo E, Shad S, et al. The Genotype-Tissue
Expression (GTEx) project. Nature Genetics. 2013;45: 580–585.
18. Sinnott-Armstrong N, Tanigawa Y, Amar D, Mars N, Benner C, Aguirre M, et al. Genetics of
35 blood and urine biomarkers in the UK Biobank. Nature Genetics. 2021;53: 185–194.
19. Jung H, Jung H-U, Baek EJ, Kwon SY, Kang J-O, Lim JE, et al. Integration of risk factor
polygenic risk score with disease polygenic risk score for disease prediction.
Communications Biology. 2024;7: 1–13.
20. Elliott MD, Vena N, Marasa M, Cocchi E, Bheda S, Bogyo K, et al. Increased risk of kidney
failure in patients with genetic kidney disorders. The Journal of Clinical Investigation.
2024;134: e178573.
21. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local
explanations to global understanding with explainable AI for trees. Nature Machine
Intelligence. 2020;2: 56–67.
22. Krishnan A, Zhang R, Yao V, Theesfeld CL, Wong AK, Tadych A, et al. Genome-wide
prediction and functional characterization of the genetic basis of autism spectrum disorder.
Nature Neuroscience. 2016;19: 1454–1462.
23. Bertin N, Mendez M, Hasegawa A, Lizio M, Abugessaisa I, Severin J, et al. Linking
FANTOM5 CAGE peaks to annotations with CAGEscan. Scientific Data. 2017;4: 1–8.
24. Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, et al.
GENCODE: the reference human genome annotation for The ENCODE Project. Genome
Res. 2012;22. doi:10.1101/gr.135350.111
25. McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, et al. The Ensembl Variant
Effect Predictor. Genome Biol. 2016;17: 1–14.
26. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The
Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA
sequencing data. Genome Res. 2010;20. doi:10.1101/gr.107524.110
27. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn:
Machine Learning in Python. Journal of Machine Learning Research. 2011;12: 2825–2830.
28. Lambert SA, Wingfield B, Gibson JT, Gil L, Ramachandran S, Yvon F, et al. Enhancing the
Polygenic Score Catalog with tools for score calculation and ancestry normalization. Nature
Genetics. 2024;56: 1989–1994.
29. Di Tommaso P, Chatzou M, Floden EW, Barja PP, Palumbo E, Notredame C. Nextflow
enables reproducible computational workflows. Nature Biotechnology. 2017;35: 316–319.
23
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Supplement
Non monogenic Only monogenic Together
ROC AUC AP ROC AUC AP ROC AUC AP
kidney
failure
0.63 ± 0.01 0.39 ± 0.01 0.79 ± 0.09 0.57 ± 0.17 0.63 ± 0.02 0.39 ± 0.01
eGFR
decline
0.67 ± 0.02 0.53 ± 0.02 0.71 ± 0.28 0.63 ± 0.33 0.67 ± 0.02 0.53 ± 0.02
Steroid
resistance
0.58 ± 0.01 0.28 ± 0.01 0.64 ± 0.25 0.83 ± 0.12 0.56 ± 0.01 0.28 ± 0.01
Supp. Table 1 | Model performance on individuals with monogenic kidney disease and
after their exclusion. Metrics shown: ROC AUC and AP for each outcome in the full test set,
non-monogenic subset, and monogenic-only subset. Error is one standard deviation across
random seeds.
24
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Supp. Figure 1 | Average precision gains across baseline models. (A–C) Volaria
shows improved precision-recall performance over baseline models across three renal
outcomes: (A) kidney failure, (B) 40% eGFR decline, and (C) steroid resistance. Values
reflect the gain in average precision (AP) over a naive baseline equal to the prevalence
of positive cases. Baselines include flat aggregations of functional scores across
matched feature sets, separately for regulatory and exonic components, as well as
combined, and a clinical-only model using sex.
25
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Supp. Figure 2 | (left) ROC AUC curves for predicting kidney failure outcome using five
polygenic scores (PGS) from the PGS Catalog; (right) Average precision improvements over
baseline for the kidney failure using each of the PGS scores. Volaria achieves over twice the AP
of the best-performing PGS
26
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Predicted feature Group Total events
in test
Event rate in
test (%)
Event rate in
train (%) ROC AUC AP Gain in AP
(%)
eGFR decline ALL 63 30.6 30.9 0.67 ± 0.02 0.53 ± 0.02 73.4 ± 6.4
eGFR decline FSGS 22 40.0 47.6 0.69 ± 0.03 0.64 ± 0.02 59.9 ± 6.0
eGFR decline IgAN 20 28.6 23.7 0.69 ± 0.04 0.59 ± 0.03 105.0 ± 11.8
eGFR decline MN 17 38.6 37.4 0.59 ± 0.05 0.58 ± 0.02 49.4 ± 4.9
eGFR decline MCD 4 10.8 16.4 0.74 ± 0.06 0.39 ± 0.07 263.8 ± 61.4
kidney failure ALL 47 23.0 26.1 0.63 ± 0.02 0.39 ± 0.01 68.1 ± 5.8
kidney failure IgAN 18 25.7 21.3 0.68 ± 0.02 0.56 ± 0.01 118.3 ± 5.0
kidney failure FSGS 17 30.9 47.9 0.68 ± 0.02 0.48 ± 0.02 56.9 ± 4.9
kidney failure MN 10 23.8 21.6 0.46 ± 0.04 0.38 ± 0.01 60.0 ± 3.9
kidney failure MCD 2 5.4 11.6 0.73 ± 0.04 0.18 ± 0.02 241.6 ± 35.5
Steroid resistance ALL 40 19.9 19.2 0.56 ± 0.01 0.28 ± 0.01 39.1 ± 2.6
Steroid resistance FSGS 21 39.6 33.2 0.42 ± 0.03 0.39 ± 0.01 -2.5 ± 3.3
Steroid resistance MCD 12 28.6 29.0 0.68 ± 0.02 0.48 ± 0.02 69.4 ± 8.5
Steroid resistance IgAN 4 6.2 7.5 0.70 ± 0.04 0.27 ± 0.07 339.4 ± 107.1
Steroid resistance MN 3 7.3 11.7 0.77 ± 0.05 0.24 ± 0.05 230.0 ± 63.0
Supp. Table 2 | Model performance by diagnosis subgroup for each predicted clinical
outcome. Reported metrics are total number of events in the test set, event rates in test and
training sets, ROC AUC, AP, and percent gain in AP over baseline. Subgroup-specific metrics
highlight variation in outcome prevalence and performance, with high scores observed across
both common and rare subtypes.
27
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Supp. Figure 3 | Computational perturbation of the embeddings. (A) Perturbation of the
RPS6KA1 coding score in the test set showed significant differences in predicted eGFR decline
risk between high and low conditions, in agreement with train subset results. (B) In silico
modification of the EYA2 regulatory score in the glomerular endothelium, training set subset.
Higher values led to increased predicted risk of eGFR decline. (C) EYA2 computational
perturbation in the glomerular endothelium repeated in the test set showed consistent results,
with higher scores again associated with worse predictions. (D) Individuals with high baseline
non-modified EYA2 regulatory scores had lower eGFR at intake, supporting the adverse
directionality observed in the computational perturbations.
28
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
CureGN Collaborators
The CureGN Consortium members listed below, from within the four Participating Clinical Center
networks and Data Coordinating Center, are acknowledged by the authors as Collaborators.
**CureGN Principal Investigators; *CureGN Site Principal Investigators; +CureGN Pathologists,
#CureGN Lead Coordinators.
CureGN Participating Clinical Centers (PCC) through Columbia University:
Columbia University, New York, NY, US: Gerald Appel, Revekka Babayev, Ibrahim Batal +,
Andrew Bomback**, Pietro Canetta, Brenda Chan, Vivette Denise D'Agati +, Samitri Dogra, Hilda
Fernandez, Gabriele Gaggero+, Ali Gharavi**, William Hines, Syed Ali Husain, Krzysztof Kiryluk**,
Satoru Kudose +, Fangming Lin, Victoria Kolupaeva#, Maddalena Marasa, Glen Markowitz +,
Mariela Naarro-Torres, Hila Milo Rasouly, Sumit Mohan, Nicola Mongera, Jordan Nestor,
Thomas Nickolas, Jai Radhakrishnan, Maya Rao, Maya Sabatello, Simone Sanna-Cherchi,
Dominick Santoriello+, Miroslav Sekulic +, Shayan Shirazian, Michael Barry Stokes+, Natalie Uy,
Natalie Vena, Benjamin Wooden
University of Warsaw, Warszawa, Poland: Bartosz Foroncewicz, Natalia Krata, Barbara
Moszczuk, Krzysztof Mucha*, Agnieszka Perkowska-Ptasińska, Elżbieta Ryszkowska
IRCCS Giannina Gaslini, Genoa, Italy: Gian Marco Ghiggeri*, Francesca Lugani, Valerio
Vellone+
CureGN Participating Clinical Centers (PCC) through the Pediatric Nephrology Research
Consortium:
Children’s Hospital of Michigan, Detroit, MI, USA: Rossana Baracco, Amrish Jain*
Children’s Hospital of New Orleans/ LSU Health, New Orleans, LA, USA: Diego Aviles*
Children’s Mercy Hospital, Kansas City, MO, USA: Tarak Srivastava*, Alexander Katz+
Children’s National Medical Center, Wmcgashington DC, USA: Sun-Young Ahn*
Cincinnati Children’s Hospital Cincinnati, OH, USA: Prasad Devarajan, Elif Erkan*, Donna
Claes, Hillarey Stone
Connecticut Children’s Medical Center, Hartford, CT, USA: Sherene Mason*
East Carolina University Brody School of Medicine, Greenville, NC, USA: Liliana
Gomez-Mendez*
Emory University, Atlanta, GA, USA: Larry Greenbaum**, Chia-shi Wang, Hong (Julie) Yin+
Helen DeVos Children’s Hospital, Grand Rapids, MI, USA: Goebel Jens*, Julia Steinke
Levine Children’s Hospital/Atrium Health, Charlotte, NC, USA: Donald Weaver*
Lurie Children’s Hospital, Chicago IL, USA: Jerome Lane*
Mayo Clinic, Rochester, MN, USA: Carl Cramer*
Medical College of Wisconsin, Milwaukee, WI, USA: Cindy Pan, Neil Paloian, Rajasree
Sreedharan*
Medical University of South Carolina, Charleston SC, USA: David Selewski, Katherine
Twombley*, Sally Self+
Nationwide Children’s Hospital, Columbus, OH, USA: Samantha Martinek-Bundt#, Dawson
Carmean#, Mary Dreher#, Aria Dockham#, Mahmoud Kallash*, John Mahan, Samantha Sharpe#,
William Smoyer**, Laura Biederman+
Oregon Health and Science University, Portland, OR, USA: Amira Al-Uzri*, Sandra Iragorri
29
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Riley Children’s Hospital, Indianapolis, IN, USA: Myda Khalid**
Cardinal Glennon Children’s Medical Center/ St. Louis University, St. Louis, MO, USA: Craig
Belsha*
Texas Children’s Hospital, Houston, TX, USA: Elizabeth Onugha*, Michael Braun, AC Gomez
Texas Tech Health Sciences Center, Amarillo, TX, USA: Tetyana Vasylyeva*
Children’s of Alabama, University of Alabama, Birmingham, AL, USA: Daniel Feig*
University of Colorado Children’s Hospital, Colorado, Aurora, CO, USA: Melisha Hannah*
University of Iowa Children’s Hospital, Iowa City, IA, USA: Carla Nester*
University of Kentucky, Lexington, KY, USA: Aftab Chishti*
University of Louisville, Louisville, KY, USA: Jon Klein**
Holtz Medical Center, University of Miami, Miami, FL, USA: Chryso Katsoufis, Wacharee
Seeherunvong*
University of Minnesota Children’s Hospital, Minneapolis, MN, USA: Michelle Rheault**
University of New Mexico Health Sciences Center, Albuquerque, NM, USA: Craig Wong*
University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA: Qassim Abid*
University of Virginia, Charlottesville, VA, USA: John Barcia*, Agnes Swiatecka-Urban
University of Wisconsin, Madison, WI, USA: Sharon Bartosh*
Vanderbilt Children’s Hospital, Nashville TN, USA: Tracy Hunley*
Washington University in St. Louis, St. Louis, MO, USA: Vikas Dharnidharka*, Brian Stotter,
Joseph, Gaut +
CureGN Participating Clinical Centers (PCC) through the University of North Carolina:
Hôpital Maisonneuve-Rosemont, Montreal, Canada: Louis-Philippe Laurin*, Virginie Royal+,
Mathieu Latour+, Natlie (Natacha) Patey +
Medical University of South Carolina, Charleston, SC, USA: Anand Achanti, Milos
Budisavljevic*
Northwestern University, Chicago, IL, USA: Cybele Ghossein, Yonatan Peleg *
Ohio State University, Columbus, OH, USA: Salem Almaani*, Isabelle Ayoub, Samir Parikh,
Brad Rovin, Anjali Satoskar+
University of Chicago, Chicago, IL, USA: Anthony Chang+
University of Alabama at Birmingham, Birmingham, AL, USA: Huma Fatima+, Jan Novak,
Matthew Renfrow, Dana Rizk*
University of North Carolina Kidney Center, Chapel Hill, NC, USA: Dhruti Chen, Vimal
Derebail**, Ronald Falk**, Keisha Gibson, Dorey Glenn, Susan Hogan, Koyal Jain, J. Charles
Jennette+, Amy Mottl, Caroline Poulton#, Monica Reynolds, Manish Kanti Saha, Nicole E. Wyatt
Vanderbilt University, Nashville, TN, USA: Agnes Fogo+, Neil Sanghani*
Virginia Commonwealth University, Richmond, VA, USA: Jason Kidd*, Selvaraj Muthusamy+
CureGN Participating Clinical Centers (PCC) through the University of Pennsylvania:
Children’s Hospital of Philadelphia, Philadelphia, PA, USA: Michelle Denburg, Amy Kogon,
Kevin Meyers*, Madhura Pradhan
Cleveland Clinic, Cleveland, OH, CA: Raed Bou Matar*, John O'Toole, John Sedor
Cohen Children’s Medical Center, New Hyde Park, NY, USA: Christine Sethna*^, Suzanne
Vento#
30
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint
Johns Hopkins University, Baltimore, MD, USA: Mohamed Atta, Serena Bagnasco+, Alicia Neu,
John Sperati*
Lundquist Institute at Harbor-UCLA Medical Center, Torrance, CA, USA: Sharon Adler*, Tiane
Dai, Ram Dukkipati
Mayo Clinic, Rochester, MN, USA: Fernando Fervenza*, Sanjeev Sethi +
Montefiore Medical Center, The Bronx, New York, NY, USA: Frederick Kaskel, Kaye Brathwaite,
Kimberly Reidy*
New York University, New York, NY, USA: Joseph Weisstuch, Ming Wu +, Olga Zhdanova
Spokane Providence Medical Center, Spokane, WA, USA: Katherine Tuttle*
Stanford University, Palo Alto, CA, USA: Jill Krissberg, Richard Lafayette*, Kamal Fahmeedah,
Elizabeth Talley
Sunnybrook Health Sciences Centre, Toronto, Canada: Michelle Hladunewich*
The Hospital for Sick Children, Toronto, Canada: Rulan Parekh*
University Health Network, Toronto, Canada: Carmen Avila-Casado+, Daniel Cattran*, Reich
Heather, Philip Boll
University of Miami, Miami, FL, USA: Yelena Drexler, Alessia Fornoni*
University of Michigan, Ann Arbor, MI, USA: Jeffrey Hodgin+, Andrea Oliverio*
University of Pennsylvania, Philadelphia, PA, USA: Jon Hogan, Lawrence Holzman**, Matthew
Palmer +, Gaia Coppock
University of Pittsburgh School of Medicine, Pittsburgh, PA, USA: Michael Mortiz*
University of Washington, Seattle, WA, USA: Charles Alpers+, J. Ashley Jefferson*
UT Southwestern, Dallas, TX, USA: Kamal Sambandam*, Bethany Roehm
Data Coordinating Center (DCC):
Cedar Sinai Medical Center, Los Angeles, CA, USA: Cynthia Nast+, Jean Hou+
Duke University, Durham, NC, USA: Laura Barisoni
Cleveland Clinic, Cleveland, OH, USA: Crystal Gadegbeku**
Northwestern University, Chicago, IL, USA: Abigail Smith**
University of Michigan, Ann Arbor, MI, USA: Brenda Gillespie, Bruce Robinson, Matthias
Kretzler, Zubin Modi, Laura Mariani**
Steering Committee Chair: Lisa M. Guay-Woodford, Children’s Hospital of Pennsylvania,
Philadelphia, PA, USA
31
.CC-BY-NC 4.0 International licenseavailable under a
(which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made
The copyright holder for this preprintthis version posted January 29, 2026. ; https://doi.org/10.64898/2026.01.27.701808doi: bioRxiv preprint