Loss-of-function phenomics, ncORFs, and ambiguity of mutant phenotypes in Medicago truncatula

preprint OA: closed
Full text JSON View at publisher
Full text 3,087 characters · extracted from oa-doi-fallback · click to expand
SUMMARY Non-canonical open reading frames (ncORFs) are an emerging area of research that is quickly gaining momentum. Many peptides and proteins missed in initial annotation efforts (ncProts) were subsequently shown to be crucial for a wide range of biological processes. The discovery of ncORFs continues to improve the accuracy of loss-of-function studies because they often occupy the same genomic spaces as annotated ORFs. While databases of mutant phenotypes linked to genomic loci exist in a few species, none of these databases integrate the information on ncORFs present in already characterized loci. In this study, we introduce a nearly comprehensive loss-of-function phenomics dataset of Medicago truncatula (673 loci characterized over the past 30 years), which was integrated as a new track into the genome browser of this organism. This dataset helped critically analyze the potential contribution of ncORFs to published phenotypes. We detected mass spectrometry (MS)-validated ncORFs in 10 characterized genes, including major regulators of development and symbiotic relationships. We also found conserved ncORFs in 113 characterized genes, including four genes with highly conserved ncORFs. In some studies, the contribution of these ncORFs can be ruled out, while in others it cannot. Using real examples, we systematized ambiguities associated with ncORFs. Furthermore, we highlighted little-known trans effects of insertional mutagenesis on splicing as contributors to that ambiguity. Finally, our meta-analysis of published phenotypes revealed that different protein classes have significantly different (unique) proportions of unconditional, conditional, and neutral phenotypes, potentially reflecting their relative functional importance. Significance statement This study is the first to merge a nearly comprehensive inventory of loss-of-function studies in a eukaryotic organism with the information on novel MS-validated and conserved ncORFs. Competing Interest Statement The authors have declared no competing interest. Footnotes In the revised version, we report the integration of our resource as a new track in the Medicago truncatula genome browser, which took place on 10 April 2026. We have made necessary changes throughout many files to reflect this integration. We have also emphasized that the track will be regularly updated and curated manually by the corresponding author. In addition, we have corrected a few statements regarding the recommended methodology to study ncORFs that overlap with refORFs. Finally, we improved wording throughout the text to make the manuscript clearer. https://github.com/umutcakir/ncORFs_in_studied_genes_of_Medicago_truncatula Abbreviations - ncORF - non-canonical open reading frame - refORF - reference open reading frame - ncProt - non-canonical protein - refProt - reference protein - mRNA - messenger - RNA - ncRNA, non-coding - RNA - rRNA, ribosomal - RNA - tRNA, transfer - RNA - CDS, coding sequence - UTR - untranslated region - Aa - amino acids - Bp - base pairs - Nt - nucleotides - MS - mass spectrometry

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00