Disentangling the genetic causes of autism

preprint OA: closed
📄 Open PDF Full text JSON View at publisher
Full text 5,953 characters · extracted from oa-doi-fallback · 2 sections · click to expand

Background

By suspecting that the scarce disease association results obtained by several genome-wide association studies, even in highly heritable diseases, could be due to wrong assumptions inferred from reference genomes but not true in affected individuals, we re-analyzed data from the Autism Genome Project consortium starting from the genotype calling phase.

Methods

We identified patterns of raw genotype intensities associated to the disease, mainly lack of hybridization. In light of it, we re-designed the polygenic models of individual risk and were able to explain most of the variance – AUC above 0.9 – in two independent datasets of whole exome sequencing (WES) provided by the Autism Sequencing and Human Autism Genetics consortia, by measuring differences in missing rates and Mendelian inconsistencies. Findings Analytical results when using a missingness test, pointed out to several highly significant positions, most of them having the autistic individuals as the group with the unknown variants. Results from Mendelian tests under our polygenic models, showed high accuracy and robustness as well, revealing a surprising pattern of higher rates of Mendelian inconsistencies –when considering rare variants– in unaffected offspring than in affected ones, even with affected and unaffected children belonging to the same family, and confirm autism as highly heritable. Missingness tests performed on copy number variation (cnv)s in sex chromosomes within array-based datasets, revealed that, in males, a missing cnv only in one gene (IL3RA) within pseudoautosomal region (PAR) 1 was able to explain the whole variance (AUC = 1) while in females it was also necessary but not enough another missing cnv, in gene IL9R, within PAR 2. Another cnv deletion, within a gene in chromosome Y very close to IL3RA, PRKY, is also highly associated to autism, and explains the large rates of aneuploidies in sex chromosomes of autistic individuals. Expression heatmaps performed with genes within autosomes found associated to autism, reveal cerebral cortex and testis as the regions most targeted by these genes. We also analyzed sequenced genotypes of Asperger and PDD-NOS individuals and found out that they are closer to individuals with typical development (AUC around 0.7) than to strict autistic ones (AUC above 0.9). Interpretations Although the origin of autism may be associated to deletions of a cnv at IL3R in males (common macrocephaly in autism may be explained by this), the strong linkage disequilibrium with PRKY, the close interaction between autosomes and sex chromosomes through evolution and large epistatic effects, result in several associated genes in autosomes strongly expressed in cerebral cortex and testis. In light of these results, autism may be confirmed as a spectrum of traits with thousand genome-wide genetic variants but with the origin in the deletion of a cnv in IL3RA in chromosome Y (PAR1), which may cause macrocephaly. Very often it is accompanied by the deletion of cnvs in PRKY, because of its strong linkage disequilibrium with IL3RA, which may cause, at least long-range, aneuplodies. The disease is transmitted to females because of PAR1 recombination but it also needs other genes in sex chromosomes (IL3RA together with IL9R in PAR2 are not enough) for the disease onset in females. Missingness tests showing almost no genetic differences between parents and affected offspring in autosomes (AUC around 0.5) seem to confirm that autism is a complex disease that requires not only the presence of the appropriate genetic variants, thousand of them perhaps, but environmental factors for the disease onset. Competing Interest Statement The authors have declared no competing interest. Funding Statement This study did not receive any funding Author Declarations I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained. Yes The details of the IRB/oversight body that provided approval or exemption for the research described are given below: All data used are openly available human data. They were obtained from the IHPM (HapMap Consortium, T. I. The international hapmap project. Nature 426, pp. 789-796, 2003) and from dbGaP at http://www.ncbi.nlm.nih.gov/gap through dbGaP accession number, phs000267.v5.p2 (AGP consortium) and phs000298.v4.p3 (ASC). Submission of the data, phs000267.v5.p2, to dbGaP was provided by Dr. Bernie Devlin on behalf of the Autism Genome Project (AGP). Collection and submission of the data to dbGaP were supported by a grant from the Medical Research Council (G0601030) and the Wellcome Trust (075491/Z/04), Anthony P. Monaco, P.I., University of Oxford. I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals. Yes I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance). Yes I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable. Yes Footnotes Now it includes results from sex chromosomes Data Availability All data results produced in the present work are contained in the manuscript. The software we developed is available online at https://bios.ugr.es/IGC/

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00