Protein structure, a genetic encoding for glycosylation

preprint OA: closed
📄 Open PDF Full text JSON View at publisher
Full text 2,106 characters · extracted from oa-doi-fallback · click to expand
Abstract Unlike DNA, RNA, and protein biosynthesis, dogma describes glycosylation as primarily determined by intrinsic cellular limitations, such as glycosyltransferase expression and precursor availability. However, this cannot explain the commonly-observed differences between glycans on the same protein. By examining site-specific glycosylation on diverse human proteins, we detected associations between protein structure and glycan structure, broadly generalizable to human-expressed glycoproteins. Through structural analysis of site-specific glycosylation data, we found protein-sequence and structural features consistently correlated with specific glycan features. To quantify these relationships, we present a new amino acid substitution matrix describing “glycoimpact”, i.e., the association of primary protein structure and glycosylation. High-glycoimpact amino acids co-evolve with glycosites, and glycoimpact is high when estimates of amino acid conservation and variant pathogenicity diverge. We report thousands of disease variants near glycosites with high-glycoimpact, including several with known links to aberrant glycosylation (e.g., Oculocutaneous Albinism, Jakob-Creutzfeldt disease, Gerstmann-Straussler-Scheinker, and Gaucher’s Disease). Finally, glycoimpact quantification is validated by studying oligomannose-complex glycan ratios on HIV ENV, differential sialylation on IgG3 Fc, differential glycosylation on SARS-CoV-2 Spike, and fucose-modulated function of a tuberculosis monoclonal antibody. Finally, to test the causality of protein-glycan associations, we created 5 glycoimpact-designed novel Rituximab variants, 4 of which substantially changed glycoprofiles as predicted. In all, we report that site-specific glycan biosynthesis is influenced by underlying protein structure, enabling glycan structure prediction and genetic sequence-guided glycoengineering. Competing Interest Statement This work is associated with a provisional patent filed by the authors, and Augment Biologics, founded by BK and NEL. Footnotes Additional experimental results and framing revision

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00