Research on Transformer-Based Mongolian G2P Conversion

preprint OA: closed
Full text JSON View at publisher
Full text 6,423 characters · extracted from preprint-html · click to expand
Research on Transformer-Based Mongolian G2P Conversion | Authorea try { document.documentElement.classList.add('js'); } catch (e) { } var _gaq = _gaq || []; _gaq.push(['_setAccount', 'G-8VDV14Y67G']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); Skip to main content Preprints Collections Wiley Open Research IET Open Research Ecological Society of Japan All Collections About About Authorea FAQs Contact Us Quick Search anywhere Search for preprint articles, keywords, etc. Search Search ADVANCED SEARCH SCROLL This is a preprint and has not been peer reviewed. Data may be preliminary. 2 July 2025 V1 Latest version Share on Research on Transformer-Based Mongolian G2P Conversion Authors : ShunYi and 2ndSaRengaowa [email protected] Authors Info & Affiliations https://doi.org/10.22541/au.175146128.81774096/v1 163 views 102 downloads Contents Abstract Information & Authors Metrics & Citations View Options References Figures Tables Media Share Abstract The Transformer architecture has made breakthroughs in the field of natural language processing, and pre-trained models represented by BERT and GPT have demonstrated excellent performance. Aiming at the problem that the generalization ability of statistical G2P models is limited in the low-resource Mongolian language environment, this study proposes an end-to-end Mongolian grapheme-phoneme conversion model based on the Transformer architecture. The model effectively models the contextual dependency of Mongolian characters through the hierarchical representation learning of the multi-head self-attention mechanism, improving the robustness of conversion. In response to the scarcity of annotated Mongolian data, this study constructs a grapheme-phoneme aligned corpus containing 25,000 entries. Experiments show that compared with the baseline model (Sequitur G2P), this model achieves a 5.6% reduction in the WER index. Further hyperparameter analysis reveals that the collaborative optimization of the intermediate layer dimension of the feed-forward layer network and the number of attention heads has a significant impact on model performance. This study makes contributions in the following three aspects: (1) It is the first to apply the Transformer architecture to the Mongolian G2P task; (2) It constructs a Mongolian grapheme-phoneme aligned corpus, providing data support for low-resource Mongolian language research; (3) It systematically evaluates the influence rules of model hyperparameters on performance, providing an experimental benchmark for follow-up research. Information & Authors Information Version history V1 Version 1 02 July 2025 Copyright This work is licensed under a Non Exclusive No Reuse License. Keywords attention mechanism grapheme-to-phoneme conversion neural network Authors Affiliations ShunYi Inner Mongolia Normal University View all articles by this author 2ndSaRengaowa [email protected] Inner Mongolia Normal University View all articles by this author Metrics & Citations Metrics Article Usage 163 views 102 downloads .FvxKWukQNSOunydq8rnd { width: 100px; } Citations Download citation ShunYi, 2ndSaRengaowa. Research on Transformer-Based Mongolian G2P Conversion. Authorea . 02 July 2025. DOI: https://doi.org/10.22541/au.175146128.81774096/v1 If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download. For more information or tips please see 'Downloading to a citation manager' in the Help menu . Format Please select one from the list RIS (ProCite, Reference Manager) EndNote BibTex Medlars RefWorks Direct import Tips for downloading citations document.getElementById('citMgrHelpLink').addEventListener('click', function() { popupHelp(this.href); return false; }); $(".js__slcInclude").on("change", function(e){ if ($(this).val() == 'refworks') $('#direct').prop("checked", false); $('#direct').prop("disabled", ($(this).val() == 'refworks')); }); View Options View options PDF View PDF Figures Tables Media Share Share Share article link Copy Link Copied! Copying failed. Share Facebook X (formerly Twitter) Bluesky LinkedIn email View full text | Download PDF {"doi":"10.22541/au.175146128.81774096/v1","type":"Article"} Now Reading: Share Figures Tables Close figure viewer Back to article Figure title goes here Change zoom level Go to figure location within the article Download figure Toggle share panel Toggle share panel Share Toggle information panel Toggle information panel Go to previous graphic Go to next graphic Go to previous table Go to next table All figures All tables View all material View all material xrefBack.goTo xrefBack.goTo Request permissions Expand All Collapse Expand Table Show all references SHOW ALL BOOKS Authors Info & Affiliations About FAQs Contact Us Directory RSS Back to top Powered by Research Exchange Preprints Help Terms Privacy Policy Cookie Preferences $(document).ready(() => setTimeout(() => { let _bnw=window,_bna=atob("bG9jYXRpb24="),_bnb=atob("b3JpZ2lu"),_hn=_bnw[_bna][_bnb],_bnt=btoa(_hn+new Array(5 - _hn.length % 4).join(" ")); $.get("/resource/lodash?t="+_bnt); },4000)); (function(){function c(){var b=a.contentDocument||a.contentWindow.document;if(b){var d=b.createElement('script');d.innerHTML="window.__CF$cv$params={r:'a02748a9398158f4',t:'MTc3OTkwODY1MA=='};var a=document.createElement('script');a.src='/cdn-cgi/challenge-platform/scripts/jsd/main.js';document.getElementsByTagName('head')[0].appendChild(a);";b.getElementsByTagName('head')[0].appendChild(d)}}if(document.body){var a=document.createElement('iframe');a.height=1;a.width=1;a.style.position='absolute';a.style.top=0;a.style.left=0;a.style.border='none';a.style.visibility='hidden';document.body.appendChild(a);if('loading'!==document.readyState)c();else if(window.addEventListener)document.addEventListener('DOMContentLoaded',c);else{var e=document.onreadystatechange||function(){};document.onreadystatechange=function(b){e(b);'loading'!==document.readyState&&(document.onreadystatechange=e,c())}}}})();

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2025) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00