Semantic Interoperability at National Scale: The SPHN Federated Clinical Routine Dataset | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Semantic Interoperability at National Scale: The SPHN Federated Clinical Routine Dataset Jan Armida, Vasundra Touré, Philip Krauss, Deepak Unni, Harald Witte, and 16 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-8250886/v2 This work is licensed under a CC BY 4.0 License Status: Posted Version 2 posted You are reading this latest preprint version Show more versions Abstract Over the past eight years, the Swiss Personalized Health Network (SPHN) has established a national federated framework enabling semantically interoperable health-related data, with a primary focus on hospital clinical routine data. Rather than centralizing patient-level information, hospitals locally perform semantic coding and standardization and store SPHN-compliant data in dedicated triple stores. To promote discoverability, descriptive metadata and summary statistics derived from these local datasets are then centralized in the SPHN Metadata Catalog, which follows the SPHN Metadata Catalog Schema and aligns with European Health Data Space metadata standards. As of 2025, the SPHN Federated Clinical Routine Dataset encompasses information from more than 800,000 patients who provided broad consent, covering the period from 2018 to present. Across the first six participating hospitals, the infrastructure holds over 12.5 billion (10^9) RDF triples mapped to 125 SPHN semantic concepts including demographics, diagnoses, procedures, medications, laboratory results, vital signs, clinical scores, allergies, microbiology, intensive care data, oncology, and biological samples. This federated approach ensures that health data remain FAIR (Findable, Accessible, Interoperable, and Reusable) while safeguarding patient privacy by avoiding centralizing information. In this paper, we present the design, implementation, and scope of the SPHN Federated Clinical Routine Dataset, and its role in supporting data discoverability for research and clinical applications. Federated datasets Metadata Knowledge Graph RDF Terminology Usage Real World Data Full Text Additional Declarations The authors declare no competing interests. Cite Share Download PDF Status: Posted Version 2 posted You are reading this latest preprint version Show more versions Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-8250886","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":557339332,"identity":"c23dcc6b-85e1-4bcb-9160-58c0ab7de11f","order_by":0,"name":"Jan Armida","email":"","orcid":"https://orcid.org/0009-0007-4250-3983","institution":"SIB Swiss Institute of Bioinformatics,","correspondingAuthor":false,"prefix":"","firstName":"Jan","middleName":"","lastName":"Armida","suffix":""},{"id":557339333,"identity":"8322cd72-483e-40c7-88eb-0edbe31f9d23","order_by":1,"name":"Vasundra Touré","email":"","orcid":"https://orcid.org/0000-0003-4639-4431","institution":"SIB Swiss Institute of Bioinformatics,","correspondingAuthor":false,"prefix":"","firstName":"Vasundra","middleName":"","lastName":"Touré","suffix":""},{"id":557339334,"identity":"1fd8d770-990f-458d-a279-bfa6e1b45a05","order_by":2,"name":"Philip Krauss","email":"","orcid":"","institution":"Accenture AG","correspondingAuthor":false,"prefix":"","firstName":"Philip","middleName":"","lastName":"Krauss","suffix":""},{"id":557339335,"identity":"595c2e2f-89dd-4464-ab64-7f47454090e5","order_by":3,"name":"Deepak Unni","email":"","orcid":"https://orcid.org/0000-0002-3583-7340","institution":"SIB Swiss Institute of Bioinformatics,","correspondingAuthor":false,"prefix":"","firstName":"Deepak","middleName":"","lastName":"Unni","suffix":""},{"id":558099911,"identity":"cb459a74-4f49-4997-a205-760535619c9e","order_by":4,"name":"Harald Witte","email":"","orcid":"","institution":"SIB Swiss Institute of Bioinformatics","correspondingAuthor":false,"prefix":"","firstName":"Harald","middleName":"","lastName":"Witte","suffix":""},{"id":557339336,"identity":"7f1eb159-34d1-43b5-a023-f2d9508a0807","order_by":5,"name":"Davide Chiarugi","email":"","orcid":"","institution":"SIB Swiss Institute of Bioinformatics","correspondingAuthor":false,"prefix":"","firstName":"Davide","middleName":"","lastName":"Chiarugi","suffix":""},{"id":557339337,"identity":"1bdc3686-3304-4133-bb54-026ee6d205c3","order_by":6,"name":"Andrea Brites Marto","email":"","orcid":"","institution":"SIB Swiss Institute of Bioinformatics","correspondingAuthor":false,"prefix":"","firstName":"Andrea","middleName":"Brites","lastName":"Marto","suffix":""},{"id":557339338,"identity":"6527e1bd-3eab-4be6-a3a6-0e9884c0a081","order_by":7,"name":"Julia Mauer","email":"","orcid":"","institution":"Swiss Academy of Medical Science","correspondingAuthor":false,"prefix":"","firstName":"Julia","middleName":"","lastName":"Mauer","suffix":""},{"id":557339339,"identity":"fb14909d-df85-4e9d-a44a-51465a334bb4","order_by":8,"name":"Thomas Geiger","email":"","orcid":"","institution":"Swiss Academy of Medical Science","correspondingAuthor":false,"prefix":"","firstName":"Thomas","middleName":"","lastName":"Geiger","suffix":""},{"id":557339340,"identity":"211d57f6-464a-4609-9890-d029eb3640bf","order_by":9,"name":"Henning Beywl","email":"","orcid":"","institution":"Inselspital, Bern University Hospital","correspondingAuthor":false,"prefix":"","firstName":"Henning","middleName":"","lastName":"Beywl","suffix":""},{"id":557339341,"identity":"982b43ad-cb46-4efa-a77a-2d4b2990a506","order_by":10,"name":"Marc Daverat","email":"","orcid":"","institution":"University Hospital of Geneva","correspondingAuthor":false,"prefix":"","firstName":"Marc","middleName":"","lastName":"Daverat","suffix":""},{"id":557339342,"identity":"e02dc62c-c631-4403-9b14-5f42217e0ac8","order_by":11,"name":"Xeni Deligianni","email":"","orcid":"","institution":"University Hospital Basel","correspondingAuthor":false,"prefix":"","firstName":"Xeni","middleName":"","lastName":"Deligianni","suffix":""},{"id":557339343,"identity":"63955c4b-07ea-451e-9e47-15296908290b","order_by":12,"name":"Dominique Furrer","email":"","orcid":"","institution":"Inselspital, Bern University Hospital","correspondingAuthor":false,"prefix":"","firstName":"Dominique","middleName":"","lastName":"Furrer","suffix":""},{"id":557339344,"identity":"95c17331-f357-4635-8887-a3e75f628a64","order_by":13,"name":"Mathias Gassner","email":"","orcid":"","institution":"University Children’s Hospital Zurich","correspondingAuthor":false,"prefix":"","firstName":"Mathias","middleName":"","lastName":"Gassner","suffix":""},{"id":557339345,"identity":"f62b352e-1ce9-4602-990c-37efc06d656d","order_by":14,"name":"Matthias Joos","email":"","orcid":"","institution":"University Hospital Zurich","correspondingAuthor":false,"prefix":"","firstName":"Matthias","middleName":"","lastName":"Joos","suffix":""},{"id":557339346,"identity":"6b5bc04a-67d7-41c3-985c-59120041bf16","order_by":15,"name":"Katie Kalt","email":"","orcid":"","institution":"University Hospital Zurich","correspondingAuthor":false,"prefix":"","firstName":"Katie","middleName":"","lastName":"Kalt","suffix":""},{"id":557339347,"identity":"7d1b3a1d-2781-4579-a0ec-b6e19f6a5b24","order_by":16,"name":"Janshah Veettuvalappil Ikbal","email":"","orcid":"","institution":"Inselspital, Bern University Hospital","correspondingAuthor":false,"prefix":"","firstName":"Janshah","middleName":"Veettuvalappil","lastName":"Ikbal","suffix":""},{"id":557339348,"identity":"318ca399-9d33-4b02-b37d-ebee17f6c054","order_by":17,"name":"Helena Peic Tukuljac","email":"","orcid":"","institution":"University Children’s Hospital Zurich","correspondingAuthor":false,"prefix":"","firstName":"Helena","middleName":"Peic","lastName":"Tukuljac","suffix":""},{"id":557339349,"identity":"49b03734-38c7-4bc5-8a38-b854d9faadf8","order_by":18,"name":"Gaëlle Vuaridel-Thurre","email":"","orcid":"","institution":"University Hospital Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Gaëlle","middleName":"","lastName":"Vuaridel-Thurre","suffix":""},{"id":557339350,"identity":"e6344e01-9359-4406-8792-152cdfd78b6e","order_by":19,"name":"Solange Zoergiebel","email":"","orcid":"","institution":"University Hospital Lausanne","correspondingAuthor":false,"prefix":"","firstName":"Solange","middleName":"","lastName":"Zoergiebel","suffix":""},{"id":557339351,"identity":"67d12cb1-c748-4198-9e31-60107b85de38","order_by":20,"name":"Sabine Österle","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABCUlEQVRIie2RsWrDMBRFnxAkixyvymL9wgteMoT2V2QMyVKXjB5KEBSc0Wu2/oInr7URdHLI6jV0LYXSpV7aynRICBF47KADAklwdO9DAA7Hv4RBdTqsIQDK+h0folCzEMJBCpwrkTrdXEds97VepxD4/qF+TXGxetp6Femy+a1Q1xVs7qXeNRBOdzGdNbhMCj2R1Mt4VFQWBe5QexlERUtHU4U6KShDSkou0VYsfzPKN0TPBz3uFP6sxCND0pXcWgzaPkWZFIhHRGElQTMEr+RE2Yq171KzFx7yNg5NsXjWz2Jy7bOIPNGf7GER+Hl9/FDpjRD5vj5+NRt7sT8uPs7yvsPhcDiG8gs25lZvJ5WXjAAAAABJRU5ErkJggg==","orcid":"https://orcid.org/0000-0003-3248-7899","institution":"SIB Swiss Institute of Bioinformatics","correspondingAuthor":true,"prefix":"","firstName":"Sabine","middleName":"","lastName":"Österle","suffix":""}],"badges":[],"createdAt":"2025-12-01 13:30:49","currentVersionCode":2,"declarations":{"humanSubjects":true,"vertebrateSubjects":false,"conflictsOfInterestStatement":false,"humanSubjectEthicalGuidelines":true,"humanSubjectConsent":true,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-8250886/v2","doiUrl":"https://doi.org/10.21203/rs.3.rs-8250886/v2","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":98441716,"identity":"d3abcc34-29cf-47a8-b35a-ea06f0e2eb6f","added_by":"auto","created_at":"2025-12-17 17:05:44","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":745506,"visible":true,"origin":"","legend":"","description":"","filename":"Manuscript250826FederatedDatasetsV312.pdf","url":"https://assets-eu.researchsquare.com/files/rs-8250886/v2_covered_ba344202-dd99-4641-bdd0-ff645a9881e7.pdf"}],"financialInterests":"The authors declare no competing interests.","formattedTitle":"Semantic Interoperability at National Scale: The SPHN Federated Clinical Routine Dataset","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"Swiss Institute of Bioinformatics","isAcceptedByJournal":false,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"Federated datasets, Metadata, Knowledge Graph, RDF, Terminology Usage, Real World Data ","lastPublishedDoi":"10.21203/rs.3.rs-8250886/v2","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-8250886/v2","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eOver the past eight years, the Swiss Personalized Health Network (SPHN) has established a national federated framework enabling semantically interoperable health-related data, with a primary focus on hospital clinical routine data. Rather than centralizing patient-level information, hospitals locally perform semantic coding and standardization and store SPHN-compliant data in dedicated triple stores. To promote discoverability, descriptive metadata and summary statistics derived from these local datasets are then centralized in the SPHN Metadata Catalog, which follows the SPHN Metadata Catalog Schema and aligns with European Health Data Space metadata standards.\u003c/p\u003e\n\u003cp\u003eAs of 2025, the SPHN Federated Clinical Routine Dataset encompasses information from more than 800,000 patients who provided broad consent, covering the period from 2018 to present. Across the first six participating hospitals, the infrastructure holds over 12.5 billion (10^9) RDF triples mapped to 125 SPHN semantic concepts including demographics, diagnoses, procedures, medications, laboratory results, vital signs, clinical scores, allergies, microbiology, intensive care data, oncology, and biological samples.\u003c/p\u003e\n\u003cp\u003eThis federated approach ensures that health data remain FAIR (Findable, Accessible, Interoperable, and Reusable) while safeguarding patient privacy by avoiding centralizing information. In this paper, we present the design, implementation, and scope of the SPHN Federated Clinical Routine Dataset, and its role in supporting data discoverability for research and clinical applications.\u003c/p\u003e","manuscriptTitle":"Semantic Interoperability at National Scale: The SPHN Federated Clinical Routine Dataset","msid":"","msnumber":"","nonDraftVersions":[{"code":2,"date":"2025-12-17 14:40:28","doi":"10.21203/rs.3.rs-8250886/v2","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}},{"code":1,"date":"2025-12-09 04:05:05","doi":"10.21203/rs.3.rs-8250886/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"93559477-d016-4961-a8bd-f29882e2ad4d","owner":[],"postedDate":"December 17th, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[],"tags":[],"updatedAt":"2025-12-09T04:05:05+00:00","versionOfRecord":[],"versionCreatedAt":"2025-12-17 14:40:28","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v2","identity":"rs-8250886","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-8250886","identity":"rs-8250886","version":["v2"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.