Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 8, 2025
Date Accepted: Dec 20, 2025

The final, peer-reviewed published version of this preprint can be found here:

Extended Grammar of Systematized Nomenclature of Medicine – Clinical Terms for Semantic Representation of Clinical Data: Methodological Study

Gaudet-Blavignac C, Ehrsam J, Baumann M, Bensahla A, Mattei M, Zheng Y, Lovis C

Extended Grammar of Systematized Nomenclature of Medicine – Clinical Terms for Semantic Representation of Clinical Data: Methodological Study

J Med Internet Res 2026;28:e80314

DOI: 10.2196/80314

PMID: 41605503

PMCID: 12895155

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Escaping Babel: A formal descriptive grammar for universal representation of clinical data

  • Christophe Gaudet-Blavignac; 
  • Julien Ehrsam; 
  • Monika Baumann; 
  • Adel Bensahla; 
  • Mirjam Mattei; 
  • Yuanyuan Zheng; 
  • Christian Lovis

ABSTRACT

Background:

Interoperability has been a challenge for half a century. Led by an informatic view of the world, the quest for interoperability has evolved from typing and categorizing data, to building increasingly complex models. In parallel with the development of these models, the field of terminologies and ontologies emerged to refine granularity and introduce notions of hierarchy. Clinical data models and terminology systems are inherently limited. They vary in purpose, and their fixed categories shape and constrain representation, which inevitably leads to information loss.

Objective:

Despite these efforts, true semantic interoperability remains out of reach and meaning remains lacking. Achieving it is essential for effective data reuse but requires more than rich terminologies and standardized models. This article proposes a paradigm shift: by mimicking natural languages, we formalize a shared grammar that defines how clinical reality can be expressed in coherent, meaningful sentences, rather than fixing the boundaries of representable reality a priori.

Methods:

This work builds on a decade of semantic representation efforts following established frameworks to develop a formal descriptive grammar through expert-guided and consensus-based refinement.

Results:

This formal descriptive grammar offers a dynamic decentralized framework, enabling large scale representation without the need of a central authority. This ensures machine-readability and enables meaningful representation of evolving medical knowledge, adapting through ongoing use to accurately represent reality. It addresses limitations in representation such as negation, scalar values, uncertainty, temporality, and integrates external vocabularies.

Conclusions:

After years of fragmented attempts at capturing meaning, this radically different approach proposes a unifying, computable framework to consistently represent data across sources and use cases.


 Citation

Please cite as:

Gaudet-Blavignac C, Ehrsam J, Baumann M, Bensahla A, Mattei M, Zheng Y, Lovis C

Extended Grammar of Systematized Nomenclature of Medicine – Clinical Terms for Semantic Representation of Clinical Data: Methodological Study

J Med Internet Res 2026;28:e80314

DOI: 10.2196/80314

PMID: 41605503

PMCID: 12895155

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.