JMIR Preprints #99806: From Data-Driven Nomograms to Knowledge-Driven Clinical AI: Comparative Validation of Bayesian and LLM-Based Models in Preoperative Risk Prediction

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

From Data-Driven Nomograms to Knowledge-Driven Clinical AI: Comparative Validation of Bayesian and LLM-Based Models in Preoperative Risk Prediction

Olivier Cussenot;
Josephine Renand;
Deborah Bret;
Thimotée Furet;
Agata Sujela;
Shahrokh Sharia1;
Géraldine Cancel-Tassin;
Georges Fournier;
Antoine Valeri

ABSTRACT

Background:

Clinical prediction tools are commonly derived from retrospective datasets using statistical learning approaches. Although effective, these data-driven nomograms may be population-specific, static, and only partially aligned with causal clinical reasoning. Recent advances in artificial intelligence enable knowledge-driven approaches in which expert assumptions, probabilistic structures, and domain reasoning contribute directly to model construction.

Objective:

To compare the performance of two knowledge-driven AI models with established data-driven nomograms for preoperative prediction of lymph node invasion (LNI) in localized prostate cancer.

Methods:

A retrospective data set of 229 consecutive patients with clinically localized prostate cancer (cT2 on examination and MRI) treated with radical prostatectomy and extended pelvic lymph node dissection was submitted as use case. Histopathological LNI was the reference standard. Predictors included PSA density, MRI extracapsular extension, biopsy ISUP grade group, and maximum tumor diameter on MRI. Three conventional models (Briganti, Yale, Roach) were compared with two knowledge-driven systems: (1) an LLM-assisted logistic equation generated from predefined clinical constraints, and (2) a Bayesian network parameterized through structured expert/AI elicitation. Discrimination, threshold metrics, predictive values, and decision utility were assessed.

Results:

The LLM-assisted logistic model (AUC 0.697) and Bayesian network (AUC 0.689) showed close performance to the Briganti model which achieved the highest discrimination (AUC 0.721). The Bayesian model achieved the highest Youden index (0.346) and a strong clinical utility, indicating the best balance between sensitivity and specificity. Negative predictive values exceeded 0.89 for all models

Conclusions:

Knowledge-driven AI models achieved performance comparable to established nomograms while offering interpretability and probabilistic reasoning. These findings support prospective evaluation of hybrid data- and knowledge-driven clinical decision-support systems.

Citation

Please cite as:

Cussenot O, Renand J, Bret D, Furet T, Sujela A, Sharia1 S, Cancel-Tassin G, Fournier G, Valeri A

From Data-Driven Nomograms to Knowledge-Driven Clinical AI: Comparative Validation of Bayesian and LLM-Based Models in Preoperative Risk Prediction

JMIR Preprints. 29/04/2026:99806

DOI: 10.2196/preprints.99806

URL: https://preprints.jmir.org/preprint/99806

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Currently submitted to: Journal of Medical Internet Research

Date Submitted: Apr 29, 2026

From Data-Driven Nomograms to Knowledge-Driven Clinical AI: Comparative Validation of Bayesian and LLM-Based Models in Preoperative Risk Prediction

ABSTRACT

Citation

Copyright