Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Dec 17, 2025
Date Accepted: Apr 23, 2026

The final, peer-reviewed published version of this preprint can be found here:

Clinical Evaluation of the Clinical Reasoning Process of Large Language Models in Nephrology: Comparative Evaluation Study

Yano Y, Kakizaki H, Nagasu H, Kishi S, Koshida T, Nihei Y, Hirano A, Nangaku M, Mori H, Naito T, Ohashi M, Maruyama S, Matsui I, Isaka Y, Okada H, Suzuki Y, Kashihara N

Clinical Evaluation of the Clinical Reasoning Process of Large Language Models in Nephrology: Comparative Evaluation Study

JMIR Form Res 2026;10:e89726

DOI: 10.2196/89726

PMID: 42234872

Clinical Evaluation of the Clinical Reasoning Process of Large Language Models in Nephrology: Comparative Evaluation Study

  • Yuichiro Yano; 
  • Hiroaki Kakizaki; 
  • Hajime Nagasu; 
  • Seiji Kishi; 
  • Takeo Koshida; 
  • Yoshihito Nihei; 
  • Akira Hirano; 
  • Masaomi Nangaku; 
  • Hirotake Mori; 
  • Toshio Naito; 
  • Mizuki Ohashi; 
  • Shoichi Maruyama; 
  • Isao Matsui; 
  • Yoshitaka Isaka; 
  • Hirokazu Okada; 
  • Yusuke Suzuki; 
  • Naoki Kashihara

ABSTRACT

We introduce a step-by-step framework to evaluate LLM clinical reasoning using complex nephrology cases, revealing model-specific weaknesses in advanced reasoning tasks and demonstrating that high reasoning performance does not require high computational cost.


 Citation

Please cite as:

Yano Y, Kakizaki H, Nagasu H, Kishi S, Koshida T, Nihei Y, Hirano A, Nangaku M, Mori H, Naito T, Ohashi M, Maruyama S, Matsui I, Isaka Y, Okada H, Suzuki Y, Kashihara N

Clinical Evaluation of the Clinical Reasoning Process of Large Language Models in Nephrology: Comparative Evaluation Study

JMIR Form Res 2026;10:e89726

DOI: 10.2196/89726

PMID: 42234872

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.