Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 17, 2025
Date Accepted: Nov 14, 2025

The final, peer-reviewed published version of this preprint can be found here:

Promoting Responsible DeepSeek Deployment in Health Care: Scoping Review Comparing Grey and White Literature

Jiang W, Wang D, Zeng Y, Huang J, Xu C, Liu C

Promoting Responsible DeepSeek Deployment in Health Care: Scoping Review Comparing Grey and White Literature

J Med Internet Res 2025;27:e80770

DOI: 10.2196/80770

PMID: 41348941

PMCID: 12680131

Promoting Responsible DeepSeek Deployment in Healthcare: A Scoping Review Comparing Grey and White Literature of Chinese Leading Hospitals Disclosure and Studies

  • Wang Jiang; 
  • Dan Wang; 
  • Yihang Zeng; 
  • Jiaqi Huang; 
  • Chang Xu; 
  • Chenxi Liu

ABSTRACT

Background:

The rapid deployment of DeepSeek, an open-source large language model has sparked concerns of its impact on patient outcomes and safety. However, little is known about how DeepSeek is used and regulated in these facilities.

Objective:

This study aimed to 1) systematically review the characteristics of deployed DeepSeek in the top 100 hospitals in China; and 2) compare performances and risks from hospital disclosure with research evidence.

Methods:

We performed a scoping review of gray and white literature, collecting data from the top 100 Chinese hospitals. We extracted basic characteristics of DeepSeek, its aim, evaluation approach, performance, risk and hospital regulation. A coding framework was developedcovering LLMs application scenario, evaluation dimension and source of risk.

Results:

We identified a total of 58 DeepSeek models in 48 out of the top 100 Chinese hospitals as well as 27 studies. We observed deployed DeepSeek mainly intended to assist clinical decision making, such as patient diagnosis and treatment recommendation. However, only 36.2% hospital-deployed models clearly indicated a pre-deployment assessment, 22.4% presented assessment results, and 8.6% identified potential risks and countermeasures. We found poor transparency in hospital reporting, with none presenting evaluation details. Hospitals were likely to report DeepSeek’s higher performance and fewer risks.

Conclusions:

The irresponsible deployment of DeepSeek in Chinese leading hospitals poses potential risks to patient outcomes and safety. We highlight the urgent need that existing regulations should be expanded to the downstream developers and users and hospitals need to perform a more rigorous validation and transparent reporting.


 Citation

Please cite as:

Jiang W, Wang D, Zeng Y, Huang J, Xu C, Liu C

Promoting Responsible DeepSeek Deployment in Health Care: Scoping Review Comparing Grey and White Literature

J Med Internet Res 2025;27:e80770

DOI: 10.2196/80770

PMID: 41348941

PMCID: 12680131

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.