Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Feb 12, 2026
Date Accepted: Jun 2, 2026

The final, peer-reviewed published version of this preprint can be found here:

Comparative Evaluation of ChatGPT, Google Translate, and UD Talk for Chinese-to-Japanese Translation in Cardiology and Pulmonology Outpatient Consultations: Prospective Observational Study

Chao KY, Huang CH, Liu WL, Chen CY, Lim AY, Sasaoka A, Lin CY

Comparative Evaluation of ChatGPT, Google Translate, and UD Talk for Chinese-to-Japanese Translation in Cardiology and Pulmonology Outpatient Consultations: Prospective Observational Study

J Med Internet Res 2026;28:e93082

DOI: 10.2196/93082

PMID: 42315139

Comparative Evaluation of ChatGPT, Google Translate, and UD Talk for Chinese-to-Japanese Translation in Cardiology and Pulmonology Outpatient Consultations

  • Ke-Yun Chao; 
  • Chua-Hui Huang; 
  • Wei-Lun Liu; 
  • Chao-Yu Chen; 
  • Ai Yin Lim; 
  • Atsuko Sasaoka; 
  • Chung-Yu Lin

ABSTRACT

Background:

Language barriers between health-care providers and patients can compromise communication quality, patient safety, and health-care equity. When professional interpreter services are limited, particularly in outpatient settings, artificial intelligence–based translation tools may serve as supplementary communication aids.

Objective:

This study compared 3 Chinese-Japanese translation tools, ChatGPT, Google Translate, and UD Talk, in terms of their performance in real-world cardiology and pulmonology outpatient consultations.

Methods:

In this single-center prospective observational study, audio-recorded outpatient consultations between December 2024 and November 2025 were analyzed. Verbatim physician-patient dialogues were translated using the 3 systems. Selected dialogue exchanges were evaluated by professional medical interpreters for translation accuracy and by Japanese-speaking lay participants for translation satisfaction by using a 6-point Likert scale. Similarity of translation outputs was assessed at the dialogue-exchange level.

Results:

In Chinese-Japanese outpatient medical consultations, large language model–based translation tools, particularly ChatGPT, demonstrated higher translation accuracy and user satisfaction than did conventional translation systems. These findings indicate that such tools may support multilingual clinical communication when used as complementary aids alongside professional interpreter services.

Conclusions:

Large language model–based translation demonstrated superior performance and usability compared with conventional translation tools in Chinese–Japanese medical consultations, suggesting its potential role in supporting multilingual clinical communication and interpreter training. Clinical Trial: NCT06934031


 Citation

Please cite as:

Chao KY, Huang CH, Liu WL, Chen CY, Lim AY, Sasaoka A, Lin CY

Comparative Evaluation of ChatGPT, Google Translate, and UD Talk for Chinese-to-Japanese Translation in Cardiology and Pulmonology Outpatient Consultations: Prospective Observational Study

J Med Internet Res 2026;28:e93082

DOI: 10.2196/93082

PMID: 42315139

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.