JMIR Preprints #64845: Clinical Laboratory Parameter–Driven Machine Learning for Participant Selection in Bioequivalence Studies Among Patients With Gastric Cancer: Framework Development and Validation Study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Clinical Laboratory Parameter–Driven Machine Learning for Participant Selection in Bioequivalence Studies Among Patients With Gastric Cancer: Framework Development and Validation Study

Byungeun Shon;
Sook Jin Seong;
Eun Jung Choi;
Mi-Ri Gwon;
Hae Won Lee;
Jaechan Park;
Ho-Young Chung;
Sungmoon Jeong;
Young-Ran Yoon

Background:

Insufficient participant enrollment is a major factor responsible for clinical trial failure.

Objective:

We formulated a machine learning (ML)-based framework using clinical laboratory parameters to identify participants eligible for enrollment in bioequivalence study.

Methods:

We acquired records of 11,592 patients with gastric cancer from the electronic medical records of Kyungpook National University Hospital in Korea. The ML model was developed using eight clinical laboratory parameters, including complete blood count, liver function tests, kidney function test, along with the dates of acquisition. Two datasets were collected: a training dataset to design an ML-based candidate selection method, and a test dataset to evaluate the performance of the proposed method. The generalization performance of the ML-based method was confirmed using the F1 score and the area under the curve (AUC). The proposed model was compared with a random selection method to evaluate its efficacy in recruiting participants.

Results:

The Weighted Ensemble model achieved strong performance with an F1 score above 0.8 and an AUC value exceeding 0.8, demonstrating its ability to accurately identify valid clinical trial candidates while minimizing misclassification. Its high sensitivity further enhances the model’s efficiency in prioritizing patients for screening. In a case study, the proposed ML model reduced the workload by 57%, efficiently identifying 150 valid patients from a pool of 209, compared to 485 patients required by random selection.

Conclusions:

The proposed ML-based framework using clinical laboratory parameters can be used to identify patients eligible for a clinical trial, enabling faster participant enrollment.

Citation

Please cite as:

Shon B, Seong SJ, Choi EJ, Gwon MR, Lee HW, Park J, Chung HY, Jeong S, Yoon YR

Clinical Laboratory Parameter–Driven Machine Learning for Participant Selection in Bioequivalence Studies Among Patients With Gastric Cancer: Framework Development and Validation Study

JMIR AI 2025;4:e64845

DOI: 10.2196/64845

PMID: 40605831

PMCID: 12223687

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR AI

Date Submitted: Jul 28, 2024

Date Accepted: Mar 26, 2025

Clinical Laboratory Parameter–Driven Machine Learning for Participant Selection in Bioequivalence Studies Among Patients With Gastric Cancer: Framework Development and Validation Study

Citation

Copyright