JMIR Preprints #64279: Development and validation of deep learning models to predict diagnostic and billing codes following visits to family medicine

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Development and validation of deep learning models to predict diagnostic and billing codes following visits to family medicine

Akshay Rajaram;
Michael Judd;
David Barber

ABSTRACT

Background:

Despite significant time spent on billing, family physicians routinely make errors and miss billing opportunities. In other disciplines, machine learning models have predicted current procedural terminology codes with high accuracy.

Objective:

Our objective was to derive machine learning models capable of predicting diagnosis and billing codes from notes recorded in the electronic medical record.

Methods:

We conducted a retrospective algorithm development and validation study involving an academic family medicine practice. Visits between July 1, 2015 and June 30, 2020 containing a physician-authored note and an invoice in the electronic medical record were eligible for inclusion. We trained two deep learning models and compared their predictions to codes submitted for reimbursement. We calculated accuracy, recall, precision, F1 score and area under the receiver operating curve.

Results:

245,045 visits were eligible for inclusion. 198,802 (81%) were included in model development. Accuracy was 99.7% and 99.6% for the diagnosis and billing code models, respectively. Recall was 66.4% and 70.2% for the diagnosis and billing code models, respectively. Precision was 38.0% and 72.8% for the diagnosis and billing code models, respectively. Area under the curve was 0.988 for the diagnosis code model and 0.933 for the billing code model.

Conclusions:

We developed models capable of predicting diagnosis and billing codes from electronic notes following visits to family medicine. The billing model outperformed the diagnosis model in terms of recall and precision likely due to fewer codes being predicted. Work is underway to further enhance model performance and assess the generalizability of these models to other family medicine practices.

Citation

Please cite as:

Rajaram A, Judd M, Barber D

Deep Learning Models to Predict Diagnostic and Billing Codes Following Visits to a Family Medicine Practice: Development and Validation Study

JMIR AI 2025;4:e64279

DOI: 10.2196/64279

PMID: 40605560

PMCID: 12231501

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR AI

Date Submitted: Jul 13, 2024

Date Accepted: Feb 8, 2025

Development and validation of deep learning models to predict diagnostic and billing codes following visits to family medicine

ABSTRACT

Citation

Copyright