Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR AI

Date Submitted: Mar 3, 2025
Date Accepted: Mar 28, 2026

The final, peer-reviewed published version of this preprint can be found here:

A Language Model for Pediatric Occupational Therapy Documentation: Model Development and Pilot Study

DiMaio R, Tuinstra T, Yu T, Manda I, Wylie-Toal B, Tripp B

A Language Model for Pediatric Occupational Therapy Documentation: Model Development and Pilot Study

JMIR AI 2026;5:e73274

DOI: 10.2196/73274

PMID: 42139718

A Language Model for Pediatric Occupational Therapy Documentation: Model Development and Pilot

  • Rachel DiMaio; 
  • Tia Tuinstra; 
  • Trevor Yu; 
  • Ilona Manda; 
  • Brendan Wylie-Toal; 
  • Bryan Tripp

ABSTRACT

Background:

In occupational therapy, progress notes and other client-related administrative tasks are essential for providing treatment but are time consuming. Therapists spend at least as much time on these tasks as providing care, which contributes to growing wait lists.

Objective:

We aimed to create a custom large language model to make the process of writing progress notes more efficient by converting point-form scratch notes from pediatric occupational therapy treatment sessions into draft documentation in subjective-objective-assessment-plan (SOAP) format.

Methods:

Using a dataset of redacted historical progress notes, various training methods, including domain-adaptive pre-training and Low-rank Adaptation (LoRA) fine-tuning, were applied to train Llama 2 and 3 models. Since the historical notes lacked corresponding scratch notes, few-shot prompting with human-in-the-loop evaluations was used to generate synthetic scratch notes. This pairing of historical notes and generated scratch notes enabled effective fine-tuning of the Llama models on the desired task. The final model, a fine-tuned Llama 3 8B Instruct model, was piloted in a pediatric rehabilitation center and compared with Microsoft Copilot. Ten therapists used both models for three weeks each.

Results:

The custom model notes scored higher than manually written notes on clarity, completeness, relevance, and organization (p <.001 in each case) and similarly on conciseness. They scored higher than those from Copilot on conciseness (p <.001). However, there was no evidence from the pilot study that either model saved therapists’ time. Follow-up investigation revealed that coaching therapists to write brief scratch notes was crucial for reducing time spent on the process, but also that they tended to revert to detailed scratch notes after coaching.

Conclusions:

The model had the capacity to save time when therapists provided brief input to the model. However, in practice, therapists preferred to provide detailed input. Used in this way, the model improved note quality rather than saving time.


 Citation

Please cite as:

DiMaio R, Tuinstra T, Yu T, Manda I, Wylie-Toal B, Tripp B

A Language Model for Pediatric Occupational Therapy Documentation: Model Development and Pilot Study

JMIR AI 2026;5:e73274

DOI: 10.2196/73274

PMID: 42139718

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.