Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently submitted to: Journal of Medical Internet Research

Date Submitted: May 12, 2026
Open Peer Review Period: May 14, 2026 - Jul 9, 2026
(currently open for review)

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Development and Preliminary Validation of CLEAR: A Framework for Evaluating Patient-Friendly AI-Generated Clinical Documentation

  • Priyanka Solanki; 
  • Batia Wiesenfeld; 
  • Jonah Zaretsky; 
  • Dennis Kurian; 
  • Katherine Kellogg; 
  • William Small; 
  • Jared Silberlust; 
  • Jacob Martin; 
  • Christopher Sonne; 
  • Alyssa Pradhan; 
  • Marina de Pablo; 
  • Kathleen Evanovich Zavotsky; 
  • Rebecca Borjas; 
  • Melissa Oliveras; 
  • Nilufar Tursnova; 
  • Jeong Min Kim; 
  • Lucille Fenelon; 
  • Kellie Owens; 
  • Alyssa Gutjahr; 
  • Marisa Lewis; 
  • Jonathan Austrian; 
  • Paul Testa; 
  • Jonah Feldman

ABSTRACT

Background:

Generative artificial intelligence (GenAI) is increasingly used to produce patient-friendly clinical documentation, yet evaluation of these outputs remains inconsistent and difficult to scale. Patient-friendliness is commonly reduced to narrow readability metrics, such as Flesch-Kincaid grade level, without accounting for clinical accuracy, completeness, or the patient perspective. No standardized framework exists to evaluate the quality and safety of AI-generated patient-friendly documentation across document types or the full documentation lifecycle.

Objective:

To develop and preliminarily validate CLEAR (Clinical Language Evaluation and AI Documentation Review), a theoretically grounded evaluation framework for AI-generated patient-friendly clinical documentation across the generation, review, and monitoring stages of the AI documentation lifecycle.

Methods:

CLEAR was developed using Messick's validity framework across four stages: content validation, response process, internal structure, and consequences. Domains were identified through a targeted literature review and reviewed by a panel of six clinical and operational experts. An iterative, consensus-based process involving four board-certified internists across 10 rounds refined domain definitions and scoring instructions. Inter-rater reliability was assessed on 50 AI-generated patient-friendly discharge summaries using Cohen's kappa and Gwet's AC1 for binary domains and intraclass correlation coefficients (ICC) and Gwet's AC2 for continuous domains. Additionally, 19 semi-structured stakeholder interviews with clinicians, informaticists, institutional leaders, and patient education experts explored operational needs and implementation contexts.

Results:

CLEAR comprises five domains for evaluating patient-friendly AI documentation: readability, understandability, patient-centeredness, accuracy, and completeness. Inter-rater reliability was good to almost perfect across all subjectively scored domains per Gwet's agreement coefficients. Stakeholder interviews independently identified three operational gaps aligned with the CLEAR lifecycle: lack of structured guidance for prompt engineering, subjectivity in human review, and absence of scalable monitoring infrastructure, directly validating the framework's real-world relevance. CLEAR was applied across three illustrative implementation contexts: prompt engineering for patient-friendly echocardiogram reports, structured human review of discharge summaries, and development of LLM-as-judge automated monitoring tools.

Conclusions:

CLEAR provides a preliminarily validated evaluation framework designed to span the full AI documentation lifecycle, from prompt engineering through human review to automated monitoring. By conceptualizing patient-friendliness as a multidimensional construct that integrates communication quality with patient safety, CLEAR offers practical infrastructure for consistent and scalable governance of patient-facing AI documentation in healthcare systems.


 Citation

Please cite as:

Solanki P, Wiesenfeld B, Zaretsky J, Kurian D, Kellogg K, Small W, Silberlust J, Martin J, Sonne C, Pradhan A, de Pablo M, Zavotsky KE, Borjas R, Oliveras M, Tursnova N, Kim JM, Fenelon L, Owens K, Gutjahr A, Lewis M, Austrian J, Testa P, Feldman J

Development and Preliminary Validation of CLEAR: A Framework for Evaluating Patient-Friendly AI-Generated Clinical Documentation

JMIR Preprints. 12/05/2026:101110

DOI: 10.2196/preprints.101110

URL: https://preprints.jmir.org/preprint/101110

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.