Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Aug 20, 2025
Date Accepted: Feb 25, 2026
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Use of an AI-Based Tool (Human Experiences and Reflections Data Connector) to Improve Discovery and Reuse of Archived Qualitative Data: Protocol for an Algorithm Development and Validation Study

Leonard T, Stimpson JP, Cano MA, Shi W, Zhang S

Use of an AI-Based Tool (Human Experiences and Reflections Data Connector) to Improve Discovery and Reuse of Archived Qualitative Data: Protocol for an Algorithm Development and Validation Study

JMIR Res Protoc 2026;15:e82725

DOI: 10.2196/82725

PMID: 42013421

Human Experiences and Reflections (HEARs) Data Connector Protocol: Development and Validation of an AI-Based Tool to Improve Discovery and Reuse of Archived Qualitative Data

  • Tammy Leonard; 
  • Jim P. Stimpson; 
  • Miguel Angel Cano; 
  • Wenqi Shi; 
  • Song Zhang

ABSTRACT

Background:

Despite growing emphasis on open science and equity in research, qualitative data capturing diverse human experiences and perspectives are rarely reused beyond the original study. Increasingly data repositories are utilized to make this data publicly available, but it is unclear whether this data can effectively be identified by researchers interested in secondary data analysis.

Objective:

We describe a protocol for identifying and characterizing archived qualitative datasets in leading public repositories, developing an AI-based tool to enhance qualitative data reuse, and validating that tool using existing data.

Methods:

We will search 4 leading repositories to assess the scope and identifiability of existing publicly available qualitative data sets. We will subsequently build the HEARs Archive, a directory of de-identified study data that is only accessible indirectly through the use of the HEARs Portal. The HEARs Portal will be supported by large language model (LLM)-based tools using retrieval-augmented generation (RAG). The AI tools’ performance will be assessed across 3 domains: relevance of identified studies, validity as evaluated by comparison with human qualitative data analysis, and robustness against the addition of irrelevant information.

Results:

Preliminary review of existing data repositories has begun. Anticipated study completion is December 31, 2026.

Conclusions:

The proposed project intends to fill important gaps related to improving capacity for qualitative data dissemination and reuse. Clinical Trial: This is not a trial; not applicable.


 Citation

Please cite as:

Leonard T, Stimpson JP, Cano MA, Shi W, Zhang S

Use of an AI-Based Tool (Human Experiences and Reflections Data Connector) to Improve Discovery and Reuse of Archived Qualitative Data: Protocol for an Algorithm Development and Validation Study

JMIR Res Protoc 2026;15:e82725

DOI: 10.2196/82725

PMID: 42013421

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.