Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.
Who will be affected?
Readers: No access to all 28 journals. We recommend accessing our articles via PubMed Central
Authors: No access to the submission form or your user account.
Reviewers: No access to your user account. Please download manuscripts you are reviewing for offline reading before Wednesday, July 01, 2020 at 7:00 PM.
Editors: No access to your user account to assign reviewers or make decisions.
Copyeditors: No access to user account. Please download manuscripts you are copyediting before Wednesday, July 01, 2020 at 7:00 PM.
Use of an AI-Based Tool (Human Experiences and Reflections Data Connector) to Improve Discovery and Reuse of Archived Qualitative Data: Protocol for an Algorithm Development and Validation Study
Human Experiences and Reflections (HEARs) Data Connector Protocol: Development and Validation of an AI-Based Tool to Improve Discovery and Reuse of Archived Qualitative Data
Tammy Leonard;
Jim P. Stimpson;
Miguel Angel Cano;
Wenqi Shi;
Song Zhang
ABSTRACT
Background:
Despite growing emphasis on open science and equity in research, qualitative data capturing diverse human experiences and perspectives are rarely reused beyond the original study. Increasingly data repositories are utilized to make this data publicly available, but it is unclear whether this data can effectively be identified by researchers interested in secondary data analysis.
Objective:
We describe a protocol for identifying and characterizing archived qualitative datasets in leading public repositories, developing an AI-based tool to enhance qualitative data reuse, and validating that tool using existing data.
Methods:
We will search 4 leading repositories to assess the scope and identifiability of existing publicly available qualitative data sets. We will subsequently build the HEARs Archive, a directory of de-identified study data that is only accessible indirectly through the use of the HEARs Portal. The HEARs Portal will be supported by large language model (LLM)-based tools using retrieval-augmented generation (RAG). The AI tools’ performance will be assessed across 3 domains: relevance of identified studies, validity as evaluated by comparison with human qualitative data analysis, and robustness against the addition of irrelevant information.
Results:
Preliminary review of existing data repositories has begun. Anticipated study completion is December 31, 2026.
Conclusions:
The proposed project intends to fill important gaps related to improving capacity for qualitative data dissemination and reuse. Clinical Trial: This is not a trial; not applicable.
Citation
Please cite as:
Leonard T, Stimpson JP, Cano MA, Shi W, Zhang S
Use of an AI-Based Tool (Human Experiences and Reflections Data Connector) to Improve Discovery and Reuse of Archived Qualitative Data: Protocol for an Algorithm Development and Validation Study