Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Mental Health

Date Submitted: Mar 13, 2025
Open Peer Review Period: Mar 21, 2025 - May 16, 2025
Date Accepted: Jun 26, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Directory of Public Datasets for Youth Mental Health to Enhance Research Through Data, Accessibility, and Artificial Intelligence: Scoping Review

Min H, Jing X, Tao C, Williams JE, Griffinb SF, Esposito-Smythers C, Chorpita B

Directory of Public Datasets for Youth Mental Health to Enhance Research Through Data, Accessibility, and Artificial Intelligence: Scoping Review

JMIR Ment Health 2025;12:e73852

DOI: 10.2196/73852

PMID: 40921091

PMCID: 12422525

Directory of Public Datasets for Youth Mental Health: A Brief Review to Enhance Research through Data, Accessibility, and Artificial Intelligence

  • Hua Min; 
  • Xia Jing; 
  • Cui Tao; 
  • Joel E. Williams; 
  • Sarah F Griffinb; 
  • Christianne Esposito-Smythers; 
  • Bruce Chorpita

ABSTRACT

Background:

Youth mental health issues have been recognized as a pressing crisis in the USA in recent years. Effective, evidence-based mental health research and interventions require access to integrated datasets that consolidate diverse and fragmented data sources. However, researchers face challenges due to the lack of centralized, publicly available datasets, limiting the potential for comprehensive analysis and data-driven decision-making.

Objective:

This paper introduces a curated directory of publicly available datasets focused on youth mental health (under 18 years old). The directory is designed to serve as critical infrastructure to enhance research, inform policy-making, and support the application of artificial intelligence (AI) and machine learning (ML) in youth mental health research.

Methods:

Unlike a systematic review, this paper offers a brief overview of open data resources, addressing the challenges of fragmented health data in youth mental health research. We conducted a structured search using three approaches: targeted searches on reputable health organization websites (e.g., NIH, CDC), librarian consultation to identify hard-to-find datasets, and expert knowledge from prior research. Identified datasets were curated with key details, including name, description, components, format, access information, and study type, with a focus on freely available resources.

Results:

A curated list of publicly available datasets on youth mental health and school policies was compiled. While not exhaustive, it highlights key resources relevant to youth mental health research. Our findings identify major national survey series conducted by organizations such as the CDC, SAMHSA, and the U.S. Census Bureau, which focus on youth mental health and substance use. Additionally, we include data on state and school health policies, offering varying scopes and granularities. Valuable health data repositories such as ICPSR, Data.gov, Healthdata.gov, Data.CDC.gov, OpenFDA, and Data.CMS.gov host a wide range of research data, including surveys, longitudinal studies, and individual research projects.

Conclusions:

Publicly accessible health data is essential for improving youth mental health outcomes. Compiling and centralizing these resources streamlines access, enhances research impact, and informs interventions and policies. By improving data integration and accessibility, it encourages interdisciplinary collaboration and supports evidence-based interventions. Clinical Trial: N/A


 Citation

Please cite as:

Min H, Jing X, Tao C, Williams JE, Griffinb SF, Esposito-Smythers C, Chorpita B

Directory of Public Datasets for Youth Mental Health to Enhance Research Through Data, Accessibility, and Artificial Intelligence: Scoping Review

JMIR Ment Health 2025;12:e73852

DOI: 10.2196/73852

PMID: 40921091

PMCID: 12422525

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.