Accepted for/Published in: JMIR Mental Health
Date Submitted: Mar 13, 2025
Open Peer Review Period: Mar 21, 2025 - May 16, 2025
Date Accepted: Jun 26, 2025
(closed for review but you can still tweet)
Directory of Public Datasets for Youth Mental Health: A Brief Review to Enhance Research through Data, Accessibility, and Artificial Intelligence
ABSTRACT
Background:
Youth mental health issues have been recognized as a pressing crisis in the USA in recent years. Effective, evidence-based mental health research and interventions require access to integrated datasets that consolidate diverse and fragmented data sources. However, researchers face challenges due to the lack of centralized, publicly available datasets, limiting the potential for comprehensive analysis and data-driven decision-making.
Objective:
This paper introduces a curated directory of publicly available datasets focused on youth mental health (under 18 years old). The directory is designed to serve as critical infrastructure to enhance research, inform policy-making, and support the application of artificial intelligence (AI) and machine learning (ML) in youth mental health research.
Methods:
Unlike a systematic review, this paper offers a brief overview of open data resources, addressing the challenges of fragmented health data in youth mental health research. We conducted a structured search using three approaches: targeted searches on reputable health organization websites (e.g., NIH, CDC), librarian consultation to identify hard-to-find datasets, and expert knowledge from prior research. Identified datasets were curated with key details, including name, description, components, format, access information, and study type, with a focus on freely available resources.
Results:
A curated list of publicly available datasets on youth mental health and school policies was compiled. While not exhaustive, it highlights key resources relevant to youth mental health research. Our findings identify major national survey series conducted by organizations such as the CDC, SAMHSA, and the U.S. Census Bureau, which focus on youth mental health and substance use. Additionally, we include data on state and school health policies, offering varying scopes and granularities. Valuable health data repositories such as ICPSR, Data.gov, Healthdata.gov, Data.CDC.gov, OpenFDA, and Data.CMS.gov host a wide range of research data, including surveys, longitudinal studies, and individual research projects.
Conclusions:
Publicly accessible health data is essential for improving youth mental health outcomes. Compiling and centralizing these resources streamlines access, enhances research impact, and informs interventions and policies. By improving data integration and accessibility, it encourages interdisciplinary collaboration and supports evidence-based interventions. Clinical Trial: N/A
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.