Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 26, 2022
Open Peer Review Period: May 26, 2022 - Jun 2, 2022
Date Accepted: Jul 8, 2022
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Deciphering the Diversity of Mental Models in Neurodevelopmental Disorders: Knowledge Graph Representation of Public Data Using Natural Language Processing

Kaur M, Costello J, Willis E, Kelm K, Reformat MZ, Bolduc FV

Deciphering the Diversity of Mental Models in Neurodevelopmental Disorders: Knowledge Graph Representation of Public Data Using Natural Language Processing

J Med Internet Res 2022;24(8):e39888

DOI: 10.2196/39888

PMID: 35930346

PMCID: 9391978

Deciphering Diversity of Mental Models in Neurodevelopmental Disorders: Knowledge Graph Representation of Public Data Using Natural Language Processing

  • Manpreet Kaur; 
  • Jeremy Costello; 
  • Elyse Willis; 
  • Karen Kelm; 
  • Marek Z. Reformat; 
  • Francois V. Bolduc

ABSTRACT

Background:

Understanding how individuals think about a topic can help to significantly improve communication. This is especially true when it comes to the medical domain where emotion and implications are high. Neurodevelopmental disorders (NDD) represent a group of diagnoses, affecting up to 18% of the population, involving differences in the development of cognitive or social functions and including attention deficit hyperactivity disorder (ADHD) as well as autism spectrum disorders (ASD). Both are complex conditions involving multiple symptoms and interventions where parents and health professionals interact. There is a gap in our global understanding of how each of those stakeholders differs in their preoccupations, making it difficult to address needs in knowledge mobilization.

Objective:

We aim to use Natural Language Processing techniques to build the Knowledge Graph from online information related to each stakeholder to help accelerate the identification of shared concerns and points of divergence between them. Ultimately, online information could be used to target knowledge mobilization and improve communication and care for individuals with ADHD and ASD.

Methods:

We created two datasets by collecting the posts from ASD and ADHD related online forums and PubMed abstracts and utilized the Unified Medical Language System (UMLS) to detect the biomedical concepts. Positive Pointwise mutual information (PPMI) followed by truncated Singular Value Decomposition (SVD) was applied to obtain the corpus-based UMLS concept embeddings for forums and PubMed. Property graph models were used for the Knowledge Graph representation of forums and PubMed. Semantic relatedness between concepts and the ASD condition or ADHD condition was calculated to rank the related concepts and stored as weight of edges. Additionally, UMLS semantic types were used to group concepts as well as to provide additional categorical information about concept’s domain.

Results:

Public forums on ADHD and ASD provide us with a wide range of concepts across multiple domains. Using Knowledge Graphs allows us to illustrate overlapping concepts between health professional literature (PubMed) and parental concerns (forums) with similar relevance scores, as the edge weight, and different co-occurrence frequency with the condition in each corpus, as the node size. Further, Knowledge Graphs also identify concepts with significantly different relevance scores between the stakeholders.

Conclusions:

Understanding the complex needs of families dealing with ASD or ADHD plays an important role in better communication between health professionals and families. Online public data, which is a source of information from large numbers of individuals, can provide significant insights into a condition. Moreover, it allows us to capture diversity in preoccupations and identify most relevant concepts for each stakeholder. Future research will be needed to identify how overlapping concepts may interact differently between each other for each stakeholder.


 Citation

Please cite as:

Kaur M, Costello J, Willis E, Kelm K, Reformat MZ, Bolduc FV

Deciphering the Diversity of Mental Models in Neurodevelopmental Disorders: Knowledge Graph Representation of Public Data Using Natural Language Processing

J Med Internet Res 2022;24(8):e39888

DOI: 10.2196/39888

PMID: 35930346

PMCID: 9391978

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.