Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently submitted to: JMIR Serious Games

Date Submitted: Apr 16, 2026
Open Peer Review Period: Apr 20, 2026 - Jun 15, 2026
(currently open for review)

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Detection of Developmental Delay in Children from VR-Derived Facial Action Units: A Comparative Study of Machine Learning, Deep Learning, and Transformer Models

  • Ahsan Aziz; 
  • Awais Khan; 
  • Syed Ali Naqi Raza; 
  • Sokea Teng; 
  • Chomyong Kim; 
  • Eun Young Kim; 
  • Seungsoo Kim; 
  • Sehoon Shim; 
  • Joungmin Kim; 
  • Yongwon Cho; 
  • Yunyoung Nam

ABSTRACT

Background:

Developmental delay is a disorder in which a child does not grasp developmental milestones in areas like: motor skills, cognition, Speech, and societal interaction at an anticipated age. These delays include cognitive, speech and language, motor, and societal and emotional delays commonly caused by aspects like genetic circumstances, premature birth or low birth weight, neurological conditions, and environmental reasons.

Objective:

In this study, we used the Meta Quest Pro headset to generate an immersive virtual reality environment through task-based interactions. Including microexpressions, muscle activities, and eye-tracking data, were collected as time series data, apprehending dynamic facial gesture fluctuations.

Methods:

Subsequent data preprocessing, the primary analysis path leveraged statistical methods, which include: Kruskal–Wallis test, ReliefF, ANOVA, and minimum redundancy maximum relevance (MRMR), to extract and select the furthermost relevant features. The secondary path employed a transformer for automated feature extraction, then both feature sets were afterward classified using machine learning classifiers to evaluate their effectiveness.

Results:

Using the MRMR method, we achieved 86.4% accuracy with the Coarse Tree classifier with a ratio of 80:20, and 95.6% accuracy with quadratic support vector machine (SVM) with a ratio of 60:40. Whereas the transformer-based approach achieved an accuracy of 86.4% with Quadratic SVM and 95.5% with the Fine KNN classifier at the 60:40 split.

Conclusions:

This study contributes to the advancement of affordable and accessible screening tools for children worldwide.


 Citation

Please cite as:

Aziz A, Khan A, Raza SAN, Teng S, Kim C, Kim EY, Kim S, Shim S, Kim J, Cho Y, Nam Y

Detection of Developmental Delay in Children from VR-Derived Facial Action Units: A Comparative Study of Machine Learning, Deep Learning, and Transformer Models

JMIR Preprints. 16/04/2026:98445

DOI: 10.2196/preprints.98445

URL: https://preprints.jmir.org/preprint/98445

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.