Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently submitted to: Journal of Medical Internet Research

Date Submitted: Jan 23, 2026
Open Peer Review Period: Jan 25, 2026 - Mar 22, 2026
(currently open for review)

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Development of a novel musculoskeletal hypothesis in the ADVANCE cohort: application of sparse Group Factor Analysis methodology

  • Fraje CE Watson; 
  • Fabio S Ferreira; 
  • Balasundaram Kadirvelu; 
  • Alex N Bennett; 
  • Aldo A Faisal; 
  • Neil Graham; 
  • Harriet Kemp; 
  • Paul Cullinan; 
  • Christopher Boos; 
  • Nicola T Fear; 
  • Anthony MJ Bull

ABSTRACT

Background:

Musculoskeletal conditions are a leading global cause of disability, yet the factors influencing long-term musculoskeletal health, particularly following trauma, remain incompletely understood. Machine learning could be applied to identify previously unknown patterns in large-scale multimodal datasets.

Objective:

Test the ability of a new sparse Group Factor Analysis method to uncover hidden patterns in large-scale multi-modal datasets and generate testable, clinically relevant hypotheses.

Methods:

This study applies sparse Group Factor Analysis, a hierarchical unsupervised machine learning method, to the ADVANCE cohort—a longitudinal dataset of 1445 UK Afghanistan War servicemen—to identify latent structures in multimodal clinical data. Study 1 validated the approach by rediscovering known group-level patterns between combat-injured and non-injured participants, including poorer outcomes in pain, mobility, and bone health among those with lower limb loss. Study 2 explored the Injured, non-amputee subgroup without prespecified labels to identify new hypothesis-generating clusters that could subsequently be tested using standard hypothesis testing methods.

Results:

A subgroup of 125 individuals with worse musculoskeletal outcomes was uncovered. This group had greater body mass, higher injury severity, and a higher prevalence of head injury. These findings led to a novel hypothesis: that head injury, including potential traumatic brain injury, is associated with long-term musculoskeletal deterioration. This hypothesis is supported by literature in both athletic and military populations and will be tested in follow-up analyses.

Conclusions:

Our findings demonstrate how sparse Group Factor Analysis, combined with clinical insight, can uncover hidden patterns in large-scale datasets and generate testable, clinically relevant hypotheses that inform prevention, treatment, and rehabilitation strategies.


 Citation

Please cite as:

Watson FC, Ferreira FS, Kadirvelu B, Bennett AN, Faisal AA, Graham N, Kemp H, Cullinan P, Boos C, Fear NT, Bull AM

Development of a novel musculoskeletal hypothesis in the ADVANCE cohort: application of sparse Group Factor Analysis methodology

JMIR Preprints. 23/01/2026:91958

DOI: 10.2196/preprints.91958

URL: https://preprints.jmir.org/preprint/91958

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.