JMIR Preprints #56682: Quantifying similarities between MediaPipe and a known standard to address issues in tracking 2D upper limb trajectories: a proof of concept study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Quantifying similarities between MediaPipe and a known standard to address issues in tracking 2D upper limb trajectories: a proof of concept study

Vaidehi Wagh;
Matthew W. Scott;
Sarah N. Kraeutner

ABSTRACT

Background:

Marker-less motion tracking methods have promise for use in a range of domains, including clinical settings where traditional marker-based systems for human pose estimation is not feasible. MediaPipe is an artificial intelligence-based system that offers a marker less, lightweight approach to motion capture, and encompasses MediaPipe Hands, for recognition of hand landmarks. However, the accuracy of MediaPipe for tracking fine upper limb movements involving the hand has not been explored.

Objective:

Here we aimed to evaluate 2-dimensional accuracy of MediaPipe against a known standard.

Methods:

Participants (N = 10) performed trials in blocks of a touchscreen-based shape- tracing task. Each trial was simultaneously captured by a video camera. Trajectories for each trial were extracted from the touchscreen and compared to those predicted by MediaPipe. Specifically, following re-sampling, normalization, and Procrustes transformations, root mean squared error (RMSE; primary outcome measure) was calculated for coordinates generated by MediaPipe vs. the touchscreen computer.

Results:

Resultant mean RMSE was 0.28 +/- 0.064 normalized px. Equivalence testing revealed that accuracy differed between MediaPipe and the touchscreen, but that the true difference was between 0-0.30 normalized px (t (114) = -3.02, p = 0.002).

Conclusions:

Overall, we quantify similarities between MediaPipe and a known standard for tracking fine upper limb movements, informing applications of MediaPipe in domains such as clinical and research settings. Future work should address accuracy in 3-dimensions to further validate the use of MediaPipe in such domains.

Citation

Please cite as:

Wagh V, Scott MW, Kraeutner SN

Quantifying Similarities Between MediaPipe and a Known Standard to Address Issues in Tracking 2D Upper Limb Trajectories: Proof of Concept Study

JMIR Form Res 2024;8:e56682

DOI: 10.2196/56682

PMID: 39696897

PMCID: 11683656

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Formative Research

Date Submitted: Jan 23, 2024

Date Accepted: Sep 3, 2024

Quantifying similarities between MediaPipe and a known standard to address issues in tracking 2D upper limb trajectories: a proof of concept study

ABSTRACT

Citation

Copyright