Currently accepted at: JMIR Aging
Date Submitted: Jul 4, 2025
Open Peer Review Period: Jul 8, 2025 - Aug 1, 2025
Date Accepted: Dec 16, 2025
(closed for review but you can still tweet)
This paper has been accepted and is currently in production.
It will appear shortly on 10.2196/80102
The final accepted version (not copyedited yet) is in this tab.
Integrating Care Context with Skeleton and Depth Information for Elderly Activity Recognition in Care Facility Utilizing Care-assessment-aware Spatio-Temporal Transformer: A Method and Validation Study
ABSTRACT
Background:
Elderly activity recognition is a critical task in long-term care monitoring, yet it remains challenging due to postural deformities and health-related variability. These factors cause different activities to appear visually similar, or the same activity to appear dissimilar, undermining the effectiveness of traditional human activity recognition (HAR) models developed for the general population.
Objective:
This study aims to improve elderly activity recognition by incorporating care assessment information to enable personalized and context-aware monitoring in real-world care environments.
Methods:
We propose a care-assessment-aware spatio-temporal transformer (CSTT) model that integrates body keypoints, heatmaps, and care level data. The model dynamically adjusts its attention mechanism based on care level context to improve recognition accuracy. CSTT was trained and validated on a real-world elderly motion dataset collected from 28 participants during natural, privacy-preserving mealtime sessions without external intervention.
Results:
Despite data imbalance and considerable intra-class variation due to differing care needs, the proposed CSTT model achieved an F1 score and accuracy of 0.96. Analysis revealed that movement patterns differ significantly across care levels and that similar motions occur in distinct activities, highlighting the importance of care-aware modeling.
Conclusions:
Incorporating care level information into activity recognition models significantly enhances performance in elderly care settings. The proposed CSTT framework demonstrates the value of personalized, context-sensitive approaches for accurate and ethical monitoring in long-term care environments.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.