Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Jun 26, 2025
Date Accepted: Feb 4, 2026

The final, peer-reviewed published version of this preprint can be found here:

Multimodal Sentiment and Emotion Analysis Framework for Personalized Health Coaching Messages: Proof-of-Concept Study

Md Zuki MA, Mohamad Ali N, Chaw JK

Multimodal Sentiment and Emotion Analysis Framework for Personalized Health Coaching Messages: Proof-of-Concept Study

JMIR Form Res 2026;10:e79558

DOI: 10.2196/79558

PMID: 36346735

Multi-Modal Sentiment and Emotion Analysis Framework for Personalized Health Coaching Messages: An Experiment

  • Muhammad Aiman Md Zuki; 
  • Nazlena Mohamad Ali; 
  • Jun Kit Chaw

ABSTRACT

Text generation approaches in healthcare communication have evolved along two major paths. The first involves Generative Adversarial Networks (GANs), progressing from basic architectures to specialized variants like TT-GAN and TFGAN, which address challenges in discrete text generation through techniques such as Gumbel-Softmax and Reinforcement Learning. The second path emerges from transformer-based architectures, particularly GPT-2, which uses extensive pre-training and self-attention mechanisms to generate contextually appropriate text. GPT-2's transformer architecture enhances persuasive health communication by generating personalized messages using various strategies like task support, dialogue support, and social support for effective health interventions. The objective of this experiment is to use GPT2 as a generative method to construct the persuasive text in the dataset and compare the performance of Semantic Analysis and Emotion Detection Analysis. We combine sentiment analysis tools (VADER and TextBlob) with emotion detection methods (Text2Emotion and NRCLex) to analyze health coaching messages across different persuasive types: reminders, rewards, suggestions, and praise. TextBlob achieved perfect scores (1.0) in accuracy, precision, recall, and F1-score, outperforming VADER's moderate performance (0.6 accuracy). RoBERTa-sentiment leads with 88% accuracy and 0.8264 F1 score. While transformers excel in accuracy, lexicon-based models like VADER offer better performance-efficiency balance for real-time health communication systems. For emotion detection, anger showed nearly perfect performance across all metrics, while anticipation scored lowest with metrics below 0.5. The emotion detection analysis revealed varying success rates across different emotions, with some categories, like anger, showing near-perfect detection while others, like anticipation, proved more challenging. This research contributes to understanding the emotional dynamics of persuasive health communication and highlights both the capabilities and limitations of current natural language processing tools in analyzing health-related persuasive messaging.


 Citation

Please cite as:

Md Zuki MA, Mohamad Ali N, Chaw JK

Multimodal Sentiment and Emotion Analysis Framework for Personalized Health Coaching Messages: Proof-of-Concept Study

JMIR Form Res 2026;10:e79558

DOI: 10.2196/79558

PMID: 36346735

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.