Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 19, 2023
Date Accepted: Dec 11, 2023

The final, peer-reviewed published version of this preprint can be found here:

Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study

FU Z, Hsu YC, Chan CS, Lau CM, Liu J, Yip PSF

Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study

J Med Internet Res 2024;26:e51069

DOI: 10.2196/51069

PMID: 38289662

PMCID: 10865189

Efficacy of ChatGPT in Cantonese Sentiment Analysis: A Comparative Study

  • Ziru FU; 
  • Yu Cheng Hsu; 
  • Christian Shaunlyn Chan; 
  • Chaak Ming Lau; 
  • Joyce Liu; 
  • Paul Siu Fai Yip

ABSTRACT

Background:

Sentiment analysis, in general, and in Cantonese, in particular, remain a significant yet challenging task in natural language processing. One major barrier is Cantonese’s lack of standardised corpus and its nature as a spoken language.

Objective:

Our study investigated using ChatGPT for sentiment analysis in online Cantonese counseling text and compared its performance with other mainstream methods.

Methods:

This study investigated the application of ChatGPT/GPT for Cantonese sentiment analysis using transcripts from an online text-based counseling service in Hong Kong. A total of 131 individual counseling sessions, with 6,169 messages between counselors and help-seekers were included in this study. First, a codebook was developed for human annotation. Then a simple prompt, “Is the sentiment of this Cantonese text positive, neutral, or negative? Respond with the sentiment label only.”, was given to ChatGPT 3.5 and GPT-4 to label each message’s sentiment. ChatGPT’s performance was compared with one lexicon-based method and three state-of-the-art models.

Results:

The accuracy in identifying positive, neutral, and negative feelings of ChatGPT 3.5 and GPT-4 was 92.1% and 95.3%, respectively, which outperformed the lexicon-based methods and machine learning models.

Conclusions:

ChatGPT/GPT stands out among many existing text analysis techniques in terms of accuracy and could be considered a useful tool for analyzing Cantonese sentiments.


 Citation

Please cite as:

FU Z, Hsu YC, Chan CS, Lau CM, Liu J, Yip PSF

Efficacy of ChatGPT in Cantonese Sentiment Analysis: Comparative Study

J Med Internet Res 2024;26:e51069

DOI: 10.2196/51069

PMID: 38289662

PMCID: 10865189

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.