Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Previously submitted to: Journal of Medical Internet Research (no longer under consideration since Feb 03, 2022)

Date Submitted: Sep 26, 2021
Open Peer Review Period: Sep 26, 2021 - Nov 21, 2021
(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Warning: This is a unreviewed preprint (What is a preprint?). Readers are warned that the document has not been peer-reviewed by expert/patient reviewers or an academic editor, may contain misleading claims, and is likely to undergo changes before final publication, if accepted, or may have been rejected/withdrawn (a note "no longer under consideration" will appear above).

Peer-review me: Readers with interest and expertise are encouraged to sign up as peer-reviewer, if the paper is within an open peer-review period (in this case, a "Peer-Review Me" button to sign up as reviewer is displayed above). All preprints currently open for review are listed here. Outside of the formal open peer-review period we encourage you to tweet about the preprint.

Citation: Please cite this preprint only for review purposes or for grant applications and CVs (if you are the author).

Final version: If our system detects a final peer-reviewed "version of record" (VoR) published in any journal, a link to that VoR will appear below. Readers are then encourage to cite the VoR instead of this preprint.

Settings: If you are the author, you can login and change the preprint display settings, but the preprint URL/DOI is supposed to be stable and citable, so it should not be removed once posted.

Submit: To post your own preprint, simply submit to any JMIR journal, and choose the appropriate settings to expose your submitted version as preprint.

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Tracing Unemployment Rate of South Africa during the COVID-19 Pandemic Using Twitter Data

  • Zahra Movahedi Nia; 
  • Ali Asgary; 
  • Nicola Bragazzi; 
  • Bruce Melado; 
  • James Orbinski; 
  • Jianhong Wu; 
  • Jude Dzevela Kong

ABSTRACT

Background:

Global economy has been hardly hit by the COVID-19 pandemic. Many countries are experiencing a severe and destructive recession. Unemployment rate is very important to policy makers as it provide a key indicator of overall labour market and wider economic conditions. Despite its relevance, there is usually a delay in the availability of the indicator as it is traditionally based on a survey of households over several months. The speed at which the economy in most countries decline at the onset of COVID-19 highlights the importance of timely information about the labour market during the onset of a recession. In the coming year, there will be uncertainty about the timing and extent of any improvement in labour market outcomes that will also highlight the value of timely information.

Objective:

The main goal of this study is to provide policy- and decision-makers with additional and real-time information about the labor market flow during a prolonged pandemic. The first objective of the study is to find the missing unemployment rates in cases where census measurements are incomplete. The second objective is to estimate the unemployment rate in real-time since it usually takes months for formal unemployment data to be published. In this paper, we use social media data, particularly, Twitter to trace and nowcast the unemployment rate of South Africa during the COVID-19 pandemic.

Methods:

Unemployment rate in South Africa is estimated quarterly. We first used Google mobility index to interpolate it and find the monthly values. Next, we created a dataset of unemployment related tweets in South Africa using certain keywords such as employed, unemployed, and retrench. Principal Component Regression (PCR) was applied to estimate the unemployment rate using the tweets and their sentiment scores.

Results:

Numerical results indicate that the number of tweets is highly correlated with the unemployment rate during and before the COVID-19 pandemic. In addition, the trend of the normalized sum of the sentiment scores of the tweets is negatively correlated with the unemployment rate of South Africa. Moreover, the estimated unemployment rate using PCR is highly correlated with the actual unemployment rate of South Africa and has a low Root Mean Square Error (RMSE) and Mean Absolute Error (MAE).

Conclusions:

The results of this study show that social media information can be used to reasonably estimate one of the key labor market indicators, especially during disaster events such as a prolonged pandemic. This information can be used to rapidly understand and manage the impacts of the pandemic on the economy and people’s life.


 Citation

Please cite as:

Nia ZM, Asgary A, Bragazzi N, Melado B, Orbinski J, Wu J, Kong JD

Tracing Unemployment Rate of South Africa during the COVID-19 Pandemic Using Twitter Data

JMIR Preprints. 26/09/2021:33843

DOI: 10.2196/preprints.33843

URL: https://preprints.jmir.org/preprint/33843

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.