Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 17, 2020
Open Peer Review Period: May 17, 2020 - May 25, 2020
Date Accepted: Sep 2, 2020
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review

Milne-Ives M, de Cock C, Lim E, Shehadeh MH, de Pennington N, Mole G, Meinert E

The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review

J Med Internet Res 2020;22(10):e20346

DOI: 10.2196/20346

PMID: 33090118

PMCID: 7644372

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

The effectiveness of artificial intelligence conversational agents in healthcare: a systematic review

  • Madison Milne-Ives; 
  • Caroline de Cock; 
  • Ernest Lim; 
  • Melissa Harper Shehadeh; 
  • Nick de Pennington; 
  • Guy Mole; 
  • Edward Meinert

ABSTRACT

Background:

High demand on healthcare services and the growing capability of artificial intelligence has led to the development of conversational agents designed to support a variety of health-related activities - including behaviour change, treatment support, health monitoring, training, triage, and screening support. Automation of these tasks could free clinicians to focus on more complex work and increase accessibility to healthcare services for the general public. An overarching assessment of the acceptability, usability, and effectiveness of these agents in healthcare is needed to collate the evidence so that future development can target areas for improvement and potential for sustainable adoption.

Objective:

This systematic review aimed to assess the effectiveness and usability of conversational agents in healthcare and identify the elements that users like and dislike, to inform future research and development of these agents.

Methods:

PubMed, Medline (Ovid), EMBASE, CINAHL, Web of Science, and ACM Digital Library were systematically searched for articles published since 2008 that evaluated unconstrained natural language processing conversational agents used in healthcare. Endnote (version X9; Clarivate Analytics) reference management software was used for initial screening, then full-text screening was conducted by one reviewer. Data was extracted and risk of bias was assessed by one reviewer and validated by another.

Results:

A total of 31 studies were selected and included a variety of conversational agents - 14 chatbots (two of which were voice chatbots), 6 embodied conversational agents, 3 each of interactive voice response calls, virtual patients, and speech recognition screening systems, as well as one contextual question answering agent and one voice recognition triage system. Overall, the evidence reported was mostly positive or mixed. Usability and satisfaction performed well (27/30 and 26/31) and positive or mixed effectiveness was found in three quarters of the studies (23/30), but there were several limitations of the agents highlighted in specific qualitative feedback.

Conclusions:

The studies generally reported positive or mixed evidence for the effectiveness, usability, and satisfactoriness of the conversational agents investigated, but qualitative user perceptions were more mixed. The quality of many of the studies was limited, and improved study design and reporting is necessary to more accurately evaluate the usefulness of the agents in healthcare and identify key areas for improvement. Further research should also analyse the cost-effectiveness, privacy, and security of the agents. Clinical Trial: N/A


 Citation

Please cite as:

Milne-Ives M, de Cock C, Lim E, Shehadeh MH, de Pennington N, Mole G, Meinert E

The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review

J Med Internet Res 2020;22(10):e20346

DOI: 10.2196/20346

PMID: 33090118

PMCID: 7644372

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.