JMIR Preprints #15823: Responses of Conversational Agents to Health and Lifestyle Prompts: An Investigation of Appropriateness and Presentation Structures

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Responses of Conversational Agents to Health and Lifestyle Prompts: An Investigation of Appropriateness and Presentation Structures

Ahmet Baki Kocaballi;
Juan C Quiroz;
Shlomo Berkovsky;
Dana Rezazadegan;
Farah Magrabi;
Enrico Coiera;
Liliana Laranjo

ABSTRACT

Background:

Conversational agents (CAs) are systems that mimic human conversations using text or spoken language. Their widely-used examples include voice-activated systems like Apple Siri, Google Assistant, Amazon Alexa, or Microsoft Cortana. The use of CAs in healthcare has been on the rise, but concerns about their potential safety risks often remain under-studied.

Objective:

In this work we set out to analyze how commonly available, general-purpose CAs on smartphones and smart speakers respond to health and lifestyle prompts (questions and open-ended statements), examining their responses in terms of content and structure alike.

Methods:

We followed a piloted script to ask eight CAs health- and lifestyle-related prompts. The CAs’ responses were assessed for their appropriateness based on the prompt type: responses to safety-critical prompts were deemed appropriate if they included a referral to a health professional or service, while responses to lifestyle prompts were deemed appropriate if they provided relevant information to address the problem prompted. The response structure was also examined according to information sources (web-search-based or pre-coded), response content style (informative and/or directive), confirmation of prompt recognition, and empathy.

Results:

The eight studied CAs provided in total 240 responses to 30 prompts. They collectively responded appropriately to 41% (46/112) of the safety-critical and 39% (37/96) of the lifestyle prompts. The ratio of appropriate responses deteriorated when safety-critical prompts were re-phrased or when the agent used a voice-only interface. The appropriate responses included mostly directive content and empathy statements for the safety-critical prompts, and a mix of informative and directive content for the lifestyle prompts.

Conclusions:

Our results suggest that the commonly available, general-purpose CAs on smartphones and smart speakers with unconstrained natural language interfaces are limited in their ability to advise on both the safety-critical health prompts and lifestyle prompts. Our study also identified some response structures the CAs employed to present their appropriate responses. Further investigation is needed to establish guidelines for designing suitable response structures for different prompt types.

Citation

Please cite as:

Kocaballi AB, Quiroz JC, Berkovsky S, Rezazadegan D, Magrabi F, Coiera E, Laranjo L

Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures

J Med Internet Res 2020;22(2):e15823

DOI: 10.2196/15823

PMID: 32039810

PMCID: 7055771

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Aug 9, 2019

Date Accepted: Dec 16, 2019

Responses of Conversational Agents to Health and Lifestyle Prompts: An Investigation of Appropriateness and Presentation Structures

ABSTRACT

Citation