JMIR Preprints #48966: ChatGPT vs. Google for Queries Related to Dementia and Other Memory Problems: Results Comparison

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

ChatGPT vs. Google for Queries Related to Dementia and Other Memory Problems: Results Comparison

Vagelis Hristidis;
Nicole Ruggiano;
Ellen L Brown;
Sai Rithesh Reddy Ganta;
Selena Stewart

ABSTRACT

Background:

People living with dementia (PLWD) or other memory problems, and their caregivers increasingly rely on the Internet to find information about their condition, available resources and services. The recent advancements in Large Language Models (LLMs), such as ChatGPT, provide a new alternative to the more traditional Web search engines like Google.

Objective:

This study compares the quality of the results of ChatGPT and Google for a collection of PLWD-related queries.

Methods:

A set of 30 informational and 30 service delivery (transactional) PLWD-related queries were selected and submitted to both Google and ChatGPT. Three domain experts assessed the results for their currency of the information, reliability of the source, objectivity, relevance to the query, and similarity of their response. The readability of the results was also analyzed. Interrater reliability coefficients were calculated for all outcomes.

Results:

Google had superior currency, as the results of ChatGPT may be out-of-date and often do not specify a validity timestamp. Google also had higher reliability, because ChatGPT rarely includes the source of a result. ChatGPT’s results were evaluated to be more objective, because Google sometimes returns results based on commercial entities. ChatGPT has significantly higher response relevance, because Google often drew upon sources that were referral services for dementia care or service providers themselves. The readability was low for both platforms, especially for ChatGPT (mean grade level 12.17 vs. 9.86 for Google). The similarity between the content of ChatGPT and Google responses were rated as high for 13(21.7%) responses, medium for 16(26.7%) responses, and low for 31(51.6%) responses.

Conclusions:

Both Google and ChatGPT have strengths and weaknesses. Google more often provides a date for and a known reliable source of the response compared to ChatGPT, whereas ChatGPT supplies more relevant responses to queries . The readability scores for both indicate responses are often not appropriate for persons with low health literacy skills. In the future, the addition of both the source and date of the health-related information and availability in other languages may increase the value of these platforms for both non-medical and medical professionals.

Citation

Please cite as:

Hristidis V, Ruggiano N, Brown EL, Ganta SRR, Stewart S

ChatGPT vs Google for Queries Related to Dementia and Other Cognitive Decline: Comparison of Results

J Med Internet Res 2023;25:e48966

DOI: 10.2196/48966

PMID: 37490317

PMCID: 10410383

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 15, 2023

Date Accepted: Jul 10, 2023

ChatGPT vs. Google for Queries Related to Dementia and Other Memory Problems: Results Comparison

ABSTRACT

Citation

Copyright