Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 15, 2023
Date Accepted: Jul 10, 2023

The final, peer-reviewed published version of this preprint can be found here:

ChatGPT vs Google for Queries Related to Dementia and Other Cognitive Decline: Comparison of Results

Hristidis V, Ruggiano N, Brown EL, Ganta SRR, Stewart S

ChatGPT vs Google for Queries Related to Dementia and Other Cognitive Decline: Comparison of Results

J Med Internet Res 2023;25:e48966

DOI: 10.2196/48966

PMID: 37490317

PMCID: 10410383

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

ChatGPT vs. Google for Queries Related to Dementia and Other Memory Problems: Results Comparison

  • Vagelis Hristidis; 
  • Nicole Ruggiano; 
  • Ellen L Brown; 
  • Sai Rithesh Reddy Ganta; 
  • Selena Stewart

ABSTRACT

Background:

People living with dementia (PLWD) or other memory problems, and their caregivers increasingly rely on the Internet to find information about their condition, available resources and services. The recent advancements in Large Language Models (LLMs), such as ChatGPT, provide a new alternative to the more traditional Web search engines like Google.

Objective:

This study compares the quality of the results of ChatGPT and Google for a collection of PLWD-related queries.

Methods:

A set of 30 informational and 30 service delivery (transactional) PLWD-related queries were selected and submitted to both Google and ChatGPT. Three domain experts assessed the results for their currency of the information, reliability of the source, objectivity, relevance to the query, and similarity of their response. The readability of the results was also analyzed. Interrater reliability coefficients were calculated for all outcomes.

Results:

Google had superior currency, as the results of ChatGPT may be out-of-date and often do not specify a validity timestamp. Google also had higher reliability, because ChatGPT rarely includes the source of a result. ChatGPT’s results were evaluated to be more objective, because Google sometimes returns results based on commercial entities. ChatGPT has significantly higher response relevance, because Google often drew upon sources that were referral services for dementia care or service providers themselves. The readability was low for both platforms, especially for ChatGPT (mean grade level 12.17 vs. 9.86 for Google). The similarity between the content of ChatGPT and Google responses were rated as high for 13(21.7%) responses, medium for 16(26.7%) responses, and low for 31(51.6%) responses.

Conclusions:

Both Google and ChatGPT have strengths and weaknesses. Google more often provides a date for and a known reliable source of the response compared to ChatGPT, whereas ChatGPT supplies more relevant responses to queries . The readability scores for both indicate responses are often not appropriate for persons with low health literacy skills. In the future, the addition of both the source and date of the health-related information and availability in other languages may increase the value of these platforms for both non-medical and medical professionals.


 Citation

Please cite as:

Hristidis V, Ruggiano N, Brown EL, Ganta SRR, Stewart S

ChatGPT vs Google for Queries Related to Dementia and Other Cognitive Decline: Comparison of Results

J Med Internet Res 2023;25:e48966

DOI: 10.2196/48966

PMID: 37490317

PMCID: 10410383

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.