JMIR Preprints #51229: Comparisons of quality, correctness, and similarity between abstracts generated by ChatGPT and real abstracts for the same basic research: a cross-sectional study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Comparisons of quality, correctness, and similarity between abstracts generated by ChatGPT and real abstracts for the same basic research: a cross-sectional study

Shu-Li Cheng;
Shih-Jen Tsai;
Ya-Mei Bai;
Chih-Hung Ko;
Chih-Wei Hsu;
Fu-Chi Yang;
Chia-Kuang Tsai;
Yu-Kang Tu;
Szu-Nian Yang;
Ping-Tao Tseng;
Tien-Wei Hsu;
Chih-Sung Liang;
Kuan-Pin Su

ABSTRACT

Background:

ChatGPT may act as a research assistant to help organize the direction of thinking and summarize research findings.

Objective:

However, few studies have examined the quality, similarity, plagiarism, and accuracy of the abstracts generated by ChatGPT when researchers provide full-text basic research papers.

Methods:

We selected 30 basic research papers from Nature, Genome Biology, and Biological Psychiatry. Excluding abstracts, we fed the full text into the ChatGPT model (using ChatPDF) and prompted it to generate abstracts with the same style as those in the original papers. Eight experts were invited to evaluate the quality of these abstracts (Likert scale:0–10) and identified which abstracts were generated by ChatGPT using a blind approach. These abstracts were also evaluated for similarity, plagiarism, and accuracy of the AI content.

Results:

The quality of the ChatGPT-generated abstracts was lower than that of the actual abstracts (Likert scale of quality, mean (standard deviation), 4.72 (2.09) vs 8.09 (1.03); p<0.001), and the quality difference was large in the unstructured format (mat (-4.33; -4.79 to -3.86) and small in the four-subheading structured format (-2.33; -2.79 to -1.86). Among the 30 ChatGPT-generated abstracts, three showed wrong conclusions and ten were identified as AI content. Similarity (2.10%-4.40%) and plagiarism (5.01%-6.70%) were not high. The blinded reviewers achieved a 93% accuracy rate in guessing which abstract was written using ChatGPT.

Conclusions:

Using ChatGPT to generate a scientific abstract may not produce the problems of similarity and plagiarism when feeding real full-texts written by humans. However, the quality of the ChatGPT-generated abstract was sub-optimal, and the accuracy was not 100%.

Citation

Please cite as:

Cheng SL, Tsai SJ, Bai YM, Ko CH, Hsu CW, Yang FC, Tsai CK, Tu YK, Yang SN, Tseng PT, Hsu TW, Liang CS, Su KP

Comparisons of Quality, Correctness, and Similarity Between ChatGPT-Generated and Human-Written Abstracts for Basic Research: Cross-Sectional Study

J Med Internet Res 2023;25:e51229

DOI: 10.2196/51229

PMID: 38145486

PMCID: 10760418

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 25, 2023

Date Accepted: Nov 20, 2023

Comparisons of quality, correctness, and similarity between abstracts generated by ChatGPT and real abstracts for the same basic research: a cross-sectional study

ABSTRACT

Citation

Copyright