Accepted for/Published in: JMIR Cancer
Date Submitted: Jul 14, 2021
Open Peer Review Period: Jul 14, 2021 - Jul 22, 2021
Date Accepted: Oct 4, 2021
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Natural Language Processing based Medical Needs Extraction for Breast Cancer Patients from Question and Answer Services
ABSTRACT
Background:
Currently, a large number of patient narratives are available on various web services. On web question and answer (QA) services, patient questions often relate to medical needs. Therefore, we expect these questions to provide clues to understanding patients’ medical needs.
Objective:
This study aims to extract patient needs and classify them into thematic categories. To clarify the patient's needs would be the first step to solve social issues for cancer patients.
Methods:
The material of this study is patient question texts containing the keyword “breast cancer" in the Yahoo! Japan QA service, Yahoo! Chiebukuro, which contains over 60,000 questions on cancer. First, we convert the question text into a vector representation; then, the relevance between patient needs and existing cancer needs categories are calculated based on cosine similarity.
Results:
The proportion of correct classifications in our proposed method is approximately 70%. We reveal the variation and the number of needs from the results of classifying questions.
Conclusions:
There are various clinical applications to applying the proposed method such as identifying the side effect signaling of drugs and the unmet needs of cancer patients. Revealing these needs is important to satisfy the medical needs of cancer patients.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.