Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Oct 22, 2019
Date Accepted: Jan 24, 2020
Date Submitted to PubMed: Feb 18, 2020

The final, peer-reviewed published version of this preprint can be found here:

Analysis of Massive Online Medical Consultation Service Data to Understand Physicians’ Economic Return: Observational Data Mining Study

Jiang J, Cameron AF, Yang M

Analysis of Massive Online Medical Consultation Service Data to Understand Physicians’ Economic Return: Observational Data Mining Study

JMIR Med Inform 2020;8(2):e16765

DOI: 10.2196/16765

PMID: 32069213

PMCID: 7055801

Analysis of Massive Online Medical Consultation Service Data to Understand Physicians’ Economic Return: Data-Mining Study

  • Jinglu Jiang; 
  • Ann-Frances Cameron; 
  • Ming Yang

ABSTRACT

Background:

Online healthcare consultation has become increasingly popular, and is considered a potential solution to healthcare resource shortages and inefficient resource distribution. However, many online medical consultation platforms are struggling to attract and retain patients who are willing to pay, and healthcare providers on the platform have the additional challenge of standing out in a crowd of physicians who can provide comparable services.

Objective:

This study uses machine learning (ML) approaches to mine massive service data to (1) identify the important features that are associated with patient payment, as opposed to free-trial-only appointments; (2) explore the relative importance of these features, and (3) understand how these features interact, linearly or non-linearly, in relation to payment.

Methods:

The dataset is from the largest China-based online medical consultation platform, which covers 1,582,564 consultation records between patient-physician pairs from 2009 to 2018. ML techniques (i.e., hyperparameter tuning, model training, and validation) were applied with four classifiers – logistic regression, decision tree, random forest, and gradient boost – to identify the most important features and their relative importance for predicting paid versus free-only appointments.

Results:

After applying the ML feature selection procedures, we identified 11 key features on the platform that are potentially useful to predict payment. For the binary ML classification task (paid vs. free services), the 11 features as a whole system achieved very good prediction performance across all four classifiers. Decision tree analysis further identified five distinct subgroups of patients delineated by five top-ranked features: previous offline connection, total dialogue, physician response rate, patient privacy concern, and social return. These subgroups interact with the physician differently, resulting in different payment outcomes.

Conclusions:

The results show that, as compared to features related to physician reputation, service-related features such as service delivery quality (e.g., consultation dialogue intensity, physician response rate), patient source (e.g., online versus offline returning patients) and patient involvement (e.g., provide social returns, reveal previous treatment) appear to contribute more to patient’s payment decision. Promoting multiple timely responses in patient-provider interactions is essential to encourage payment.


 Citation

Please cite as:

Jiang J, Cameron AF, Yang M

Analysis of Massive Online Medical Consultation Service Data to Understand Physicians’ Economic Return: Observational Data Mining Study

JMIR Med Inform 2020;8(2):e16765

DOI: 10.2196/16765

PMID: 32069213

PMCID: 7055801

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.