Accepted for/Published in: JMIR Formative Research
Date Submitted: Apr 30, 2024
Open Peer Review Period: Jun 10, 2024 - Aug 5, 2024
Date Accepted: Oct 1, 2024
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
A Pathological Diagnosis Method for FUO based on Multi-Path Hierarchical Classification: Model Design and Validation
ABSTRACT
Background:
Fever of Unknown Origin (FUO) is a significant challenge for the medical community due to its association with a wide range of diseases, the complexity of diagnosis, and the likelihood of misdiagnosis. Machine learning can extract valuable information from the extensive data of patient indicators, aiding doctors in diagnosing the underlying cause of FUO.
Objective:
A hierarchical classification method based on multipath feature selection was proposed to address the problem that the causes of FUO are diverse and difficult to diagnose accurately.
Methods:
The dataset was obtained from the clinical diagnostic records of patients with FUO who were admitted to the First Affiliated Hospital of Xi'an Jiaotong University between the years 2011 and 2020. The dataset contains 564 samples, with 5 broad categories of etiologies and subcategories, totaling 16 precise etiologies. An intelligent diagnostic algorithm for FUO was constructed based on hierarchical classification to identify 16 precise etiologies and two improvements were made: (1) A multi-path prediction model to minimize the probability of error propagation; (2) A utilization of L1,2 regularization constraint to effectively identify the most suitable subset of features to eliminate redundancy and interference in the hierarchical classification process. To confirm the proposed method's validity, we conducted ablation experiments and compared it with other flat and hierarchical classification algorithms.
Results:
According to the ablation experiments, the proposed method achieves the optimal result when the intermediate path is 3, with an accuracy of 72.35%, FH of 85.01%, and FLCA of 83.29%. This performance is superior to the traditional single-path hierarchical classification, showing improvements of 5.69%, 2.89%, and 3.39%, respectively. The feature selection based on the L1,2 regularization constraint can further enhance the model's performance. The best results were achieved by filtering 25% of the features, with an accuracy of 76.08%, FH of 86.72%, and FLCA of 85.39%. Based on the comparison experiments, the proposed method outperforms the seven flat algorithms and four hierarchical classification algorithms that were compared. For example, the accuracy can be improved by at least 5.39% compared to ELM and up to 24.7% compared to KNN.
Conclusions:
The proposed algorithm greatly improves the classification performance compared to existing machine learning algorithms. It can better predict the cause of FUO, assisting physicians in their work.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.