Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Apr 9, 2025
Date Accepted: Oct 23, 2025
Predicting Metabolic Dysfunction-Associated Fatty Liver Disease Phenotypes among Adults: A Two-Stage Contrastive Learning Method
ABSTRACT
Background:
Metabolic dysfunction-associated fatty liver disease (MAFLD) is a leading cause of chronic disease and may develop into liver fibrosis or hepatocellular carcinoma. It has distinct subtypes—obesity, diabetic, and lean—each associated with divergent fibrosis burden and complications. However, existing analytics methods often overlook the multisystem nature, intra-phenotype heterogeneity and disease dynamics. These limitations hinder accurate risk stratification and personalized intervention planning for MAFLD patients.
Objective:
This study develops a novel, two-stage, contrastive learning–based method to estimate the phenotype of metabolic dysfunction-associated fatty liver disease (MAFLD) among adults. The proposed method leverages multi-view contrastive learning; it models individual heterogeneity and important relationships in clinical and survey-based data to make effective phenotype predictions in support of clinical decision-making and personalized care.
Methods:
Demographic, clinical, lifestyle-related, and family genetics history–oriented data of 4,408 adults reveal how capturing important relationships in the data from different sources can transform individual-level representations into multiple, complementary views. The comparative evaluation of the predictive efficacy of the proposed method, in comparison with eight prevalent methods, relies on recall, precision, F-measure, and area under the curve.
Results:
The proposed method consistently and significantly outperforms all benchmark methods. It attains the highest F-measure value, with a 34.0% improvement for the non-diabetics phenotype and 22.2% improvement for the diabetes phenotype than the respective best-performing benchmarks. These results demonstrate the clinical value and utilities of integrating clinical and survey-based data to better estimate MAFLD phenotypes among adults.
Conclusions:
The proposed method provides a viable means for MAFLD phenotype estimates. It is more efficacious for identifying at-risk adults than many prevalent data-driven methods and thereby can enhance clinical decision-making and personalized care.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.