Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Bioinformatics and Biotechnology

Date Submitted: May 30, 2024
Date Accepted: Oct 2, 2024

The final, peer-reviewed published version of this preprint can be found here:

Eco-Evolutionary Drivers of Vibrio parahaemolyticus Sequence Type 3 Expansion: Retrospective Machine Learning Approach

Campbell AM, Hauton C, van Aerle R, Martinez-Urtaza J

Eco-Evolutionary Drivers of Vibrio parahaemolyticus Sequence Type 3 Expansion: Retrospective Machine Learning Approach

JMIR Bioinform Biotech 2024;5:e62747

DOI: 10.2196/62747

PMID: 39607996

PMCID: 11638695

A machine learning approach to identify eco-evolutionary drivers of Vibrio parahaemolyticus ST3 expansion

  • Amy Marie Campbell; 
  • Chris Hauton; 
  • Ronny van Aerle; 
  • Jaime Martinez-Urtaza

ABSTRACT

Background:

Environmentally-sensitive pathogens exhibit ecological and evolutionary responses to climate change, that result in the emergence and global expansion of well-adapted variants. It is imperative to understand the mechanisms that facilitate pathogen emergence and expansion, and the drivers behind these, to understand and prepare for future pandemic expansions.

Objective:

The unique, rapid global expansion of a clonal complex of Vibrio parahaemolyticus (VpST3), a marine bacteria causing gastroenteritis infections, provides an opportunity to explore eco-evolutionary drivers of pathogen expansion.

Methods:

The global expansion of VpST3 was reconstructed using VpST3 genomes which were then classified into metrics characterizing stages of this expansion process, indicative of stages of emergence and establishment. We used machine learning to test a range of ecological and evolutionary drivers for their potential in predicting these VpST3 expansion dynamics, using a random forest classifier.

Results:

We identified a range of evolutionary features, including mutations in the core genome and accessory gene presence, associated with expansion dynamics. A range of random forest classifier approaches were tested to predict expansion classification metrics for each genome. The highest predictive accuracy (ranging from 0.722 to 0.967) was achieved for models using a combined eco-evolutionary approach. While population structure and the difference between introduced and established isolates could be predicted to a high accuracy, our model reported multiple false positives when predicting the success of an introduced isolate, inferring potential limiting factors not represented in our eco-evolutionary features. Regional models produced for two countries reporting the most VpST3 genomes had varying success, reflecting impacts of class imbalance.

Conclusions:

These novel insights into evolutionary features and ecological conditions related to stages of VpST3 expansion showcase the potential of machine learning models using genomic data, and will contribute to future understanding of eco-evolutionary pathways of climate-sensitive pathogens.


 Citation

Please cite as:

Campbell AM, Hauton C, van Aerle R, Martinez-Urtaza J

Eco-Evolutionary Drivers of Vibrio parahaemolyticus Sequence Type 3 Expansion: Retrospective Machine Learning Approach

JMIR Bioinform Biotech 2024;5:e62747

DOI: 10.2196/62747

PMID: 39607996

PMCID: 11638695

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.