A comparison of machine learning models for predicting Vibrio parahaemolyticus in oysters

Ndraha, Nodali and Hsiao, Hsin-I (2025) A comparison of machine learning models for predicting Vibrio parahaemolyticus in oysters. Microbial Risk Analysis, 30. p. 100345. ISSN 23523522

Full text not available from this repository. (Request a copy)

Abstract

Vibrio parahaemolyticus, a major seafood pathogen, threatens public health as oyster consumption rises. We evaluated 14 machine learning models to predict its concentrations in oysters, achieving high accuracy (Concordance Correlation Coefficient, CCC > 0.85 training, > 0.9 testing, except bag-MARS) across diverse algorithms. Processing times varied from 23 min (KNN) to 162 min (bag-RPart), highlighting computational trade-offs. Five top models—Elastic Net (EN), Random Forest (RF), XGBoost, Light Gradient-Boosting Machine (L-GBM), and Cubist (39–92 min)—were selected for their performance and efficiency, forming a robust toolkit for shellfish safety monitoring. Variable importance and partial dependence plots identified sea surface temperature (SST) and wind as primary drivers, with SST thresholds of 16–26 °C driving proliferation and wind showing mixed effects (negative >4 m/s, positive >6 m/s). Precipitation, salinity (>19 ppm), and pH (7.5–7.7) played supplementary roles. Lagged variables (e.g., SST_imX_25) underscored temporal dynamics, supporting real-time monitoring and risk assessment strategies.

Item Type: Article
Uncontrolled Keywords: Predictive model; Foodborne pathogen; Seafood; Food safety; Machine learning model; Temporal lags
Subjects: Medicine & Biology > Microbiology
Depositing User: Maria Regina Karunia
Date Deposited: 25 Feb 2026 11:02
Last Modified: 25 Feb 2026 11:02
URI: https://karya.brin.go.id/id/eprint/57745

Actions (login required)

View Item
View Item