02831nas a2200241 4500000000100000008004100001260001200042653002100054653003900075653001800114653003100132653002100163653002500184653001000209100001400219700001200233245018900245856007500434300001100509490000700520520204800527022001402575 2026 d c03/202610amachine learning10aNeglected tropical diseases (NTDs)10aCo-endemicity10aSoil-transmitted helminths10aSchistosomiasis 10aLymphatic filariasis10aKenya1 aNyerere N1 aMulwa D00aAdvanced machine learning approaches for predicting Neglected Tropical Disease co-endemicity in Kenya: A focus on soil-transmitted helminths, schistosomiasis, and lymphatic filariasis. uhttps://pmc.ncbi.nlm.nih.gov/articles/PMC13046276/pdf/pntd.0014156.pdf a1 - 220 v203 a
BACKGROUND:
Neglected Tropical Diseases (NTDs) affect 1.5 billion people worldwide with 39% of the global burden occurring in Africa. In Kenya, NTDs remain endemic despite control efforts, with co-endemicity of soil-transmitted helminths (STH), schistosomiasis (SCH), and lymphatic filariasis (LF) complicating intervention strategies. This study developed machine learning models to predict high-risk co-endemic areas using demographic and Water, Sanitation, and Hygiene (WASH) indicators.
METHODOLOGY:
The study analyzed Kenya's 2022 NTD co-endemicity data from the Expanded Special Project for Elimination of Neglected Tropical Diseases, incorporating WASH and population variables. Three machine learning algorithms, Random Forest, Gradient Boosting Machine, and Extreme Gradient Boosting (XGBoost) were trained to classify regions by STH prevalence levels and co-endemicity status. Model performance was evaluated using cross-validation, Receiver Operating Characteristic - Area under the Curve (AUC) and variable importance analysis.
RESULTS:
The RF model achieved the highest predictive performance (AUC = 0.70), followed by XGBoost (AUC = 0.66) and GBM (AUC = 0.62). Key predictors included improved sanitation access (mean importance score: 0.24), population density (0.21), and co-endemicity with LF/SCH (0.18). Spatial analysis identified Eastern and North-Eastern Kenya as persistent hotspots, correlating with low WASH coverage (<40%).
CONCLUSION:
Machine learning models effectively identified the high-risk NTD co-endemic areas in Kenya, with RF outperforming other models. These findings support targeted interventions integrating WASH improvements with mass drug administration in identified hotspots. We propose a real-time dashboard for dynamic risk mapping to optimize resource allocation; a strategy aligned with Kenya's NTD Elimination Strategic Plan 2030.
a1935-2735