02683nas a2200205   4500000000100000008004100001260001200042653001400054653002100068653001500089653001700104653001300121100001500134700001400149245007400163856008300237300001100320520213200331022001402463       2026                            d  c04/202610aBlindness10amachine learning10aR-workflow10aSurveillance10aTrachoma1 aKulohoma B1 aWesonga C00aScaling trachoma surveillance in endemic areas using machine learning  uhttps://link.springer.com/content/pdf/10.1186/s12879-026-13215-8_reference.pdf  a1 - 123 a<p><strong>Background</strong></p>

<p>Trachoma remains a leading infectious cause of blindness in endemic regions despite progress towards global elimination. Accurate and scalable diagnosis remains challenging in areas with limited ophthalmic expertise. We developed and evaluated a reproducible transfer learning model using eyelid image-based data.</p>

<p><strong>Methods</strong></p>

<p>We retrospectively analysed anonymised inner eyelid photographs collected from trachoma prevalence surveys conducted in Ethiopia, Tanzania, Australia, Solomon Islands, Colombia, the Gambia, and Guatemala (<em>n</em> = 572 images). Images were categorized as trachoma (<em>n</em> = 251) or not trachoma (<em>n</em> = 321) based on consensus grades from the Global Trachoma Mapping Project (GMTP) certified graders. Data were processed and analysed in R (version 4.4.1) using the keras and tensorflow packages interfaced with Python (version 3.10) through reticulate. A ResNet50 convolutional neural network pretrained on ImageNet was fine-tuned for binary classification. The model was trained for up to 30 epochs with early stopping and adaptive learning-rate reduction. Performance on a held-out validation set (<em>n</em> = 62, 12%) was evaluated using accuracy, sensitivity, specificity, and Cohen’s κ.</p>

<p><strong>Results</strong></p>

<p>The ResNet50 model achieved an overall accuracy of 85.4% (95% CI 76.3–92%) on the validation set. The model achieved a sensitivity of 80.9% for detecting trachoma and a specificity of 90.5% for correctly identifying non-trachoma eyes. Agreement between predictions and grader labels was substantial (κ = 0.71, 95% CI 0.58–0.84).</p>

<p><strong>Conclusion</strong></p>

<p>Our findings demonstrate the feasibility of using a reproducible R-based deep learning pipeline for automated trachoma classification in large-scale surveys. These findings demonstrate the feasibility of automated trachoma classification using a transfer-learning framework. Larger datasets will be required before operational deployment in surveillance programmes.</p>
  a1471-2334