TY - JOUR
KW - Machine learning
KW - Imbalanced Data
KW - Classification Algorithms
KW - Particle Swarm Optimization
KW - Stroke Prediction
KW - Chagas' disease
AU - Coimbra AG
AU - Oliveira CG
AU - Libório MP
AU - Mannan H
AU - Santos LI
AU - Fusco E
AU - D’Angelo MF
AU - Bukhari SNH
AB - Machine learning has increasingly gained prominence in the healthcare sector due to its ability to address various challenges. However, a significant issue remains unresolved in this field: the handling of imbalanced data. This process is crucial for ensuring the efficiency of algorithms that utilize classification techniques, which are commonly applied in risk management, monitoring, diagnosis, and prognosis of patient health. This study conducts a comparative analysis of techniques for handling imbalanced data and evaluates their effectiveness in combination with a set of classification algorithms, specifically focusing on stroke prediction. Additionally, a new approach based on Particle Swarm Optimization (PSO) and Naive Bayes was proposed. This approach was applied to the real problem of Chagas disease. The application of these techniques aims to improve the quality of life for individuals, reduce healthcare costs, and allocate available resources more efficiently, making it a preventive action.
BT - PLOS One
DO - 10.1371/journal.pone.0320966
IS - 5
LA - eng
M3 - Research Article
N2 - Machine learning has increasingly gained prominence in the healthcare sector due to its ability to address various challenges. However, a significant issue remains unresolved in this field: the handling of imbalanced data. This process is crucial for ensuring the efficiency of algorithms that utilize classification techniques, which are commonly applied in risk management, monitoring, diagnosis, and prognosis of patient health. This study conducts a comparative analysis of techniques for handling imbalanced data and evaluates their effectiveness in combination with a set of classification algorithms, specifically focusing on stroke prediction. Additionally, a new approach based on Particle Swarm Optimization (PSO) and Naive Bayes was proposed. This approach was applied to the real problem of Chagas disease. The application of these techniques aims to improve the quality of life for individuals, reduce healthcare costs, and allocate available resources more efficiently, making it a preventive action.
PB - Public Library of Science (PLoS)
PY - 2025
SP - 1
EP - 19
T2 - PLOS One
TI - Approaches for handling imbalanced data used in machine learning in the healthcare field: A case study on Chagas disease database prediction
UR - https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0320966
VL - 20
SN - 1932-6203
ER -