01566nas a2200205 4500000000100000008004100001260000900042653002300051653002100074653003000095653001300125653001200138653001800150653004700168653002200215100001200237700001200249245009900261520100000360 2023 d bIEEE10aData preprocessing10aMedical services10aElectronic Health Records10aAdaBoost10aXGBoost10aRandom Forest10aClassification and Regression Trees (CART)10aensemble learning1 aMehta J1 aKalla M00aPredicting Severity from Electronic Health Records of Leprosy Patients using Ensemble Learning3 a

Electronic Health Records (EHRs) are speedily being enforced by healthcare providers in recent years. Leprosy is a specially listed neglected tropical disease that continues as a major health problem in India. The delay in the diagnosis can lead to increase disability rate among patients. This paper intends to identify various risk factors from EHRs by applying ensemble machine learning techniques. The EHRs are included with the first sign of symptoms and various diagnosis details of leprosy cases. This information is used to determine the severity of leprosy cases and classify them into 3 categories, namely mild, moderate, and severe. To predict the severity, AdaBoost and XGBoost ensemble classifiers are applied in this paper. The performance of these classifiers is compared with Classification and Regression Trees (CART) and Random Forest (RF) techniques. The results show that AdaBoost gives with 97% accuracy and 97% precision. XGBoost gives 97% accuracy and 99% recall.