Saputra, Dimas Chaerul Ekty and Ratnaningsih, Tri and Futri, Irianna and Muryadi, Elvaro Islami and Phann, Raksmey and Tun, Su Sandi Hla and Caibigan, Ritchie Natuan (2024) HNIHA: Hybrid Nature-Inspired Imbalance Handling Algorithm to Addressing Imbalanced Datasets for Improved Classification: In Case of Anemia Identification. Buletin Ilmiah Sarjana Teknik Elektro, 6 (3). pp. 254-270.
11306-Article Text-44791-3-10-20250425.pdf - Published Version
Download (918kB)
Abstract
This study presents a comprehensive evaluation of three ensemble models designed to handle imbalanced datasets. Each model incorporates the hybrid nature-inspired imbalance handling algorithm (HNIHA) with matthews correlation coefficient and synthetic minority oversampling technique in conjunction with different base classifiers: support vector machine, random forest, and LightGBM. Our focus is to address the challenges posed by imbalanced datasets, emphasizing the balance between sensitivity and specificity. The HNIHA algorithm-guided support vector machine ensemble demonstrated superior performance, achieving an impressive matthews correlation coefficient of 0.8739, showcasing its robustness in balancing true positives and true negatives. The f1-score, precision, and recall metrics further validated its accuracy, precision, and sensitivity, attaining values of 0.9767, 0.9545, and 1.0, respectively. The ensemble demonstrated its ability to minimize prediction errors by minimizing the mean squared error and root mean squared error to 0.0384 and 0.1961, respectively. The HNIHA-guided random forest ensemble and HNIHA-guided LightGBM ensemble also exhibited strong performances.
| Item Type: | Article |
|---|---|
| Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering |
| Depositing User: | BISTE UAD |
| Date Deposited: | 20 May 2026 03:45 |
| Last Modified: | 20 May 2026 03:45 |
| URI: | https://alxiv.org/id/eprint/899 |
