Ensemble learning approach with explainable AI for improved heart disease prediction
Adekoya, Ayomide and Saeed, Faisal and Ghaban, Wad and Qasem, Sultan Noman (2025) Ensemble learning approach with explainable AI for improved heart disease prediction. Frontiers in Pharmacology, 16. ISSN 1663-9812
Preview |
Text
fphar-16-1654681.pdf - Published Version Available under License Creative Commons Attribution. Download (3MB) |
Abstract
Introduction
Heart disease remains a leading cause of global morbidity and mortality, motivating the development of predictive models that are both accurate and clinically interpretable. We introduce the Interpretable Ensemble Learning Framework (IELF), which integrates Explainable Boosting Machines (EBM) with XGBoost, SHAP-based explanations, and LIME for enhanced local interpretability.
Methods
IELF was evaluated on two benchmark datasets: Cleveland (n = 303) and Framingham (n = 4,240). Model assessment included 5-fold cross-validation, held-out test sets, calibration, subgroup analyses, and explanation stability evaluation using Kendall’s τ and Overlap@10.
Results
IELF achieved robust discrimination (AUC 0.899, accuracy 88.5% on Cleveland; AUC 0.696, accuracy 82.6% on Framingham) with balanced precision–recall profiles. Compared with EBM, IELF significantly improved recall, F1, and AUC on the Framingham dataset (p < 0.05), while differences versus XGBoost were less consistent. IELF produced transparent feature rankings aligned with established cardiovascular risk factors and stable explanations across folds.
Discussion
IELF is, to our knowledge, the first framework to combine EBM and XGBoost with SHAP and LIME under strict nested cross-validation and calibration procedures. Although headline accuracies are lower than some recent >97% reports, IELF was developed under stricter methodological controls that enhance reproducibility, interpretability, and clinical reliability. These findings position IELF as a trustworthy benchmark for translational AI in cardiovascular risk prediction, complementing high-accuracy but less transparent models.
| Item Type: | Article |
|---|---|
| Identification Number: | 10.3389/fphar.2025.1654681 |
| Dates: | Date Event 11 December 2025 Published Online 18 November 2025 Accepted |
| Uncontrolled Keywords: | cardiovascular disease, explainable AI, heart disease, ensemble learning, XGBoost, ensemble |
| Subjects: | CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science |
| Divisions: | Architecture, Built Environment, Computing and Engineering > Computer Science |
| Depositing User: | Gemma Tonks |
| Date Deposited: | 14 May 2026 12:16 |
| Last Modified: | 14 May 2026 12:16 |
| URI: | https://www.open-access.bcu.ac.uk/id/eprint/17042 |
Actions (login required)
![]() |
View Item |

Tools
Tools