Ensemble learning approach with explainable AI for improved heart disease prediction

Adekoya, Ayomide; Saeed, Faisal; Ghaban, Wad; Qasem, Sultan Noman

Ensemble learning approach with explainable AI for improved heart disease prediction

Adekoya, Ayomide and Saeed, Faisal and Ghaban, Wad and Qasem, Sultan Noman (2025) Ensemble learning approach with explainable AI for improved heart disease prediction. Frontiers in Pharmacology, 16. ISSN 1663-9812

Preview

Text
fphar-16-1654681.pdf - Published Version
Available under License Creative Commons Attribution.
Download (3MB)

Official URL: https://www.frontiersin.org/journals/pharmacology/...

Abstract

Introduction

Heart disease remains a leading cause of global morbidity and mortality, motivating the development of predictive models that are both accurate and clinically interpretable. We introduce the Interpretable Ensemble Learning Framework (IELF), which integrates Explainable Boosting Machines (EBM) with XGBoost, SHAP-based explanations, and LIME for enhanced local interpretability.

Methods

IELF was evaluated on two benchmark datasets: Cleveland (n = 303) and Framingham (n = 4,240). Model assessment included 5-fold cross-validation, held-out test sets, calibration, subgroup analyses, and explanation stability evaluation using Kendall’s τ and Overlap@10.

Results

IELF achieved robust discrimination (AUC 0.899, accuracy 88.5% on Cleveland; AUC 0.696, accuracy 82.6% on Framingham) with balanced precision–recall profiles. Compared with EBM, IELF significantly improved recall, F1, and AUC on the Framingham dataset (p < 0.05), while differences versus XGBoost were less consistent. IELF produced transparent feature rankings aligned with established cardiovascular risk factors and stable explanations across folds.

Discussion

IELF is, to our knowledge, the first framework to combine EBM and XGBoost with SHAP and LIME under strict nested cross-validation and calibration procedures. Although headline accuracies are lower than some recent >97% reports, IELF was developed under stricter methodological controls that enhance reproducibility, interpretability, and clinical reliability. These findings position IELF as a trustworthy benchmark for translational AI in cardiovascular risk prediction, complementing high-accuracy but less transparent models.

Item Type:	Article
Identification Number:	10.3389/fphar.2025.1654681
Dates:	Date Event 11 December 2025 Published Online 18 November 2025 Accepted
Uncontrolled Keywords:	cardiovascular disease, explainable AI, heart disease, ensemble learning, XGBoost, ensemble
Subjects:	CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
Divisions:	Architecture, Built Environment, Computing and Engineering > Computer Science
Depositing User:	Gemma Tonks
Date Deposited:	14 May 2026 12:16
Last Modified:	14 May 2026 12:16
URI:	https://www.open-access.bcu.ac.uk/id/eprint/17042