Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.14 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)
Chronic diseases cause 71% of global deaths (41 million annually) and consume 75% of healthcare costs. Traditional risk scores (Framingham C-statistic 0.76-0.79, QRISK3 0.86-0.88) demonstrate limited accuracy with linear assumptions and population-specific miscalibration. We developed an AI-driven prediction system using supervised machine learning on publicly available Electronic Health Record datasets (UCI Heart Disease and Pima Indians Diabetes) with clinically relevant features. Methods included comprehensive preprocessing (k-NN imputation, one-hot encoding, z-score normalization, SMOTE balancing), comparison of four algorithms (Logistic Regression, Random Forest, XGBoost, Neural Networks), and stratified 5-fold cross-validation with grid search optimization. XGBoost achieved superior performance: 92.4% accuracy (95% CI: 91.8-93.0%), 91.2%
recall, 0.964 ROC-AUC (95% CI: 0.958-0.970), outperforming Framingham by 16 percentage points and QRISK3 by 14 points. SHAP analysis identified clinically meaningful predictors: HbA1c (0.187), age (0.152), systolic blood pressure (0.134), BMI (0.121), fasting glucose (0.108). Risk stratification into low (<0.3, 62% population), medium (0.3-0.7, 28%), and high (≥0.7, 10%) categories demonstrated excellent calibration (Hosmer-Lemeshow p=0.42). Subgroup analysis showed consistent fairness across demographics (all ROC-AUC >0.94). Computational performance of 87ms latency enables real-time clinical deployment. This system provides superior accuracy, interpretability, and fairness for early high-risk identification and targeted prevention.
Keywords:
Chronic disease prediction, Electronic health records, Machine learning, XGBoost, Risk stratification, SHAP, Clinical decision support
Cite Article:
"AI-Driven Risk Prediction System for Chronic Diseases: A Supervised Machine Learning Approach Using Electronic Health Records", International Journal for Research Trends and Innovation (www.ijrti.org), ISSN:2456-3315, Vol.11, Issue 4, page no.b211-b217, April-2026, Available :http://www.ijrti.org/papers/IJRTI2604167.pdf
Downloads:
00041
ISSN:
2456-3315 | IMPACT FACTOR: 8.14 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.14 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator