FreshRSS

🔒
❌ Acerca de FreshRSS
Hay nuevos artículos disponibles. Pincha para refrescar la página.
Hoy — Abril 4th 2026Tus fuentes RSS

Synthetic data-augmented machine learning for 30-day readmission prediction in patients with chronic conditions: a retrospective real-world study

Por: Brothers · T. · Adhikari · K. · Ramazani · M. · Imtiaz · A. · Al-Mamun · M.
Objectives

To develop and evaluate an explainable machine learning framework enhanced with synthetic data generation to predict unplanned 30-day hospital readmissions among patients with chronic obstructive pulmonary disease (COPD), heart failure (HF) and type 2 diabetes mellitus (T2DM), and to identify key clinical and social predictors of readmission.

Design

A retrospective cohort study using electronic health record data incorporating both structured variables and information extracted from unstructured clinical notes. Synthetic data were generated using advanced resampling and deep learning-based techniques to address outcome imbalance and improve model training.

Setting

Intensive care unit and general ward admissions at a single tertiary academic medical centre included in the MIMIC-IV (Medical Information Mart for Intensive Care IV) database.

Participants

Adult patients (≥18 years) were admitted with a primary diagnosis of COPD (n=14 050), HF (n=7097) or T2DM (n=12 735) between 2008 and 2019, with complete 30-day follow-up and no in-hospital mortality during the index admission.

Primary and secondary outcome measures

The primary outcome was unplanned all-cause hospital readmission within 30-days of discharge. Predictors were drawn from six domains, including demographics, comorbidities, clinical acuity, therapies, behavioural factors and care continuity. Predictive performance was evaluated using multiple machine learning methods and fivefold cross-validation, with model interpretability assessed using established goal and local explanation approaches.

Results

Ensemble-based machine learning models demonstrated the strongest predictive performance across all three disease cohorts. Key predictors of readmission included higher illness severity, greater comorbidity burden, medication non-adherence, gaps in preventive care and limited social support. Models incorporating synthetic data augmentation showed improved discrimination compared with models trained on original data alone.

Conclusions

An explainable synthetic-data driven framework incorporating clinical, behavioural and social data can support prediction of 30-day readmissions among patients with common chronic conditions using routinely available electronic health record data.

❌