Multimodal Prediction of COVID-19 ICU Admissions and Demand Using Clinical, Governmental, and Social Media Data with GBM and LSTM Models

T. T. Sabin; B. S. Sunitha

doi:10.48084/etasr.12756

Authors

T. T. Sabin Department of Computer Science & Engineering, PES Institute of Technology and Management, Shivamogga, Karnataka, India | Visvesvaraya Technological University, Belagavi, Karnataka, India
B. S. Sunitha Department of Computer Science Engineering (Data Science), PES Institute of Technology and Management, Shivamogga, Karnataka, India | Visvesvaraya Technological University, Belagavi, Karnataka, India

Volume: 15 | Issue: 5 | Pages: 28108-28113 | October 2025 | https://doi.org/10.48084/etasr.12756

Received: 15 June 2025 | Revised: 2 July 2025, 12 July 2025, 27 July 2025, 9 August 2025, and 22 August 2025 | Accepted: 25 August 2025 | Online: 17 September 2025
Corresponding author: T. T. Sabin

Abstract

During pandemics such as COVID-19, it is very important to accurately and quickly predict how bad the disease will get so that resources and patient care can be used most effectively. This study suggests a dual-model machine learning framework that uses data from three sources—Electronic Health Records (EHRs), government-reported case statistics, and social media sentiment trends—to predict ICU admissions at both the patient and population levels. A Gradient Boosting Machine (GBM) classifier was trained on structured clinical features, such as age, comorbidities (such as diabetes and high blood pressure), pneumonia status, and the need for intubation to predict the likelihood of a patient being admitted to the ICU. This model was 93% accurate and had an AUC of 94.5%. SHAP-based feature importance showed that age, hypertension, and pneumonia were the best predictors. Using trends in hospitalization rates, changes in public policy, and social media sentiment over time, a Long Short-Term Memory (LSTM) model was created to predict how many people will need an ICU over time. This model was 95% accurate and had a ±10% error margin for predicting ICU admissions over the next 14 days. All data were aligned in time and combined using region-level tags. To improve model performance, data preprocessing, hyperparameter tuning (using grid search and Bayesian optimization), and comparisons with baseline models, such as ARIMA and linear regression, were performed. This method shows how multimodal, easy-to-understand AI models can be used in healthcare decision support systems for real-time patient triage and hospital capacity planning.

Keywords:

COVID-19 severity prediction, ICU admission forecasting, machine learning in healthcare, Gradient Boosting Machine (GBM), Long Short-Term Memory (LSTM), feature importance analysis, disease severity classification, time-series forecasting, healthcare resource optimization, pandemic preparedness

Downloads

Download data is not yet available.

References

N. Narayan Das, N. Kumar, M. Kaur, V. Kumar, and D. Singh, "Automated Deep Transfer Learning-Based Approach for Detection of COVID-19 Infection in Chest X-rays," IRBM, vol. 43, no. 2, pp. 114–119, Apr. 2022.

N. Kumar, M. Gupta, D. Gupta, and S. Tiwari, "Novel deep transfer learning model for COVID-19 patient detection using X-ray chest images," Journal of Ambient Intelligence and Humanized Computing, vol. 14, no. 1, pp. 469–478, Jan. 2023.

J. Li, W. Huang, C. L. Sia, Z. Chen, T. Wu, and Q. Wang, "Enhancing COVID-19 Epidemic Forecasting Accuracy by Combining Real-time and Historical Data From Multiple Internet-Based Sources: Analysis of Social Media Data, Online News Articles, and Search Queries," JMIR Public Health and Surveillance, vol. 8, no. 6, Jun. 2022, Art. no. e35266.

L. Ansell and L. Dalla Valle, "A new data integration framework for Covid-19 social media information," Scientific Reports, vol. 13, no. 1, Apr. 2023, Art. no. 6170.

N. Altieri et al., "Curating a COVID-19 Data Repository and Forecasting County-Level Death Counts in the United States," Harvard Data Science Review, Feb. 2021.

Z. Zhao et al., "Prediction model and risk scores of ICU admission and mortality in COVID-19," PLOS ONE, vol. 15, no. 7, Jul. 2020, Art. no. e0236618.

S. Bhatia et al., "Severity and mortality prediction models to triage Indian COVID-19 patients," PLOS Digital Health, vol. 1, no. 3, Mar. 2022, Art. no. e0000020.

M. D. B. Braga et al., "Artificial neural networks for short-term forecasting of cases, deaths, and hospital beds occupancy in the COVID-19 pandemic at the Brazilian Amazon," PLOS ONE, vol. 16, no. 3, Mar. 2021, Art. no. e0248161.

A. Patrício, R. S. Costa, and R. Henriques, "Predictability of COVID-19 Hospitalizations, Intensive Care Unit Admissions, and Respiratory Assistance in Portugal: Longitudinal Cohort Study," Journal of Medical Internet Research, vol. 23, no. 4, Apr. 2021, Art. no. e26075.

H. Chao et al., "Integrative analysis for COVID-19 patient outcome prediction," Medical Image Analysis, vol. 67, Jan. 2021, Art. no. 101844.

J. Devaraj et al., "Forecasting of COVID-19 cases using deep learning models: Is it reliable and practically significant?," Results in Physics, vol. 21, Feb. 2021, Art. no. 103817.

A. A. Alrajhi et al., "Data-Driven Prediction for COVID-19 Severity in Hospitalized Patients," International Journal of Environmental Research and Public Health, vol. 19, no. 5, Mar. 2022, Art. no. 2958.

M. M. B. Azam et al., "A hybrid contextual framework to predict severity of infectious disease: COVID-19 case study," Egyptian Informatics Journal, vol. 27, Sep. 2024, Art. no. 100508.

D. Liu et al., "Real-Time Forecasting of the COVID-19 Outbreak in Chinese Provinces: Machine Learning Approach Using Novel Digital Data and Estimates From Mechanistic Models," Journal of Medical Internet Research, vol. 22, no. 8, Aug. 2020, Art. no. e20285.

S. Ma and S. Yang, "COVID-19 forecasts using Internet search information in the United States," Scientific Reports, vol. 12, no. 1, Jul. 2022, Art. no. 11539.

A. J. Aljaaf, T. M. Mohsin, D. Al-Jumeily, and M. Alloghani, "A fusion of data science and feed-forward neural network-based modelling of COVID-19 outbreak forecasting in IRAQ," Journal of Biomedical Informatics, vol. 118, Jun. 2021, Art. no. 103766.

M. D. Hssayeni et al., "The forecast of COVID-19 spread risk at the county level," Journal of Big Data, vol. 8, no. 1, Dec. 2021, Art. no. 99.

M. Ala’raj, M. Majdalawieh, and N. Nizamuddin, "Modeling and forecasting of COVID-19 using a hybrid dynamic model based on SEIRD with ARIMA corrections," Infectious Disease Modelling, vol. 6, pp. 98–111, 2021.

H. Alalawi, M. Alsuwat, and H. Alhakami, "A Survey of the Application of Artifical Intellegence on COVID-19 Diagnosis and Prediction," Engineering, Technology & Applied Science Research, vol. 11, no. 6, pp. 7824–7835, Dec. 2021.

N. Kumar, A. Hashmi, M. Gupta, and A. Kundu, "Automatic Diagnosis of Covid-19 Related Pneumonia from CXR and CT-Scan Images," Engineering, Technology & Applied Science Research, vol. 12, no. 1, pp. 7993–7997, Feb. 2022.

K. E. ArunKumar, D. V. Kalaga, Ch. M. Sai Kumar, G. Chilkoor, M. Kawaji, and T. M. Brenza, "Forecasting the dynamics of cumulative COVID-19 cases (confirmed, recovered and deaths) for top-16 countries using statistical machine learning models: Auto-Regressive Integrated Moving Average (ARIMA) and Seasonal Auto-Regressive Integrated Moving Average (SARIMA)," Applied Soft Computing, vol. 103, May 2021, Art. no. 107161.

Y. Zhang-James et al., "A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties." In Review, Apr. 26, 2021.

S. G. Khatami et al., "Curating, Collecting, and Cataloguing Global COVID-19 Datasets for the Aim of Predicting Personalized Risk," Data, vol. 9, no. 2, Jan. 2024, Art. no. 25.

Md. S. Satu et al., "Short-Term Prediction of COVID-19 Cases Using Machine Learning Models," Applied Sciences, vol. 11, no. 9, May 2021, Art. no. 4266.

P. G. Asteris et al., "Prognosis of COVID-19 severity using DERGA, a novel machine learning algorithm," European Journal of Internal Medicine, vol. 125, pp. 67–73, Jul. 2024.

K. Raza, "Artificial Intelligence Against COVID-19: A Meta-analysis of Current Research," in Big Data Analytics and Artificial Intelligence Against COVID-19: Innovation Vision and Approach, vol. 78, A. E. Hassanien, N. Dey, and S. Elghamrawy, Eds. Springer International Publishing, 2020, pp. 165–176.

F. N. Khan, A. A. Khanam, A. Ramlal, and S. Ahmad, "A Review on Predictive Systems and Data Models for COVID-19," in Computational Intelligence Methods in COVID-19: Surveillance, Prevention, Prediction and Diagnosis, vol. 923, K. Raza, Ed. Springer Singapore, 2021, pp. 123–164.

E. Dong, H. Du, and L. Gardner, "CSSEGISandData/COVID-19." Sep. 04, 2025, [Online]. Available: https://github.com/CSSEGISandData/COVID-19.

"COVID-19 Reported Patient Impact and Hospital Capacity by Facility | HealthData.gov," U.S. Department of Health and Human Services. https://healthdata.gov/Hospital/COVID-19-Reported-Patient-Impact-and-Hospital-Capa/anag-cw7u/about_data.

"Oxford COVID-19 Government Response Tracker," Blavatnik School of Government, Mar. 18, 2020. https://www.bsg.ox.ac.uk/research/covid-19-government-response-tracker.

"COVID-19 India data - Ministry of Health and Family Welfare | GOI." https://www.mohfw.gov.in/.

"COVID-19 Dataset." Kaggle, [Online]. Available: https://www.kaggle.com/datasets/meirnizri/covid19-dataset.

N. Kumar and D. Aggarwal, "LEARNING-based Focused WEB Crawler," IETE Journal of Research, vol. 69, no. 4, pp. 2037–2045, May 2023.

Multimodal Prediction of COVID-19 ICU Admissions and Demand Using Clinical, Governmental, and Social Media Data with GBM and LSTM Models

Authors

Abstract

Keywords:

Downloads

References

Downloads

How to Cite

Metrics

License