Transforming Air Quality Index Prediction Using Machine Learning: Key Insights from the Taj Trapezium Zone (TTZ)

Authors

  • Swati Varshney Department of Computer Application, Invertis University, Bareilly, Uttar Pradesh, India
  • Jitendra Nath Shrivastava Department of Computer Science and Engineering, Invertis University, Bareilly, Uttar Pradesh, India
  • Neha Gupta School of Computer Science and Information Technology, Symbiosis University of Applied Science, Indore, Madhya Pradesh, India
Volume: 15 | Issue: 5 | Pages: 26741-26749 | October 2025 | https://doi.org/10.48084/etasr.12168

Abstract

The exponential growth of air pollution is alarming for today's modern society. The Air Quality Index (AQI) is the main index used to gauge the severity of air pollution. In light of the recent Commission for Air Quality Management (CAQM) mandate for three days' advance upgrading of the Graded Response Action Plan (GRAP) stage, this research focuses on predicting the AQI to minimize negative economic losses through the correct adoption of the GRAP stage. The current research paper aims to predict the AQI in the Taj Trapezium Zone (TTZ) using machine learning algorithms. In this study, the main air contaminants that affect air quality—PM2.5, PM10, CO, NO2, NH3, NOx, O3, SO2, benzene, and toluene—along with meteorological factors such as temperature, humidity, wind speed, wind direction, and pressure, are used for data analysis and AQI prediction using machine learning algorithms. The study also aims to identify the most dominant pollutants in the TTZ area. The data source is a real-time dataset from air quality monitoring stations operated by the Central Pollution Control Board (CPCB) in Agra, one of the key cities within the TTZ area. This study uses four machine learning algorithms: AdaBoost, XGBoost, CatBoost, and LightGBM to calculate and compare AQI prediction accuracy. Four statistical performance metrics are used, namely: R2 score, Root Mean Square Error (RMSE), Mean Square Error (MSE), and Mean Absolute Error (MAE). The findings show that the XGBoost algorithm forecasts the AQI with the highest accuracy and best performance in the TTZ region. The study also found that PM10 is the most dominant pollutant in the TTZ area. This indicates that more stringent control measures are needed to curb PM10 pollution and improve the AQI. The use of these predicted AQI values may help CAQM in real-time GRAP stage decision making.

Keywords:

Air Quality Index (AQI), Graded Response Action Plan (GRAP), machine learning, Commission for Air Quality Management (CAQM), ensemble techniques, XGBoost, AdaBoost, LightGBM, CatBoost

Downloads

Download data is not yet available.

References

"National Air Quality Index." Central Pollution Control Board. https://airquality.cpcb.gov.in/AQI_India/.

"Stage-IV Of GRAP In Delhi." Pwonlyias. https://pwonlyias.com/current-affairs/stage-iv-of-grap-in-delhi/.

"Continuous Stations Status." Central Pollution Control Board. https://airquality.cpcb.gov.in/ccr/#/caaqm-dashboard/caaqm-landing.

A. Ansari and A. R. Quaff, "Advanced Machine Learning Techniques for Precise hourly Air Quality Index (AQI) Prediction in Azamgarh, India," International Journal of Environmental Research, vol. 19, no. 1, Nov. 2024, Art. no. 15.

G. Sharma, S. Khurana, N. Saina, Shivansh, and G. Gupta, "Comparative Analysis of Machine Learning Techniques in Air Quality Index (AQI) prediction in smart cities," International Journal of System Assurance Engineering and Management, vol. 15, no. 7, pp. 3060–3075, July 2024.

H. Jing and Y. Wang, "Research on Urban Air Quality Prediction Based on Ensemble Learning of XGBoost," E3S Web of Conferences, vol. 165, May 2020, Art. no. 02014.

K. Kumar and B. P. Pande, "Air pollution prediction with machine learning: a case study of Indian cities," International Journal of Environmental Science and Technology, vol. 20, no. 5, pp. 5333–5348, May 2023.

Q. Liu, B. Cui, and Z. Liu, "Air Quality Class Prediction Using Machine Learning Methods Based on Monitoring Data and Secondary Modeling," Atmosphere, vol. 15, no. 5, May 2024, Art. no. 553.

V. Devasekhar and P. Natarajan, "Prediction of Air Quality and Pollution using Statistical Methods and Machine Learning Techniques," International Journal of Advanced Computer Science and Applications, vol. 14, no. 4, pp. 927–937, 39/29 2023.

Y.-C. Liang, Y. Maimury, A. H.-L. Chen, and J. R. C. Juarez, "Machine Learning-Based Prediction of Air Quality," Applied Sciences, vol. 10, no. 24, Dec. 2020, Art. no. 9151.

M. Emeç and M. Yurtsever, "A novel ensemble machine learning method for accurate air quality prediction," International Journal of Environmental Science and Technology, vol. 22, no. 1, pp. 459–476, Jan. 2025.

A. Bekkar, B. Hssina, S. Douzi, and K. Douzi, "Air-pollution prediction in smart city, deep learning approach," Journal of Big Data, vol. 8, no. 1, Dec. 2021, Art. no. 161.

C.-C. Wei and W.-J. Kao, "Establishing a Real-Time Prediction System for Fine Particulate Matter Concentration Using Machine-Learning Models," Atmosphere, vol. 14, no. 12, p. 1817, Dec. 2023.

S. K. Natarajan, P. Shanmurthy, D. Arockiam, B. Balusamy, and S. Selvarajan, "Optimized machine learning model for air quality index prediction in major cities in India," Scientific Reports, vol. 14, no. 1, Mar. 2024, Art. no. 6795.

D. Zhu, C. Cai, T. Yang, and X. Zhou, "A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization," Big Data and Cognitive Computing, vol. 2, no. 1, Mar. 2018, Art. no. 5.

N. S. Gupta, Y. Mohta, K. Heda, R. Armaan, B. Valarmathi, and G. Arulkumaran, "Prediction of Air Quality Index Using Machine Learning Techniques: A Comparative Analysis," Journal of Environmental and Public Health, vol. 2023, no. 1, 2023, Art. no. 4916267.

S. Simu et al., "Air Pollution Prediction using Machine Learning," in 2020 IEEE Bombay Section Signature Conference, Mumbai, India, 2020, pp. 231–236.

K. M. O. Nahar, M. A. Ottom, F. Alshibli, and M. M. A. Shquier, "Air quality index using machine learning – A Jordan case study," Compusoft: An International Journal of Advanced Computer Technology, vol. 9, no. 9, pp. 3831–3840, Sept. 2020.

P. Bhalgat, S. Pitale, and S. Bhoite, "Air Quality Prediction using Machine Learning Algorithms," International Journal of Computer Applications Technology and Research, vol. 8, no. 9, pp. 367–370, Sept. 2019.

T. Madan, S. Sagar, and D. Virmani, "Air Quality Prediction using Machine Learning Algorithms –A Review," in 2020 2nd International Conference on Advances in Computing, Communication Control and Networking, Greater Noida, India, 2020, pp. 140–145.

H. A. Al-Jamimi, S. Al-Azani, and T. A. Saleh, "Supervised machine learning techniques in the desulfurization of oil products for environmental protection: A review," Process Safety and Environmental Protection, vol. 120, pp. 57–71, Nov. 2018.

M. Castelli, F. M. Clemente, A. Popovič, S. Silva, and L. Vanneschi, "A Machine Learning Approach to Predict Air Quality in California," Complexity, vol. 2020, no. 1, Aug. 2020, Art. no. 8049504.

U. Mahalingam, K. Elangovan, H. Dobhal, C. Valliappa, S. Shrestha, and G. Kedam, "A Machine Learning Model for Air Quality Prediction for Smart Cities," in 2019 International Conference on Wireless Communications Signal Processing and Networking, Chennai, India, 2019, pp. 452–457.

D. Sanjeev, "Implementation of Machine Learning Algorithms for Analysis and Prediction of Air Quality," International Journal of Engineering Research & Technology, vol. 10, no. 3, pp. 433–538, Mar. 2021.

M. R. Delavar et al., "A Novel Method for Improving Air Pollution Prediction Based on Machine Learning Approaches: A Case Study Applied to the Capital City of Tehran," ISPRS International Journal of Geo-Information, vol. 8, no. 2, Feb. 2019, Art. no. 99.

V. M. Madhuri, G. G. H. Samyama, and S. Kamalapurkar, "Air Pollution Prediction Using Machine Learning Supervised Learning Approach," International Journal of Scientific and Technology Research, vol. 9, no. 4, pp. 118–123, Apr. 2020.

Downloads

How to Cite

[1]
S. Varshney, J. N. Shrivastava, and N. Gupta, “Transforming Air Quality Index Prediction Using Machine Learning: Key Insights from the Taj Trapezium Zone (TTZ)”, Eng. Technol. Appl. Sci. Res., vol. 15, no. 5, pp. 26741–26749, Oct. 2025.

Metrics

Abstract Views: 8
PDF Downloads: 2

Metrics Information