Transforming Air Quality Index Prediction Using Machine Learning: Key Insights from the Taj Trapezium Zone (TTZ)
Received: 16 May 2025 | Revised: 23 June 2025 | Accepted: 9 July 2025 | Online: 6 October 2025
Corresponding author: Swati Varshney
Abstract
The exponential growth of air pollution is alarming for today's modern society. The Air Quality Index (AQI) is the main index used to gauge the severity of air pollution. In light of the recent Commission for Air Quality Management (CAQM) mandate for three days' advance upgrading of the Graded Response Action Plan (GRAP) stage, this research focuses on predicting the AQI to minimize negative economic losses through the correct adoption of the GRAP stage. The current research paper aims to predict the AQI in the Taj Trapezium Zone (TTZ) using machine learning algorithms. In this study, the main air contaminants that affect air quality—PM2.5, PM10, CO, NO2, NH3, NOx, O3, SO2, benzene, and toluene—along with meteorological factors such as temperature, humidity, wind speed, wind direction, and pressure, are used for data analysis and AQI prediction using machine learning algorithms. The study also aims to identify the most dominant pollutants in the TTZ area. The data source is a real-time dataset from air quality monitoring stations operated by the Central Pollution Control Board (CPCB) in Agra, one of the key cities within the TTZ area. This study uses four machine learning algorithms: AdaBoost, XGBoost, CatBoost, and LightGBM to calculate and compare AQI prediction accuracy. Four statistical performance metrics are used, namely: R2 score, Root Mean Square Error (RMSE), Mean Square Error (MSE), and Mean Absolute Error (MAE). The findings show that the XGBoost algorithm forecasts the AQI with the highest accuracy and best performance in the TTZ region. The study also found that PM10 is the most dominant pollutant in the TTZ area. This indicates that more stringent control measures are needed to curb PM10 pollution and improve the AQI. The use of these predicted AQI values may help CAQM in real-time GRAP stage decision making.
Keywords:
Air Quality Index (AQI), Graded Response Action Plan (GRAP), machine learning, Commission for Air Quality Management (CAQM), ensemble techniques, XGBoost, AdaBoost, LightGBM, CatBoostDownloads
References
"National Air Quality Index." Central Pollution Control Board. https://airquality.cpcb.gov.in/AQI_India/.
"Stage-IV Of GRAP In Delhi." Pwonlyias. https://pwonlyias.com/current-affairs/stage-iv-of-grap-in-delhi/.
"Continuous Stations Status." Central Pollution Control Board. https://airquality.cpcb.gov.in/ccr/#/caaqm-dashboard/caaqm-landing.
A. Ansari and A. R. Quaff, "Advanced Machine Learning Techniques for Precise hourly Air Quality Index (AQI) Prediction in Azamgarh, India," International Journal of Environmental Research, vol. 19, no. 1, Nov. 2024, Art. no. 15.
G. Sharma, S. Khurana, N. Saina, Shivansh, and G. Gupta, "Comparative Analysis of Machine Learning Techniques in Air Quality Index (AQI) prediction in smart cities," International Journal of System Assurance Engineering and Management, vol. 15, no. 7, pp. 3060–3075, July 2024.
H. Jing and Y. Wang, "Research on Urban Air Quality Prediction Based on Ensemble Learning of XGBoost," E3S Web of Conferences, vol. 165, May 2020, Art. no. 02014.
K. Kumar and B. P. Pande, "Air pollution prediction with machine learning: a case study of Indian cities," International Journal of Environmental Science and Technology, vol. 20, no. 5, pp. 5333–5348, May 2023.
Q. Liu, B. Cui, and Z. Liu, "Air Quality Class Prediction Using Machine Learning Methods Based on Monitoring Data and Secondary Modeling," Atmosphere, vol. 15, no. 5, May 2024, Art. no. 553.
V. Devasekhar and P. Natarajan, "Prediction of Air Quality and Pollution using Statistical Methods and Machine Learning Techniques," International Journal of Advanced Computer Science and Applications, vol. 14, no. 4, pp. 927–937, 39/29 2023.
Y.-C. Liang, Y. Maimury, A. H.-L. Chen, and J. R. C. Juarez, "Machine Learning-Based Prediction of Air Quality," Applied Sciences, vol. 10, no. 24, Dec. 2020, Art. no. 9151.
M. Emeç and M. Yurtsever, "A novel ensemble machine learning method for accurate air quality prediction," International Journal of Environmental Science and Technology, vol. 22, no. 1, pp. 459–476, Jan. 2025.
A. Bekkar, B. Hssina, S. Douzi, and K. Douzi, "Air-pollution prediction in smart city, deep learning approach," Journal of Big Data, vol. 8, no. 1, Dec. 2021, Art. no. 161.
C.-C. Wei and W.-J. Kao, "Establishing a Real-Time Prediction System for Fine Particulate Matter Concentration Using Machine-Learning Models," Atmosphere, vol. 14, no. 12, p. 1817, Dec. 2023.
S. K. Natarajan, P. Shanmurthy, D. Arockiam, B. Balusamy, and S. Selvarajan, "Optimized machine learning model for air quality index prediction in major cities in India," Scientific Reports, vol. 14, no. 1, Mar. 2024, Art. no. 6795.
D. Zhu, C. Cai, T. Yang, and X. Zhou, "A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization," Big Data and Cognitive Computing, vol. 2, no. 1, Mar. 2018, Art. no. 5.
N. S. Gupta, Y. Mohta, K. Heda, R. Armaan, B. Valarmathi, and G. Arulkumaran, "Prediction of Air Quality Index Using Machine Learning Techniques: A Comparative Analysis," Journal of Environmental and Public Health, vol. 2023, no. 1, 2023, Art. no. 4916267.
S. Simu et al., "Air Pollution Prediction using Machine Learning," in 2020 IEEE Bombay Section Signature Conference, Mumbai, India, 2020, pp. 231–236.
K. M. O. Nahar, M. A. Ottom, F. Alshibli, and M. M. A. Shquier, "Air quality index using machine learning – A Jordan case study," Compusoft: An International Journal of Advanced Computer Technology, vol. 9, no. 9, pp. 3831–3840, Sept. 2020.
P. Bhalgat, S. Pitale, and S. Bhoite, "Air Quality Prediction using Machine Learning Algorithms," International Journal of Computer Applications Technology and Research, vol. 8, no. 9, pp. 367–370, Sept. 2019.
T. Madan, S. Sagar, and D. Virmani, "Air Quality Prediction using Machine Learning Algorithms –A Review," in 2020 2nd International Conference on Advances in Computing, Communication Control and Networking, Greater Noida, India, 2020, pp. 140–145.
H. A. Al-Jamimi, S. Al-Azani, and T. A. Saleh, "Supervised machine learning techniques in the desulfurization of oil products for environmental protection: A review," Process Safety and Environmental Protection, vol. 120, pp. 57–71, Nov. 2018.
M. Castelli, F. M. Clemente, A. Popovič, S. Silva, and L. Vanneschi, "A Machine Learning Approach to Predict Air Quality in California," Complexity, vol. 2020, no. 1, Aug. 2020, Art. no. 8049504.
U. Mahalingam, K. Elangovan, H. Dobhal, C. Valliappa, S. Shrestha, and G. Kedam, "A Machine Learning Model for Air Quality Prediction for Smart Cities," in 2019 International Conference on Wireless Communications Signal Processing and Networking, Chennai, India, 2019, pp. 452–457.
D. Sanjeev, "Implementation of Machine Learning Algorithms for Analysis and Prediction of Air Quality," International Journal of Engineering Research & Technology, vol. 10, no. 3, pp. 433–538, Mar. 2021.
M. R. Delavar et al., "A Novel Method for Improving Air Pollution Prediction Based on Machine Learning Approaches: A Case Study Applied to the Capital City of Tehran," ISPRS International Journal of Geo-Information, vol. 8, no. 2, Feb. 2019, Art. no. 99.
V. M. Madhuri, G. G. H. Samyama, and S. Kamalapurkar, "Air Pollution Prediction Using Machine Learning Supervised Learning Approach," International Journal of Scientific and Technology Research, vol. 9, no. 4, pp. 118–123, Apr. 2020.
Downloads
How to Cite
License
Copyright (c) 2025 Swati Varshney, Jitendra Nath Shrivastava, Neha Gupta

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.