Utilizing Deep Learning Algorithms for the Prompt Identification of Chronic Obstructive Pulmonary Disease

Authors

Volume: 15 | Issue: 4 | Pages: 24940-24949 | August 2025 | https://doi.org/10.48084/etasr.10738

Abstract

This study presents a Deep Learning (DL)-based approach for the early detection of Chronic Obstructive Pulmonary Disease (COPD) using a novel dual-branch Convolutional Neural Network (CNN) architecture.DL techniques are leveraged to recognize complex, early-stage patterns of the disease that may be overlooked by conventional medical assessments or traditional machine learning models, which are prone to misclassifying COPD as other lung conditions. To ensure robust model training, a pre-filtered dataset of lung sound recordings was used. These recordings, each 20 s in duration, were cleaned, standardized, and converted into two-dimensional representations using Mel spectrograms and Mel Frequency Cepstral Coefficients (MFCCs). These image-like features served as the input for the CNN model, enhancing its ability to distinguish COPD-specific acoustic patterns.To address the issue of class imbalance in the dataset, two data augmentation techniques, pitch-shifted noise injection and time-frequency masking, were applied, contributing to improved model generalization. The proposed CNN model achieved promising results, with a precision of 97.75%, an accuracy of 96.0%, a sensitivity of 97.96%, and an F1-score of 96.97% during validation. These performance metrics outperform those obtained from widely used CNN architectures, such as InceptionV3 and ResNet, highlighting the effectiveness of the proposed model. Overall, the proposed approach demonstrates significant potential as a reliable diagnostic support tool for early COPD detection.

Keywords:

deep learning, chronic obstructive pulmonary disease, data augmentation, double-branch, convolutional neural network, mel frequency cepstral coefficients

Downloads

Download data is not yet available.

References

World Health Organization, World Health Statistics 2021: Monitoring Health for the SDGs, Sustainable Development Goals, 1st ed. Geneva: World Health Organization, 2021.

T. Siddiqui, M. Latif, M. U. Farooq, M. A. Baig, and Y. S. Hassan, "Chronic Obstructive Pulmonary Disease Diagnosis with Bagging Ensemble Learning and ANN Classifiers," Engineering, Technology & Applied Science Research, vol. 14, no. 3, pp. 14741–14746, Jun. 2024. DOI: https://doi.org/10.48084/etasr.7106

R. Karla and R. Yalavarthi, "A Hybrid RNN-based Deep Learning Model for Lung Cancer and COPD Detection," Engineering, Technology & Applied Science Research, vol. 14, no. 5, pp. 16847–16853, Oct. 2024. DOI: https://doi.org/10.48084/etasr.8181

J. Y. Choi and C. K. Rhee, "Diagnosis and Treatment of Early Chronic Obstructive Lung Disease (COPD)," Journal of Clinical Medicine, vol. 9, no. 11, Oct. 2020, Art. no. 3426. DOI: https://doi.org/10.3390/jcm9113426

P. J. Patel et al., "Performance analysis of deep learning algorithms for classifying chronic obstructive pulmonary disease," Journal of Integrated Science and Technology, vol. 12, no. 2, 2024, Art. no. 745.

ADVOCATEHEALTH. "Chronic obstructive pulmonary disease (COPD)." Advocate Health Care. [Online]. Available: https://www.advocatehealth.com/health-services/lung-respiratory-care/chronic-obstructive-pulmonary-disease-copd.

M. Elbarbary et al., "Ambient air pollution, lung function and COPD: cross-sectional analysis from the WHO Study of AGEing and adult health wave 1," BMJ Open Respiratory Research, vol. 7, no. 1, Dec. 2020, Art. no. e000684. DOI: https://doi.org/10.1136/bmjresp-2020-000684

O. R. Sánchez-Ccoyllo, C. G. Ordoñez-Aquino, J. Arratea-Morán, N. S. Marín-Huachaca, and W. Reátegui-Romero, "Describing Aerosol and Assessing Health Effects in Lima, Peru," International Journal of Environmental Science and Development, vol. 12, no. 12, pp. 355–362, 2021. DOI: https://doi.org/10.18178/ijesd.2021.12.12.1361

V. Kim and G. J. Criner, "Chronic Bronchitis and Chronic Obstructive Pulmonary Disease," American Journal of Respiratory and Critical Care Medicine, vol. 187, no. 3, pp. 228–237, Feb. 2013. DOI: https://doi.org/10.1164/rccm.201210-1843CI

Y. S. Lee, J. Y. Oh, K. H. Min, S. Y. Lee, K. H. Kang, and J. J. Shim, "The association between living below the relative poverty line and the prevalence of chronic obstructive pulmonary disease," Journal of Thoracic Disease, vol. 11, no. 2, pp. 427–437, Feb. 2019. DOI: https://doi.org/10.21037/jtd.2019.01.40

S. W. Ali, M. Asif, M. Rashid, S. Tanvir, S. Shams, and S. Abid, "Detection of Crackle and Wheeze in Lung Sound using Machine Learning Technique for Clinical Decision Support System," VAWKUM Transactions on Computer Sciences, vol. 11, no. 1, pp. 67–78, Mar. 2023, https://doi.org/10.21015/vtcs.v11i1.1384. DOI: https://doi.org/10.21015/vtcs.v11i1.1384

S. Ali, S. Tanweer, S. Khalid, and N. Rao, "Mel Frequency Cepstral Coefficient: A Review," in Proceedings of the 2nd International Conference on ICT for Digital, Smart, and Sustainable Development, ICIDSSD 2020, 27-28 February 2020, Jamia Hamdard, New Delhi, India, New Delhi, India, 2021. DOI: https://doi.org/10.4108/eai.27-2-2020.2303173

L. Fraiwan, O. Hassanin, M. Fraiwan, B. Khassawneh, A. M. Ibnian, and M. Alkhodari, "Automatic identification of respiratory diseases from stethoscopic lung sound signals using ensemble classifiers," Biocybernetics and Biomedical Engineering, vol. 41, no. 1, pp. 1–14, Jan. 2021. DOI: https://doi.org/10.1016/j.bbe.2020.11.003

G. Petmezas et al., "Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function," Sensors, vol. 22, no. 3, Feb. 2022, Art. no. 1232. DOI: https://doi.org/10.3390/s22031232

Z. Sun, "ICBHI 2017 challenge." Harvard Dataverse, 2023, Art. no 1978998275.

M. Fraiwan, L. Fraiwan, B. Khassawneh, and A. Ibnian, "A dataset of lung sounds recorded from the chest wall using an electronic stethoscope," Data in Brief, vol. 35, Apr. 2021, Art. no. 106913. DOI: https://doi.org/10.1016/j.dib.2021.106913

S. Reichert, R. Gass, C. Brandt, and E. Andrès, "Analysis of Respiratory Sounds: State of the Art," Clinical medicine. Circulatory, respiratory and pulmonary medicine, vol. 2, Jan. 2008, Art. no. CCRPM.S530. DOI: https://doi.org/10.4137/CCRPM.S530

T. H. Falk, W.Y. Chan, E. Sejdic, and T. Chau, "Spectro-Temporal Analysis of Auscultatory Sounds," in New Developments in Biomedical Engineering, D. Campolo, Ed. InTech, 2010. DOI: https://doi.org/10.5772/7624

M. E. Akbiyik, "Data Augmentation in Training CNNs: Injecting Noise to Images." arXiv, 2023.

G. Zhou, Y. Chen, and C. Chien, "On the analysis of data augmentation methods for spectral imaged based heart sound classification using convolutional neural networks," BMC Medical Informatics and Decision Making, vol. 22, no. 1, Aug. 2022, Art. no. 226. DOI: https://doi.org/10.1186/s12911-022-01942-2

E. Flórez, S. Cardona, and L. Jordi, "Selecting the temporal window in the short time Fourier transforms used in the vibration signal analysis to determine flats at train’s wheels," Revista Facultad de Ingeniería Universidad de Antioquia, no. 50, pp. 145–158, Mar. 2013. DOI: https://doi.org/10.17533/udea.redin.14940

S. Guha, A. Das, P. K. Singh, A. Ahmadian, N. Senu, and R. Sarkar, "Hybrid Feature Selection Method Based on Harmony Search and Naked Mole-Rat Algorithms for Spoken Language Identification From Audio Signals," IEEE Access, vol. 8, pp. 182868–182887, 2020. DOI: https://doi.org/10.1109/ACCESS.2020.3028121

B. Barai, D. Das, N. Das, S. Basu, and M. Nasipuri, "VQ/GMM-Based Speaker Identification with Emphasis on Language Dependency," in Advanced Computing and Systems for Security, vol. 883, R. Chaki, A. Cortesi, K. Saeed, and N. Chaki, Eds. Singapore: Springer Singapore, 2019, pp. 125–141. DOI: https://doi.org/10.1007/978-981-13-3702-4_8

R. Mushi and Y.-P. Huang, "Assessment of Mel-Filter Bank Features on Sound Classifications Using Deep Convolutional Neural Network," in 2021 International Conference on System Science and Engineering (ICSSE), Ho Chi Minh City, Vietnam, Aug. 2021, pp. 334–339. DOI: https://doi.org/10.1109/ICSSE52999.2021.9538433

M. Del Coco, P. Carcagnì, M. Leo, P. Spagnolo, P. L. Mazzeo, and C. Distante, "Multi-branch CNN for Multi-scale Age Estimation," in Image Analysis and Processing - ICIAP 2017, vol. 10485, S. Battiato, G. Gallo, R. Schettini, and F. Stanco, Eds. Cham: Springer International Publishing, 2017, pp. 234–244. DOI: https://doi.org/10.1007/978-3-319-68548-9_22

A. Ganjdanesh, S. Gao, and H. Huang, "EffConv: Efficient Learning of Kernel Sizes for Convolution Layers of CNNs," in Proceedings of the AAAI Conference on Artificial Intelligence, Washington, D.C., USA, Jun. 2023, vol. 37, pp. 7604–7612. DOI: https://doi.org/10.1609/aaai.v37i6.25923

C. Garbin, X. Zhu, and O. Marques, "Dropout vs. batch normalization: an empirical study of their impact to deep learning," Multimedia Tools and Applications, vol. 79, no. 19–20, pp. 12777–12815, May 2020. DOI: https://doi.org/10.1007/s11042-019-08453-9

W. M. Fatihia, A. Fariza, and T. Karlita, "CNN with Batch Normalization Adjustment for Offline Hand-written Signature Genuine Verification," JOIV : International Journal on Informatics Visualization, vol. 7, no. 1, Feb. 2023, Art. no. 200. DOI: https://doi.org/10.30630/joiv.7.1.1443

M. Islam, S. S. Arora, R. Chatterjee, P. Rindal, and M. Shirvanian, "Compact: Approximating Complex Activation Functions for Secure Computation," Proceedings on Privacy Enhancing Technologies, vol. 2024, no. 3, pp. 25–41, Jul. 2024. DOI: https://doi.org/10.56553/popets-2024-0065

S. Mehra, G. Raut, R. D. Purkayastha, S. K. Vishvakarma, and A. Biasizzo, "An Empirical Evaluation of Enhanced Performance Softmax Function in Deep Learning," IEEE Access, vol. 11, pp. 34912–34924, 2023. DOI: https://doi.org/10.1109/ACCESS.2023.3265327

C. Peel and T. K. Moon, "Algorithms for Optimization [Bookshelf]," IEEE Control Syst., vol. 40, no. 2, pp. 92–94, Apr. 2020. DOI: https://doi.org/10.1109/MCS.2019.2961589

C. Szegedy et al., "Going deeper with convolutions," in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, Jun. 2015, pp. 1–9. DOI: https://doi.org/10.1109/CVPR.2015.7298594

C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, "Rethinking the Inception Architecture for Computer Vision," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, Jun. 2016, pp. 2818–2826. DOI: https://doi.org/10.1109/CVPR.2016.308

K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, Jun. 2016, pp. 770–778. DOI: https://doi.org/10.1109/CVPR.2016.90

Downloads

How to Cite

[1]
C. Medina-Ramos, N. Sare-Vargas, W. Reategui-Romero, K. Paucar-Cuba, D. Carbonel-Olazabal, and J. Betetta-Gomez, “Utilizing Deep Learning Algorithms for the Prompt Identification of Chronic Obstructive Pulmonary Disease”, Eng. Technol. Appl. Sci. Res., vol. 15, no. 4, pp. 24940–24949, Aug. 2025.

Metrics

Abstract Views: 382
PDF Downloads: 285

Metrics Information