A Comparative Study of ResNet50 and YOLOv9 for Face Detection and Gender Classification

Authors

  • Aseil Nadhim Kadhim Faculty of Artificial Intelligence, Universiti Teknologi Malaysia, Kuala Lumpur, Malaysia
  • Syahid Anuar Faculty of Artificial Intelligence, Universiti Teknologi Malaysia, Kuala Lumpur, Malaysia
  • Saiful Adli Bin Ismail Faculty of Artificial Intelligence, Universiti Teknologi Malaysia, Kuala Lumpur, Malaysia
Volume: 15 | Issue: 5 | Pages: 27581-27586 | October 2025 | https://doi.org/10.48084/etasr.13079

Abstract

Gender classification based on facial features plays a central role in numerous intelligent applications such as surveillance cameras, interactive advertising, and human-computer interaction. This study aimed to compare two deep models, YOLOv9 and ResNet50, on face detection and gender classification, focusing on accuracy and inference speed. YOLOv9 performed well in terms of speed, with an inference time of 332 ms per image and a processing speed of 3 fps, and had a precision of 86.8%, 86.1% of recall, and 86.54% of F1-score. These performance characteristics make YOLOv9 suitable for real-time applications with high-speed response demands, even with moderately low classification accuracy. Conversely, ResNet50 was applied directly to gender classification after data preparation on images and had high classification accuracy, with a precision of 93.6%, 92% of recall, and 92.79% of F1-score. Its inference time was slower at 446.33 ms per image, with a 2.24 fps processing speed and a long training time of 9 hours and 18 minutes. These results show that YOLOv9 has high performance within a time scope of face detection, with reference to detecting enough faces within short timeframes with a limited number of computational resources, whereas ResNet50 has better classification accuracy. Depending on particular use case scenario demands, one corresponding model with a preferred feature can be selected: YOLOv9, if high-speed response is a concern during real-time applications, and ResNet50, if high classification accuracy is a concern.

Keywords:

object detection, face detection, gender classification, YOLO, ResNet50

Downloads

Download data is not yet available.

References

X. Ming, X. Nanfeng, Z. Mengjun, and Y. Qunyong, "Optimized Convolutional Neural Network-Based Object Recognition for Humanoid Robot," Journal of Robotics and Automation, vol. 4, no. 1, Feb. 2020.

K. Jiang et al., "An Attention Mechanism-Improved YOLOv7 Object Detection Algorithm for Hemp Duck Count Estimation," Agriculture, vol. 12, no. 10, Oct. 2022, Art. no. 1659.

Y. Xia, M. Nguyen, and W. Q. Yan, "A Real-Time Kiwifruit Detection Based on Improved YOLOv7," in Image and Vision Computing, vol. 13836, W. Q. Yan, M. Nguyen, and M. Stommel, Eds. Springer Nature Switzerland, 2023, pp. 48–61.

J. Redmon and A. Farhadi, "YOLO9000: Better, Faster, Stronger," in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, Jul. 2017, pp. 6517–6525.

U. Nepal and H. Eslamiat, "Comparing YOLOv3, YOLOv4 and YOLOv5 for Autonomous Landing Spot Detection in Faulty UAVs," Sensors, vol. 22, no. 2, Jan. 2022, Art. no. 464.

Y. Wang and L. Pan, "YOLOV5s-Face face detection algorithm," in 2022 China Automation Congress (CAC), Xiamen, China, Nov. 2022, pp. 1107–1112.

A. Dhillon and G. K. Verma, "Convolutional neural network: a review of models, methodologies and applications to object detection," Progress in Artificial Intelligence, vol. 9, no. 2, pp. 85–112, Jun. 2020.

H. Jiang and E. Learned-Miller, "Face Detection with the Faster R-CNN," in 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), Washington, DC, DC, USA, May 2017, pp. 650–657.

S. D. Meena, C. S. Siri, P. S. Lakshmi, N. S. Doondı, and J. Sheela, "Real time DNN-based Face Mask Detection System using MobileNetV2 and ResNet50," in 2023 International Conference on Inventive Computation Technologies (ICICT), Lalitpur, Nepal, Apr. 2023, pp. 1007–1015.

B. Mandal, A. Okeukwu, and Y. Theis, "Masked Face Recognition using ResNet-50." arXiv, Apr. 19, 2021.

R. Tolosana, R. Vera-Rodriguez, J. Fierrez, A. Morales, and J. Ortega-Garcia, "DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection." arXiv, Jun. 18, 2020.

Z. Yu, H. Huang, W. Chen, Y. Su, Y. Liu, and X. Wang, "YOLO-FaceV2: A Scale and Occlusion Aware Face Detector." arXiv, Aug. 04, 2022.

W. Chen, H. Huang, S. Peng, C. Zhou, and C. Zhang, "YOLO-face: a real-time face detector," The Visual Computer, vol. 37, no. 4, pp. 805–813, Apr. 2021.

T. W. Shen, D. Wang, K. W. K. Cheung, M. C. Chan, K. H. Chiu, and Y. K. Li, "A Real-Time Single-Shot Multi-Face Detection, Landmark Localization, and Gender Classification," in 2021 3rd International Conference on Image Processing and Machine Vision (IPMV), Hong Kong, China, May 2021, pp. 1–4.

D. K. Srivastava, E. Gupta, S. Shrivastav, and R. Sharma, "Detection of Age and Gender from Facial Images Using CNN," in Proceedings of 3rd International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications, vol. 540, V. K. Gunjan and J. M. Zurada, Eds. Springer Nature Singapore, 2023, pp. 481–491.

H. Guan, H. Li, R. Li, M. Qi, and V. Velmurugan, "Face Detection System Based on Deep Learning," in Proceedings of the 2nd International Conference on Cognitive Based Information Processing and Applications (CIPA 2022), 2023, pp. 525–531.

M. DhivyaShree, K. R. Sarumathi, and R. S. V. Durai, "An Ensemble Model for Face Mask Detection Using Faster RCNN with ResNet50," in Artificial Intelligence and Speech Technology, vol. 1546, A. Dev, S. S. Agrawal, and A. Sharma, Eds. Springer International Publishing, 2022, pp. 593–603.

B. Li and D. Lima, "Facial expression recognition via ResNet-50," International Journal of Cognitive Computing in Engineering, vol. 2, pp. 57–64, Jun. 2021.

Dr. S. Gothane, "A Practice for Object Detection Using YOLO Algorithm," International Journal of Scientific Research in Computer Science, Engineering and Information Technology, pp. 268–272, Apr. 2021.

D. Qi, W. Tan, Q. Yao, and J. Liu, "YOLO5Face: Why Reinventing a Face Detector." arXiv, Jan. 27, 2022.

S. Ennaama, H. Silkan, A. Bentajer, and A. Tahiri, "Enhanced Real-Time Object Detection using YOLOv7 and MobileNetv3," Engineering, Technology & Applied Science Research, vol. 15, no. 1, pp. 19181–19187, Feb. 2025.

D. Fajalia, R. Satkar, A. Jejurkar, and S. Dhanake, "Video Based Face Mask Detection and Face Recognition using CNN, YOLO and Google Facenet," International Research Journal of Modernization in Engineering Technology and Science, vol. 2, no. 6, pp. 259–266, May 2023.

Nandita and B. Jain, "A Comprehensive Review of Machine Learning Techniques for Voice-Based Gender Recognition," in 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), Delhi, India, Jul. 2023, pp. 1–7.

R. Bhatia and N. P. Singh, "Gender Recognition by Voice Using Machine Learning," in Advanced Network Technologies and Intelligent Computing, 2022, pp. 307–318.

K. Jain, M. Chawla, A. Gadhwal, R. Jain, and P. Nagrath, "Age and Gender Prediction Using Convolutional Neural Network," in Proceedings of First International Conference on Computing, Communications, and Cyber-Security (IC4S 2019), vol. 121, P. K. Singh, W. Pawłowski, S. Tanwar, N. Kumar, J. J. P. C. Rodrigues, and M. S. Obaidat, Eds. Springer Singapore, 2020, pp. 247–259.

M. J. Awan, A. Raza, A. Yasin, H. M. F. Shehzad, and I. Butt, "The Customized Convolutional Neural Network of Face Emotion Expression Classification," Annals of R.S.C.B., vol. 25, no. 6, pp. 5296–5304, 2021.

S. Mittal, K. Thakral, P. Majumdar, M. Vatsa, and R. Singh, "Are Face Detection Models Biased?," in 2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG), Waikoloa Beach, HI, USA, Jan. 2023, pp. 1–7.

R. Li and J. Yang, "Improved YOLOv2 Object Detection Model," in 2018 6th International Conference on Multimedia Computing and Systems (ICMCS), Rabat, Morocco, May 2018, pp. 1–6.

E. Suherman, B. Rahman, D. Hindarto, and H. Santoso, "Implementation of ResNet-50 on End-to-End Object Detection (DETR) on Objects," SinkrOn, vol. 8, no. 2, pp. 1085–1096, Apr. 2023.

T. N. V. S. Praveen, D. Sivathmika, G. Jahnavi, and J. Bolledu, "An In-depth Exploration of ResNet-50 for Complex Emotion Recognition to Unraveling Emotional States," in 2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT), Gharuan, India, May 2023, pp. 1–5.

S. Shivadekar, B. Kataria, S. Hundekari, K. Wanjale, V. P. Balpande, and R. Suryawanshi, "Deep Learning Based Image Classification of Lungs Radiography for Detecting COVID-19 using a Deep CNN and ResNet 50," International Journal of Intelligent Systems and Applications in Engineering, vol. 11, no. 1s, pp. 241–250, Jan. 2023.

G. Boesch, "YOLOv9: Advancements in Real-time Object Detection," viso.ai, Sep. 27, 2024. https://viso.ai/computer-vision/yolov9/.

C. Y. Wang, I. H. Yeh, and H. Y. M. Liao, "YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information." arXiv, Feb. 29, 2024.

S. Chaudhuri et al., "Infrared Thermography of Turbulence Patterns of Operational Wind Turbine Rotor Blades Supported With High‐Resolution Photography: KI‐VISIR Dataset," Wind Energy, vol. 28, no. 1, Jan. 2025, Art. no. e2958.

X. Xue et al., "Design and Analysis of a Deep Learning Ensemble Framework Model for the Detection of COVID-19 and Pneumonia Using Large-Scale CT Scan and X-ray Image Datasets," Bioengineering, vol. 10, no. 3, Mar. 2023, Art. no. 363.

E. Yildirim, "ResNet-based Gender Recognition on Hand Images," Engineering, Technology & Applied Science Research, vol. 14, no. 6, pp. 17969–17972, Dec. 2024.

Downloads

How to Cite

[1]
A. N. Kadhim, S. Anuar, and S. A. B. Ismail, “A Comparative Study of ResNet50 and YOLOv9 for Face Detection and Gender Classification”, Eng. Technol. Appl. Sci. Res., vol. 15, no. 5, pp. 27581–27586, Oct. 2025.

Metrics

Abstract Views: 5
PDF Downloads: 4

Metrics Information