A Congolese Swahili Task-Oriented Dialogue System for Addressing Humanitarian Crises

Authors

  • Ussen Kimanuka Institute for Basic Sciences, Technology and Innovation, Pan African University, Kenya
  • Ciira wa Maina Dedan Kimathi University of Technology, Kenya | Center for Data Science and Artificial Intelligence (DSAIL), Kenya
  • Osman Büyük Izmir Demokrasi University, Türkiye
  • Masika Kassay Godelive Benevolencija, Congo
Volume: 15 | Issue: 5 | Pages: 27387-27399 | October 2025 | https://doi.org/10.48084/etasr.12403

Abstract

As Artificial Intelligence (AI) advances, conversational agents are increasingly used across sectors, including humanitarian response. However, current systems and datasets mainly support high-resource languages and open-domain tasks, resulting in significant limitations in addressing low-resource, domain-specific needs. This study addresses this gap by focusing on a Congolese Swahili corpus collected from Short Message Service (SMS) messages and call-center humanitarian questions to develop an effective conversational agent for low-resource languages that supports communication during humanitarian crises. The goal of this research is to develop an effective Task-Oriented Dialogue System (ToDS) to assist displaced persons seeking humanitarian information in Congolese Swahili. We built a pipeline-based ToDS that converts natural language into SPARQL by utilizing a trained Named Entity Recognition (NER) model and a Dual Intent and Entity Transformer (DIET) classifier. This ToDS includes a humanitarian-specific ontology and dynamically queries a local triple store with data derived from the Humanitarian Data Exchange (HDX). The preliminary results indicate high accuracy in entity recognition and intent classification, which enables precise and timely information responses. The agent effectively provides context-relevant answers to humanitarian questions in crisis interactions. The findings demonstrate that applying Natural Language Understanding (NLU) methods in a low-resource, crisis-based context is viable and impactful. This ToDS offers a scalable solution for improving information accessibility in humanitarian emergencies and during forced internal displacements.

Keywords:

conversational Artificial Intelligence (AI), Task-Oriented Dialogue System (ToDS), natural language query formalization, SPARQL, resource data framework, ontology, low-resource languages, humanitarian crisis scenarios

Downloads

Download data is not yet available.

References

L. Qin et al., "End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions," in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023, pp. 5925–5941.

A. Arora, A. Shrivastava, M. Mohit, L. S.-M. Lecanda, and A. Aly, "Cross-lingual Transfer Learning for Intent Detection of Covid-19 Utterances." OpenReview, Aug. 12, 2020.

S. Schuster, S. Gupta, R. Shah, and M. Lewis, "Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog," in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN, USA, 2019, pp. 3795–3805.

J. Ni, T. Young, V. Pandelea, F. Xue, and E. Cambria, "Recent advances in deep learning based dialogue systems: a systematic survey," Artificial Intelligence Review, vol. 56, no. 4, pp. 3055–3155, Apr. 2023.

L. Qin, X. Xu, W. Che, and T. Liu, "AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slot Filling," in Findings of the Association for Computational Linguistics: EMNLP 2020, Online, 2020, pp. 1807–1816.

L. Jacqmin, L. M. Rojas-Barahona, and B. Favre, "‘Do you follow me?’: A Survey of Recent Approaches in Dialogue State Tracking." arXiv, Jul. 29, 2022.

W.-C. Kwan, H.-R. Wang, H.-M. Wang, and K.-F. Wong, "A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning," Machine Intelligence Research, vol. 20, no. 3, pp. 318–334, Jun. 2023.

Y. Li, K. Yao, L. Qin, W. Che, X. Li, and T. Liu, "Slot-consistent NLG for Task-oriented Dialogue Systems with Iterative Rectification Network," in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 2020, pp. 97–106.

T. Zhao and M. Eskenazi, "Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning," in Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Los Angeles, CA, USA, 2016, pp. 1–10.

B. Peng, C. Li, J. Li, S. Shayandeh, L. Liden, and J. Gao, "Soloist: Building Task Bots at Scale with Transfer Learning and Machine Teaching," Transactions of the Association for Computational Linguistics, vol. 9, pp. 807–824, Aug. 2021.

Y. Yang, Y. Li, and X. Quan, "UBAR: Towards Fully End-to-End Task-Oriented Dialog System with GPT-2," Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 16, pp. 14230–14238, May 2021.

Y. Lee, "Improving End-to-End Task-Oriented Dialog System with A Simple Auxiliary Task," in Findings of the Association for Computational Linguistics: EMNLP 2021, Punta Cana, Dominican Republic, 2021, pp. 1296–1303.

H. Le, D. Sahoo, C. Liu, N. Chen, and S. C. H. Hoi, "UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented Dialogues," in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, Online, 2020, pp. 1860–1877.

L. Qin, X. Xu, W. Che, Y. Zhang, and T. Liu, "Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog," in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 2020, pp. 6344–6354.

D. M. Eberhard, G. F. Simons, and C. Fennig, Ethnologue: Languages of the World, 22nd ed. Dallas, TX, USA: SIL International, 2019.

M. A. Hedderich, L. Lange, H. Adel, J. Strötgen, and D. Klakow, "A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios," in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Online, 2021, pp. 2545–2568.

D. Mbaye and M. Diallo, "Task-Oriented Dialog Systems for the Senegalese Wolof Language," in Proceedings of the 31st International Conference on Computational Linguistics, Abu Dhabi, UAE, 2025, pp. 4803–4812.

"Chatbots against COVID-19: Using chatbots to answer questions on COVID-19 in the user’s language." World Health Organization. https://www.who.int/news-room/feature-stories/detail/scicom-compilation-chatbot.

A. Stopper. "Lifting Up Women Through Land Ownership." Mozilla Foundation. https://www.mozillafoundation.org/en/blog/lifting-up-women-through-land-ownership/.

"Common Voice Kiswahili Awards." Mozilla Foundation. https://www.mozillafoundation.org/en/what-we-fund/programs/common-voice-kiswahili-awards/awards/.

A. Stopper. "Growing Skills, Confidence, and ‘Quality Potatoes' in Tanzania." Mozilla Foundation. https://www.mozillafoundation.org/en/blog/growing-skills-confidence-and-quality-potatoes-in-tanzania/.

A. Stopper. "‘Wezesha na Kabambe:’ Building Tech With Smallholder Farmers in Western Kenya." Mozilla Foundation. https://www.mozillafoundation.org/en/blog/wezesha-na-kabambe-building-tech-with-smallholder-farmers-in-western-kenya/.

A. Stopper. "Chatbots, Local Weather Reports, and a Boon for Kenya’s Smallholder Farmers." Mozilla Foundation. https://www.mozillafoundation.org/en/blog/chatbots-local-weather-reports-and-a-boon-for-kenyas-smallholder-farmers/.

"Back in the Early Days...." Woebot Health. https://woebothealth.com/back-in-the-early-days/.

Digital-Umuganda. "Mbaza-chatbot: Kinyarwanda-chatbot." GitHub. https://github.com/Digital-Umuganda/Mbaza-chatbot/tree/master/Kinyarwanda-chatbot.

"Nivi Platform." Nivi. https://www.nivi.io/platform.

"Annual Report 2024." Reach Digital Health. https://www.reachdigitalhealth.org/annual-report-2024.

"Crowdsourcing Solutions to Empower Communities." Ushahidi. https://www.ushahidi.com//.

N. N. Amir Sjarif, N. F. Mohd Azmi, S. Chuprat, H. M. Sarkan, Y. Yahya, and S. M. Sam, "SMS Spam Message Detection using Term Frequency-Inverse Document Frequency and Random Forest Algorithm," Procedia Computer Science, vol. 161, pp. 509–515, Jan. 2019.

D. A. Oyeyemi and A. K. Ojo, "SMS Spam Detection and Classification to Combat Abuse in Telephone Networks Using Natural Language Processing," Journal of Advances in Mathematics and Computer Science, vol. 38, no. 10, pp. 144–156, Oct. 2023.

O. Okolloh, "Ushahidi, or ‘testimony’: Web 2.0 tools for crowdsourcing crisis information," Participatory learning and action, vol. 59, pp. 65–70, 2009.

L. Palen, S. R. Hiltz, and S. B. Liu, "Online forums supporting grassroots participation in emergency preparedness and response," Communications of the ACM, vol. 50, no. 3, pp. 54–58, Mar. 2007.

P. Meier, Digital Humanitarians: How Big Data Is Changing the Face of Humanitarian Response, 1st ed. New York, NY, USA: Routledge, 2015.

J. FitzGerald et al., "MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages," in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Toronto, Canada, 2023, pp. 4277–4302.

T. Bunk, D. Varshneya, V. Vlasov, and A. Nichol, "DIET: Lightweight Language Understanding for Dialogue Systems." arXiv, May 11, 2020.

A. Öktem, E. DeLuca, R. Bashizi, E. Paquin, and G. Tang, "Congolese Swahili Machine Translation for Humanitarian Response." arXiv, Mar. 19, 2021.

L. Besacier, E. Barnard, A. Karpov, and T. Schultz, "Automatic speech recognition for under-resourced languages: A survey," Speech Communication, vol. 56, pp. 85–100, Jan. 2014.

U. Kimanuka, C. wa Maina, and O. Büyük, "Speech recognition datasets for low-resource Congolese languages," Data in Brief, vol. 52, Feb. 2024, Art. no. 109796.

T. Bocklisch, J. Faulkner, N. Pawlowski, and A. Nichol, "Rasa: Open Source Language Understanding and Dialogue Management." arXiv, Dec. 15, 2017.

F. Feng, Y. Yang, D. Cer, N. Arivazhagan, and W. Wang, "Language-agnostic BERT Sentence Embedding," in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Dublin, Ireland, 2022, pp. 878–891.

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, MN, USA, 2019, pp. 4171–4186.

A. D. Vincentio and S. Hansun, "A Fine-Tuned BART Pre-trained Language Model for the Indonesian Question-Answering Task," Engineering, Technology & Applied Science Research, vol. 15, no. 2, pp. 21398–21403, Apr. 2025.

D. I. Adelani et al., "IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models," in Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Albuquerque, NM, USA, 2025, pp. 2732–2757.

P. Liu, W. Yuan, J. Fu, Z. Jiang, H. Hayashi, and G. Neubig, "Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing," ACM Computing Surveys, vol. 55, no. 9, Jan. 2023, Art. no. 195.

B. Xu et al., "ExpertPrompting: Instructing Large Language Models to be Distinguished Experts." arXiv, Mar. 05, 2025.

E. P. Tsili, "A Voice Assistant for cooking based on a natural language to SPARQL transformation pipeline using Rasa," M.S. thesis, Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece, 2023.

Downloads

How to Cite

[1]
U. Kimanuka, C. wa Maina, O. Büyük, and M. K. Godelive, “A Congolese Swahili Task-Oriented Dialogue System for Addressing Humanitarian Crises”, Eng. Technol. Appl. Sci. Res., vol. 15, no. 5, pp. 27387–27399, Oct. 2025.

Metrics

Abstract Views: 89
PDF Downloads: 41

Metrics Information