Staff

The many faces of GESIS

Vita

Abdelhalim Hafedh Dahou has recently joined the FAIR Data and Human Information Interaction Team at GESIS as a PhD student, where he is focusing on research in several areas including information extraction, deep learning, natural language processing, resources construction, and social media data mining.

Abdelhalim completed his Bachelor degree in computer science from the University of Ahmed Draia, Adrar, Algeria in 2017. During his studies, he implemented an algorithm for optimizing sensor networks. Later, in 2019, he received his Master degree in computer science and Intelligent Systems from the same university. For his master's thesis, he developed a system and corpus to solve the anaphora problem in Arabic language, dealing with pronominal and verbal types using a computational technique.

After completing his master's degree, Abdelhalim pursued another Master's degree, this time specializing in NLP at the IDMC in the University of Lorraine, France. He also completed an internship at ATILF laboratory, which specializes in linguistics and language studies. During his internship, he worked on the identification of discourse markers in French spoken corpora using deep learning approaches and pre-trained models such as CamemBERT and FlauBERT.

Currently, Abdelhalim is working as a volunteer with the NLP lab in the University of Ahmed Draia, where he is focusing on Arabic natural language processing. Specifically, he is working on several applications, including fake news detection, sentiment analysis, text normalization, named entity recognition, and resource construction.




Publications

Journal article

Dahou, Abdelghani, Mohamed Abd Elaziz, Mohamed Haibaoui, Abdelhalim Hafedh Dahou, Mohammed A. A. Al-qaness, Mohamed Ghetas, Ahmed Ewess, and Zhonglong Zheng. 2024. "Linguistic Feature Fusion for Arabic Fake News Detection and Named Entity Recognition Using Reinforcement Learning and Swarm Optimization." Neurocomputing 598 (14 September 2024): 128078. doi: https://doi.org/10.1016/j.neucom.2024.128078.

Dahou, Abdelhalim Hafedh, and Mohamed Amine Cheragui. 2023. "DzNER: A large Algerian named entity recognition dataset." Natural Language Processing Journal 3 (June 2023): 100005. doi: https://doi.org/10.1016/j.nlp.2023.100005.

Abdedaiem, Amin, Abdelhalim Hafedh Dahou, and Mohamed Amine Cheragui. 2023. "Fake News Detection in Low Resource Languages using SetFit Framework." Inteligencia Artificial 26 (72): 178-201. doi: https://doi.org/10.4114/intartif.vol26iss72pp178-201.

Ben Aichaoui, Shaimaa, Nawel Hiri, Abdelhalim Hafedh Dahou, and Mohamed Amine Cheragui. 2022. "Automatic Building of a Large Arabic Spelling Error Corpus." SN Computer Science 2 (4): 108. doi: https://doi.org/10.1007/s42979-022-01499-x.

Chapter in an edited book

Dahou, Abdelhalim Hafedh, and Brigitte Mathiak. 2024 (Forthcoming). "Automatic Categorization of Software Repository Domains with Minimal Resources."

Dahou, Abdelhalim Hafedh, Mohamed Amine Cheragui, Amin Abdedaiem, and Brigitte Mathiak. 2024. "Enhancing Model Performance through Translation-based Data Augmentation in the context of Fake News Detection." In ACLing 2024: 6th International Conference on AI in Computational Linguistics, edited by Khaled Shaalan, and Samhaa El-Beltagy, Procedia Computer Science 244, 342-352. Elsevier. doi: https://doi.org/10.1016/j.procs.2024.10.208.

Abdedaiem, Amin, Abdelhalim Hafedh Dahou, Mohamed Amine Cheragui, and Brigitte Mathiak. 2024. "FASSILA: A Corpus for Algerian Dialect Fake News Detection and Sentiment Analysis." In ACLing 2024: 6th International Conference on AI in Computational Linguistics, edited by Khaled Shaalan, and Samhaa El-Beltagy, Procedia Computer Science 244, 397-407. Elsevier. doi: https://doi.org/10.1016/j.procs.2024.10.214.

Cheragui, Mohamed Amine, Abdelhalim Hafedh Dahou, and Amin Abdedaiem. 2024 (Forthcoming). "Full Arabic diacritics restoration based on Statistical machine translation approach." In 6th Mediterranean Conference on Pattern Recognition and Artificial Intelligence,

Cheragui, Mohamed Amine, Abdelhalim Hafedh Dahou, and Amin Abdedaiem. 2023. "Exploring BERT Models for Part-of-Speech Tagging in the Algerian Dialect: A Comprehensive Study." 140-150. Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023). https://aclanthology.org/2023.icnlsp-1.14.pdf.

Dahou, Abdelhalim Hafedh. 2023. "Identifying Discourse Markers in French Spoken Corpora: Using Machine Learning and Rule-Based Approaches." In Intelligent Systems and Pattern Recognition: Third International Conference, ISPR 2023, Hammamet, Tunisia, May 11–13, 2023, Revised Selected Papers, Part II, edited by Akram Bennour, Ahmed Bouridane, and Lotfi Chaari, Communications in Computer and Information Science 1941, 288–299. Cham: Springer. doi: https://doi.org/10.1007/978-3-031-46338-9_22.

Dahou, Abdelhalim Hafedh, and Mohamed Amine Cheragui. 2023. "Named Entity Recognition for Algerian Arabic Dialect in Social Media." In 12th International Conference on Information Systems and Advanced Technologies “ICISAT 2022” : Intelligent Information, Data Science and Decision Support System, edited by Mohamed Ridda Laouar, Valentina Emilia Balas, Brahim Lejdel, Sean Eom, and Mohamed Amine Boudia, 135-145. Cham: Springer. doi: https://doi.org/10.1007/978-3-031-25344-7_13.

Dahou, Abdelhalim Hafedh, Mohamed Amine Cheragui, and Ahmed Abdelali. 2023. "Performance Analysis of Arabic Pre-Trained Models on Named Entity Recognition Task." In Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, edited by Ruslan Mitkov, and Galia Angelova, 458–467. Shoumen: INCOMA Ltd.. https://aclanthology.org/2023.ranlp-1.51.pdf.

Dahou, Abdelhalim Hafedh, and Brigitte Mathiak. 2023. "Subject Classification of Software Repository." In Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - KDIR, 1, 30-38. SciTePress. doi: https://doi.org/10.5220/0012159600003598.

Dahou, Abdelhalim Hafedh, Mohamed Abdelmoazz, and Mohamed Amine Cheragui. 2022. "Arabic Anaphora Resolution System Using New Features: Pronominal and Verbal Cases." Analysis and Application of Natural Language and Speech Processing. 101-121. Springer International Publishing. doi: https://doi.org/10.1007/978-3-031-11035-1_5.

Dahou, Abdelhalim Hafedh, and Mohamed Amine Cheragui. 2022. "Impact of Normalization and Data Augmentation in NER for Algerian Arabic Dialect." Modelling and Implementation of Complex Systems: Proceedings of the 7th International Symposium, MISC 2022, Mostaganem, Algeria, October 30‐31, 2022. 249-262. Springer International Publishing. doi: https://doi.org/10.1007/978-3-031-18516-8_18.

Dahou, Abdelhalim Hafedh. 2021. "A3C: Arabic Anaphora Annotated Corpus." Proceedings of the 4th International Conference on Natural Language and Speech Processing (ICNLSP 2021), 147–155. Association for Computational Linguistics.

Working and discussion paper

Sihler, Florian, Lukas Pietzschmann, Raphael Straub, Matthias Tichy, Andor Diera, and Abdelhalim Hafedh Dahou. 2024. On the Anatomy of Real-World R Code for Static Analysis. ArXiV Preprint. https://arxiv.org/abs/2401.16228.

Diera, Andor, Abdelhalim Hafedh Dahou, Lukas Galke, Fabian Karl, Florian Sihler, and Ansgar Scherp. 2023. GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding. Proceedings of the 1st GenBench Workshop on (Benchmarking) Generalisation in NLP. Association for Computational Linguistics (ACL). doi: https://doi.org/10.18653/v1/2023.genbench-1.2.