Publications
2025
[1] Amirhossein Abaskohi, Spandana Gella, Giuseppe Carenini, Issam H Laradji
FM2DS: Few-Shot Multimodal Multihop Data Synthesis with Knowledge Distillation for Question Answering
arXiv preprint arXiv:2412.07030
2024
[2] Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi, Tianyu Zhang, Aarash Feizi, Abhay Puri, Akshay Kalkunte, Francois Savard, Amirhossein Abaskohi (Second Author With Equal Contribution), Ahmed Masry, Perampalli Shravan Nayak, Mahsa Massoud, Rabiul Awal, Pierre-André Noël, Mats L. Richter, Saverio Vadacchino, Shubham Agarwal, Sanket Biswas, Ying Zhang, Sathwik Tejaswi Madhusudhan, João Monteiro, Krishnamurthy (Dj) Dvijotham, Torsten Scholak, Nicolas Chapados, Sean Hughes, Tamer Özsu, Aishwarya Agrawal, Marco Pedersoli, Christopher Pal, Perouz Taslakian, David Vazquez, Issam H. Laradji, Spandana Gella, Sai Rajeswar Mudumba
BigDocs: A Permissively-Licensed Dataset for Training Vision-Language Models on Document and Code Tasks
RBFM@NeurIPS 2024 & ICLR 2025
[Paper] / [Website] / [Huggingface]
[3] Gaurav Sahu, Abhay Puri, Juan Rodriguez, Amirhossein Abaskohi, Mohammad Chegini, Alexandre Drouin, Perouz Taslakian, Valentina Zantedeschi, Alexandre Lacoste, David Vazquez, Nicolas Chapados, Christopher Pal, Sai Rajeswar Mudumba, Issam Hadj Laradji
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
arXiv preprint arXiv:2407.06423
[Paper] / [Code]
[4] Amirhossein Abaskohi, Sara Baruni, Mostafa Masoudi, Nesa Abbasi, Mohammad Hadi Babalou, Ali Edalat, Sepehr Kamahi, Samin Mahdizadeh Sani, Nikoo Naghavian, Danial Namazifard, Pouya Sadeghi, and Yadollah Yaghoobzadeh
Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPT
LREC-COLING 2024
[Paper] / [Code]
[5] Amirhossein Abaskohi*, Amirhossein Dabiriaghdam*, Lele Wang, Giuseppe Carenini
BCAmirs at SemEval-2024 Task 4: Beyond Words: A Multimodal and Multilingual Exploration of Persuasion in Memes
Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) @ NAACL 2024.
[Paper] / [Code] / [Huggingface]
[6] Pouya Sadeghi*, Amirhossein Abaskohi*, Yadollah Yaghoobzadeh
uTeBC-NLP at SemEval-2024 Task 9: Can LLMs be Lateral Thinkers?
Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) @ NAACL 2024.
[Paper] / [Code]
2023
[7] Amirhossein Abaskohi, Sascha Rothe, and Yadollah Yaghoobzadeh
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning
Proceeding of Annual Meeting of the Association for Computational Linguistics (ACL), 2023.
[Paper] / [Code]
[8] Amirhossein Abaskohi*, Alireza Salemi*, Sara Tavakoli, Yadollah Yaghoobzadeh, Azadeh Shakery
PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation
Proceedings of the The Sixth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2023) @ EACL 2023.
[Paper] / [Code]
[9] Taha ShabaniMirzaei, Houmaan Chamani, Amirhossein Abaskohi, Zhivar Sourati Hassan Zadeh, and Behnam Bahrak
A Large-Scale Analysis of Persian Tweets Regarding Covid-19 Vaccination
Springer’s Social Network Analysis and Mining Journal.
[Paper]
2022
[10] Amirhossein Abaskohi, Sabri Nazanin, and Bahrak Behnam
Persian Emotion Detection using ParsBERT and Imbalanced Data Handling Approaches
arXiv preprint arXiv:2211.08029.
[Paper] / [Code]
[11] Amirhossein Abaskohi, Arash Rasouli*, Tanin Zeraati*, and Behnam Bahrak
UTNLP at SemEval-2022 Task 6: A Comparative Analysis of Sarcasm Detection Using Generative-based and Mutation-based Data Augmentation
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) @ NAACL 2022.
[Paper] / [Code]
[12] Amirhossein Abaskohi, Fatemeh Mortazavi, and Hadi Moradi
Automatic Speech Recognition for Speech Assessment of Persian Preschool Children
arXiv preprint arXiv:2203.12886
[Paper] / [Code]