Shammur Absar Chowdhury

Conversational AI | Representation Learning | Spoken Language Processing | Natural Language Processing

Shammur_Chowdury_1.jpg

⌜⌟ Research Scientist, Qatar Computing Research Institute (QCRI)

🗨️ Contact: shchowdhury@hbku.edu.qa

✍ Research Interest and Expertise:

  • Speech Processing: Representation Learning, Self-Supervised Models, Atypical Human Interaction, Speech Discourse & Turn-taking, and Spoken Language Understanding.

  • Natural Language Processing: (Large) Language Models and their task understanding capabilities, Benchmarking.

  • Explainable and Inclusive Speech Technology:
    Dialectal and Accented Speech Recognition, Pronunciation Assessment, Children Speech Recognition, Multilingual Models.



🧩 Current Projects:

🌐 Current Platforms:


Download CV Download CV


👁️‍🗨️ Short Bio:

Dr. Chowdhury specializes in designing Conversational AI models, primarily addressing complex challenges such as multispeaker interactions, nuanced multilingual and dialect variations, and code-switching, among various other intricate conversational dynamics. She is currently the leading the speech technology in Fanar - Arabic AI Large Language Model and also LPI on the QVoice project, which empowers speakers—both native and non-native of all ages alike—to learn spoken Arabic. Dr. Chowdhury has received numerous awards and grants, including the NVIDIA Academic Hardware Grant for her research in simulating human language learning capabilities using DNN-based language models, a study that was also conducted as a part of the TRAILs project, funded by PRIN MIUR. As a key contributor to the EU-funded projects SENSEI and PortDial, Dr. Chowdhury developed conversational models adept at understanding human conversation, facilitating automatic summarization and mental health screening. She authored over 60 peer-reviewed publications in top-tier conferences and journals and played an active role in the research community by organizing shared tasks, challenges, and workshops, as well as serving on the committees of top-tier conferences and special interest groups. She co-founded the Bangla Language Processing Community and MyVoice, a crowdsourced platform, designed to bridge the gaps between standard and dialectal Arabic resources.

news

Aug 02, 2024

CFP for Workshop on Detecting AI Generated Content at COLING 2025 is out!

Jul 24, 2024 Check out our INTERSPEECH 2024 paper: Children’s Speech Recognition through Discrete Token Enhancement
Jul 07, 2024 Check out our ACL 2024 paper: Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic
Mar 24, 2024 Gave a tutorial at EACL 2024 on Multimodal LLMs. Tutorial titled: LMs for Low Resource Languages in Multilingual, Multimodal and Dialectal Settings
Mar 04, 2024 Gave an invited talk at SIG SLATE webinar titled: Towards L1-aware Multilingual Mispronunciation Detection Modeling