Badr M. Abdullah

prof_pic_4.png

I am a post-doctoral researcher in the Language Science and Technology Department at Saarland University. I am a member of the collaborative research center SFB 1102: Information Density and Linguistic Encoding. I work with Dietrich Klakow and Bernd Möbius.

My research primarily focuses on representation learning for speech and language processing. Concretely, I am interested in (1) analyzing and understanding the encoding of linguistic structure in speech and language models, (2) developing linguistically-informed approaches for cross-lingual transfer learning, and (3) bridging between human language processing and computational language modeling.

In addition, I am always excited to chat and learn more about …

  • Non-native speech processing
  • Multilinguality, linguistic typology, and language variation
  • Information theory and language structure
  • Interpretability of neural networks

 

news

Aug 23, 2023 I presented our paper in Interspeech 2023 in Dublin! :shamrock: :ireland: :beers:
Jul 19, 2023 PhD thesis submitted :sparkles: :partying_face: :confetti_ball:
May 6, 2023 I presented our extended abstract in SIGTYP workshop during EACL 2023 conference in Dubrovnik! :croatia: :european_castle:
Dec 10, 2021 I gave an invited talk in the SIGTYP lecture series on Capturing Cross-linguistic Similarity with Speech Representation Learning :green_circle: :orange_circle:
Feb 1, 2021 With Elizabeth Salesky and Sabrina J. Mielke, I co-organized a shared task in SIGTYP 2021 on robust language identification in speech :speech_balloon: :speaking_head:

selected publications

  1. An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech
    Badr M. Abdullah, Mohammed Maqsood Shaik, Bernd Möbius, and 1 more author
    In Proceedings of INTERSPEECH, 2023
  2. Information-Theoretic Characterization of Vowel Harmony: A Cross-Linguistic Study on Word Lists
    Julius Steuer, Badr M. Abdullah, Johann-Mattis List, and 1 more author
    In Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP), May 2023
  3. How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings
    Badr M. Abdullah, Iuliia Zaitova, Tania Avgustinova, and 2 more authors
    In Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, Nov 2021
  4. SIGTYP 2021 Shared Task: Robust Spoken Language Identification
    Elizabeth Salesky, Badr M. Abdullah, Sabrina Mielke, and 6 more authors
    In Proceedings of the Third Workshop on Computational Typology and Multilingual NLP (SIGTYP), Jun 2021