publications

Publications and preprints in reverse chronological order. For an always up-to-date list check my Google Scholar.

2024

  1. Preprint
    LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders BehnamGhader, Parishad, Adlakha, Vaibhav, Mosbach, Marius, Bahdanau, Dzmitry, Chapados, Nicolas, and Reddy, Siva arXiv 2024 [Abs] [HTML] [Code]
  2. Insights 2024
    What explains the success of cross-modal fine-tuning with ORCA? García-de-Herreros, Paloma, Gautam, Vagrant, Slusallek, Philipp, Klakow, Dietrich, and Mosbach, Marius Workshop on Insights from Negative Results in NLP @ ACL 2024 [Abs] [HTML]
  3. Preprint
    The Hidden Space of Transformer Language Adapters Jesujoba O., Alabi, Mosbach, Marius, Eyal, Matan, Klakow, Dietrich, and Geva, Mor arXiv 2024 [Abs] [HTML]
  4. Preprint
    The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis Zhang, Miaoran, Gautam, Vagrant, Wang, Mingyang, Alabi, Jesujoba O., Shen, Xiaoyu, Klakow, Dietrich, and Mosbach, Marius arXiv 2024 [Abs] [HTML]

2023

  1. BabyLM 2023 Best paper
    Large GPT-like Models are Bad Babies: A Closer Look at the Relationship between Linguistic Competence and Psycholinguistic Measures Steuer, Julius, Mosbach, Marius, and Klakow, Dietrich In 2023 [Abs] [HTML]
  2. ACL 2023
    Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation Mosbach, Marius, Pimentel, Tiago, Ravfogel, Shauli, Klakow, Dietrich, and Elazar, Yanai In Findings of the Association for Computational Linguistics: ACL 2023 2023 [Abs] [HTML]
  3. ACL 2023 Best paper
    Weaker Than You Think: A Critical Look at Weakly Supervised Learning Zhu, Dawei, Shen, Xiaoyu, Mosbach, Marius, Stephan, Andreas, and Klakow, Dietrich In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2023 [Abs] [HTML]

2022

  1. COLING 2022 Best paper
    Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-Tuning Alabi, Jesujoba O., Adelani, David Ifeoluwa, Mosbach, Marius, and Klakow, Dietrich In Proceedings of the 29th International Conference on Computational Linguistics 2022 [Abs] [HTML]
  2. NAACL 2022
    MCSE: Multimodal Contrastive Learning of Sentence Embeddings Zhang, Miaoran, Mosbach, Marius, Adelani, David Ifeoluwa, Hedderich, Michael A., and Klakow, Dietrich NAACL 2022 2022 [Abs] [HTML]
  3. SpaNLP 2022
    Knowledge Base Index Compression via Dimensionality and Precision Reduction Zouhar, Vilém, Mosbach, Marius, and Klakow, Dietrich SpaNLP workshop @ ACL 2022 2022 [Abs] [HTML]

2021

  1. AKBC 2021
    Artefact Retrieval: Overview of NLP Models with Knowledge Base Access Zouhar, Vilém, Mosbach, Marius, Biswas, Debanjali, and Klakow, Dietrich CSKB workshop @ AKBC 2021 [Abs] [HTML] [Code]
  2. Interspeech 2021
    Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study Abdullah, Badr M., Mosbach, Marius, Zaitova, Iuliia, Möbius, Bernd, and Klakow, Dietrich Interspeech 2021 [Abs] [HTML] [Code]
  3. ICLR 2021
    On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines Mosbach, Marius, Andriushchenko, Maksym, and Klakow, Dietrich ICLR 2021 [Abs] [HTML] [Code]

2020

  1. COLING 2020
    A Closer Look at Linguistic Knowledge in Masked Language Models: The Case of Relative Clauses in American English Mosbach, Marius, Degaetano-Ortlieb, Stefania, Krielke, Marie-Pauline, Abdullah, Badr M., and Klakow, Dietrich COLING 2020 [Abs] [HTML] [Code]
  2. EMNLP 2020
    On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers Mosbach, Marius, Khokhlova, Anna, Hedderich, Michael A., and Klakow, Dietrich Findings of EMNLP and BlackboxNLP 2020 [Abs] [HTML] [Code]
  3. ICML 2020
    Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation Mogadala, Aditya, Mosbach, Marius, and Klakow, Dietrich ICML Workshop on Bridge Between Perception and Reasoning: Graph Neural Networks & Beyond 2020 [Abs] [HTML]

2019

  1. NoDaLiDa 2019
    Some steps towards the generation of diachronic WordNets Bizzoni, Yuri, Mosbach, Marius, Klakow, Dietrich, and Degaetano-Ortlieb, Stefania In Proceedings of the 22nd Nordic Conference on Computational Linguistics 2019 [Abs] [HTML]
  2. RANLP 2019
    incom.py - A Toolbox for Calculating Linguistic Distances and Asymmetries between Related Languages Mosbach, Marius, Stenger, Irina, Avgustinova, Tania, and Klakow, Dietrich In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019) 2019 [Abs] [HTML] [Code]

2018

  1. NeurIPS 2018
    Logit pairing methods can fool gradient-based attacks Mosbach, Marius, Andriushchenko, Maksym, Trost, Thomas, Hein, Matthias, and Klakow, Dietrich NeurIPS Workshop on Security in Machine Learning 2018 [Abs] [HTML] [Code]