Publications | MaiNLP research lab

Selected Publications

dec 2023

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation

Litschko, Robert and Müller-Eberstein, Max and van der Goot, Rob and Weber-Genzel, Leon and Plank, Barbara
jul 2023

ActiveAED: A Human in the Loop Improves Annotation Error Detection

Weber, Leon and Plank, Barbara

All Publications

nov 2025

MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs
Zhao, Raoyuan and Chen, Beiduo and Plank, Barbara and Hedderich, Michael A.

Findings of the Association for Computational Linguistics: EMNLP 2025
nov 2025

Make Every Letter Count: Building Dialect Variation Dictionaries from Monolingual Corpora
Litschko, Robert and Blaschke, Verena and Burkhardt, Diana and Plank, Barbara and Frassinelli, Diego

Findings of the Association for Computational Linguistics: EMNLP 2025
nov 2025

RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
Testoni, Alberto and Plank, Barbara and Fernández, Raquel

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
nov 2025

Disentangling Subjectivity and Uncertainty for Hate Speech Annotation and Modeling using Gaze
Alacam, Özge and Hoeken, Sanne and Säuberli, Andreas and Gröner, Hannes and Frassinelli, Diego and Zarrieß, Sina and Plank, Barbara

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
nov 2025

The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It
Bertolazzi, Leonardo and Mondorf, Philipp and Plank, Barbara and Bernardi, Raffaella

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
nov 2025

Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation
Chen, Beiduo and Liu, Yang Janet and Korhonen, Anna and Plank, Barbara

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
nov 2025

LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference
Hong, Pingjun and Chen, Beiduo and Peng, Siyao and de Marneffe, Marie-Catherine and Plank, Barbara

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
nov 2025

Crossing Domains without Labels: Distant Supervision for Term Extraction
Senger, Elena and Campbell, Yuri and van der Goot, Rob and Plank, Barbara

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
nov 2025

What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse
Zhou, Shijia and Peng, Siyao and Luebke, Simon M. and Haßler, Jörg and Haim, Mario and Mohammad, Saif M. and Plank, Barbara

Findings of the Association for Computational Linguistics: EMNLP 2025
nov 2025

BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods
Mondorf, Philipp and Wang, Mingyang and Gerstner, Sebastian and Hakimi, Ahmad Dawar and Liu, Yihong and Veloso, Leonor and Zhou, Shijia and Schuetze, Hinrich and Plank, Barbara

Proceedings of the 8th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP
nov 2025

Relevant for the Right Reasons? Investigating Lexical Biases in Zero-Shot and Instruction-Tuned Rerankers
Mao, Yuchen and Plank, Barbara and Litschko, Robert

Proceedings of the 5th Workshop on Multilingual Representation Learning (MRL 2025)
nov 2025

Revisiting Active Learning under (Human) Label Variation
Gruber, Cornelia and Alber, Helen and Bischl, Bernd and Kauermann, Göran and Plank, Barbara and Aßenmacher, Matthias

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
nov 2025

Aligning NLP Models with Target Population Perspectives using PAIR: Population-Aligned Instance Replication
Eckman, Stephanie and Ma, Bolei and Kern, Christoph and Chew, Rob and Plank, Barbara and Kreuter, Frauke

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
nov 2025

LeWiDi-2025 at NLPerspectives: The Third Edition of the Learning with Disagreements Shared Task
Leonardelli, Elisa and Casola, Silvia and Peng, Siyao and Rizzi, Giulia and Basile, Valerio and Fersini, Elisabetta and Frassinelli, Diego and Jang, Hyewon and Pavlovic, Maja and Plank, Barbara and Poesio, Massimo

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
nov 2025

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Ruiz, Tomas and Peng, Siyao and Plank, Barbara and Schwemmer, Carsten

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
nov 2025

Tracing Multilingual Factual Knowledge Acquisition in Pretraining
Liu, Yihong and Wang, Mingyang and Kargaran, Amir Hossein and Körner, Felicia and Nie, Ercong and Plank, Barbara and Yvon, François and Schuetze, Hinrich

Findings of the Association for Computational Linguistics: EMNLP 2025
nov 2025

Reason to Rote: Rethinking Memorization in Reasoning
Du, Yupei and Mondorf, Philipp and Casola, Silvia and Yao, Yuekun and Litschko, Robert and Plank, Barbara

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
nov 2025

M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
Wu, ChengYan and Ma, Bolei and Liu, Yihong and Zhang, Zheyu and Deng, Ningyuan and Li, Yanshu and Chen, Baolan and Zhang, Yi and Xue, Yun and Plank, Barbara

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
nov 2025

DistaLs: a Comprehensive Collection of Language Distance Measures
van der Goot, Rob and Ploeger, Esther and Blaschke, Verena and Samardzic, Tanja

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
sep 2025

Human-centered LLMs for Inclusive Language Technology: The Need to Embrace Variation Holistically in NLP
Plank, Barbara

Proceedings of the 20th Conference on Computer Science and Intelligence Systems (FedCSIS)
aug 2025

A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation
Blaschke, Verena and Winkler, Miriam and Förster, Constantin and Wenger-Glemser, Gabriele and Plank, Barbara

Interspeech 2025
jul 2025

Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study
Ma, Bolei and Yoztyurk, Berk and Haensch, Anna-Carolina and Wang, Xinpeng and Herklotz, Markus and Kreuter, Frauke and Plank, Barbara and Aßenmacher, Matthias

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
jul 2025

Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges
Ma, Bolei and Li, Yuting and Zhou, Wei and Gong, Ziwei and Liu, Yang Janet and Jasinskaja, Katja and Friedrich, Annemarie and Hirschberg, Julia and Kreuter, Frauke and Plank, Barbara

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
jul 2025

A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
Chen, Beiduo and Peng, Siyao and Korhonen, Anna and Plank, Barbara

Findings of the Association for Computational Linguistics: ACL 2025
jul 2025

Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models
Mondorf, Philipp and Wold, Sondre and Plank, Barbara

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
jul 2025

Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
Eichin, Florian and Liu, Yang Janet and Plank, Barbara and Hedderich, Michael A.

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
jul 2025

What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns
Hedderich, Michael A. and Wang, Anyi and Zhao, Raoyuan and Eichin, Florian and Fischer, Jonas and Plank, Barbara

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
jul 2025

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Bavaresco, Anna and Bernardi, Raffaella and Bertolazzi, Leonardo and Elliott, Desmond and Fernández, Raquel and Gatt, Albert and Ghaleb, Esam and Giulianelli, Mario and Hanna, Michael and Koller, Alexander and Martins, Andre and Mondorf, Philipp and Neplenbroek, Vera and Pezzelle, Sandro and Plank, Barbara and Schlangen, David and Suglia, Alessandro and Surikuchi, Aditya K and Takmaz, Ece and Testoni, Alberto

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
jul 2025

Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
Säuberli, Andreas and Frassinelli, Diego and Plank, Barbara

Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
jul 2025

Analyzing the Effect of Linguistic Similarity on Cross-Lingual Transfer: Tasks and Experimental Setups Matter
Blaschke, Verena and Fedzechkina, Masha and Ter Hoeve, Maartje

Findings of the Association for Computational Linguistics: ACL 2025
may 2025

Methods and Resources in Germanic Variationist Linguistics
Nerbonne, John and Blaschke, Verena and Schütze, Hinrich and Plank, Barbara

Oxford Research Encyclopedia of Linguistics
apr 2025

Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum
Shim, Ryan Soh-Eun and Plank, Barbara

Findings of the Association for Computational Linguistics: NAACL 2025
jan 2025

Evaluating Pixel Language Models on Non-Standardized Languages
Muñoz-Ortiz, Alberto and Blaschke, Verena and Plank, Barbara

Proceedings of the 31st International Conference on Computational Linguistics
jan 2025

Cross-Dialect Information Retrieval: Information Access in Low-Resource and High-Variance Languages
Litschko, Robert and Kraus, Oliver and Blaschke, Verena and Plank, Barbara

Proceedings of the 31st International Conference on Computational Linguistics
jan 2025

KARRIEREWEGE: A large scale Career Path Prediction Dataset
Senger, Elena and Campbell, Yuri and van der Goot, Rob and Plank, Barbara

Proceedings of the 31st International Conference on Computational Linguistics: Industry Track
jan 2025

Neural Text Normalization for Luxembourgish Using Real-Life Variation Data
Lutgen, Anne-Marie and Plum, Alistair and Purschke, Christoph and Plank, Barbara

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
jan 2025

Neural Text Normalization for Luxembourgish Using Real-Life Variation Data
Lutgen, Anne-Marie and Plum, Alistair and Purschke, Christoph and Plank, Barbara

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
jan 2025

Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study
Krückl, Xaver Maria and Blaschke, Verena and Plank, Barbara

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
jan 2025

Add Noise, Tasks, or Layers? MaiNLP at the VarDial 2025 Shared Task on Norwegian Dialectal Slot and Intent Detection
Blaschke, Verena and Körner, Felicia and Plank, Barbara

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
dec 2024

Fine-grained Sexism Detection in Italian Newspapers
Manzi, Federica and Weber-Genzel, Leon and Plank, Barbara

Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
nov 2024

The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Ma, Bolei and Wang, Xinpeng and Hu, Tiancheng and Haensch, Anna-Carolina and Hedderich, Michael A. and Plank, Barbara and Kreuter, Frauke

Findings of the Association for Computational Linguistics: EMNLP 2024
nov 2024

“Seeing the Big through the Small”: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Chen, Beiduo and Wang, Xinpeng and Peng, Siyao and Litschko, Robert and Korhonen, Anna and Plank, Barbara

Findings of the Association for Computational Linguistics: EMNLP 2024
nov 2024

To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity
Sedova, Anastasiia and Litschko, Robert and Frassinelli, Diego and Roth, Benjamin and Plank, Barbara

Findings of the Association for Computational Linguistics: EMNLP 2024
nov 2024

Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models
Mondorf, Philipp and Plank, Barbara

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
oct 2024

Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models - A Survey
Mondorf, Philipp and Plank, Barbara

First Conference on Language Modeling
oct 2024

Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think
Wang, Xinpeng and Hu, Chengzhi and Ma, Bolei and Rottger, Paul and Plank, Barbara

First Conference on Language Modeling
aug 2024

Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification
Xu, Shanshan and T.y.s.s, Santosh and Ichim, Oana and Plank, Barbara and Grabmair, Matthias

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
aug 2024

What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects
Blaschke, Verena and Purschke, Christoph and Schuetze, Hinrich and Plank, Barbara

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
aug 2024

“My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
Wang, Xinpeng and Ma, Bolei and Hu, Chengzhi and Weber-Genzel, Leon and Röttger, Paul and Kreuter, Frauke and Hovy, Dirk and Plank, Barbara

Findings of the Association for Computational Linguistics ACL 2024
aug 2024

Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning
Mondorf, Philipp and Plank, Barbara

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
aug 2024

VariErr NLI: Separating Annotation Error from Human Label Variation
Weber-Genzel, Leon and Peng, Siyao and De Marneffe, Marie-Catherine and Plank, Barbara

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
aug 2024

CLIMATELI: Evaluating Entity Linking on Climate Change Data
Zhou, Shijia and Peng, Siyao and Plank, Barbara

Proceedings of the 1st Workshop on Natural Language Processing Meets Climate Change (ClimateNLP 2024)
jul 2024

Position: Insights from Survey Methodology can Improve Training Data
Eckman, Stephanie and Plank, Barbara and Kreuter, Frauke

Forty-first International Conference on Machine Learning
jun 2024

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark
Mayhew, Stephen and Blevins, Terra and Liu, Shuheng and Suppa, Marek and Gonen, Hila and Imperial, Joseph Marvin and Karlsson, Börje and Lin, Peiqin and Ljubešić, Nikola and Miranda, Lester James and Plank, Barbara and Riabi, Arij and Pinter, Yuval

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
jun 2024

MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness
Zhou, Shijia and Shan, Huangyan and Plank, Barbara and Litschko, Robert

Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
may 2024

MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Blaschke, Verena and Kovačić, Barbara and Peng, Siyao and Schütze, Hinrich and Plank, Barbara

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
may 2024

Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data
Peng, Siyao and Sun, Zihang and Shan, Huangyan and Kolm, Marie and Blaschke, Verena and Artemova, Ekaterina and Plank, Barbara

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
may 2024

Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants
Winkler, Miriam and Juozapaityte, Virginija and van der Goot, Rob and Plank, Barbara

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
mar 2024

EEVEE: An Easy Annotation Tool for Natural Language Processing
Sorensen, Axel and Peng, Siyao and Plank, Barbara and Van Der Goot, Rob

Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)
mar 2024

Donkii: Characterizing and Detecting Errors in Instruction-Tuning Datasets
Weber, Leon and Litschko, Robert and Artemova, Ekaterina and Plank, Barbara

Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)
mar 2024

Proceedings of the 1st Workshop on Uncertainty-Aware NLP (UncertaiNLP 2024)
Vázquez, Raúl and Celikkanat, Hande and Ulmer, Dennis and Tiedemann, Jörg and Swayamdipta, Swabha and Aziz, Wilker and Plank, Barbara and Baan, Joris and de Marneffe, Marie-Catherine
mar 2024

Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?
Baan, Joris and Fernández, Raquel and Plank, Barbara and Aziz, Wilker

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers)
mar 2024

NNOSE: Nearest Neighbor Occupational Skill Extraction
Zhang, Mike and Goot, Rob and Kan, Min-Yen and Plank, Barbara

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
mar 2024

Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties
Artemova, Ekaterina and Blaschke, Verena and Plank, Barbara

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
mar 2024

Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings
Senger, Elena and Zhang, Mike and Goot, Rob and Plank, Barbara

Proceedings of the First Workshop on Natural Language Processing for Human Resources (NLP4HR 2024)
mar 2024

Entity Linking in the Job Market Domain
Zhang, Mike and Goot, Rob and Plank, Barbara

Findings of the Association for Computational Linguistics: EACL 2024
mar 2024

Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations
Peng, Siyao and Sun, Zihang and Loftus, Sebastian and Plank, Barbara

Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language
mar 2024

More Labels or Cases? Assessing Label Variation in Natural Language Inference
Gruber, Cornelia and Hechinger, Katharina and Assenmacher, Matthias and Kauermann, Göran and Plank, Barbara

Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language
mar 2024

MaiBaam Annotation Guidelines
Blaschke, Verena and Kovačić, Barbara and Peng, Siyao and Plank, Barbara

arXiv
dec 2023

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara and Titov, Ivan

Findings of the Association for Computational Linguistics: EMNLP 2023
dec 2023

What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
Giulianelli, Mario and Baan, Joris and Aziz, Wilker and Fernández, Raquel and Plank, Barbara

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
dec 2023

From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification
Xu, Shanshan and T.y.s.s, Santosh and Ichim, Oana and Risini, Isabella and Plank, Barbara and Grabmair, Matthias

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
dec 2023

ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation
Wang, Xinpeng and Plank, Barbara

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
dec 2023

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Litschko, Robert and Müller-Eberstein, Max and van der Goot, Rob and Weber-Genzel, Leon and Plank, Barbara

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
dec 2023

Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis
Teodorescu, Daniela and Mohammad, Saif

Findings of the Association for Computational Linguistics: EMNLP 2023
dec 2023

Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers
Teodorescu, Daniela and Cheng, Tiffany and Fyshe, Alona and Mohammad, Saif

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
oct 2023

LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation
Zhang, Shengqiang and Wicke, Philipp and Şenel, Lütfi Kerem and Figueredo, Luis and Naceri, Abdeldjallil and Haddadin, Sami and Plank, Barbara and Schuetze, Hinrich

arXiv
jul 2023

ActiveAED: A Human in the Loop Improves Annotation Error Detection
Weber, Leon and Plank, Barbara

Findings of the Association for Computational Linguistics: ACL 2023
jul 2023

Silver Syntax Pre-training for Cross-Domain Relation Extraction
Bassignana, Elisa and Ginter, Filip and Pyysalo, Sampo and van der Goot, Rob and Plank, Barbara

Findings of the Association for Computational Linguistics: ACL 2023
jul 2023

Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data
Litschko, Robert and Artemova, Ekaterina and Plank, Barbara

Findings of the Association for Computational Linguistics: ACL 2023
jul 2023

SemEval-2023 Task 11: Learning with Disagreements (LeWiDi)
Leonardelli, Elisa and Abercrombie, Gavin and Almanea, Dina and Basile, Valerio and Fornaciari, Tommaso and Plank, Barbara and Rieser, Verena and Uma, Alexandra and Poesio, Massimo

Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
jul 2023

How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives
Wang, Xinpeng and Weissweiler, Leonie and Schütze, Hinrich and Plank, Barbara

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
jul 2023

ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain
Zhang, Mike and van der Goot, Rob and Plank, Barbara

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
jul 2023

Uncertainty in Natural Language Generation: From Theory to Applications
Baan, Joris and Daheim, Nico and Ilia, Evgenia and Ulmer, Dennis and Li, Haau-Sing and Fernández, Raquel and Plank, Barbara and Sennrich, Rico and Zerva, Chrysoula and Aziz, Wilker

arXiv
may 2023

A Survey of Corpora for Germanic Low-Resource Languages and Dialects
Blaschke, Verena and Schuetze, Hinrich and Plank, Barbara

Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
may 2023

Low-resource Bilingual Dialect Lexicon Induction with Large Language Models
Artemova, Ekaterina and Plank, Barbara

Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
may 2023

Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction
Bassignana, Elisa and Ginter, Filip and Pyysalo, Sampo and van der Goot, Rob and Plank, Barbara

Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
may 2023

Findings of the VarDial Evaluation Campaign 2023
Aepli, Noëmi and Çöltekin, Çağrı and Van Der Goot, Rob and Jauhiainen, Tommi and Kazzaz, Mourhaf and Ljubešić, Nikola and North, Kai and Plank, Barbara and Scherrer, Yves and Zampieri, Marcos

Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)
may 2023

Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages
Blaschke, Verena and Schütze, Hinrich and Plank, Barbara

Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)
dec 2022

CrossRE: A Cross-Domain Dataset for Relation Extraction
Bassignana, Elisa and Plank, Barbara

Findings of the Association for Computational Linguistics: EMNLP 2022
dec 2022

Experimental Standards for Deep Learning in Natural Language Processing Research
Ulmer, Dennis and Bassignana, Elisa and Müller-Eberstein, Max and Varab, Daniel and Zhang, Mike and van der Goot, Rob and Hardmeier, Christian and Plank, Barbara

Findings of the Association for Computational Linguistics: EMNLP 2022
dec 2022

On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
Samardžić, Tanja and Gutierrez-Vasques, Ximena and van der Goot, Rob and Müller-Eberstein, Max and Pelloni, Olga and Plank, Barbara

Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL)
dec 2022

The “Problem” of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation
Plank, Barbara

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
dec 2022

Spectral Probing
Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
dec 2022

Evidence > Intuition: Transferability Estimation for Encoder Selection
Bassignana, Elisa and Müller-Eberstein, Max and Zhang, Mike and Plank, Barbara

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
dec 2022

Stop Measuring Calibration When Humans Disagree
Baan, Joris and Aziz, Wilker and Plank, Barbara and Fernandez, Raquel

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
oct 2022

An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper
Kucher, Kostiantyn and Sultanum, Nicole and Daza, Angel and Simaki, Vasiliki and Skeppstedt, Maria and Plank, Barbara and Fekete, Jean-Daniel and Mahyar, Narges

2022 IEEE Evaluation and Beyond - Methodological Approaches for Visualization (BELIV)
sep 2022

Skill Extraction from Job Postings using Weak Supervision
Zhang, Mike and Jensen, Kristian Nørgaard and van der Goot, Rob and Plank, Barbara

Proceedings of the 2nd Workshop on Recommender Systems for Human Resources (RecSys-in-HR 2022)
jul 2022

SkillSpan: Hard and Soft Skill Extraction from English Job Postings
Zhang, Mike and Jensen, Kristian and Sonniks, Sif and Plank, Barbara

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
jul 2022

Sort by Structure: Language Model Ranking as Dependency Probing
Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
jul 2022

Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget
Plank, Barbara

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
jun 2022

Fine-tuning vs From Scratch: Do Vision & Language Models Have Similar Capabilities on Out-of-Distribution Visual Question Answering?
Jensen, Kristian Nørgaard and Plank, Barbara

Proceedings of the Thirteenth Language Resources and Evaluation Conference
jun 2022

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embeddings
van der Goot, Rob and Müller-Eberstein, Max and Plank, Barbara

Proceedings of the Thirteenth Language Resources and Evaluation Conference
jun 2022

Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning
Zhang, Mike and Jensen, Kristian Nørgaard and Plank, Barbara

Proceedings of the Thirteenth Language Resources and Evaluation Conference
may 2022

What Do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification
Bassignana, Elisa and Plank, Barbara

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
may 2022

Probing for Labeled Dependency Trees
Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Selected Publications

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation

ActiveAED: A Human in the Loop Improves Annotation Error Detection

All Publications