Publications | MaiNLP research lab

Selected Publications

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Robert Litschko, Max Müller‑Eberstein, Rob van der Goot, Leon Weber‑Genzel & Barbara Plank

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
ActiveAED: A Human in the Loop Improves Annotation Error Detection
Leon Weber & Barbara Plank

Findings of the Association for Computational Linguistics: ACL 2023

All Publications

MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs
Raoyuan Zhao, Beiduo Chen, Barbara Plank & Michael A. Hedderich

Findings of the Association for Computational Linguistics: EMNLP 2025
Make Every Letter Count: Building Dialect Variation Dictionaries from Monolingual Corpora
Robert Litschko, Verena Blaschke, Diana Burkhardt, Barbara Plank & Diego Frassinelli

Findings of the Association for Computational Linguistics: EMNLP 2025
RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
Alberto Testoni, Barbara Plank & Raquel Fernández

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Disentangling Subjectivity and Uncertainty for Hate Speech Annotation and Modeling using Gaze
Özge Alacam, Sanne Hoeken, Andreas Säuberli, Hannes Gröner, Diego Frassinelli, Sina Zarrieß & Barbara Plank

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It
Leonardo Bertolazzi, Philipp Mondorf, Barbara Plank & Raffaella Bernardi

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation
Beiduo Chen, Yang Janet Liu, Anna Korhonen & Barbara Plank

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
LiTEx: A Linguistic Taxonomy of Explanations for Understanding Within-Label Variation in Natural Language Inference
Pingjun Hong, Beiduo Chen, Siyao Peng, Marie‑Catherine de Marneffe & Barbara Plank

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Crossing Domains without Labels: Distant Supervision for Term Extraction
Elena Senger, Yuri Campbell, Rob van der Goot & Barbara Plank

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
What Media Frames Reveal About Stance: A Dataset and Study about Memes in Climate Change Discourse
Shijia Zhou, Siyao Peng, Simon M. Luebke, Jörg Haßler, Mario Haim, Saif M. Mohammad & Barbara Plank

Findings of the Association for Computational Linguistics: EMNLP 2025
BlackboxNLP-2025 MIB Shared Task: Exploring Ensemble Strategies for Circuit Localization Methods
Philipp Mondorf, Mingyang Wang, Sebastian Gerstner, Ahmad Dawar Hakimi, Yihong Liu, Leonor Veloso, Shijia Zhou, Hinrich Schuetze & Barbara Plank

Proceedings of the 8th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP
Relevant for the Right Reasons? Investigating Lexical Biases in Zero-Shot and Instruction-Tuned Rerankers
Yuchen Mao, Barbara Plank & Robert Litschko

Proceedings of the 5th Workshop on Multilingual Representation Learning (MRL 2025)
Revisiting Active Learning under (Human) Label Variation
Cornelia Gruber, Helen Alber, Bernd Bischl, Göran Kauermann, Barbara Plank & Matthias Aßenmacher

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
Aligning NLP Models with Target Population Perspectives using PAIR: Population-Aligned Instance Replication
Stephanie Eckman, Bolei Ma, Christoph Kern, Rob Chew, Barbara Plank & Frauke Kreuter

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
LeWiDi-2025 at NLPerspectives: The Third Edition of the Learning with Disagreements Shared Task
Elisa Leonardelli, Silvia Casola, Siyao Peng, Giulia Rizzi, Valerio Basile, Elisabetta Fersini, Diego Frassinelli, Hyewon Jang, Maja Pavlovic, Barbara Plank & Massimo Poesio

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Tomas Ruiz, Siyao Peng, Barbara Plank & Carsten Schwemmer

Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
Tracing Multilingual Factual Knowledge Acquisition in Pretraining
Yihong Liu, Mingyang Wang, Amir Hossein Kargaran, Felicia Körner, Ercong Nie, Barbara Plank, François Yvon & Hinrich Schuetze

Findings of the Association for Computational Linguistics: EMNLP 2025
Reason to Rote: Rethinking Memorization in Reasoning
Yupei Du, Philipp Mondorf, Silvia Casola, Yuekun Yao, Robert Litschko & Barbara Plank

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
ChengYan Wu, Bolei Ma, Yihong Liu, Zheyu Zhang, Ningyuan Deng, Yanshu Li, Baolan Chen, Yi Zhang, Yun Xue & Barbara Plank

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
DistaLs: a Comprehensive Collection of Language Distance Measures
Rob van der Goot, Esther Ploeger, Verena Blaschke & Tanja Samardzic

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Human-centered LLMs for Inclusive Language Technology: The Need to Embrace Variation Holistically in NLP
Barbara Plank

Proceedings of the 20th Conference on Computer Science and Intelligence Systems (FedCSIS)
A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation
Verena Blaschke, Miriam Winkler, Constantin Förster, Gabriele Wenger‑Glemser & Barbara Plank

Interspeech 2025
Algorithmic Fidelity of Large Language Models in Generating Synthetic German Public Opinions: A Case Study
Bolei Ma, Berk Yoztyurk, Anna‑Carolina Haensch, Xinpeng Wang, Markus Herklotz, Frauke Kreuter, Barbara Plank & Matthias Aßenmacher

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges
Bolei Ma, Yuting Li, Wei Zhou, Ziwei Gong, Yang Janet Liu, Katja Jasinskaja, Annemarie Friedrich, Julia Hirschberg, Frauke Kreuter & Barbara Plank

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
A Rose by Any Other Name: LLM-Generated Explanations Are Good Proxies for Human Explanations to Collect Label Distributions on NLI
Beiduo Chen, Siyao Peng, Anna Korhonen & Barbara Plank

Findings of the Association for Computational Linguistics: ACL 2025
Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models
Philipp Mondorf, Sondre Wold & Barbara Plank

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set
Florian Eichin, Yang Janet Liu, Barbara Plank & Michael A. Hedderich

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns
Michael A. Hedderich, Anyi Wang, Raoyuan Zhao, Florian Eichin, Jonas Fischer & Barbara Plank

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Anna Bavaresco, Raffaella Bernardi, Leonardo Bertolazzi, Desmond Elliott, Raquel Fernández, Albert Gatt, Esam Ghaleb, Mario Giulianelli, Michael Hanna, Alexander Koller, Andre Martins, Philipp Mondorf, Vera Neplenbroek, Sandro Pezzelle, Barbara Plank, David Schlangen, Alessandro Suglia, Aditya K Surikuchi, Ece Takmaz & Alberto Testoni

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
Andreas Säuberli, Diego Frassinelli & Barbara Plank

Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
Analyzing the Effect of Linguistic Similarity on Cross-Lingual Transfer: Tasks and Experimental Setups Matter
Verena Blaschke, Masha Fedzechkina & Maartje ter Hoeve

Findings of the Association for Computational Linguistics: ACL 2025
Methods and Resources in Germanic Variationist Linguistics
John Nerbonne, Verena Blaschke, Hinrich Schütze & Barbara Plank

Oxford Research Encyclopedia of Linguistics
Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum
Ryan Soh‑Eun Shim & Barbara Plank

Findings of the Association for Computational Linguistics: NAACL 2025
Evaluating Pixel Language Models on Non-Standardized Languages
Alberto Muñoz‑Ortiz, Verena Blaschke & Barbara Plank

Proceedings of the 31st International Conference on Computational Linguistics
Cross-Dialect Information Retrieval: Information Access in Low-Resource and High-Variance Languages
Robert Litschko, Oliver Kraus, Verena Blaschke & Barbara Plank

Proceedings of the 31st International Conference on Computational Linguistics
KARRIEREWEGE: A large scale Career Path Prediction Dataset
Elena Senger, Yuri Campbell, Rob van der Goot & Barbara Plank

Proceedings of the 31st International Conference on Computational Linguistics: Industry Track
Neural Text Normalization for Luxembourgish Using Real-Life Variation Data
Anne‑Marie Lutgen, Alistair Plum, Christoph Purschke & Barbara Plank

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
Neural Text Normalization for Luxembourgish Using Real-Life Variation Data
Anne‑Marie Lutgen, Alistair Plum, Christoph Purschke & Barbara Plank

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study
Xaver Maria Krückl, Verena Blaschke & Barbara Plank

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
Add Noise, Tasks, or Layers? MaiNLP at the VarDial 2025 Shared Task on Norwegian Dialectal Slot and Intent Detection
Verena Blaschke, Felicia Körner & Barbara Plank

Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects
Fine-grained Sexism Detection in Italian Newspapers
Federica Manzi, Leon Weber‑Genzel & Barbara Plank

Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Bolei Ma, Xinpeng Wang, Tiancheng Hu, Anna‑Carolina Haensch, Michael A. Hedderich, Barbara Plank & Frauke Kreuter

Findings of the Association for Computational Linguistics: EMNLP 2024
“Seeing the Big through the Small”: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Beiduo Chen, Xinpeng Wang, Siyao Peng, Robert Litschko, Anna Korhonen & Barbara Plank

Findings of the Association for Computational Linguistics: EMNLP 2024
To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity
Anastasiia Sedova, Robert Litschko, Diego Frassinelli, Benjamin Roth & Barbara Plank

Findings of the Association for Computational Linguistics: EMNLP 2024
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models
Philipp Mondorf & Barbara Plank

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models - A Survey
Philipp Mondorf & Barbara Plank

First Conference on Language Modeling
Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think
Xinpeng Wang, Chengzhi Hu, Bolei Ma, Paul Rottger & Barbara Plank

First Conference on Language Modeling
Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification
Shanshan Xu, Santosh T.y.s.s, Oana Ichim, Barbara Plank & Matthias Grabmair

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects
Verena Blaschke, Christoph Purschke, Hinrich Schuetze & Barbara Plank

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
“My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber‑Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy & Barbara Plank

Findings of the Association for Computational Linguistics ACL 2024
Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning
Philipp Mondorf & Barbara Plank

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
VariErr NLI: Separating Annotation Error from Human Label Variation
Leon Weber‑Genzel, Siyao Peng, Marie‑Catherine de Marneffe & Barbara Plank

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
CLIMATELI: Evaluating Entity Linking on Climate Change Data
Shijia Zhou, Siyao Peng & Barbara Plank

Proceedings of the 1st Workshop on Natural Language Processing Meets Climate Change (ClimateNLP 2024)
Position: Insights from Survey Methodology can Improve Training Data
Stephanie Eckman, Barbara Plank & Frauke Kreuter

Forty-first International Conference on Machine Learning
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark
Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Suppa, Hila Gonen, Joseph Marvin Imperial, Börje Karlsson, Peiqin Lin, Nikola Ljubešić, Lester James Miranda, Barbara Plank, Arij Riabi & Yuval Pinter

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness
Shijia Zhou, Huangyan Shan, Barbara Plank & Robert Litschko

Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze & Barbara Plank

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data
Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova & Barbara Plank

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants
Miriam Winkler, Virginija Juozapaityte, Rob van der Goot & Barbara Plank

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
EEVEE: An Easy Annotation Tool for Natural Language Processing
Axel Sorensen, Siyao Peng, Barbara Plank & Rob van der Goot

Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)
Donkii: Characterizing and Detecting Errors in Instruction-Tuning Datasets
Leon Weber, Robert Litschko, Ekaterina Artemova & Barbara Plank

Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)
Proceedings of the 1st Workshop on Uncertainty-Aware NLP (UncertaiNLP 2024)
Raúl Vázquez, Hande Celikkanat, Dennis Ulmer, Jörg Tiedemann, Swabha Swayamdipta, Wilker Aziz, Barbara Plank, Joris Baan & Marie‑Catherine de Marneffe
Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?
Joris Baan, Raquel Fernández, Barbara Plank & Wilker Aziz

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers)
NNOSE: Nearest Neighbor Occupational Skill Extraction
Mike Zhang, Rob Goot, Min‑Yen Kan & Barbara Plank

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties
Ekaterina Artemova, Verena Blaschke & Barbara Plank

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings
Elena Senger, Mike Zhang, Rob Goot & Barbara Plank

Proceedings of the First Workshop on Natural Language Processing for Human Resources (NLP4HR 2024)
Entity Linking in the Job Market Domain
Mike Zhang, Rob Goot & Barbara Plank

Findings of the Association for Computational Linguistics: EACL 2024
Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations
Siyao Peng, Zihang Sun, Sebastian Loftus & Barbara Plank

Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language
More Labels or Cases? Assessing Label Variation in Natural Language Inference
Cornelia Gruber, Katharina Hechinger, Matthias Assenmacher, Göran Kauermann & Barbara Plank

Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language
MaiBaam Annotation Guidelines
Verena Blaschke, Barbara Kovačić, Siyao Peng & Barbara Plank

arXiv
Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Max Müller‑Eberstein, Rob van der Goot, Barbara Plank & Ivan Titov

Findings of the Association for Computational Linguistics: EMNLP 2023
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández & Barbara Plank

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification
Shanshan Xu, Santosh T.y.s.s, Oana Ichim, Isabella Risini, Barbara Plank & Matthias Grabmair

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation
Xinpeng Wang & Barbara Plank

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Robert Litschko, Max Müller‑Eberstein, Rob van der Goot, Leon Weber‑Genzel & Barbara Plank

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis
Daniela Teodorescu & Saif Mohammad

Findings of the Association for Computational Linguistics: EMNLP 2023
Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers
Daniela Teodorescu, Tiffany Cheng, Alona Fyshe & Saif Mohammad

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation
Shengqiang Zhang, Philipp Wicke, Lütfi Kerem Şenel, Luis Figueredo, Abdeldjallil Naceri, Sami Haddadin, Barbara Plank & Hinrich Schuetze

arXiv
ActiveAED: A Human in the Loop Improves Annotation Error Detection
Leon Weber & Barbara Plank

Findings of the Association for Computational Linguistics: ACL 2023
Silver Syntax Pre-training for Cross-Domain Relation Extraction
Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot & Barbara Plank

Findings of the Association for Computational Linguistics: ACL 2023
Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data
Robert Litschko, Ekaterina Artemova & Barbara Plank

Findings of the Association for Computational Linguistics: ACL 2023
SemEval-2023 Task 11: Learning with Disagreements (LeWiDi)
Elisa Leonardelli, Gavin Abercrombie, Dina Almanea, Valerio Basile, Tommaso Fornaciari, Barbara Plank, Verena Rieser, Alexandra Uma & Massimo Poesio

Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives
Xinpeng Wang, Leonie Weissweiler, Hinrich Schütze & Barbara Plank

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain
Mike Zhang, Rob van der Goot & Barbara Plank

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Uncertainty in Natural Language Generation: From Theory to Applications
Joris Baan, Nico Daheim, Evgenia Ilia, Dennis Ulmer, Haau‑Sing Li, Raquel Fernández, Barbara Plank, Rico Sennrich, Chrysoula Zerva & Wilker Aziz

arXiv
A Survey of Corpora for Germanic Low-Resource Languages and Dialects
Verena Blaschke, Hinrich Schuetze & Barbara Plank

Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Low-resource Bilingual Dialect Lexicon Induction with Large Language Models
Ekaterina Artemova & Barbara Plank

Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction
Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot & Barbara Plank

Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Findings of the VarDial Evaluation Campaign 2023
Noëmi Aepli, Çağrı Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubešić, Kai North, Barbara Plank, Yves Scherrer & Marcos Zampieri

Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)
Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages
Verena Blaschke, Hinrich Schütze & Barbara Plank

Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)
CrossRE: A Cross-Domain Dataset for Relation Extraction
Elisa Bassignana & Barbara Plank

Findings of the Association for Computational Linguistics: EMNLP 2022
Experimental Standards for Deep Learning in Natural Language Processing Research
Dennis Ulmer, Elisa Bassignana, Max Müller‑Eberstein, Daniel Varab, Mike Zhang, Rob van der Goot, Christian Hardmeier & Barbara Plank

Findings of the Association for Computational Linguistics: EMNLP 2022
On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers
Tanja Samardžić, Ximena Gutierrez‑Vasques, Rob van der Goot, Max Müller‑Eberstein, Olga Pelloni & Barbara Plank

Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL)
The “Problem” of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation
Barbara Plank

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Spectral Probing
Max Müller‑Eberstein, Rob van der Goot & Barbara Plank

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Evidence > Intuition: Transferability Estimation for Encoder Selection
Elisa Bassignana, Max Müller‑Eberstein, Mike Zhang & Barbara Plank

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Stop Measuring Calibration When Humans Disagree
Joris Baan, Wilker Aziz, Barbara Plank & Raquel Fernandez

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper
Kostiantyn Kucher, Nicole Sultanum, Angel Daza, Vasiliki Simaki, Maria Skeppstedt, Barbara Plank, Jean‑Daniel Fekete & Narges Mahyar

2022 IEEE Evaluation and Beyond - Methodological Approaches for Visualization (BELIV)
Skill Extraction from Job Postings using Weak Supervision
Mike Zhang, Kristian Nørgaard Jensen, Rob van der Goot & Barbara Plank

Proceedings of the 2nd Workshop on Recommender Systems for Human Resources (RecSys-in-HR 2022)
SkillSpan: Hard and Soft Skill Extraction from English Job Postings
Mike Zhang, Kristian Jensen, Sif Sonniks & Barbara Plank

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Sort by Structure: Language Model Ranking as Dependency Probing
Max Müller‑Eberstein, Rob van der Goot & Barbara Plank

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget
Barbara Plank

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
Fine-tuning vs From Scratch: Do Vision & Language Models Have Similar Capabilities on Out-of-Distribution Visual Question Answering?
Kristian Nørgaard Jensen & Barbara Plank

Proceedings of the Thirteenth Language Resources and Evaluation Conference
Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embeddings
Rob van der Goot, Max Müller‑Eberstein & Barbara Plank

Proceedings of the Thirteenth Language Resources and Evaluation Conference
Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning
Mike Zhang, Kristian Nørgaard Jensen & Barbara Plank

Proceedings of the Thirteenth Language Resources and Evaluation Conference
What Do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification
Elisa Bassignana & Barbara Plank

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
Probing for Labeled Dependency Trees
Max Müller‑Eberstein, Rob van der Goot & Barbara Plank

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)