Publications

All Publications

  1. jan 2025
    Evaluating Pixel Language Models on Non-Standardized Languages

    Muñoz-Ortiz, Alberto and Blaschke, Verena and Plank, Barbara

    Proceedings of the 31st International Conference on Computational Linguistics


  2. jan 2025
    Cross-Dialect Information Retrieval: Information Access in Low-Resource and High-Variance Languages

    Litschko, Robert and Kraus, Oliver and Blaschke, Verena and Plank, Barbara

    Proceedings of the 31st International Conference on Computational Linguistics


  3. jan 2025
    KARRIEREWEGE: A large scale Career Path Prediction Dataset

    Senger, Elena and Campbell, Yuri and van der Goot, Rob and Plank, Barbara

    Proceedings of the 31st International Conference on Computational Linguistics: Industry Track


  4. jan 2025
    Neural Text Normalization for Luxembourgish Using Real-Life Variation Data

    Lutgen, Anne-Marie and Plum, Alistair and Purschke, Christoph and Plank, Barbara

    Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects


  5. jan 2025
    Neural Text Normalization for Luxembourgish Using Real-Life Variation Data

    Lutgen, Anne-Marie and Plum, Alistair and Purschke, Christoph and Plank, Barbara

    Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects


  6. jan 2025
    Improving Dialectal Slot and Intent Detection with Auxiliary Tasks: A Multi-Dialectal Bavarian Case Study

    Krückl, Xaver Maria and Blaschke, Verena and Plank, Barbara

    Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects


  7. jan 2025
    Add Noise, Tasks, or Layers? MaiNLP at the VarDial 2025 Shared Task on Norwegian Dialectal Slot and Intent Detection

    Blaschke, Verena and Körner, Felicia and Plank, Barbara

    Proceedings of the 12th Workshop on NLP for Similar Languages, Varieties and Dialects


  8. dec 2024
    Fine-grained Sexism Detection in Italian Newspapers

    Manzi, Federica and Weber-Genzel, Leon and Plank, Barbara

    Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)


  9. nov 2024
    The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models

    Ma, Bolei and Wang, Xinpeng and Hu, Tiancheng and Haensch, Anna-Carolina and Hedderich, Michael A. and Plank, Barbara and Kreuter, Frauke

    Findings of the Association for Computational Linguistics: EMNLP 2024


  10. nov 2024
    “Seeing the Big through the Small”: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?

    Chen, Beiduo and Wang, Xinpeng and Peng, Siyao and Litschko, Robert and Korhonen, Anna and Plank, Barbara

    Findings of the Association for Computational Linguistics: EMNLP 2024


  11. nov 2024
    To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity

    Sedova, Anastasiia and Litschko, Robert and Frassinelli, Diego and Roth, Benjamin and Plank, Barbara

    Findings of the Association for Computational Linguistics: EMNLP 2024


  12. nov 2024
    Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models

    Mondorf, Philipp and Plank, Barbara

    Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing


  13. oct 2024
    Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models - A Survey

    Mondorf, Philipp and Plank, Barbara

    First Conference on Language Modeling


  14. oct 2024
    Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think

    Wang, Xinpeng and Hu, Chengzhi and Ma, Bolei and Rottger, Paul and Plank, Barbara

    First Conference on Language Modeling


  15. aug 2024
    Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification

    Xu, Shanshan and T.y.s.s, Santosh and Ichim, Oana and Plank, Barbara and Grabmair, Matthias

    Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)


  16. aug 2024
    What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects

    Blaschke, Verena and Purschke, Christoph and Schuetze, Hinrich and Plank, Barbara

    Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)


  17. aug 2024
    “My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

    Wang, Xinpeng and Ma, Bolei and Hu, Chengzhi and Weber-Genzel, Leon and Röttger, Paul and Kreuter, Frauke and Hovy, Dirk and Plank, Barbara

    Findings of the Association for Computational Linguistics ACL 2024


  18. aug 2024
    Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning

    Mondorf, Philipp and Plank, Barbara

    Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)


  19. aug 2024
    VariErr NLI: Separating Annotation Error from Human Label Variation

    Weber-Genzel, Leon and Peng, Siyao and De Marneffe, Marie-Catherine and Plank, Barbara

    Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)


  20. aug 2024
    CLIMATELI: Evaluating Entity Linking on Climate Change Data

    Zhou, Shijia and Peng, Siyao and Plank, Barbara

    Proceedings of the 1st Workshop on Natural Language Processing Meets Climate Change (ClimateNLP 2024)


  21. jul 2024
    Position: Insights from Survey Methodology can Improve Training Data

    Eckman, Stephanie and Plank, Barbara and Kreuter, Frauke

    Forty-first International Conference on Machine Learning


  22. jun 2024
    Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark

    Mayhew, Stephen and Blevins, Terra and Liu, Shuheng and Suppa, Marek and Gonen, Hila and Imperial, Joseph Marvin and Karlsson, Börje and Lin, Peiqin and Ljubešić, Nikola and Miranda, Lester James and Plank, Barbara and Riabi, Arij and Pinter, Yuval

    Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)


  23. jun 2024
    MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness

    Zhou, Shijia and Shan, Huangyan and Plank, Barbara and Litschko, Robert

    Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)


  24. may 2024
    MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank

    Blaschke, Verena and Kovačić, Barbara and Peng, Siyao and Schütze, Hinrich and Plank, Barbara

    Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)


  25. may 2024
    Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data

    Peng, Siyao and Sun, Zihang and Shan, Huangyan and Kolm, Marie and Blaschke, Verena and Artemova, Ekaterina and Plank, Barbara

    Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)


  26. may 2024
    Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants

    Winkler, Miriam and Juozapaityte, Virginija and van der Goot, Rob and Plank, Barbara

    Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)


  27. mar 2024
    EEVEE: An Easy Annotation Tool for Natural Language Processing

    Sorensen, Axel and Peng, Siyao and Plank, Barbara and Van Der Goot, Rob

    Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)


  28. mar 2024
    Donkii: Characterizing and Detecting Errors in Instruction-Tuning Datasets

    Weber, Leon and Litschko, Robert and Artemova, Ekaterina and Plank, Barbara

    Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)


  29. mar 2024
    Proceedings of the 1st Workshop on Uncertainty-Aware NLP (UncertaiNLP 2024)

    Vázquez, Raúl and Celikkanat, Hande and Ulmer, Dennis and Tiedemann, Jörg and Swayamdipta, Swabha and Aziz, Wilker and Plank, Barbara and Baan, Joris and de Marneffe, Marie-Catherine


  30. mar 2024
    Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?

    Baan, Joris and Fernández, Raquel and Plank, Barbara and Aziz, Wilker

    Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 2: Short Papers)


  31. mar 2024
    NNOSE: Nearest Neighbor Occupational Skill Extraction

    Zhang, Mike and Goot, Rob and Kan, Min-Yen and Plank, Barbara

    Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)


  32. mar 2024
    Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties

    Artemova, Ekaterina and Blaschke, Verena and Plank, Barbara

    Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)


  33. mar 2024
    Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings

    Senger, Elena and Zhang, Mike and Goot, Rob and Plank, Barbara

    Proceedings of the First Workshop on Natural Language Processing for Human Resources (NLP4HR 2024)


  34. mar 2024
    Entity Linking in the Job Market Domain

    Zhang, Mike and Goot, Rob and Plank, Barbara

    Findings of the Association for Computational Linguistics: EACL 2024


  35. mar 2024
    Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations

    Peng, Siyao and Sun, Zihang and Loftus, Sebastian and Plank, Barbara

    Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language


  36. mar 2024
    More Labels or Cases? Assessing Label Variation in Natural Language Inference

    Gruber, Cornelia and Hechinger, Katharina and Assenmacher, Matthias and Kauermann, Göran and Plank, Barbara

    Proceedings of the Third Workshop on Understanding Implicit and Underspecified Language


  37. mar 2024
    MaiBaam Annotation Guidelines

    Blaschke, Verena and Kovačić, Barbara and Peng, Siyao and Plank, Barbara

    arXiv


  38. dec 2023
    Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training

    Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara and Titov, Ivan

    Findings of the Association for Computational Linguistics: EMNLP 2023


  39. dec 2023
    What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability

    Giulianelli, Mario and Baan, Joris and Aziz, Wilker and Fernández, Raquel and Plank, Barbara

    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing


  40. dec 2023
    From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification

    Xu, Shanshan and T.y.s.s, Santosh and Ichim, Oana and Risini, Isabella and Plank, Barbara and Grabmair, Matthias

    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing


  41. dec 2023
    ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation

    Wang, Xinpeng and Plank, Barbara

    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing


  42. dec 2023
    Establishing Trustworthiness: Rethinking Tasks and Model Evaluation

    Litschko, Robert and Müller-Eberstein, Max and van der Goot, Rob and Weber-Genzel, Leon and Plank, Barbara

    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing


  43. dec 2023
    Evaluating Emotion Arcs Across Languages: Bridging the Global Divide in Sentiment Analysis

    Teodorescu, Daniela and Mohammad, Saif

    Findings of the Association for Computational Linguistics: EMNLP 2023


  44. dec 2023
    Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers

    Teodorescu, Daniela and Cheng, Tiffany and Fyshe, Alona and Mohammad, Saif

    Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing


  45. oct 2023
    LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation

    Zhang, Shengqiang and Wicke, Philipp and Şenel, Lütfi Kerem and Figueredo, Luis and Naceri, Abdeldjallil and Haddadin, Sami and Plank, Barbara and Schuetze, Hinrich

    arXiv


  46. jul 2023
    ActiveAED: A Human in the Loop Improves Annotation Error Detection

    Weber, Leon and Plank, Barbara

    Findings of the Association for Computational Linguistics: ACL 2023


  47. jul 2023
    Silver Syntax Pre-training for Cross-Domain Relation Extraction

    Bassignana, Elisa and Ginter, Filip and Pyysalo, Sampo and van der Goot, Rob and Plank, Barbara

    Findings of the Association for Computational Linguistics: ACL 2023


  48. jul 2023
    Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data

    Litschko, Robert and Artemova, Ekaterina and Plank, Barbara

    Findings of the Association for Computational Linguistics: ACL 2023


  49. jul 2023
    SemEval-2023 Task 11: Learning with Disagreements (LeWiDi)

    Leonardelli, Elisa and Abercrombie, Gavin and Almanea, Dina and Basile, Valerio and Fornaciari, Tommaso and Plank, Barbara and Rieser, Verena and Uma, Alexandra and Poesio, Massimo

    Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)


  50. jul 2023
    How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives

    Wang, Xinpeng and Weissweiler, Leonie and Schütze, Hinrich and Plank, Barbara

    Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)


  51. jul 2023
    ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain

    Zhang, Mike and van der Goot, Rob and Plank, Barbara

    Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)


  52. jul 2023
    Uncertainty in Natural Language Generation: From Theory to Applications

    Baan, Joris and Daheim, Nico and Ilia, Evgenia and Ulmer, Dennis and Li, Haau-Sing and Fernández, Raquel and Plank, Barbara and Sennrich, Rico and Zerva, Chrysoula and Aziz, Wilker

    arXiv


  53. may 2023
    A Survey of Corpora for Germanic Low-Resource Languages and Dialects

    Blaschke, Verena and Schuetze, Hinrich and Plank, Barbara

    Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)


  54. may 2023
    Low-resource Bilingual Dialect Lexicon Induction with Large Language Models

    Artemova, Ekaterina and Plank, Barbara

    Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)


  55. may 2023
    Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction

    Bassignana, Elisa and Ginter, Filip and Pyysalo, Sampo and van der Goot, Rob and Plank, Barbara

    Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)


  56. may 2023
    Findings of the VarDial Evaluation Campaign 2023

    Aepli, Noëmi and Çöltekin, Çağrı and Van Der Goot, Rob and Jauhiainen, Tommi and Kazzaz, Mourhaf and Ljubešić, Nikola and North, Kai and Plank, Barbara and Scherrer, Yves and Zampieri, Marcos

    Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)


  57. may 2023
    Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages

    Blaschke, Verena and Schütze, Hinrich and Plank, Barbara

    Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)


  58. dec 2022
    CrossRE: A Cross-Domain Dataset for Relation Extraction

    Bassignana, Elisa and Plank, Barbara

    Findings of the Association for Computational Linguistics: EMNLP 2022


  59. dec 2022
    Experimental Standards for Deep Learning in Natural Language Processing Research

    Ulmer, Dennis and Bassignana, Elisa and Müller-Eberstein, Max and Varab, Daniel and Zhang, Mike and van der Goot, Rob and Hardmeier, Christian and Plank, Barbara

    Findings of the Association for Computational Linguistics: EMNLP 2022


  60. dec 2022
    On Language Spaces, Scales and Cross-Lingual Transfer of UD Parsers

    Samardžić, Tanja and Gutierrez-Vasques, Ximena and van der Goot, Rob and Müller-Eberstein, Max and Pelloni, Olga and Plank, Barbara

    Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL)


  61. dec 2022
    The “Problem” of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation

    Plank, Barbara

    Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing


  62. dec 2022
    Spectral Probing

    Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara

    Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing


  63. dec 2022
    Evidence > Intuition: Transferability Estimation for Encoder Selection

    Bassignana, Elisa and Müller-Eberstein, Max and Zhang, Mike and Plank, Barbara

    Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing


  64. dec 2022
    Stop Measuring Calibration When Humans Disagree

    Baan, Joris and Aziz, Wilker and Plank, Barbara and Fernandez, Raquel

    Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing


  65. oct 2022
    An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper

    Kucher, Kostiantyn and Sultanum, Nicole and Daza, Angel and Simaki, Vasiliki and Skeppstedt, Maria and Plank, Barbara and Fekete, Jean-Daniel and Mahyar, Narges

    2022 IEEE Evaluation and Beyond - Methodological Approaches for Visualization (BELIV)


  66. sep 2022
    Skill Extraction from Job Postings using Weak Supervision

    Zhang, Mike and Jensen, Kristian Nørgaard and van der Goot, Rob and Plank, Barbara

    Proceedings of the 2nd Workshop on Recommender Systems for Human Resources (RecSys-in-HR 2022)


  67. jul 2022
    SkillSpan: Hard and Soft Skill Extraction from English Job Postings

    Zhang, Mike and Jensen, Kristian and Sonniks, Sif and Plank, Barbara

    Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies


  68. jul 2022
    Sort by Structure: Language Model Ranking as Dependency Probing

    Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara

    Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies


  69. jul 2022
    Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget

    Plank, Barbara

    Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)


  70. jun 2022
    Fine-tuning vs From Scratch: Do Vision & Language Models Have Similar Capabilities on Out-of-Distribution Visual Question Answering?

    Jensen, Kristian Nørgaard and Plank, Barbara

    Proceedings of the Thirteenth Language Resources and Evaluation Conference


  71. jun 2022
    Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embeddings

    van der Goot, Rob and Müller-Eberstein, Max and Plank, Barbara

    Proceedings of the Thirteenth Language Resources and Evaluation Conference


  72. jun 2022
    Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning

    Zhang, Mike and Jensen, Kristian Nørgaard and Plank, Barbara

    Proceedings of the Thirteenth Language Resources and Evaluation Conference


  73. may 2022
    What Do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification

    Bassignana, Elisa and Plank, Barbara

    Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop


  74. may 2022
    Probing for Labeled Dependency Trees

    Müller-Eberstein, Max and van der Goot, Rob and Plank, Barbara

    Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)