All publications

2026

Heinrich, P., Blombach, A., Evert, S., & Schäfer, F. (2026). Narratives and Linguistic Features of Drivel. Datenbank-Spektrum. https://doi.org/10.1007/s13222-026-00528-w

2025

Adrian, A., Doan Dang, B.M., Evert, S., Heinrich, P., & Zilio, L. (2025). Führen Unterschiede in den Sprachfassungen der KI-VO zu unterschiedlichen technischen und juristischen Interpretationen? Eine Untersuchung anhand ausgewählter Tatbestandsmerkmale. In Der Mensch im Zentrum – KI, Ethik & Recht, Tagungsband des 28. Internationalen Rechtsinformatik Symposions IRIS. Wien, AT.
Adrian, A., Doan Dang, B.M., Evert, S., Heinrich, P., & Zilio, L. (2025). Führen Unterschiede in den Sprachfassungen der KI-VO zu unterschiedlichen technischen und juristischen Interpretationen? Eine Untersuchung anhand ausgewählter Tatbestandsmerkmale. Jusletter IT, März 2025. https://doi.org/10.38023/ee731001-8b00-4a44-9c9b-4ed3405625ab
Adrian, A., Evert, S., Doan Dang, B.M., Heinrich, P., Mantash, M., Odorfer, D.,... Werner, J. (2025). Robustheit und Domänenanpassung bei der automatischen Anonymisierung von Gerichtsentscheidungen. Künstliche Intelligenz und Recht, 2(2), 60-69.
Adrian, A., Evert, S., Gritz, M., Stürmer, V., Lindner, J., Blöcher, M.,... Rapp, M. (2025). Maschinelles Schliessen mit s(CASP) - Anmeldung eines neuen Geschäftsführers einer GmbH zum Handelsregister. Jusletter IT, 30 April 2025, 433-442. https://doi.org/10.38023/74311d25-cd34-4fd2-9add-3b19549c3edc
Blombach, A., Doan Dang, B.M., Evert, S., Fuchs, T.S., Heinrich, P., Kalashnikova, O., & Unjum, N. (2025). Narrlangen at SemEval-2025 Task 10: Comparing (mostly) simple multilingual approaches to narrative classification. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025) (pp. 2240-2248). Vienna, AT.
Daunicht, T.-M., Hofmann, F., Gläser-Zikuda, M., Kammerl, R., Evert, S., & Ganslmayer, C. (2025, September). [Lehren | Lernen] [mit | aus | über] KI: Evaluation des Projekts „Prompt Higher Learning“. Poster presentation at 89. Jahrestagung der Arbeitsgruppe Empirisch-Pädagogische Forschung (AEPF), Essen.
Evert, S., Ganslmayer, C., & Rink, C. (2025). KI-generierte Wörterbuchartikel bewerten. Ein Beitrag zur Methodik der Wörterbuchkritik. In Wiebke Blanck, Rufus H. Gouws, Anja Lobenstein-Reichmann (Hrg.), Lexikographisch-grammatische Perspektiven.Tradition, Veränderung und Vielfalt in Lexikographie und Wörterbuchforschung. Berlin/Boston: De Gruyter Brill.
Frenken, F., Evert, S., Schneider, G., & Neumann, S. (2025). How stable are multivariate findings about register variation across varieties of English? On the replicability of Geometric Multivariate Analysis. ICAME Journal, 49(1), 23--45. https://doi.org/10.2478/icame-2025-0003

2024

Adrian, A., Basaran, A., Dykes, N., Evert, S., Gritz, M., Humml, M.,... Stürmer, V. (2024, December). DIREGA – Building Decision Support for German Register Law. Poster presentation at JURIX 2024, Brno, CZ.
Adrian, A., Evert, S., Heinrich, P., & Keuchen, M. (2024). AUSLEGUNG DES KI-VO-E ZUR EVALUATION VON VERFAHREN DER KÜNSTLICHEN INTELLIGENZ AM BEISPIEL DER AUTOMATISCHEN ANONYMISIERUNG VON GERICHTSENTSCHEIDUNGEN. Jusletter IT, 215-226. https://doi.org/10.38023/e6faab9c-1802-46ff-8510-48dc8c349957
Adrian, A., Evert, S., Heinrich, P., & Keuchen, M. (2024). Auslegung des KI-VO-E zur Evaluation von Verfahren der Künstlichen Intelligenz am Beispiel der automatischen Anonymisierung von Gerichtsentscheidungen. In Erich Schweighofer, Stefan Eder, Federico Costantini, Felix Schmautzer, Jonas Pfister (Eds.), Sprachmodelle: Juristische Papageien oder mehr? -- Tagungsband des 27. Internationalen Rechtsinformatik Symposions IRIS 2024 (pp. 205 -- 215). Salzburg, Österreich: Salzburg, Austria.
Adrian, A., Evert, S., Heinrich, P., & Keuchen, M. (2024). Auslegung des KI-VO-E zur Evaluation von Verfahren der Künstlichen Intelligenz am Beispiel der automatischen Anonymisierung von Gerichtsentscheidungen. In Erich Schweighofer / Stefan Eder / Federico Costantini / Felix Schmautzer / Jonas Pfister (Hrg.), Sprachmodelle: Juristische Papageien oder mehr? – Tagungsband des 27. Internationalen Rechtsinformatik Symposions IRIS 2024. (S. 205 - 215).
Adrian, A., Keuchen, M., Rapp, M., & Steen, A. (2024). Auslegung des KI-VO-E zur Evaluation von Symbolischen Deduktionsverfahren der Künstlichen Intelligenz für juristische Anwendungen. Jusletter IT, 85-94. https://doi.org/10.38023/38a66708-701d-4947-bcbc-b50d1d104701
Blombach, A., & Lindner-Bornemann, B. (2024). Der possessive Dativ in Raum und Zeit. In Dagobert Höllein, Günter Koch, Alexander Werth (Hrg.), Regionale Sprachgeschichte(n). (S. 29-46). Berlin/Boston: De Gruyter.
Chiarcos, C., Ionov, M., Apostol, E.S., Gkirtzou, K., Kabashi, B., Khan, A.F., & Truică, C.O. (2024). Multiword expressions, collocations and the OntoLex vocabulary. In Voula Giouli (ed), Verginica Barbu Mititelu (ed) (Eds.), Multiword expressions in lexical resources: Linguistic, lexicographic, and computational perspectives. (pp. 187–227). Berlin: Language Science Press.
Dykes, N., Evert, S., Heinrich, P., Humml, M., & Schröder, L. (2024). Finding Argument Fragments on Social Media with Corpus Queries and LLMs. In Philipp Cimiano, Anette Frank, Michael Kohlhase, Benno Stein (Eds.), Robust Argumentation Machines (pp. 163-181). Bielefeld, DEU: Cham: Springer Science and Business Media Deutschland GmbH.
Dykes, N., Evert, S., Heinrich, P., Humml, M., & Schröder, L. (2024). Leveraging High-Precision Corpus Queries for Text Classification via Large Language Models. In Hautli-Janisz A, Lapesa G, Anastasiou L, Gold V, Liddo AD, Reed C (Eds.), Proceedings of the First Workshop on Language-driven Deliberation Technology (DELITE) @ LREC-COLING 2024 (pp. 52--57). Torino, Italy: Torino, Italy: ELRA and ICCL.
Evert, S., Ganslmayer, C., & Rink, C. (2024). Multi-level analysis as a systematic Approach to evaluating the quality of AI-generated dictionary entries. In Kristina Š. Despot, Ana Ostroški Anić, Ivana Brač (Eds.), Proceedings of the XXI EURALEX International Congress (pp. 298–315). Cavtat/Dubrovnik, HR.
Heinrich, P., Blombach, A., Doan Dang, B.M., Zilio, L., Havenstein, L., Dykes, N.,... Schäfer, F. (2024). Automatic Identification of COVID-19-Related Conspiracy Narratives in German Telegram Channels and Chats. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue (Eds.), Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) (pp. 1932-1943). Turin, IT.
Heinrich, P., Blombach, A., Doan Dang, B.M., Zilio, L., Havenstein, L., Dykes, N.,... Schäfer, F. (2024). Automatic Identification of COVID-19-related Narratives in German Telegram Channels and Chats. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue (Eds.), LREC-COLING 2024 - Main Conference Proceedings (pp. 1932-1943). Torino, IT: European Language Resources Association (ELRA).
Heinrich, P., Blombach, A., Dykes, N., Evert, S., Fuchs, T.S., Havenstein, L., & Schäfer, F. (2024). From Linguistic to Discursive Patterns: Introducing Discoursemes as a Basic Unit of Discourse Analysis. CADAAD Journal. Critical Approaches to Discourse Analysis across Disciplines, 16(2), 87-111. https://doi.org/10.21827/cadaad.16.2.42457
Heinrich, P., & Evert, S. (2024). Operationalising the Hermeneutic Grouping Process in Corpus-assisted Discourse Studies. In Christopher Klamm, Gabriella Lapesa, Gabriella Lapesa, Simone Paolo Ponzetto, Ines Rehbein, Indira Sen (Eds.), CPSS 2024 - 4th Workshop on Computational Linguistics for the Political and Social Sciences, Proceedings of the Workshop (pp. 33-44). Vienna, AUT: Association for Computational Linguistics (ACL).
Khan, A.F., Ionov, M., Chiarcos, C., Romary, L., Sérasset, G., & Kabashi, B. (2024). On Modelling Corpus Citations in Computational Lexical Resources. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue (Eds.), 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings (pp. 12385-12394). Torino, ITA: Paris: European Language Resources Association (ELRA).
Rink, C., Ganslmayer, C., & Evert, S. (2024). Towards a comprehensive method for evaluating and utilizing AI-generated bilingual lexicographic data in language learning using the example of Chinese as a foreign language. In Ai Inoue, Naho Kawamoto, Makoto Sumiyoshi (Eds.), Asian Lexicography - Merging cutting-edge and established approaches (pp. 133–142). Toyo University, Tokyo, JP: Tokyo: 東洋大学 (Toyo University).
Wilkens, R., Zilio, L., & Villavicencio, A. (2024). Assessing linguistic generalisation in language models: a dataset for Brazilian Portuguese. Language Resources and Evaluation, 58(1), 175-201. https://doi.org/10.1007/s10579-023-09664-1
Zilio, L., & Kabashi, B. (2024). USING NEURAL MACHINE TRANSLATION FOR NORMALISING HISTORICAL DOCUMENTS. In Kristina Štrkalj Despot, Ana Ostroški Anić, Ivana Brač (Eds.), EURALEX Proceedings (pp. 783-795). Cavtat, HRV: European Association for Lexicography.
Zilio, L., Qian, S., Kanojia, D., & Orăsan, C. (2024). Using character-level models for efficient abbreviation and long-form detection. In Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue (Eds.), 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings (pp. 3028-3037). Torino, Hybrid, IT: European Language Resources Association (ELRA).

2023

Adrian, A., Dykes, N., Evert, S., Heinrich, P., & Keuchen, M. (2023). AUTOMATISCHE ANONYMISIERUNG VON GERICHTSURTEILEN – EINE VISION SCHEINT REALISIERBAR. Jusletter IT, March, 211-220. https://doi.org/10.38023/14A32D75-E299-40D4-9523-3AF8BD445F95
Adrian, A., Dykes, N., Evert, S., Heinrich, P., & Keuchen, M. (2023). Automatische Anonymisierung von Gerichtsurteilen – Eine Vision scheint realisierbar. In Erich Schweighofer / Jakob Zanol / Stefan Eder (Hrg.), (S. 211 - 220). Editions Weblaw.
Dykes, N., Wilson, A., & Uhrig, P. (2023). A Pipeline for the Creation of Multimodal Corpora from YouTube Videos. In Piush Aggarwal, Özge Alaçam, Carina Silberer, Sina Zarrieß, Torsten Zesch (Eds.), Proceedings of the 1st Workshop on Linguistic Insights from and for Multimodal Language Processing (LIMO 2023) (pp. 1-5). Ingolstadt, DE: Ingolstadt: Association for Computational Linguistics.
Lindner-Bornemann, B., & Blombach, A. (2023). „Ach [...] was wars so dunkel in dem Wolf seinem Leib!“ Zur diachronen Entwicklung des possessiven Dativs. In Alexander Lasch, Kerstin Roth, Dominik Hetjens (Hrg.), Historische (Morpho-)Syntax des Deutschen. (S. 298-316). Berlin/Boston: De Gruyter.
Malapally, A., Blombach, A., Heinrich, P., Schnepf, J., & Bruckmüller, S. (2023). Unequal Tweets: Black Disadvantage is (Re)tweeted More but Discussed Less Than White Privilege. Political Communication. https://doi.org/10.1080/10584609.2023.2257624
Patel, M., Garibyan, A., Winckel, E., & Evert, S. (2023). A reference constructicon as a database. Yearbook of the German Cognitive Linguistics Association, 11, 175-202. https://doi.org/10.1515/gcla-2023-0009
Uhrig, P., Payne, E., Pavlova, I., Burenko, I., Dykes, N., Baltazani, M.,... Wilson, A. (2023). Studying Time Conceptualisation via Speech, Prosody, and Hand Gesture: Interweaving Manual and Computational Methods of Analysis. In Wim Pouw, James Trujillo, Hans Rutger Bosker, Linda Drijvers, Marieke Hoetjes, Judith Holler, Sarka Kadava, Lieke Van Maastricht, Ezgi Mamus, Asli Ozyurek (Eds.), Gesture and Speech in Interaction. Nijmegen, NL.

2022

Adrian, A., Dykes, N., Evert, S., Heinrich, P., & Keuchen, M. (2022). Entwicklung und Evaluation automatischer Verfahren zur Anonymisierung von Gerichtsentscheidungen. LegalTech, 4, 233-238.
Adrian, A., Dykes, N., Evert, S., Heinrich, P., Keuchen, M., & Proisl, T. (2022). Manuelle und automatische Anonymisierung von Urteilen. In Adrian, Axel/Kohlhase, Michael/Evert, Stephanie/Zwickel, Martin (Hrg.), Digitalisierung von Zivilprozess und Rechtsdurchsetzung. (S. 173-197).
Blombach, A., Evert, S., Jannidis, F., Pielström, S., Konle, L., & Proisl, T. (2022). Exploring Lexical Diversities. In Digital Humanities 2022. Conference Abstracts (pp. 130-134). Tokyo, JP.
Chiarcos, C., Apostol, E.S., Kabashi, B., & Truica, C.O. (2022). Modelling Frequency, Attestation, and Corpus-Based Information with OntoLex-FrAC. In Nicoletta Calzolari, Chu-Ren Huang, Hansaem Kim, James Pustejovsky, Leo Wanner, Key-Sun Choi, Pum-Mo Ryu, Hsin-Hsi Chen, Lucia Donatelli, Heng Ji, Sadao Kurohashi, Patrizia Paggio, Patrizia Paggio, Nianwen Xue, Seokhwan Kim, Younggyun Hahm, Zhong He, Tony Kyungil Lee, Enrico Santus, Francis Bond, Seung-Hoon Na (Eds.), Proceedings - International Conference on Computational Linguistics, COLING (pp. 4018-4027). Gyeongju, KOR: Association for Computational Linguistics (ACL).
Chiarcos, C., Gkirtzou, K., Ionov, M., Kabashi, B., Khan, A.F., & Truica, C.-O. (2022). Modelling Collocations in OntoLex-FrAC. In Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference (pp. 10--18). Marseille, France: European Language Resources Association.
Diwersy, S., Dykes, N., Evert, S., Heinrich, P., & Luxardo, G. (2022). Eine korpuslinguistische Analyse der Corona-Berichterstattung in der deutschen und französischen Presse. In Tagungsband Mots et Discours de la Pandémie. Heidelberg, DE.
Dykes, N., Heinrich, P., & Evert, S. (2022). Retrieving Twitter argumentation with corpus queries and discourse analysis. In Susanne Flach, Martin Hilpert (Eds.), Broadening the Spectrum of Corpus Linguistics: New approaches to variability and change. (pp. 229-256). John Benjamins Publishing Company.
Gracia, J., Kabashi, B., & Kernerman, I. (2022). TIAD 2022: The Fifth Translation Inference Across Dictionaries Shared Task. In Proceedings of Globalex Workshop on Linked Lexicography within the 13th Language Resources and Evaluation Conference (pp. 19--25). Marseille, France: European Language Resources Association.
Nesset, T., Piperski, A., & Sokolova, S. (2022). Russian feminitives: what can corpus data tell us? Russian Linguistics, 46(2), 95-113. https://doi.org/10.1007/s11185-022-09253-w
Peters, J., & Dykes, N. (2022). Die Palliativmedizinische Fachkultur in Geschichte und Gegenwart – sprachwissenschaftliche Perspektiven. In Ilg, Yvonne, Schnedermann, Theresa, Iakushevich, Marina (Eds.), Linguistik und Medizin. (pp. 194-214). Berlin, New York: De Gruyter.
Peters, J., Dykes, N., Heckel, M., Ostgathe, C., & Habermann, M. (2022). Präsentation von Palliativstationen und SAPV-Teams im Internet - eine korpusbasierte Metaanalyse von Webseiten. Zeitschrift für Palliativmedizin, 23, 46-53. https://doi.org/10.1055/a-1689-7524
Proisl, T. (2022). Use words, not constructions! A new perspective on the unit of analysis in collostructional analysis. International Journal of Corpus Linguistics. https://dx.doi.org/10.1075/ijcl.20072.pro

2021

Adrian, A., Evert, S., Keuchen, M., Heinrich, P., & Dykes, N. (2021). Anonymisierung von Gerichtsurteilen – eine wesentliche Voraussetzung für E-justice. Jusletter IT, May, 137-147. https://doi.org/10.38023/8a6f3e93-06e9-4655-84ec-ecf2c55db3e1
Adrian, A., Evert, S., Keuchen, M., Heinrich, P., & Dykes, N. (2021). Anonymisierung von Gerichtsurteilen – Eine wesentliche Voraussetzung für E-Justice –. In Schweighofer E, Eder S, Hanke P, Kummer F, Saarenpää A (Hrg.), Cybergovernance - Tagungsband des 24. Internationalen Rechtsinformatik Symposions IRIS 2021. (S. 137 - 149). Editions Weblaw.
Dykes, N., Evert, S., Göttlinger, M., Heinrich, P., & Schröder, L. (2021). Argument parsing via corpus queries. it - Information Technology, 63, 31-44. https://doi.org/10.1515/itit-2020-0051
Evert, S., & Lapesa, G. (2021). FAST: A carefully sampled and cognitively motivated dataset for distributional semantic evaluation. In Arianna Bisazza, Omri Abend (Eds.), CoNLL 2021 - 25th Conference on Computational Natural Language Learning, Proceedings (pp. 588-595). Virtual, Online: Association for Computational Linguistics (ACL).
Gracia, J., Kabashi, B., & Kernerman, I. (2021). Results of the Translation Inference Across Dictionaries 2021 Shared Task. In Carvalho S, Souza RR (Eds.), The 4th Language, Data and Knowledge Conference (LDK 2021) Workshops and Tutorials (pp. 208--220). Zaragoza, Spain: CEUR-WS.org,.
Jansen, S., Higuera Del Moral, S., Barzen, J., Reimann, P., & Opolka, M.M. (2021). Demystifying Bilingualism. How Metaphor Guides Research towards Mythification. London: Palgrave Macmillan, Cham.
Pfaffenberger, F., & Heinrich, P. (2021). Die überschätzte Gefahr? Twitter-Bots im Europawahlkampf 2019. In Holtz-Bacha C (Eds.), Europawahlkampf 2019: Zur Rolle der Medien. (pp. 115 - 148). Wiesbaden: Springer.
Tayebi Arasteh, S., Monajem, M., Christlein, V., Heinrich, P., Nicolao, A., Boldaji, H.N.,... Evert, S. (2021). How Will Your Tweet Be Received? Predicting the Sentiment Polarity of Tweet Replies. In IEEE (Eds.), 2021 IEEE 15th International Conference on Semantic Computing (ICSC) (pp. 370-373). Laguna Hills, CA, US.

2020

Adrian, C., Griebel, T., Heinrich, P., & Vollmann, E. (2020). Will the real populism (please) stand out? Eine interdisziplinäre Aufarbeitung populistischer Tendenzen in Brexit-Tweets im Kontext der Europawahl 2019. In Christina Holtz-Bacha (Eds.), Europawahlkampf 2019. (pp. 245-274). Wiesbaden: Springer VS.
Blombach, A., Dykes, N., Evert, S., Heinrich, P., Kabashi, B., & Proisl, T. (2020). A new German Reddit corpus. In Proceedings of the 15th Conference on Natural Language Processing, KONVENS 2019 (pp. 278-279). Erlangen-Nurnberg, DE: German Society for Computational Linguistics and Language Technology.
Blombach, A., Dykes, N., Heinrich, P., Kabashi, B., & Proisl, T. (2020). A Corpus of German Reddit Exchanges (GeRedE). In Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis (Eds.), LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp. 6310-6316). Marseille, FR: European Language Resources Association (ELRA).
Dykes, N., Evert, S., Göttlinger, M., Heinrich, P., & Schröder, L. (2020). Reconstructing Arguments from Noisy Text. Datenbank-Spektrum, 20, 123-129. https://doi.org/10.1007/s13222-020-00342-y
Dykes, N., Heinrich, P., & Blombach, A. (2020, February). Independent argumentation schemes? Transferring argument queries from Brexit to environment tweets. Paper presentation at ICAME41, Heidelberg, DE.
Dykes, N., & Peters, J. (2020). Reconstructing argumentation patterns in German newspaper articles on multidrug-resistant pathogens: a multi-measure keyword approach. Journal of Corpora and Discourse Studies, 3, 51-74. https://doi.org/10.18573/jcads.35
Evert, S., Harlamov, O., Heinrich, P., & Baski, P. (2020). Corpus query lingua franca part II: Ontology. In Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis (Eds.), LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp. 3346-3352). Marseille, FR: European Language Resources Association (ELRA).
Griebel, T., Evert, S., & Heinrich, P. (Eds.) (2020). Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom. London: Routledge.
Griebel, T., Evert, S., & Heinrich, P. (2020). Possibilities and Challenges of Corpus-Assisted Discourse Analyses of Austerity in the United Kingdom. In Griebel T, Evert S, Heinrich P (Eds.), Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom. (pp. 1 - 10). London: Routledge.
Griebel, T., & Heinrich, P. (2020). The Cultural Political Economy of Brexit in the Age of Austerity. In Griebel T, Evert S, Heinrich P (Eds.), Multimodal Approaches to Media Discourses: Reconstructing the Age of Austerity in the United Kingdom. (pp. 163 - 188). London: Routledge.
Peters, J., Dykes, N., Ostgathe, C., Habermann, M., & Heckel, M. (2020). Kompetenzdarstellung, Patientennähe und Argumentationsstrategien von Internetangeboten deutscher Hospize, Palliativstationen und SAPV-Teams-eine korpusbasierte Meta-Analyse. Zeitschrift für Palliativmedizin, 21(5), e34.
Piperski, A. (2020). Russian language and corpus diversity РУССКИЙ ЯЗЫК И КОРПУСНОЕ РАЗНООБРАЗИЕ. In Proceedings of the 2020 Annual International Conference on Computational Linguistics and Intellectual Technologies, Dialogue 2020 (pp. 615-627). ABBYY PRODUCTION LLC.
Proisl, T., Dykes, N., Heinrich, P., Kabashi, B., Blombach, A., & Evert, S. (2020). EmpiriST Corpus 2.0: Adding Manual Normalization, Lemmatization and Semantic Tagging to a German Web and CMC Corpus. In Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis (Eds.), LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings (pp. 6142-6148). Marseille, FR: European Language Resources Association (ELRA).
Proisl, T., & Lapesa, G. (2020). KLUMSy@KIPoS: Experiments on Part-of-Speech Tagging of Spoken Italian. In Basile V, Croce D, Di Maro M, Passaro L (Eds.), Proceedings of the 7th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian (EVALITA 2020). Online: CEUR-WS.org.

2019

Dimpel, F.M., & Proisl, T. (2019). Gute Wörter für Delta: Verbesserung der Autorschaftsattribution durch autorspezifische distinktive Wörter. In Patrick Sahle (Hrg.), DHd 2019. Digital Humanities: multimedial & multimodal. Konferenzabstracts. (S. 296–299).
Dykes, N., Heinrich, P., & Evert, S. (2019, June). Arguing Brexit on Twitter. A corpus linguistic study. Paper presentation at European Conference on Argumentation 2019, Groningen, NL.
Dykes, N., Heinrich, P., & Evert, S. (2019, June). Reconstructing Twitter arguments with corpus linguistics. Paper presentation at ICAME40: Language in Time, Time in Language, Neuchâtel, CH.
Evert, S., Heinrich, P., Henselmann, K., Rabenstein, U., Scherr, E., Schmitt, M., & Schröder, L. (2019). Combining Machine Learning and Semantic Features in the Classification of Corporate Disclosures. Journal of Logic, Language and Information, 309-330. https://doi.org/10.1007/s10849-019-09283-6
Fritsch, J., Wankerl, S., & Nöth, E. (2019). Automatic Diagnosis of Alzheimer's Disease Using Neural Network Language Models. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 5841-5845). Brighton, GBR: Institute of Electrical and Electronics Engineers Inc..
Gracia, J., Kabashi, B., Kernerman, I., Lanau-Coronas, M., & Lonke, D. (2019). Results of the translation inference across dictionaries 2019 shared task. In Jorge Gracia, Besim Kabashi, Besim Kabashi, Ilan Kernerman (Eds.), CEUR Workshop Proceedings (pp. 1-12). Leipzig, DE: CEUR-WS.
Kabashi, B. (2019). Collecting collocations for the Albanian language. In Iztok Kosem, Tanara Zingano Kuhn, Margarita Correia, Jose Pedro Ferreira, Maarten Jansen, Isabel Pereira, Jelena Kallas, Milos Jakubicek, Simon Krek, Carole Tiberius (Eds.), Proceedings of Electronic Lexicography in the 21st Century Conference (pp. 478-489). Sintra, PT: Lexical Computing CZ s.r.o..
Peters, J., Dykes, N., Habermann, M., Ostgathe, C., & Heckel, M. (2019). Metaphors for multidrug-resistant bacteria in German newspaper articles, 1995-2015. A computer-assisted qualitative study. Metaphor and the Social World, 9(2), 221-241.
Peters, J., Dykes, N., Heckel, M., Ostgathe, C., & Habermann, M. (2019). A Linguistic Model of Communication Types in Palliative Medicine: Effects of Multidrug-Resistant Organisms (MDRO) Colonization or Infection and Isolation Measures in End of Life on Family Caregivers’ Knowledge, Attitude and Practices. Journal of Palliative Medicine, 22(8). https://doi.org/10.1089/jpm.2019.0027
Pfaffenberger, F., Adrian, C., & Heinrich, P. (2019). Was bin ich – und wenn ja, wie viele? Identifikation und Analyse von Political Bots während des Bundestagswahlkampfs 2017 auf Twitter. In Holtz-Bacha, Christina (Eds.), Die (Massen-)Medien im Wahlkampf: Die Bundestagswahl 2017. (pp. 97 - 124). Wiesbaden: Springer.
Proisl, T. (2019). The cooccurrence of linguistic structures. Erlangen: FAU University Press.
Proisl, T., Uhrig, P., Heinrich, P., Blombach, A., Mammarella, S., Dykes, N., & Kabashi, B. (2019). The_Illiterati: Part-of-Speech Tagging for Magahi and Bhojpuri Without Even Knowing the Alphabet. In Proceedings of the First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) (pp. 73-79). Trento, IT: Association for Computational Linguistics.

2018

Evert, S., Dykes, N., & Peters, J. (2018). A quantitative evaluation of keyword measures for corpus-based discourse analysis.
Heinrich, P. (2018). Stylistic Features in Corporate Disclosures and their Predictive Power. In Yukio Tono & Hitoshi Isahara (Eds.), Proceedings of 4th Asia Pacific Corpus Linguistics Conference (APCLC2018) (pp. 129 - 134). Takamatsu, JP.
Heinrich, P., Adrian, C., Kalashnikova, O., Schäfer, F., & Evert, S. (2018). A Transnational Analysis of News and Tweets about Nuclear Phase-Out in the Aftermath of the Fukushima Incident. In Andreas Witt, Jana Diesner, Georg Rehm (Eds.), Proceedings of the LREC 2018 “Workshop on Computational Impact Detection from Text Data” (pp. 8 - 16). Miyazaki, JP: Paris: ELRA.
Heinrich, P., & Schäfer, F. (2018). Extending Corpus-Based Discourse Analysis for Exploring Japanese Social Media. In Yukio Tono & Hitoshi Isahara (Eds.), Proceedings of 4th Asia Pacific Corpus Linguistics Conference (APCLC2018) (pp. 135 - 140). Takamatsu, JP.
Kabashi, B., & Proisl, T. (2018). Albanian Part-of-Speech Tagging: Gold Standard and Evaluation. In Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (pp. 2593–2599). Miyazaki, JP: Miyazaki: European Language Resources Association.
Peters, J., & Dykes, N. (2018). From keywords to discourse - towards a keyword operationalisation model in discourse linguistics. In Corpora and Discourse International Conference. Lancaster.
Pfaffenberger, F., Adrian, C., & Heinrich, P. (2018). Political bots during the German federal election campaign 2017 on Twitter. In Proceedings of the 7. European Communication Conference (ECC) der European Communication Research and Education Association (ECREA). Lugano, CH.
Proisl, T. (2018). SoMeWeTa: A Part-of-Speech Tagger for German Social Media and Web Texts. In Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (pp. 665–670). Miyazaki, JP: Miyazaki: European Language Resources Association.
Proisl, T., Evert, S., Jannidis, F., Schöch, C., Konle, L., & Pielström, S. (2018). Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods. In Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (pp. 3309–3314). Miyazaki, JP: Miyazaki: European Language Resources Association.
Proisl, T., Heinrich, P., Kabashi, B., & Evert, S. (2018). EmotiKLUE at IEST 2018: Topic-Informed Classification of Implicit Emotions. In Balahur A, Mohammad SM, Hoste V, Klinger R (Eds.), Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (pp. 235–242). Brüssel, BE: Brussels: Association for Computational Linguistics.
Uhrig, P., Evert, S., & Proisl, T. (2018). Collocation Candidate Extraction from Dependency-Annotated Corpora: Exploring Differences across Parsers and Dependency Annotation Schemes. In Cantos-Gómez P, Almela-Sánchez M (Eds.), Lexical Collocation Analysis: Advances and Applications. (pp. 111–140). Cham: Springer International Publishing.

2017

Büttner, A., Dimpel, F.M., Evert, S., Jannidis, F., Pielström, S., Proisl, T.,... Vitt, T. (2017). »Delta« in der stilometrischen Autorschaftsattribution. Zeitschrift für digitale Geisteswissenschaften. https://doi.org/10.17175/2017_006
Evert, S., Heinrich, P., Henselmann, K., Rabenstein, U., Scherr, E., & Schröder, L. (2017). Combining Machine Learning and Semantic Features in the Classification of Corporate Disclosures. In Loukanova R, Liefke K (Eds.), Proceedings of the Workshop on Logic and Algorithms in Computational Linguistics 2017 (LACompLing2017) (pp. 47 - 62). Stockholm, SE: Stockholm: Stockholm University.
Evert, S., & Neumann, S. (2017). The impact of translation direction on characteristics of translated texts. A multivariate analysis for English and German. In De Sutter G, Lefer M, Delaere I (Eds.), Empirical Translation Studies. New Theoretical and Methodological Traditions. (pp. 47-80). Berlin: Mouton de Gruyter.
Evert, S., Proisl, T., Jannidis, F., Reger, I., Pielström, S., Schöch, C., & Vitt, T. (2017). Understanding and explaining Delta measures for authorship attribution. Digital Scholarship in the Humanities, 32(suppl_2), ii4–ii16. https://doi.org/10.1093/llc/fqx023
Evert, S., Uhrig, P., Bartsch, S., & Proisl, T. (2017). E-VIEW-Alation – a Large-Scale Evaluation Study of Association Measures for Collocation Identification. In Iztok K, Carole T, Miloš J, Jelena K, Simon K, and Vít B (Eds.), Electronic Lexicography in the 21st Century. Proceedings of the eLex 2017 Conference (pp. 531–549). Leiden, NL: Brno: Lexical Computing.
Evert, S., Wankerl, S., & Nöth, E. (2017). Reliable measures of syntactic and lexical complexity: The case of Iris Murdoch. Paper presentation, Birmingham, GB.
Lapesa, G., & Evert, S. (2017). Large-scale evaluation of dependency-based DSMs: Are they worth the effort? In Proceedings of the 15th Annual Meeting of the European Association for Computational Linguistics (EACL 2017): Volume 2, Short Papers (pp. 394-400). Valencia, Spain.
Proisl, T., Heinrich, P., Evert, S., & Kabashi, B. (2017). Translation Inference across Dictionaries via a Combination of Graph-based Methods and Co-occurrence Statistics. In McCrae J, Bond F, Buitelaar P, Cimiano P, Declerck T, Gracia J, Kernerman I, Ponsoda E, Ordan N, Piasecki M (Eds.), Proceedings of the LDK 2017 Workshops: 1st Workshop on the OntoLex Model (OntoLex-2017), Shared Task on Translation Inference Across Dictionaries & Challenges for Wordnets (pp. 94–102). Galway, IE: CEUR.
Schäfer, F., Evert, S., & Heinrich, P. (2017). Japan's 2014 General Election: Political Bots, Right-Wing Internet Activism and PM Abe Shinzō’s Hidden Nationalist Agenda. Big Data, 5(4), 1 - 16.

2016

Evert, S. (2016). CogALex-V Shared Task: Mach5 – A traditional DSM approach to semantic relatedness. In Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V) (pp. 92-97). Osaka, Japan.
Evert, S., Beißwenger, M., Bartsch, S., & Würzner, K.-M. (2016). EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora. In Proceedings of the 10th Web as Corpus Workshop (WAC-X) and the EmpiriST Shared Task (pp. 44-56). Berlin, DE: Berlin, Germany.
Evert, S., Greiner, P., Baigger, F., & Lang, B. (2016). A Distributional Approach to Open Questions in Market Research. Computers in Industry, 78, 16-28. https://doi.org/10.1016/j.compind.2015.10.008
Evert, S., Jannidis, F., Dimpel, F.M., Schöch, C., Pielström, S., Vitt, T.,... Proisl, T. (2016). „Delta“ in der stilometrischen Autorschaftsattribution. Paper presentation at DHd 2016, Leipzig, DE.
Kabashi, B., & Proisl, T. (2016). A Proposal for a Part-of-Speech Tagset for the Albanian Language. In Calzolari Nicoletta, Choukri Khalid, Declerck Thierry, Grobelnik Marko, Maegaard Bente, Mariani Joseph, Moreno Asuncion, Odijk Jan, Piperidis Stelios (Eds.), Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) (pp. 4305–4310). Portorož, SI: Paris: European Language Resources Association (ELRA).
Piperski, A., & Kukhto, A. (2016). Intra-speaker stress variation in Russian: A corpus-driven study of Russian poetry. In Proceedings of the 2016 International Conference on Computational Linguistics and Intellectual Technologies, Dialogue 2016 (pp. 540-550). Rossiiskii Gosudarstvennyi Gumanitarnyi Universitet.
Proisl, T., & Uhrig, P. (2016). SoMaJo: State-of-the-art tokenization for German web and social media texts. In Cook P, Evert S, Schäfer R, Stemle E (Eds.), Proceedings of the 10th Web as Corpus Workshop (WAC-X) and the EmpiriST Shared Task (pp. 57-62). Berlin, DE: Berlin: Association for Computational Linguistics (ACL).
Santus, E., Gladkova, A., Evert, S., & Lenci, A. (2016). The CogALex-V Shared Task on the Corpus-Based Identification of Semantic Relations. In Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex-V) (pp. 69-79). Osaka, Japan.
Wankerl, S., Nöth, E., & Evert, S. (2016). An Analysis of Perplexity to Reveal the Effects of Alzheimer's Disease on Language. In ITG-Fachbericht 267: Speech Communication (pp. 254-259). Paderborn, Germany.

2015

Evert, S., & Arppe, A. (2015). Some theoretical and experimental observations on naïve discriminative learning. In Proceedings of the 6th Conference on Quantitative Investigations in Theoretical Linguistics (QITL-6). Tübingen, Germany.
Evert, S., & Hardie, A. (2015). Ziggurat: A new data model and indexing format for large annotated text corpora. In Proceedings of the 3rd Workshop on the Challenges in the Management of Large Corpora (CMLC-3) (pp. 21--27). Lancaster, UK.
Evert, S., Proisl, T., Jannidis, F., Pielström, S., Schöch, C., & Vitt, T. (2015). Towards a better understanding of Burrows's Delta in literary authorship attribution. In Proceedings of the Fourth Workshop on Computational Linguistics for Literature (pp. 79--88). Denver, CO.
Kabashi, B. (2015). Automatische Verarbeitung der Morphologie des Albanischen. Erlangen: FAU University Press.
Plotnikova, N., Kohl, M., Volkert, K., Lerner, A., Dykes, N., Ermer, H., & Evert, S. (2015). KLUEless: Polarity Classification and Association. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) (pp. 619--625). Denver, Colorado.
Plotnikova, N., Lapesa, G., Proisl, T., & Evert, S. (2015). SemantiKLUE: Semantic Textual Similarity with Maximum Weight Matching. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) (pp. 111--116). Denver, Colorado.

2014

Bartsch, S., & Evert, S. (2014). Towards a Firthian Notion of Collocation. In Abel A, Lemnitzer L (Eds.), Vernetzungsstrategien, Zugriffsstrukturen und automatisch ermittelte Angaben in Internetwörterbüchern. (pp. 48–61). Mannheim: Institut für Deutsche Sprache.
Diwersy, S., Evert, S., & Neumann, S. (2014). A weakly supervised multivariate approach to the study of language variation. In Szmrecsanyi B, Wälchli B (Eds.), Aggregating Dialectology, Typology, and Register Analysis. Linguistic Variation in Text and Speech. (pp. 174–204). Berlin, Boston: De Gruyter.
Evert, S. (2014). Distributional Semantics in R with the wordspace Package. In Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: System Demonstrations (pp. 110–114). Dublin, Ireland.
Evert, S., Proisl, T., Greiner, P., & Kabashi, B. (2014). SentiKLUE: Updating a polarity classifier in 48 hours. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval-2014) (pp. 551–555). Dublin, Ireland.
Lapesa, G., & Evert, S. (2014). A Large Scale Evaluation of Distributional Semantic Models: Parameters, Interactions and Model Selection. Transactions of the Association for Computational Linguistics, 2, 531–545.
Lapesa, G., & Evert, S. (2014). NaDiR: Naive Distributional Response Generation. In Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex) (pp. 50–59). Dublin, Ireland.
Lapesa, G., Evert, S., & Schulte im Walde, S. (2014). Contrasting Syntagmatic and Paradigmatic Relations: Insights from Distributional Semantic Models. In Proceedings of the Third Joint Conference on Lexical and Computational Semantics (*SEM 2014) (pp. 160–170). Dublin, Ireland.
Proisl, T., Evert, S., Greiner, P., & Kabashi, B. (2014). SemantiKLUE: Robust semantic similarity at multiple levels using maximum weight matching. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval-2014) (pp. 532–540). Dublin, Ireland.
Schulze Wettendorf, C., Jegan, R., Körner, A., Zerche, J., Plotnikova, N., Moreth, J.,... Evert, S. (2014). SNAP: A Multi-Stage XML-Pipeline for Aspect Based Sentiment Analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014) (pp. 578-584). Dublin, Ireland.

2013

Ansorge, U., Reynvoet, B., Hendler, J., Oettl, L., & Evert, S. (2013). Conditional automaticity in subliminal morphosyntactic priming. Psychological research, 77, 399–421.
Biemann, C., Bildhauer, F., Evert, S., Goldhahn, D., Quasthoff, U., Schäfer, R.,... Zesch, T. (2013). Scalable Construction of High-Quality Web Corpora. Journal for language technology and computational linguistics, 28(2), 23–59.
Evert, S. (2013). Tools for the acquisition of lexical combinatorics. In Gouws RH, Heid U, Schweickard W, Wiegand HE (Eds.), Dictionaries. An International Encyclopedia of Lexicography. Supplementary volume: Recent Developments with Focus on Electronic and Computational Lexicography (HSK 5.4). (pp. 1415–1432). Berlin, New York: Mouton de Gruyter.
Greiner, P., Proisl, T., Evert, S., & Kabashi, B. (2013). KLUE-CORE: A regression model of semantic textual similarity. In Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity (pp. 181–186). Atlanta, Georgia, USA: Association for Computational Linguistics.
Lapesa, G., & Evert, S. (2013). Evaluating Neighbor Rank and Distance Measures as Predictors of Semantic Priming. In Proceedings of the ACL Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2013) (pp. 66--74). Sofia, Bulgaria.
Proisl, T., Greiner, P., Evert, S., & Kabashi, B. (2013). KLUE: Simple and robust methods for polarity classification. In Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013) (pp. 395–401). Atlanta, GA: Association for Computational Linguistics.