Research Areas
I apply statistical methods to lexical-semantic phenomena, with a focus on the linguistic and cognitive plausibility of the computational approaches. My research topics include:- Automatic induction of semantic classifications and semantic relations
- Compositionality and meaning shifts of multi-word expressions
- Synchronic and diachronic ambiguity and figurative language usage
- Creation of datasets with human judgements on meaning components and meaning relatedness
- Evaluation of corpus-based semantic knowledge
- Application of models to lexicography, machine translation, terminology extraction
Projects and Grants
10/2015-06/2018 |
Principal Investigator, DFG SFB 732/D11 (taken over from Lonneke van der Plas) Collaborative Research Centre 732 "Incremental Specification in Context" Project: A Crosslingual Approach to the Analysis of Compound Nouns |
07/2014-06/2018 |
Principal Investigator, DFG SFB 732/D12 Collaborative Research Centre 732 "Incremental Specification in Context" Project: Sense Discrimination and Regular Meaning Shifts of German Particle Verbs |
07/2014-06/2018 |
Co-Director of the Integrated Research Training Group, DFG SFB 732/MGK Collaborative Research Centre 732 "Incremental Specification in Context" |
11/2011-02/2017 |
Principal Investigator, DFG Research Grant SCHU 2580/2 Project: Distributional Approaches to Semantic Relatedness |
10/2011-06/2014 |
Principal Investigator, DFG SFB 732/D6 (taken over from Sebastian Padó) Collaborative Research Centre 732 "Incremental Specification in Context" Project: Lexical-Semantic Factors in Event Interpretation |
10/2011-12/2016 |
DFG Heisenberg Fellowship SCHU 2580/1 |
08/2000-07/2003 |
DFG Doctoral Scholarship Graduate School 609 "Linguistic Representations and their Interpretation" |
Milestones
- Habilitation:
Theoretical Adequacy, Human Data and Classification Approaches in Modelling Word Properties, Word Relatedness and Word Classes
Philosophische Fakultät, Universität des Saarlandes, June 2009. [Habilitation homepage]
- PhD Thesis:
Experiments on the Automatic Induction of German Semantic Verb Classes
Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, June 2003.
Published as AIMS Report 9(2). [PhD homepage]
Collaborations (current and past)
- Felix Bildhauer (Institut für Deutsche Sprache, Mannheim) and Roland Schäfer (Fachbereich Philosophie und Geisteswissenschaften, Freie Universität Berlin): evaluation of COW corpora
- Gemma Boleda Torrent (Department of Translation and Language Sciences,
Universitat Pompeu Fabra): automatic acquisition and evaluation of lexical classes
- Susanne Borgwaldt (Germanistisches Seminar, Universität Siegen): associations and compositionality of German compound nouns
- Miriam Butt and Daniela Briem (Fachbereich Sprachwissenschaft, Universität Konstanz): light verbs
- Katrin Erk (Linguistics Department, University of Texas at Austin) and Sebastian Padó (Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart): semantic verb classifications
- Alexander Fraser (Centrum für Informations- und Sprachverarbeitung, Universität München): integration of linguistic information into machine translation systems
- Diego Frassinelli (Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart): priming experiments on the directionality of complex verbs; contexts of abstract vs. concrete words
- Adam Kilgarriff (Lexical Computing Ltd): German Sketch Engine
- Tibor Kiss and Antje Müller (Theoretische Linguistik/Computerlinguistik, Ruhr-Universität Bochum): preposition senses
- Steffen Koch (Institut für Visualisierung und Interaktive Systeme, Universität Stuttgart): visualisation of ambiguous words
- Alessandro Lenci (Dipartimento di Linguistica, Universita di Pisa): association norms, feature norms, and semantic relations
- Timm Lichte, Rafael Ehren, Fabienne Cap and Heike Zinsmeister (Universitäten Düsseldorf, Hamburg, Uppsala): VerbCompoCor - German corpus with compositionality judgements for verb-dependent pairs
- Alissa Melinger (School of Psychology, University of Dundee) and Andrea Weber (Englisches Seminar, Universität Tübingen): collection and properties of association norms and their usage for NLP
- Eva Smolka (Fachbereich Sprachwissenschaft, Universität Konstanz): processing and representation of noun compounds and particle verbs
- Lonneke van der Plas (Institute of Linguistics and Language Technology, L'Universita ta' Malta): cross-lingual and distributional models of noun compounds