Types of resources:
- Empirical Lexical Resources
- Cognitive and Visual Resources
- Semantic Relation and Relatedness Datasets
- Semantic Change Datasets
- Datasets of Compositionality and Figurative Language Usage
- Plausibility Datasets
- Terminology Datasets
- Information and Networking Resources
- Software
Empirical Lexical Resources
- Empirical Lexical Information from Lexicalised PCFG Parses:
Based on manually developed grammars and statistical parsers, we acquired various types of lexical information from German and English corpora.
- Resources for German: lexical acquisition from the Huge German Corpus (HGC)
- Resources for English: lexical acquisition from the British National Corpus (BNC)
- Resources for German: lexical acquisition from the Huge German Corpus (HGC)
- Subcategorisation Database for German MATE Parses
- Analogies at the morpho-syntactic and syntactico-semantic interfaces:
- Deep Semantic Analogies: evaluation benchmarks regarding semantic analogies for English and for German
- Derivation Analogies for German Particle Verbs
- German Personal Name Compounds
- Deep Semantic Analogies: evaluation benchmarks regarding semantic analogies for English and for German
Cognitive and Visual Resources
- Association Norms for
German, Italian and French verbs, German noun compounds and their
constituents, and German name compounds
- Feature Norms for German noun compounds and their constituents
- Affective Norms:
human-generated and automatically extrapolated
concreteness/abstractness, arousal, imageability and valence ratings
for German, Modern Standard Arabic, English, Estonian, Romanian,
Slovene
- Bildernetle: a dataset of German noun-to-ImageNet mappings
- Image Generation for Complex Expressions:
- VO-ImageGen with
image generation for abstract and concrete verb-object events:
automatically generated images and human judgements on their event
description quality as well as metaphoricity of the events
- ComposiGen:
prompting strategies and image generation for English noun
compounds and their constituents
- VO-ImageGen with
image generation for abstract and concrete verb-object events:
automatically generated images and human judgements on their event
description quality as well as metaphoricity of the events
- AbsVis:
675 images annotated with 14,175 concept–explanation pairs from humans and two vision-language models
Semantic Relation and Relatedness Datasets
- Paradigmatic Semantic Relation Database compiled via human judgement experiments
- GermaNet-based Semantic Relation Networks
- Semantic Relatedness Datasets for Vietnamese
- DURel - Annotation of Diachronic Usage Relatedness
- SURel - Annotation of Synchronic Domain-specific Meaning Shifts
Diachronic Semantic Relatedness and Semantic Change Datasets
- German Diachronic Metaphor Annotation Dataset
- DURel - Annotation of Diachronic Usage Relatedness
- DiaWUG: Diatopic Word Usage Graphs for Spanish
- Dataset of Grammaticalisation for German Prepositions
- Clean Corpus of Historical American English (CCOHA)
- Dia-Neighbour-NN: Semantic Neighbourhoods of English Noun Compounds over Times
- Dia-Sem-NN:
Annotations of Meaning Changes for English and German Noun Compounds
- Compositionality
Ratings for German and English Noun Compounds and Particle Verbs
in Context
Compositionality and Figurative Language Usage
- Compositionality
Ratings for German and English noun compounds and particle
verbs, both present-day and diachronic, as well as on the type level
and in context
- Datasets for meaning components and meaning shifts of German particle verbs:
- Sentence Generation for German Particle Verb Neologisms
- Directionality in German Particle Verbs
- Source-Target Domains, Directionalities, Mappings and Metaphoricity
- Analogies in Meaning Shifts
- German Personal Name Compounds
- German Diachronic Metaphor Annotation Dataset
- Metaphor Datasets for English, German, Russian and Latin
- Figurative Language Ratings:
- (Non-)Literalness Ratings for German, French and Estonian complex verbs
- Metaphoricity ratings plus human-generated examples sentences for (subject-)verb-object events in English, German and Slovene
- (Non-)Literalness Ratings for German, French and Estonian complex verbs
- Synonymous Literal
and Metaphorical Expressions in Discourse
- VOLIMET: a
parallel corpus of literal and metaphorical verb-object pairs for
English-German and English-French
Plausibility Datasets
- Dataset for Plausibility of Vehicle Components
- PAP: Dataset for
Physical and Abstract Plausibility, accompanied by metaphoricity
ratings of (im)plausible events
- Adapters for Plausibility Prediction
Terminology Datasets
- Laypeople Study on Term Annotation
- SURel - Annotation of Synchronic Meaning Shifts
- Domain-Specific Cooking Compound Termhood Annotation
- Domain-Specific Compound Difficulty Ratings
Information and Networking Resources for Computational Linguistics (mainly in German)
- Studienbibliographie Computerlinguistik und Sprachtechnologie:
Referenz-Sammlung zu Grundlagen, Methoden/Ressourcen und Anwendungen
- Portal Computerlinguistik:
Informationen zur Computerlinguistik im deutschsprachigen Raum und zur deutschen Sprache
Software
- German Noun Chunker
- Predicate Argument Clustering (PAC)
- SubCat-Extractor: induction of verb subcategorisation from MATE dependency parses
- DURel: annotation tool for annotating sentence pairs and clustering and visualising sense relatedness