Annotation and automatic extraction of definitions:
- Annotation Guidelines (in German)
- Corpus of sentences (potential defintions) extracted from the German Wacky corpus (deWaC version 2008) and lexical-syntactic extraction patterns:
Lexical Chaining and Thematic Indexing: HyTex tools and resources
Named Entities: list of proper names (celebrities) extracted form Wikipedia (German version as of March 2007)
This and that: list of German stop words