Resources


Annotation and automatic extraction of definitions:
  • Annotation Guidelines (in German)
  • [coming soon] Corpus of sentences extracted from the German Wacky corpus (deWaC version 2008) using:
    • accurate patterns
    • moderately accurate patterns
    • inaccurate patterns
Lexical Chaining and Thematic Indexing: HyTex tools and resources

Named Entities: list of proper names (celebrities) extracted form Wikipedia (German version as of March 2007) 

This and that: list of German stop words


last modified by IMC 2010-06-23