Annotation and automatic extraction of definitions:

  • Annotation Guidelines (in German)
  • Corpus of sentences (potential defintions) extracted from the German Wacky corpus (deWaC version 2008) and lexical-syntactic extraction patterns: 3 Zip-archives and a Read Me file (.xls).

Lexical Chaining and Thematic Indexing: HyTex tools and resources

 


Named Entities: list of proper names (celebrities) extracted form Wikipedia (German version as of March 2007)

 

AnnotatingDefinitions_byIMC_2008.zip
Archivdatei im ZIP Format [299.2 KB]
wackyExtractLexSynPatterns_Stand2011-02-[...]
Archivdatei im ZIP Format [4.7 MB]
wackyExtractLexSynPatterns_Stand2011-02-[...]
Archivdatei im ZIP Format [4.9 MB]
wackyExtractLexSynPatterns_Stand2011-02-[...]
Archivdatei im ZIP Format [9.6 MB]
ListOfProperNames_ExtractedFromWikipedia[...]
Archivdatei im ZIP Format [675.0 KB]
Druckversion Druckversion | Sitemap
last modified by IMC 2017-02