TüBa-D/DP is a machine-annotated dependency treebank of German. The goal of TüBa-D/DP is to offer high-qualitity syntactic annotations for a huge amount of contemporary German text. TüBa-D/DP attempts to provide familiar annotations by following the TüBa-D/Z annotation guidelines (Telljohann et al, 2006) as closely as possible. TüBa-D/DP currently consists of the following subcorpora:
|taz (1986-2009)||Newspaper||29.9M||393.7M||Contact us|
|Common Crawl (2019)||Webpages||1.4B||27.3B||Contact us|
Each subcorpus has the following annotation layers:
A description of the annotation format can be found in the stylebook.
Please send any questions to
create an issue on GitHub.