deu German compound splitting dataset
eng The compounds that were used in Ma et al (2016) paper entitled "Letter Sequence Labeling for Compound Splitting". It contains both two-constituent and multi-constituent compounds. As standard evaluation also involves non-compounds, the data also include non-compounds that we used. The data are organized into the exact same training/test/development split as in the paper.
2017-03-14
1
578545a9-0f5c-468c-b68c-1e51f89e252e
8cefa5dd-f5fb-4527-8acb-88cc6824eb48