The task is tranlation of IT-texts from English into German, in line with WMT'16 IT translation task: http://www.statmt.org/wmt16/it-translation-task.html
There is an alternative link to download data: OPUS. You should choose En-De language pair. OPUS also contains some additional datasets, which also could be useful.
There is no restriction on training data for the task. You can use the data provided for the WMT'16 IT translation task or any other datasets. Test data consists of novel texts in IT domain.
We will evaluate submissions with BLEU score.
- BLEU - the BLEU score measures how many words and ngrams (n consecutive words) overlap in a given translation and a reference translation. The most commonly used BLEU version is BLEU-4, which considers words, bigrams, trigrams and 4-grams. It also uses a penalty for too short traslations.
For local validation you can use the reference implementation of BLEU from MOSES or its Python version from Google.