#6 Data acquisition - Big brother

Speaker:
Ondřej Bojar
Abstract:
The sixth talk on machine translation is devoted to data, the fuel of statistical approaches to all NLP tasks. We discuss various possible data sources, effects of the text domain, the greed for more data and the diminishing utility of new data additions.
Length:
00:11:16
Date:
24/02/2015
views: 1045

Images:
Attachments: (video, slides, etc.)
16 MB
683 downloads
106 MB
1046 downloads
62 MB
739 downloads
46 MB
801 downloads