In NLP we rely on manually annotated data, e.g.
treebanks. Such data is hard to come by, explaining recent
interests in semi-supervised NLP. However, our labeled data is
also (almost always) extremely biased. This talk presents bias
correction techniques and discusses their applicability in NLP.