Exploiting KonText for querying corpora from the Lindat repository

Speaker:
Natalia Kluyeva
Abstract:
In this presentation, I will describe how KonText – a corpus query interface for Czech National Corpus is adopted for handling various corpora from Lindat. The repository contains corpora with different types of annotation – like syntactic, shallow semantic, sentiment and other types. I will show how to search for this information within the KonText environment using CQL (Corpus Query Language) on the example of two corpora. First, I will focus on the Universal Dependencies – syntactically annotated treebanks for several languages. Secondly, I will demonstrate the queries over the Prague Dependency Treebank that are related both to syntactic (analytical) and deep syntactic (tectogrammatical) layers. I will also show several query examples from other Lindat corpora.
Length:
01:18:45
Date:
22/02/2016
views: 1223

Images:
Preview of 1.jpg
Image 1.jpg
Preview of 2.jpg
Image 2.jpg
Preview of 3.jpg
Image 3.jpg
Attachments: (video, slides, etc.)
109 MB
846 downloads
916 MB
879 downloads
471 MB
1224 downloads
221 MB
860 downloads
132 MB
906 downloads