Language Model in Bulgarian Treebank (BulTreeBank)

Speaker:
Petya Osenova
Abstract:
The talk will focus on the representation of linguistic knowledge in BulTreeBank. First, the syntactic corpus will be introduced, and then its underlying annotation principles will be discussed. The types of linguistic annotation come in order: morphological information, constituency and dependency relations, word order and discontinuity. The boundary between lexical and phrasal elements will be also considered. Special attention will be put on the 'hard nut' phenomena, such as coordination, ellipsis, pragmatic expressions, foculizers, coreference and unexpressed elements. The conclusion will reflect the integration and communication among all levels and types of linguistic interpretation.
Length:
01:10:44
Date:
26/03/2007

Images:
Preview of img003.jpg
Image img003.jpg
Preview of img021.jpg
Image img021.jpg
Preview of img024.jpg
Image img024.jpg
Attachments: (video, slides, etc.)
56M
1651 downloads
419M
1676 downloads
135M
1664 downloads
852K
3412 downloads