Language Model in Bulgarian Treebank (BulTreeBank)
Speaker:
Petya Osenova
Abstract:
The talk will focus on the representation of linguistic knowledge in BulTreeBank. First, the syntactic corpus will be introduced, and then its underlying annotation principles will be discussed. The types of linguistic annotation come in order: morphological information, constituency and dependency relations, word order and discontinuity. The boundary between lexical and phrasal elements will be also considered. Special attention will be put on the 'hard nut' phenomena, such as coordination, ellipsis, pragmatic expressions, foculizers, coreference and unexpressed elements. The conclusion will reflect the integration and communication among all levels and types of linguistic interpretation.
Length:
01:10:44
Date:
26/03/2007
Attachments: (video, slides, etc.)
56M
1782 downloads
419M
1807 downloads
135M
1731 downloads
852K
3523 downloads