Multilingual Spontaneous Speech Corpora: Compilation And Annotation

Speaker:
Antonio Moreno Sandoval
Abstract:
In this talk I will discuss issues on compilation and annotation of spoken corpora. The languages covered will be Spanish, Chinese and Japanese, and also Spanish child language. Although we use an unified methodology based on the C-ORAL-ROM project, specific strategies have been adopted for transcribing and tagging each corpus. This approach allows cross-lingual comparison while showing the distinctive features. I will focus on problems and how we handle them.
Length:
01:56:08
Date:
21/11/2011
views: 1910

Images:
Preview of img-007.jpg
Image img-007.jpg
Preview of img-011.jpg
Image img-011.jpg
Preview of img-041.jpg
Image img-041.jpg
Attachments: (video, slides, etc.)
53,1M
1550 downloads
152M
1911 downloads
574M
1219 downloads
2,3M
1010 downloads