Por favor, use este identificador para citar o enlazar este ítem: http://inaoe.repositorioinstitucional.mx/jspui/handle/1009/1203
The Corpus DIMEx100: transcription and evaluation
Luis A. Pineda
HAYDE CASTELLANOS VARGAS
JANET JUAREZ ESCOBAR
Joaquim Llisterri
LUIS VILLASEÑOR PINEDA
Acceso Abierto
Atribución-NoComercial-SinDerivadas
Phonetic corpus
Phonetic transcription
Transcription granularity
Mexican Spanish
Acoustic models
In this paper the transcription and evaluation of the corpus DIMEx100 for Mexican Spanish is presented. First we describe the corpus and explain the linguistic and computational motivation for its design and collection process; then, the phonetic antecedents and the alphabet adopted for the transcription task are presented; the corpus has been transcribed at three different granularity levels, which are also specified in detail. The corpus statistics for each transcription level are also presented. A set of phonetic rules describing phonetic context observed empirically in spontaneous conversation is also validated with the transcription. The corpus has been used for the construction of acoustic models and a phonetic dictionary for the construction of a speech recognition system. Initial performance results suggest that the data can be used to train good quality acoustic models.
Springer Science+Business Media
2009
Artículo
Inglés
Estudiantes
Investigadores
Público en general
Pineda, L.A. , et al., (2009). The Corpus DIMEx100: transcription and evaluation, Lang Resources & Evaluation, (44): 347–370
CIENCIA DE LOS ORDENADORES
Versión aceptada
acceptedVersion - Versión aceptada
Aparece en las colecciones: Artículos de Ciencias Computacionales

Cargar archivos:


Fichero Tamaño Formato  
2009-VillasenorPinedaLuis-The Corpus DIMEx100.pdf572.07 kBAdobe PDFVisualizar/Abrir