Prosodic features and formant modeling for an ivector-based language recognition system

Martinez, D.; Ortega, A.; Miguel, A.; Lleida, E.

doi:10.1109/ICASSP.2013.6638988

Prosodic features and formant modeling for an ivector-based language recognition system

Martinez, D. (Universidad de Zaragoza) ; Lleida, E. (Universidad de Zaragoza) ; Ortega, A. (Universidad de Zaragoza) ; Miguel, A. (Universidad de Zaragoza)

Resumen: The prosody of a language is encoded in syllable length, loudness and pitch. These attributes make humans perceive rhythm, stress and intonation in speech. Depending on the language, these speech properties vary, making language classification possible. On the other hand, formants are the resonance frequencies of the vocal tract, depend heavily on the position adopted by the articulatory organs, and are especially useful to disambiguate vowel sounds. In this paper prosodic and formant information are combined to build a generative language identification system based on Gaussian models fed with iVectors. The system is evaluated on the NIST LRE09 database and the inclusion of formant information gives about 50% relative improvement for the 30 s task over a prosodic system without it. The fusion with a state-of-the-art acoustic system based on shifted delta cepstral coefficients (SDC) shows the complementarity of both approaches.
Idioma: Inglés
DOI: 10.1109/ICASSP.2013.6638988
Año: 2013
Publicado en: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 2013 (2013), 6847-6851
ISSN: 1520-6149
Financiación: info:eu-repo/grantAgreement/ES/MINECO/TIN2011-28169-C05-02
Tipo y forma: Artículo (Versión definitiva)
Área (Departamento): Área Teoría Señal y Comunicac. (Dpto. Ingeniería Electrón.Com.)

Derechos reservados por el editor de la revista

Exportado de SIDERAL (2025-10-17-14:09:58)

Enlace permanente:

Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
Artículos > Artículos por área > Teoría de la Señal y Comunicaciones

Volver a la búsqueda

Registro creado el 2025-08-29, última modificación el 2025-10-17

Versión publicada:
PDF

Valore este documento:

(Sin ninguna reseña)

Añadir a una carpeta personal
Exportar como BibTeX, MARC, MARCXML, DC, EndNote, NLM, RefWorks

Repositorio Institucional de Documentos

Prosodic features and formant modeling for an ivector-based language recognition system