Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion
Resumen: Spoken term detection (STD) aims at retrieving data from a speech repository given a textual representation of the search term. Nowadays, it is receiving much interest due to the large volume of multimedia information. STD differs from automatic speech recognition (ASR) in that ASR is interested in all the terms/words that appear in the speech data, whereas STD focuses on a selected list of search terms that must be detected within the speech data. This paper presents the systems submitted to the STD ALBAYZIN 2014 evaluation, held as a part of the ALBAYZIN 2014 evaluation campaign within the context of the IberSPEECH 2014 conference. This is the first STD evaluation that deals with Spanish language. The evaluation consists of retrieving the speech files that contain the search terms, indicating their start and end times within the appropriate speech file, along with a score value that reflects the confidence given to the detection of the search term. The evaluation is conducted on a Spanish spontaneous speech database, which comprises a set of talks from workshops and amounts to about 7 h of speech. We present the database, the evaluation metrics, the systems submitted to the evaluation, the results, and a detailed discussion. Four different research groups took part in the evaluation. Evaluation results show reasonable performance for moderate out-of-vocabulary term rate. This paper compares the systems submitted to the evaluation and makes a deep analysis based on some search term properties (term length, in-vocabulary/out-of-vocabulary terms, single-word/multi-word terms, and in-language/foreign terms).
Idioma: Inglés
DOI: 10.1186/s13636-015-0063-8
Año: 2015
Publicado en: EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING 2015, 21 (2015), [27 pp]
ISSN: 1687-4714

Factor impacto JCR: 0.797 (2015)
Categ. JCR: ENGINEERING, ELECTRICAL & ELECTRONIC rank: 175 / 257 = 0.681 (2015) - Q3 - T3
Categ. JCR: ACOUSTICS rank: 21 / 30 = 0.7 (2015) - Q3 - T3

Factor impacto SCIMAGO: 0.326 - Electrical and Electronic Engineering (Q2) - Acoustics and Ultrasonics (Q3)

Financiación: info:eu-repo/grantAgreement/ES/MINECO/TEC2012-37585-C02-01
Tipo y forma: Artículo (Versión definitiva)
Área (Departamento): Área Teoría Señal y Comunicac. (Dpto. Ingeniería Electrón.Com.)

Creative Commons Debe reconocer adecuadamente la autoría, proporcionar un enlace a la licencia e indicar si se han realizado cambios. Puede hacerlo de cualquier manera razonable, pero no de una manera que sugiera que tiene el apoyo del licenciador o lo recibe por el uso que hace.


Exportado de SIDERAL (2021-01-21-11:18:51)


Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
Artículos



 Registro creado el 2017-11-09, última modificación el 2021-01-21


Versión publicada:
 PDF
Valore este documento:

Rate this document:
1
2
3
 
(Sin ninguna reseña)