Error correction in lightly supervised alignment of broadcast subtitles
Financiación FP7 / Fp7 Funds
Resumen: This paper presents a range of error correction techniques aimed at improving the accuracy of a lightly supervised alignment task for broadcast subtitles. Lightly supervised approaches are frequently used in the multimedia domain, either for subtitling purposes or for providing a more reliable source for training speech based systems. The proposed methods focus on directly correcting of the alignment output using different techniques to infer word insertions and words with inaccurate time boundaries. The features used by the classification models are the outputs from the alignment system, such as confidence measures, and word or segment duration. Experiments in this paper are based on broadcast material provided by the BBC to the Multi Genre Broadcast (MGB) challenge participants. Results, show that the order alignment F measure improves up to 2.6% absolute (15.8% relative) when combining insertion and word boundary correction.
Idioma: Inglés
DOI: 10.21437/Interspeech.2016-56
Año: 2016
Publicado en: Interspeech (USB) (2016), 2110-2114
ISSN: 2308-457X

Originalmente disponible en: Texto completo de la revista

Financiación: info:eu-repo/grantAgreement/EC/FP7/610986/EU/IRIS: Towards Natural Interaction and Communication/IRIS
Financiación: info:eu-repo/grantAgreement/ES/MINECO/BES-2012-056894
Financiación: info:eu-repo/grantAgreement/ES/MINEC0/TIN2014-54288-C4-2-R
Tipo y forma: Article (PostPrint)
Área (Departamento): Área Teoría Señal y Comunicac. (Dpto. Ingeniería Electrón.Com.)

Rights Reserved All rights reserved by journal editor


Exportado de SIDERAL (2019-05-29-11:43:30)


Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
Articles



 Record created 2018-06-27, last modified 2019-05-29


Postprint:
 PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)