Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

Castán, D.; Ortega, A.; Miguel, A.; Lleida, E.
doi:10.1186/s13636-014-0034-5
000063147 001__ 63147
000063147 005__ 20171109140606.0
000063147 0247_ $$2doi$$a10.1186/s13636-014-0034-5
000063147 0248_ $$2sideral$$a89130
000063147 037__ $$aART-2014-89130
000063147 041__ $$aeng
000063147 100__ $$aCastán, D.
000063147 245__ $$aAudio segmentation-by-classification approach based on factor analysis in broadcast news domain
000063147 260__ $$c2014
000063147 5060_ $$aAccess copy available to the general public$$fUnrestricted
000063147 5203_ $$aThis paper studies a novel audio segmentation-by-classification approach based on factor analysis. The proposed technique compensates the within-class variability by using class-dependent factor loading matrices and obtains the scores by computing the log-likelihood ratio for the class model to a non-class model over fixed-length windows. Afterwards, these scores are smoothed to yield longer contiguous segments of the same class by means of different back-end systems. Unlike previous solutions, our proposal does not make use of specific acoustic features and does not need a hierarchical structure. The proposed method is applied to segment and classify audios coming from TV shows into five different acoustic classes: speech, music, speech with music, speech with noise, and others. The technique is compared to a hierarchical system with specific acoustic features achieving a significant error reduction.
000063147 536__ $$9info:eu-repo/grantAgreement/ES/MINECO/TIN2011-28169-C05-02
000063147 540__ $$9info:eu-repo/semantics/openAccess$$aby$$uhttp://creativecommons.org/licenses/by/3.0/es/
000063147 590__ $$a0.386$$b2014
000063147 591__ $$aENGINEERING, ELECTRICAL & ELECTRONIC$$b213 / 247 = 0.862$$c2014$$dQ4$$eT3
000063147 591__ $$aACOUSTICS$$b28 / 30 = 0.933$$c2014$$dQ4$$eT3
000063147 655_4 $$ainfo:eu-repo/semantics/article$$vinfo:eu-repo/semantics/publishedVersion
000063147 700__ $$0(orcid)0000-0002-3886-7748$$aOrtega, A.$$uUniversidad de Zaragoza
000063147 700__ $$0(orcid)0000-0001-5803-4316$$aMiguel, A.$$uUniversidad de Zaragoza
000063147 700__ $$0(orcid)0000-0001-9137-4013$$aLleida, E.$$uUniversidad de Zaragoza
000063147 7102_ $$15008$$2800$$aUniversidad de Zaragoza$$bDepartamento de Ingeniería Electrónica y Comunicaciones$$cTeoría de la Señal y Comunicaciones
000063147 773__ $$g2014, 1 (2014), 1-13$$pEURASIP j. audio, speech music. process.$$tEURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING$$x1687-4714
000063147 8564_ $$s1289762$$uhttps://zaguan.unizar.es/record/63147/files/texto_completo.pdf$$yVersión publicada
000063147 8564_ $$s107219$$uhttps://zaguan.unizar.es/record/63147/files/texto_completo.jpg?subformat=icon$$xicon$$yVersión publicada
000063147 909CO $$ooai:zaguan.unizar.es:63147$$particulos$$pdriver
000063147 951__ $$a2017-11-09-11:57:42
000063147 980__ $$aARTICLE
Atlantis Institut des Sciences Fictives