Evento
A generalization of PLDA for joint modeling of speaker identity and multiple nuisance conditions
Tipo del evento:
Conferencia
Nombre del evento:
19th Annual Conference of the International Speech Communication Association: Speech research for emerging markets in multilingual societies
Fecha del evento:
02/09/2018
Institución Organizadora:
International Speech Communication Association;
Título del Libro:
Proc. Interspeech 2018
Título de la revista:
Proc. Interspeech 2018
Editorial:
International Speech Communication Association
Idioma:
Inglés
Clasificación temática:
Resumen
Probabilistic linear discriminant analysis (PLDA) is the leading method for computing scores in speaker recognition systems. The method models the vectors representing each audio sample as a sum of three terms: one that depends on the speaker identity, one that models the within-speaker variability, and one that models any remaining variability. The last two terms are assumed to be independent across samples. We recently proposed anextension of the PLDAmethod, whichwetermedJoint PLDA (JPLDA), where the second term is considered dependent on the type of nuisance condition present in the data (e.g., the language or channel). The proposed method led to significant gains for multilanguage speaker recognition when taking language as the nuisance condition. In this paper, we present a generalization of this approach that allows for multiple nuisance terms. We show results using language and several nuisance conditions describing the acoustic characteristics of the sample and demonstrate that jointly including all these factors in the model leads to better results than including only language or acoustic condition factors. Overall, we obtain relative improvements in detection cost function between 5% and 47% for various systems and test conditions with respect to standard PLDA approaches.
Palabras clave:
SPEAKER RECOGNITION
,
PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS
Archivos asociados
Licencia
Identificadores
Colecciones
Eventos(OCA CIUDAD UNIVERSITARIA)
Eventos de OFICINA DE COORDINACION ADMINISTRATIVA CIUDAD UNIVERSITARIA
Eventos de OFICINA DE COORDINACION ADMINISTRATIVA CIUDAD UNIVERSITARIA
Citación
A generalization of PLDA for joint modeling of speaker identity and multiple nuisance conditions; 19th Annual Conference of the International Speech Communication Association: Speech research for emerging markets in multilingual societies; Hyderabad; India; 2018; 82-86
Compartir
Altmétricas