Generative pairwise models for speaker recognition

Cumani, Sandro; Laface, Pietro

This paper proposes a simple model for speaker recognition based on i–vector pairs, and analyzes its similarity and differences with respect to the state–of–the–art Probabilistic Linear Discriminant Analysis (PLDA) and Pairwise Support Vector Machine (PSVM) models. Similar to the discriminative PSVM approach, we propose a generative model of i–vector pairs, rather than an usual i–vector based model. The model is based on two Gaussian distributions, one for the “same speakers” and the other for the “different speakers” i–vector pairs, and on the assumption that the i–vector pairs are independent. This independence assumption allows the distributions of the two classes to be independently estimated. The “Two–Gaussian” approach can be extended to the Heavy–Tailed distributions, still allowing a fast closed form solution to be obtained for testing i–vector pairs. We show that this model is closely related to PLDA and to PSVM models, and that tested on the female part of the tel–tel NIST SRE 2010 extended evaluation set, it is able to achieve comparable accuracy with respect to the other models, trained with different objective functions and training procedures.

Generative pairwise models for speaker recognition / Cumani, Sandro; Laface, Pietro. - ELETTRONICO. - 1:(2014), pp. 273-279. (Intervento presentato al convegno Odyssey 2014: The Speaker and Language Recognition Workshop tenutosi a Joensuu, Finland nel 16-19 June 2014).

Generative pairwise models for speaker recognition

CUMANI, SANDRO;LAFACE, Pietro

2014

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2014

Appare nelle tipologie

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
odyssey2014-v5.pdf accesso aperto Tipologia: 1. Preprint / submitted version [pre- review] Licenza: PUBBLICO - Tutti i diritti riservati Dimensione 1.31 MB Formato Adobe PDF Visualizza/Apri	1.31 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2551354

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

PORTO @ Archivio Istituzionale della Ricerca

Generative pairwise models for speaker recognition

CUMANI, SANDRO;LAFACE, Pietro

2014

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)