Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification - Télécom Paris Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification

Résumé

This paper presents a feature learning approach for speaker identification that is based on nonnegative matrix factorisation. Recent studies have shown that with such models, the dictionary atoms can represent well the speaker identity. The approaches proposed so far focused only on speaker variability and not on session variability. However, this later point is a crucial aspect in the success of the I-vector approach that is now the state-of-the-art in speaker identification.

This paper proposes a method that relies on group nonnegative matrix factorisation and that is inspired by the I-vector training procedure. By doing so the proposed approach intends to capture both the speaker variability and the session variability. Results on a small corpus prove that the proposed approach can be competitive with I-vectors.

Fichier non déposé

Dates et versions

hal-02288453 , version 1 (14-09-2019)

Identifiants

  • HAL Id : hal-02288453 , version 1

Citer

Romain Serizel, Slim Essid, Gael Richard. Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification. ICASSP, Mar 2016, Shangai, China. pp.5470 - 5474. ⟨hal-02288453⟩
70 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More