Distributed speech separation in spatially unconstrained microphone arrays

Nicolas Furnon; Romain Serizel; Irina Illina; Slim Essid

Communication Dans Un Congrès Année : 2021

Distributed speech separation in spatially unconstrained microphone arrays

(1) , (1) , (1) , (2, 3)

1
2
3

Nicolas Furnon

Fonction : Auteur
PersonId : 741857
IdHAL : nicolas-furnon

Speech Modeling for Facilitating Oral-Based Communication

Romain Serizel

Fonction : Auteur
PersonId : 10320
IdHAL : romain-serizel
IdRef : 223797391

Speech Modeling for Facilitating Oral-Based Communication

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Speech Modeling for Facilitating Oral-Based Communication

Slim Essid

Fonction : Auteur
PersonId : 181234
IdHAL : slimessid
ORCID : 0000-0002-0028-327X
IdRef : 11025130X

Télécom ParisTech

Laboratoire Traitement et Communication de l'Information

Résumé

Speech separation with several speakers is a challenging task because of the non-stationarity of the speech and the strong signal similarity between interferent sources. Current state-of-the-art solutions can separate well the different sources using sophisticated deep neural networks which are very tedious to train. When several microphones are available, spatial information can be exploited to design much simpler algorithms to discriminate speakers. We propose a distributed algorithm that can process spatial information in a spatially unconstrained microphone array. The algorithm relies on a convolutional recurrent neural network that can exploit the signal diversity from the distributed nodes. In a typical case of a meeting room, this algorithm can capture an estimate of each source in a first step and propagate it over the microphone array in order to increase the separation performance in a second step. We show that this approach performs even better when the number of sources and nodes increases. We also study the influence of a mismatch in the number of sources between the training and testing conditions.

Mots clés

Speech separation Microphone arrays Distributed processing

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

icassp2021.pdf (1.3 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Nicolas Furnon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02985794

Soumis le : lundi 2 novembre 2020-14:50:11

Dernière modification le : lundi 9 octobre 2023-12:49:42

Dates et versions

hal-02985794 , version 1 (02-11-2020)

hal-02985794 , version 2 (08-02-2021)

hal-02985794 , version 3 (15-04-2021)

Identifiants

HAL Id : hal-02985794 , version 1
ARXIV : 2011.00982

Citer

Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid. Distributed speech separation in spatially unconstrained microphone arrays. ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto, Canada. ⟨hal-02985794v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

541 Consultations

217 Téléchargements

Distributed speech separation in spatially unconstrained microphone arrays

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager