Skip to Main content Skip to Navigation
Conference papers

Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation

Abstract : This paper describes a blind source separation method for multichannel audio signals, called NF-FastMNMF, based on the integration of the normalizing flow (NF) into the multichannel nonnegative matrix factorization with jointly-diagonalizable spatial covariance matrices, a.k.a. FastMNMF. Whereas the NF of flow-based independent vector analysis, called NF-IVA, acts as the demixing matrices to transform an M-channel mixture into M independent sources, the NF of NF-FastMNMF acts as the diagonalization matrices to transform an Mchannel mixture into a spatially-independent M-channel mixture represented as a weighted sum of N source images. This diagonalization enables the NF, which has been used only for determined separation because of its bijective nature, to be applicable to non-determined separation. NF-FastMNMF has time-varying diagonalization matrices that are potentially better at handling dynamical data variation than the time-invariant ones in FastMNMF. To have an NF with richer expression capability, the dimension-wise scalings using diagonal matrices originally used in NF-IVA are replaced with linear transformations using upper triangular matrices; in both cases, the diagonal and upper triangular matrices are estimated by neural networks. The evaluation shows that NF-FastMNMF performs well for both determined and non-determined separations of multiple speech utterances by stationary or non-stationary speakers from a noisy reverberant mixture.
Complete list of metadata
Contributor : Mathieu Fontaine Connect in order to contact the contributor
Submitted on : Monday, April 11, 2022 - 4:14:36 PM
Last modification on : Monday, May 2, 2022 - 8:54:09 AM
Long-term archiving on: : Tuesday, July 12, 2022 - 6:44:59 PM


Files produced by the author(s)


  • HAL Id : hal-03637425, version 1



Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii. Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation. 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), May 2022, Singapore, Singapore. ⟨hal-03637425⟩



Record views


Files downloads