E. Vincent, T. Virtanen, and S. Gannot, Audio Source Separation and Speech Enhancement, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01881431

E. Hänsler and G. Schmidt, Acoustic Echo and Noise Control: a Practical Approach, 2004.

J. S. Erkelens and R. Heusdens, Correlation-based and model-based blind single-channel late-reverberation suppression in noisy time-varying acoustical environments, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1746-1765, 2010.

I. Kodrasi and S. Doclo, Joint dereverberation and noise reduction based on acoustic multi-channel equalization, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.4, pp.680-693, 2016.

O. Schwartz, S. Gannot, and E. A. Habets, Multi-microphone speech dereverberation and noise reduction using relative early transfer functions, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.2, pp.240-251, 2015.

T. Dietzen, S. Doclo, M. Moonen, and T. Van-waterschoot, Joint multimicrophone speech dereverberation and noise reduction using integrated sidelobe cancellation and linear prediction, Proc. IWAENC, pp.221-225, 2018.

T. Yoshioka, T. Nakatani, M. Miyoshi, and H. G. Okuno, Blind separation and dereverberation of speech mixtures by joint optimization, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.1, pp.69-84, 2011.

H. Kagami, H. Kameoka, and M. Yukawa, Joint separation and dereverberation of reverberant mixtures with determined multichannel nonnegative matrix factorization, Proc. ICASSP, pp.31-35, 2018.

R. Le-bouquin-jeannès, P. Scalart, G. Faucon, and C. Beaugeant, Combined noise and echo reduction in hands-free systems: a survey, IEEE Transactions on Speech and Audio Processing, vol.9, issue.8, pp.808-820, 2001.

S. Gustafsson, R. Martin, P. Jax, and P. Vary, A psychoacoustic approach to combined acoustic echo cancellation and noise reduction, IEEE Transactions on Speech and Audio Processing, vol.10, issue.5, pp.245-256, 2002.

W. Herbordt, S. Nakamura, and W. Kellermann, Joint optimization of LCMV beamforming and acoustic echo cancellation for automatic speech recognition, Proc. ICASSP, 2005.

G. Reuven, S. Gannot, and I. Cohen, Joint noise reduction and acoustic echo cancellation using the transfer-function generalized sidelobe canceller, Speech Communication, vol.49, issue.7-8, pp.623-635, 2007.

M. Togami, Y. Kawaguchi, and R. Takashima, Frequency domain acoustic echo reduction based on Kalman smoother with time-varying noise covariance matrix, Proc. ICASSP, pp.5909-5913, 2014.

K. Nathwani, Joint acoustic echo and noise cancellation using spectral domain Kalman filtering in double talk scenario, Proc. IWAENC, pp.326-330, 2018.

R. Takeda, K. Nakadai, T. Takahashi, K. Komatani, T. Ogata et al., ICA-based efficient blind dereverberation and echo cancellation method for barge-in-able robot audition, Proc. ICASSP, pp.3677-3680, 2009.

M. Togami and Y. Kawaguchi, Speech enhancement combined with dereverberation and acoustic echo reduction for time varying systems, Proc. SSP, pp.357-360, 2012.

E. A. Habets, S. Gannot, I. Cohen, and P. C. Sommen, Joint dereverberation and residual echo suppression of speech signals in noisy environments, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.8, pp.1433-1451, 2008.

M. Togami and Y. Kawaguchi, Simultaneous optimization of acoustic echo reduction, speech dereverberation, and noise reduction against mutual interference, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.11, pp.1612-1623, 2014.

D. S. Williamson and D. Wang, Speech dereverberation and denoising using complex ratio masks, Proc. ICASSP, pp.5590-5594, 2017.

Y. Zhao, Z. Wang, and D. Wang, A two-stage algorithm for noisy and reverberant speech enhancement, Proc. ICASSP, pp.5580-5584, 2017.

H. Seo, M. Lee, and J. Chang, Integrated acoustic echo and background noise suppression based on stacked deep neural networks, Applied Acoustics, vol.133, pp.194-201, 2018.

H. Zhang and D. Wang, Deep learning for acoustic echo cancellation in noisy and double-talk scenarios, in Interspeech, pp.3239-3243, 2018.

F. Yang, G. Enzner, and J. Yang, Statistical convergence analysis for optimal control of DFT-domain adaptive echo canceler, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.5, pp.1095-1106, 2017.

C. M. Lee, J. W. Shin, and N. S. Kim, DNN-based residual echo suppression, pp.316-320, 2015.

G. Carbajal, R. Serizel, E. Vincent, and E. Humbert, Multiple-input neural network-based residual echo suppression, Proc. ICASSP, pp.231-235, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01723630

G. Enzner and P. Vary, Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones, Signal Processing, vol.86, issue.6, pp.1140-1156, 2006.

M. Togami and K. Hori, Multichannel semi-blind source separation via local Gaussian modeling for acoustic echo reduction, Proc. EUSIPCO, pp.496-500, 2011.

T. Nakatani, T. Yoshioka, K. Kinoshita, M. Miyoshi, and B. H. Juang, Speech dereverberation based on variance-normalized delayed linear prediction, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1717-1731, 2010.

T. Yoshioka and T. Nakatani, Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.10, pp.2707-2720, 2012.

A. Jukic, T. Van-waterschoot, T. Gerkmann, and S. Doclo, Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.9, pp.1509-1520, 2015.

K. Kinoshita, M. Delcroix, H. Kwon, T. Mori, and T. Nakatani, Neural network-based spectrum estimation for online WPE dereverberation, pp.384-388, 2017.

K. Furuya and A. Kataoka, Robust speech dereverberation using multichannel blind deconvolution with spectral subtraction, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.1579-1591, 2007.

M. Togami, Y. Kawaguchi, R. Takeda, Y. Obuchi, and N. Nukaga, Optimized speech dereverberation from probabilistic perspective for time varying acoustic transfer function, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, issue.7, pp.1369-1380, 2013.

A. Cohen, G. Stemmer, S. Ingalsuo, and S. Markovich-golan, Combined weighted prediction error and minimum variance distortionless response for dereverberation, Proc. ICASSP, pp.446-450, 2017.

S. Gannot, E. Vincent, S. Markovich-golan, and A. Ozerov, A consolidated perspective on multimicrophone speech enhancement and source separation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue.4, pp.692-730, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01414179

N. Q. Duong, E. Vincent, and R. Gribonval, Under-determined reverberant audio source separation using a full-rank spatial covariance model, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.7, pp.1830-1840, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00435807

A. Ozerov and C. Févotte, Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.3, pp.550-563, 2010.

A. A. Nugraha, A. Liutkus, and E. Vincent, Multichannel audio source separation with deep neural networks, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, issue.9, pp.1652-1664, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01163369

S. Leglaive, L. Girin, and R. Horaud, Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization, Proc. ICASSP, pp.101-105, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02005102

G. Carbajal, R. Serizel, E. Vincent, and E. Humbert, Joint DNN-based multichannel reduction of echo, reverberation and noise: Supporting document, Inria, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02372431

A. A. Nugraha, A. Liutkus, and E. Vincent, Multichannel music separation with deep neural networks, Proc. EUSIPCO, pp.1748-1752, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01334614

A. Liutkus, D. Fitzgerald, and Z. Rafii, Scalable audio separation with light kernel additive modelling, Proc. ICASSP, pp.76-80, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01114890

V. Panayotov, G. Chen, D. Povey, and S. Khudanpur, Librispeech: an ASR corpus based on public domain audio books, Proc. ICASSP, pp.5206-5210, 2015.

E. Vincent and D. R. Campbell, Roomsimove, 2008.

P. C. Loizou, Speech Enhancement: Theory and Practice, 2007.

J. L. Roux, S. Wisdom, H. Erdogan, and J. R. Hershey, SDR -halfbaked or well done, Proc. ICASSP, pp.626-630, 2019.

J. M. Valin, On adjusting the learning rate in frequency domain echo cancellation with double-talk, IEEE Transactions on Audio, Speech, and Language Processing, vol.15, issue.3, pp.1030-1034, 2007.

D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, Proc. ICLR, 2015.