User-guided one-shot deep model adaptation for music source separation

Music source separation is the task of isolating individual instruments which are mixed in a musical piece. This task is particularly challenging, and even state-of-the-art models can hardly generalize to unseen test data. Nevertheless, prior knowledge about individual sources can be used to better adapt a generic source separation model to the observed signal. In this work, we propose to exploit a temporal segmentation provided by the user, that indicates when each instrument is active, in order to fine-tune a pre-trained deep model for source separation and adapt it to one specific mixture. This paradigm can be referred to as user-guided one-shot deep model adaptation for music source separation, as the adaptation acts on the target song instance only. Our results are promising and show that state-of-the-art source separation models have large margins of improvement especially for those instruments which are underrepresented in the training data.

Mots clés

Music Source Separation One-shot Domain Adaptation User-guided Source Separation

Domaines

Traitement du signal et de l'image [eess.SP] Recherche d'information [cs.IR]

Fichier principal

WASPAA2021_Hal.pdf (3.17 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

giorgia cantisani : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-03219350

Soumis le : jeudi 6 mai 2021-12:51:22

Dernière modification le : lundi 9 octobre 2023-12:49:43

Archivage à long terme le : samedi 7 août 2021-18:50:36

Dates et versions

hal-03219350 , version 1 (06-05-2021)

hal-03219350 , version 2 (02-06-2021)

hal-03219350 , version 3 (29-07-2021)

Identifiants

HAL Id : hal-03219350 , version 1

Citer

Giorgia Cantisani, Alexey Ozerov, Slim Essid, Gael Richard. User-guided one-shot deep model adaptation for music source separation. 2021. ⟨hal-03219350v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

484 Consultations

637 Téléchargements