Skip to Main content Skip to Navigation
Conference papers

SUPERVISED AND UNSUPERVISED SEQUENCE MODELLING FOR DRUM TRANSCRIPTION

Olivier Gillet Gael Richard 1, 2
1 S2A - Signal, Statistique et Apprentissage
LTCI - Laboratoire Traitement et Communication de l'Information
Abstract : We discuss in this paper two post-processings for drum transcription systems, which aim to model typical properties of drum sequences. Both methods operate on a symbolic representation of the sequence, which is obtained by quantizing the onsets of drum strokes on an optimal tatum grid, and by fusing the posterior probabilities produced by the drum transcription system. The first proposed method is a generalization of the N-gram model. We discuss several training and recognition strategies (style-dependent models, local models) in order to maximize the reliability and the specificity of the trained models. Alternatively, we introduce a novel unsupervised algorithm based on a complexity criterion, which finds the most regular and wellstructured sequence compatible with the acoustic scores produced by the transcription system. Both approaches are evaluated on a subset of the ENST-drums corpus, and yield performance improvements.
Document type :
Conference papers
Complete list of metadata

https://hal.telecom-paris.fr/hal-03153898
Contributor : Gaël Richard <>
Submitted on : Friday, February 26, 2021 - 6:52:27 PM
Last modification on : Tuesday, September 21, 2021 - 2:16:05 PM

Identifiers

  • HAL Id : hal-03153898, version 1

Collections

Citation

Olivier Gillet, Gael Richard. SUPERVISED AND UNSUPERVISED SEQUENCE MODELLING FOR DRUM TRANSCRIPTION. International Conference on Music Information Retrieval (ISMIR), 2007, Vienna, Austria. ⟨hal-03153898⟩

Share

Metrics

Record views

20