Skip to Main content Skip to Navigation
Conference papers

Downbeat tracking with multiple features and deep neural networks

Simon Durand 1, 2 Juan P. Bello 1, 2 Bertrand David 1, 2 Gael Richard 1, 2 
1 S2A - Signal, Statistique et Apprentissage
LTCI - Laboratoire Traitement et Communication de l'Information
Abstract : In this paper, we introduce a novel method for the automatic estimation of downbeat positions from music signals. Our system relies on the computation of musically inspired features capturing important aspects of music such as timbre, harmony, rhythmic patterns, or local similarities in both timbre and harmony. It then uses several independent deep neural networks to learn higher-level representations. The downbeat sequences are finally obtained thanks to a temporal decoding step based on the Viterbi algorithm. The comparative evaluation conducted on varied datasets demonstrates the efficiency and robustness across different music styles of our approach.
Complete list of metadata
Contributor : TelecomParis HAL Connect in order to contact the contributor
Submitted on : Friday, September 13, 2019 - 4:27:45 PM
Last modification on : Tuesday, October 19, 2021 - 11:16:12 AM


  • HAL Id : hal-02287003, version 1


Simon Durand, Juan P. Bello, Bertrand David, Gael Richard. Downbeat tracking with multiple features and deep neural networks. ICASSP 2015, Apr 2015, Brisbane, Australia. ⟨hal-02287003⟩



Record views