Simon Durand

Estimation de la position des premiers temps dans un signal audio musical

We show the activation of each of the four adapted networks presented in our Ph.D manuscript on a challenging exemple, aswell as the mean activation of all four networks. The audio is also provided with a superimposed clic at the downbeat position as estimated by the mean activation of all four networks each time. We see that the combination of the networks is able to produce a suitable downbeat detection funciton that leads to an appropriate downbeat sequence, while each network individually is less reliable.

1. Harmonic Network

Audio signal	Network output

2. Rhythmic Network

Audio signal	Network output (after reduction)

3. Melodic Network

Audio signal	Network output

4. Bass Network

Audio signal	Network output (after reduction)

5. Network Combination

Audio signal	Network output

Simon Durand

Ph.D. student in Audio Signal Processing and Machine Learning

Télécom ParisTech, CNRS-LTCI

Estimation de la position des premiers temps dans un signal audio musical