We show the inputs that led to the maximum activation of the networks. The audio is also provided, with a super-imposed clic at the estimated downbeat position. It should match the 21st or the 41st time frame depending on the input size. We display 5 inputs per network for two differents styles. The first style is pop/rock music, and the second one is classical music.
1. Harmonic Network
Audio signal | DNN input | |
---|---|---|
SH1: | ![]() |
|
SH2: | ![]() |
|
SH3: | ![]() |
|
SH4: | ![]() |
|
SH5: | ![]() |
|
SH6: | ![]() |
|
SH7: | ![]() |
|
SH8: | ![]() |
|
SH9: | ![]() |
|
SH10: | ![]() |
2. Rhythmic Network
Audio signal | DNN input | |
---|---|---|
SR1: | ![]() |
|
SR2: | ![]() |
|
SR3: | ![]() |
|
SR4: | ![]() |
|
SR5: | ![]() |
|
SR6: | ![]() |
|
SR7: | ![]() |
|
SR8: | ![]() |
|
SR9: | ![]() |
|
SR10: | ![]() |
3. Melodic Network
Audio signal | DNN input | |
---|---|---|
SM1: | ![]() |
|
SM2: | ![]() |
|
SM3: | ![]() |
|
SM4: | ![]() |
|
SM5: | ![]() |
|
SM6: | ![]() |
|
SM7: | ![]() |
|
SM8: | ![]() |
|
SM9: | ![]() |
|
SM10: | ![]() |
4. Bass Network
Audio signal | DNN input | |
---|---|---|
SB1: | ![]() |
|
SB2: | ![]() |
|
SB3: | ![]() |
|
SB4: | ![]() |
|
SB5: | ![]() |
|
SB6: | ![]() |
|
SB7: | ![]() |
|
SB8: | ![]() |
|
SB9: | ![]() |
|
SB10: | ![]() |