Drum translation for timbral and rhythmic transformation

Tomczak, Maciej and Drysdale, Jake and Hockman, Jason (2019) Drum translation for timbral and rhythmic transformation. In: Digital Audio Effects 2019, 3rd - 6th September 2019, Birmingham City University.

DAFx2019_paper_25.pdf - Published Version
Available under License Creative Commons Attribution.

Download (5MB)


Many recent approaches to creative transformations of musical audio have been motivated by the success of raw audio generation models such as WaveNet, in which audio samples are modeled by generative neural networks. This paper describes a generative audio synthesis model for multi-drum translation based on a WaveNet denosing autoencoder architecture. The timbre of an arbitrary source audio input is transformed to sound as if it were played by various percussive instruments while preserving its rhythmic structure. Two evaluations of the transformations are conducted based on the capacity of the model to preserve the rhythmic patterns of the input and the audio quality as it relates to timbre of the target drum domain. The first evaluation measures the rhythmic similarities between the source audio and the corresponding drum translations, and the second provides a numerical analysis of the quality of the synthesised audio. Additionally, a semi- and fully-automatic audio effect has been proposed, in which the user may assist the system by manually labelling source audio segments or use a state-of-the-art automatic drum transcription system prior to drum translation.

Item Type: Conference or Workshop Item (Paper)
1 June 2019Accepted
6 September 2019Published Online
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
CAH11 - computing > CAH11-01 - computing > CAH11-01-04 - software engineering
CAH11 - computing > CAH11-01 - computing > CAH11-01-05 - artificial intelligence
CAH25 - design, and creative and performing arts > CAH25-02 - performing arts > CAH25-02-02 - music
Divisions: Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Jason Hockman
Date Deposited: 01 Apr 2022 11:00
Last Modified: 22 Mar 2023 12:01
URI: https://www.open-access.bcu.ac.uk/id/eprint/13022

Actions (login required)

View Item View Item


In this section...