Audio style transfer with rhythmic constraints

Tomczak, Maciej and Southall, Carl and Hockman, Jason (2018) Audio style transfer with rhythmic constraints. In: International Conference on Digital Audio Effects, 4th - 8th September 2018, Aveiro, Portugal.

[img]
Preview
Text
DAFx2018_paper_48.pdf - Published Version

Download (2MB)

Abstract

In this transformation we present a rhythmically constrained audio style transfer technique for automatic mixing and mashing of two audio inputs. In this transformation the rhythmic and timbral features of both input signals are combined together through the use of an audio style transfer process that transforms the files so that they adhere to a larger metrical structure of the chosen input. This is accomplished by finding beat boundaries of both inputs and performing the transformation on beat-length audio segments. In order for the system to perform a mashup between two signals, we reformulate the previously used audio style transfer loss terms into three loss functions and enable them to be independent of the input. We measure and compare rhythmic similarities of the transformed and input audio signals using their rhythmic envelopes to investigate the influence of the tested transformation objectives.

Item Type: Conference or Workshop Item (Paper)
Dates:
DateEvent
1 June 2018Accepted
8 September 2018Published Online
Subjects: CAH11 - computing > CAH11-01 - computing > CAH11-01-01 - computer science
CAH11 - computing > CAH11-01 - computing > CAH11-01-04 - software engineering
CAH11 - computing > CAH11-01 - computing > CAH11-01-05 - artificial intelligence
CAH25 - design, and creative and performing arts > CAH25-02 - performing arts > CAH25-02-02 - music
Divisions: Faculty of Computing, Engineering and the Built Environment > School of Computing and Digital Technology
Depositing User: Jason Hockman
Date Deposited: 04 Apr 2022 15:21
Last Modified: 22 Mar 2023 12:01
URI: https://www.open-access.bcu.ac.uk/id/eprint/13028

Actions (login required)

View Item View Item

Research

In this section...