ClaviNet: Generate music with different musical styles

Classically, the style of the generated music by deep learning models is usually governed by the training dataset. In this article, we improved this by proposing the continuous style embedding ${z}_{s}$zs to the general formulation of variational autoencoder (VAE) to allow users to be able to condit...

Full description

Saved in:
Bibliographic Details
Main Authors: Lim, Yu-Quan, Chan, Chee Seng, Loo, Fung Ying
Format: Article
Published: IEEE Computer Soc 2021
Subjects:
Online Access:http://eprints.um.edu.my/27026/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.27026
record_format eprints
spelling my.um.eprints.270262022-04-04T07:12:47Z http://eprints.um.edu.my/27026/ ClaviNet: Generate music with different musical styles Lim, Yu-Quan Chan, Chee Seng Loo, Fung Ying M Music QA75 Electronic computers. Computer science Classically, the style of the generated music by deep learning models is usually governed by the training dataset. In this article, we improved this by proposing the continuous style embedding ${z}_{s}$zs to the general formulation of variational autoencoder (VAE) to allow users to be able to condition on the style of the generated music. For this purpose, we explored and compared two different methods to integrate z(s) into the VAE. In the literature of conditional generative modeling, disentanglement of attributes from the latent space is often associated with better generative performance. In our experiments, we find that this is not the case with our proposed model. Empirically and from a musical theory perspective, we show that our proposed model can generate better music samples than a baseline model that utilizes a discrete style label. The source code and generated samples are available at . IEEE Computer Soc 2021-01-01 Article PeerReviewed Lim, Yu-Quan and Chan, Chee Seng and Loo, Fung Ying (2021) ClaviNet: Generate music with different musical styles. IEEE Multimedia, 28 (1). pp. 83-93. ISSN 1070-986X, DOI https://doi.org/10.1109/MMUL.2020.3046491 <https://doi.org/10.1109/MMUL.2020.3046491>. 10.1109/MMUL.2020.3046491
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic M Music
QA75 Electronic computers. Computer science
spellingShingle M Music
QA75 Electronic computers. Computer science
Lim, Yu-Quan
Chan, Chee Seng
Loo, Fung Ying
ClaviNet: Generate music with different musical styles
description Classically, the style of the generated music by deep learning models is usually governed by the training dataset. In this article, we improved this by proposing the continuous style embedding ${z}_{s}$zs to the general formulation of variational autoencoder (VAE) to allow users to be able to condition on the style of the generated music. For this purpose, we explored and compared two different methods to integrate z(s) into the VAE. In the literature of conditional generative modeling, disentanglement of attributes from the latent space is often associated with better generative performance. In our experiments, we find that this is not the case with our proposed model. Empirically and from a musical theory perspective, we show that our proposed model can generate better music samples than a baseline model that utilizes a discrete style label. The source code and generated samples are available at .
format Article
author Lim, Yu-Quan
Chan, Chee Seng
Loo, Fung Ying
author_facet Lim, Yu-Quan
Chan, Chee Seng
Loo, Fung Ying
author_sort Lim, Yu-Quan
title ClaviNet: Generate music with different musical styles
title_short ClaviNet: Generate music with different musical styles
title_full ClaviNet: Generate music with different musical styles
title_fullStr ClaviNet: Generate music with different musical styles
title_full_unstemmed ClaviNet: Generate music with different musical styles
title_sort clavinet: generate music with different musical styles
publisher IEEE Computer Soc
publishDate 2021
url http://eprints.um.edu.my/27026/
_version_ 1735409489868226560
score 13.211869