double.pdf (236.76 kB)
Download file

Dilated Convolutional Model for Melody Extraction

Download (236.76 kB)
posted on 2022-02-03, 04:51 authored by Xian WangXian Wang, Lingqiao LiuLingqiao Liu, Javen ShiJaven Shi
Melody extraction is a challenging task in music information retrieval that enables many down-stream applications. In this paper we propose a simple dilated convolutional model for melody extraction. It takes variable-q transforms as inputs. It first uses consecutive layers of convolution to capture local temporal-frequency patterns. Afterward, it relies only a single layer of dilated convolution for capturing global frequency patterns formed by the pitches and harmonics of active notes. This model is effective in that it achieves the-state-of-the-art performance on most datasets, for both general and vocal melody extraction. In addition, it gets the best performance with the least training data.


Email Address of Submitting Author

ORCID of Submitting Author


Submitting Author's Institution

The University of Adelaide

Submitting Author's Country

  • Australia