double.pdf (236.76 kB)
Download fileDilated Convolutional Model for Melody Extraction
preprint
posted on 2022-02-03, 04:51 authored by Xian WangXian Wang, Lingqiao LiuLingqiao Liu, Javen ShiJaven ShiMelody extraction is a challenging task in music information retrieval that enables many down-stream applications. In this paper we propose a simple dilated convolutional model for melody extraction. It takes variable-q transforms as inputs. It first uses consecutive layers of convolution to capture local temporal-frequency patterns. Afterward, it relies only a single layer of dilated convolution for capturing global frequency patterns formed by the pitches and harmonics of active notes. This model is effective in that it achieves the-state-of-the-art performance on most datasets, for both general and vocal melody extraction. In addition, it gets the best performance with the least training data.
History
Email Address of Submitting Author
xian.wang01@adelaide.edu.auORCID of Submitting Author
0000-0003-2306-1503Submitting Author's Institution
The University of AdelaideSubmitting Author's Country
- Australia