TechRxiv
Ensemble Learning Based HRTF Personalization Using Anthropometric Features.pdf (273.93 kB)

Ensemble Learning Based HRTF Personalization Using Anthropometric Features

Download (273.93 kB)
preprint
posted on 2022-09-06, 15:55 authored by YIH LIANG SHENYIH LIANG SHEN, TZU HSUAN KUO, TAI SHIH CHI

In this paper, we propose an ensemble learning based model to synthesize the logarithmic magnitude response of head-related transfer function (HRTF) using anthropometric features. We first cluster subjects based on relevant anthropometric features to reduce differences within each group, then we use the ensemble learning algorithm on clustered results to predict the log-magnitude HRTF. In the training phase, three deep neural networks (DNNs), each of which aims to predict log-magnitude HRTFs in a particular group, are trained using anthropometric and angle-related features. Afterward, another DNN is trained to integrate estimates from the three group-wise DNNs into log-magnitude HTRFs. The proposed model is compared with a baseline DNN model and our previously proposed model, which incorporates an auto-encoder for dimensionality reduction. Experimental results show that the proposed model performs the best in synthesizing log-magnitude HRTFs in terms of the log-spectral distortion (LSD) measure with great stability.

History

Email Address of Submitting Author

yihliang.eed02@g2.nctu.edu.tw

ORCID of Submitting Author

0000-0003-4789-6695

Submitting Author's Institution

National Yang Ming Chiao Tung University

Submitting Author's Country

  • Taiwan

Usage metrics

    Licence

    Exports