TechRxiv
TNNLS_2022___Beyond_Instance_Dependent_Noise__Generalizing_Noisy_Label_Modeling_to_Multiple_Labeler_Dependence.pdf (883.2 kB)

Beyond Instance-Dependent Noise: Generalizing Noisy Label Modeling to Multiple Labeler Dependence

Download (883.2 kB)
preprint
posted on 2022-09-20, 19:37 authored by Glenn DawsonGlenn Dawson, Robi Polikar

Most studies on learning from noisy labels rely on unrealistic models of i.i.d. label noise, such as class-conditional transition matrices. More recent work on instance-dependent noise models are more realistic; however, these models assume a single generative process for label noise across the entire dataset, which does not accurately capture the real-world labeling dynamics observed in modern dataset collection. We propose a new model of label noise, which we call "labeler-dependent noise (LDN)." LDN extends and generalizes instance-dependent noise to multiple labelers, based on the observation that modern datasets are typically constructed via distributed data gathering and labeling methods. Furthermore, inspired by recent research in social learning theory, we introduce a modular learning framework under our proposed labeler-dependent noise model called "labeler-aware learning", in which each labeler is estimated and compared against the others during the learning process. Our experimental results demonstrate how previous state-of-the-art approaches for learning from noisy labels are unable to handle the general model of label noise, while our labeler-aware learning framework remains robust even when large fractions of labelers are spammers.

Funding

P200A180055

History

Email Address of Submitting Author

dawsong@rowan.edu

ORCID of Submitting Author

0000-0002-5877-5605

Submitting Author's Institution

Rowan University

Submitting Author's Country

  • United States of America