TechRxiv
Null-Labelling_A_Generic_Approach_for_Learning_in_the_Presence_of_Class_Noise.pdf (686.63 kB)

Null-Labelling: A Generic Approach for Learning in the Presence of Class Noise

Download (686.63 kB)
preprint
posted on 2021-03-18, 01:24 authored by Benjamin DenhamBenjamin Denham, Russel Pears, M. Asif Naeem
Datasets containing class noise present significant challenges to accurate classification, thus requiring classifiers that can refuse to classify noisy instances. We demonstrate the inability of the popular confidence-thresholding rejection method to learn from relationships between input features and not-at-random class noise. To take advantage of these relationships, we propose a novel null-labelling scheme based on iterative re-training with relabelled datasets that uses a classifier to learn to reject instances that are likely to be misclassified. We demonstrate the ability of null-labelling to achieve a significantly better tradeoff between classification error and coverage than the confidence-thresholding method. Models generated by the null-labelling scheme have the added advantage of interpretability, in that they are able to identify features correlated with class noise. We also unify prior theories for combining and evaluating sets of rejecting classifiers.

Funding

Callaghan Innovation R&D Fellowship Grant (FPAP1902)

Auckland University of Technology Doctoral Fees Scholarship

History

Email Address of Submitting Author

ben.denham@aut.ac.nz

Submitting Author's Institution

Auckland University of Technology

Submitting Author's Country

  • New Zealand