Dataset_Shift_IEEE_TechRxiv.pdf (237.33 kB)
Download fileDetecting Distributional Shift Responsible for Predictive Model’s Failure*
preprint
posted on 2021-09-27, 15:35 authored by Dipanwita Sinha MukherjeeDipanwita Sinha Mukherjee, Divyanshu Bhandari, Naveen YeriAny predictive software deployed with this hypothesis that test data distribution will not differ from training data distribution. Real time scenario does not follow this rule, which results inconsistent and non-transferable observation in various cases. This makes the dataset shift, a growing concern. In this paper, we’ve explored the recent concept of Label shift detection and classifier correction with the help of Black Box shift detection(BBSD), Black Box shift estimation(BBSE) and Black Box shift correction(BBSC). Digits dataset from ”sklearn” and ”LogisticRegression” classifier have been used for this investigation. Knock out shift was clearly detected by applying Kolmogorov–Smirnov test for BBSD. Performance of the classifier got improved after applying BBSE and BBSC from 91% to 97% in terms of overall accuracy.
Funding
Wells Fargo International Solutions Private Limited, Bangalore, India
History
Email Address of Submitting Author
dipanwitas@alum.iisc.ac.inSubmitting Author's Institution
Artificial Intelligence - Center of Excellence, Wells Fargo International Solutions Private Limited, Bangalore, IndiaSubmitting Author's Country
- India