TechRxiv
nn_rule.2023.02.18.pdf (1.05 MB)

Extracting Classification Boolean Rules with Probabilities from Neural Networks

Download (1.05 MB)
preprint
posted on 2023-02-22, 16:52 authored by Chun-Kai HwangChun-Kai Hwang, Yen-Jen Oyang

For classification problems, neural networks are well known for high accuracy in comparison to traditional statistical methods such as logistic regression and discriminant analysis. It is even better than other algorithms such as decision trees and Bayesian networks. However, the knowledge learned by the neural networks is stored in the hierarchical functional mapping of the structures of neural networks and the weight and bias parameters. It is not easy for people to understand its black-box decision process. In this research, we extract probabilistic Boolean classification rules from neural networks. The ruleset model can be tuned to a specified sensitivity according to different thresholds. In addition, we can compute a weighted important factor for each attribute that composes the Boolean rules. The weighted important factor is a numeric number between 0 and 1. If the weighted important factor is 0, it means the corresponding attribute is a noise signal. Hence, the weighted important features can be filtered out with a given threshold.

From the linearly and nonlinearly separable simulation datasets, we find that the accuracy of PBCR1 and PBCR2 are better than neural networks even with a 1/10 training ratio. From UCI machine learning datasets, we find that the AUC of PBCR1 and PBCR2 will be a little lower than the AUC of neural networks. However, on the accuracy metric, from red wine and white wine datasets, PBCR1 and PBCR2 are almost the same with neural networks. The accuracies of PBCR1 and PBCR2 are superior to DT by a statistically significant margin. For the F1 score, PBCR1 and PBCR2 are statistically significantly better than DT on red wine, white wine, and PID datasets.

History

Email Address of Submitting Author

d08922034@csie.ntu.edu.tw

ORCID of Submitting Author

https://orcid.org/0000-0003-3844-7353

Submitting Author's Institution

Department of Computer Science and Information Engineering, National Taiwan University

Submitting Author's Country

  • Taiwan