TechRxiv
filter_method_based_FS_process_for_UIMTR_problem.pdf (609.69 kB)

Filter method-based feature selection process for unattributed-identity multi-target regression problem

Download (609.69 kB)
preprint
posted on 2023-01-09, 14:17 authored by Iker GarciaIker Garcia, Roberto Santana

In this paper, for the first time, a feature selection (FS) problem for an unattributed-identity multi-target regression (UIMTR) problem is presented. UIMTR is defi?ned as a multi-target regression problem where the set of target and predictor variables are undetermined, i.e., the identity of the variables is unattributed. Two forward selection ?filter-based mutual information sequential-methods are proposed. In particular, the proposed methods are multi-objective adaptations of the classical Mutual Information Maximization (MIM) and Maximum Relevance Minimum Redundancy (mRMR). The concept of "sentinel variable" is also introduced in this paper: any variable selected by the methods that, a posteriori, will be a predictor variable (its real-time data will be used by the models to predict the value of the target variables). To highlight the existence of this type of problems in the industry, and thus the need for this approach, a current problem of low voltage power grids is presented and modelled. In particular, the question of selecting a subset of smart meters ("sentinel smart meters") that serve as predictors of a certain electrical measurement for the rest of the smart meters in the grid. The empirical approach will be applied to voltage curves of smart meters for six different transformer substations. The results are evaluated from three perspectives: (i) the quality of the predictions, (ii) the stability of the methods and (iii) the execution time. In addition, the results are compared with three other methods, a purely empirical one proposed in the article (based on voltage patterns (VP)) and another two which are well-known in the literature: (a) RReliefF (Relief for regressions) and (b) Fisher Score.

Funding

TIN2016-78365-R, Spanish Ministry of Economy, Industry and Competitiveness

PID2019-104966GB-I00, Spanish Ministry of Science and Innovation

IT1244-19, Basque Government Elkartek programs KIA, KK-2020/00049, SPRI-Basque Government

History

Email Address of Submitting Author

iri@ormazabal.com

ORCID of Submitting Author

0000-0002-8267-3377

Submitting Author's Institution

Ormazabal Corporate Technology

Submitting Author's Country

  • Spain

Usage metrics

    Licence

    Exports