Vertical Feature Selection
Introduction
Feature selection relies on the output of algorithm VerticalBinningWoeIV and VerticalPearson.
The operator consists of three stages:
Vertical Binning Woe IV
Vertical Pearson
Performing feature selection based on the results of the above two steps.
Parameters List
The parameters given below is only for feature selection.
identity: str The role of each participant in federated learning, should be label_trainer or trainer.
- model_info:
name:
strModel name, should be vertical_feature_selection.
- input:
- trainset:
type:
strTrain dataset type, currently supported is csv.path:
strThe folder path of train dataset.name:
strThe file name of train dataset.has_id:
boolWhether the dataset has id column.has_label:
boolWhether the dataset has label column.
- valset:
type:
strTrain dataset type, currently supported is csv.path:
strThe folder path of train dataset.name:
strThe file name of train dataset.has_id:
boolWhether the dataset has id column.has_label:
boolWhether the dataset has label column.
- iv_result:
path:
strFolder path of the iv_result from VerticalBinningWoeIV.name:
strModel file name.
- corr_result:
path:
strFolder path of the result from VerticalPearson.name:
strModel file name.
- output:
path:
strOutput folder path.- model:
name:
strFile name of output model.
- trainset:
name:
strFile name of train dataset after feature selection.
- valset:
name:
strFile name of validation dataset after feature selection.
- train_info:
- train_params:
- filter:
- common:
metrics:
strMetric type, currently supported is iv.filter_method:
strFeature filtering method, currently supported is threshold.threshold:
floatThreshold for filtering if the filter_method is threshold.
- correlation:
sort_metric:
strSorting (descending) metric before the correlated filtering, currently supported is iv.correlation_threshold:
floatThreshold for correlated filtering.