Local Feature Preprocess
Introduction
Local feature preprocess is a module that preprocess feature locally.
Parameter List
identity: str Federated identity of the party, should be one of label_trainer or trainer.
- model_info:
name:
strModel name, should be local_feature_preprocess.
- input:
- trainset:
type:
strTrain dataset type, support csv.path:
strIf type is csv, folder path of train dataset.name:
strIf type is csv, file name of train dataset.has_id:
boolIf type is csv, whether dataset has id column.has_label:
boolIf type is csv, whether dataset has label column.
- valset:
type:
strValidation dataset type, support csv.path:
strIf type is csv, folder path of validation dataset.name:
strIf type is csv, file name of validation dataset.has_id:
boolIf type is csv, whether dataset has id column.has_label:
boolIf type is csv, whether dataset has label column.
- output:
path:
strFolder path of output model.- model:
name:
strFile name of output model.
- trainset:
name:
strFile name of output trainset.
- valset:
name:
strFile name of output valset.
- train_info:
- train_params:
- missing:
missing_values:
intorfloatorstrorlistThe placeholder for the missing values.strategy:
strThe imputation strategy.fill_value:
strornumerical valueWhen strategy == “constant”, fill_value is used to replace all occurrences of missing_values.missing_features:
mapFine-grained configuration for column preprocess(axis = 0). The format is defined as: {column name: {“missing_values”: placeholder for the missing values, “strategy”: imputation strategy, “fill_value”: imputation value when strategy == “constant”}, …}.
- outlier:
outlier_values:
intorfloatorstrorlistThe placeholder for the outlier values.outlier_features:
mapFine-grained configuration for column preprocess(axis = 0). The format is defined as: {column name: {“outlier_values”: placeholder for the outlier values}, …}.
- onehot:
onehot_features:
mapFine-grained configuration for column preprocess(axis = 0). The format is defined as: {column name: {}, …, column name: {}}.