Vertical Sampler
Introduction
Two sample methods are provided: random sample and stratified sample. Downsample and upsample are supported in both methods.
Parameters List
identity: str Federated identity of the party, should be one of label_trainer or trainer.
- model_info:
name:
strModel name, should be vertical_sampler.
- input:
- dataset:
type:
strInput dataset type, support csv.path:
strIf type is csv, folder path of train dataset.name:
boolIf type is csv, file name of train dataset.has_id:
boolIf type is csv, whether dataset has id column.has_label:
boolIf type is csv, whether dataset has label column.
- output:
path:
strFolder path of output.- sample_id:
name:
strFile name of output id.
- dataset:
name:
boolFile name of output dataset.
- train_info:
- train_params:
method:
strSample method, support random or stratify.strategy:
strSample strategy, support downsample or upsample.random_seed:
intRandom seed.- fraction: support three keys: percentage or number or labeled_percentage.
percentage:
floatThreshold of percentage filter.number:
intThreshold of number filter.labeled_percentage:
listThreshold of labeled_percentage filter.