Experiment Settings¶

This section includes settings that can be used to customize the experiment like total runtime, reproducibility level, pipeline building, feature brain control, adding config.toml settings and more.

`max_runtime_minutes`¶

`max_runtime_minutes_until_abort`¶

`time_abort`¶

`pipeline-building-recipe`¶

Pipeline Building Recipe

Specify the Pipeline Building recipe type (overrides GUI settings). Select from the following:

Auto: Specifies that all models and features are automatically determined by experiment settings, config.toml settings, and the feature engineering effort. (Default)
Compliant: Similar to Auto except for the following:
- Interpretability is set to 10.
- Only uses GLM or booster as ‘giblinear’.
- Fixed ensemble level is set to 0.
- Feature brain level is set to 0.
- Max feature interaction depth is set to 1 i.e no interactions.
- Target transformers is set to ‘identity’ for regression.
- Does not use distribution shift detection.
- monotonicity_constraints_correlation_threshold is set to 0.
monotonic_gbm: Similar to Auto except for the following:
- Enables monotonicity constraints
- Only uses LightGBM model.
- Drops features that are not correlated with target by at least 0.01. See monotonicity-constraints-drop-low-correlation-features and monotonicity-constraints-correlation-threshold.
- Does not build an ensemble model i.e set fixed_ensemble_level=0
- No feature brain is used to ensure every restart is identical.
- Interaction depth is set to 1 i.e no multi-feature interactions done to avoid complexity.
- No target transformations applied for regression problems i.e sets target_transformer to ‘identity’. The equivalent config.toml parameter is recipe=['monotonic_gbm'].
- num_as_cat feature transformation is disabled.
- List of included_transformers
‘OriginalTransformer’, #numeric (no clustering, no interactions, no num->cat)

‘CatOriginalTransformer’, ‘RawTransformer’,’CVTargetEncodeTransformer’, ‘FrequentTransformer’,’WeightOfEvidenceTransformer’,’OneHotEncodingTransformer’, #categorical (but no num-cat)

‘CatTransformer’,’StringConcatTransformer’, # big data only

‘DateOriginalTransformer’, ‘DateTimeOriginalTransformer’, ‘DatesTransformer’, ‘DateTimeDiffTransformer’, ‘IsHolidayTransformer’, ‘LagsTransformer’, ‘EwmaLagsTransformer’, ‘LagsInteractionTransformer’, ‘LagsAggregatesTransformer’,#dates/time

‘TextOriginalTransformer’, ‘TextTransformer’, ‘StrFeatureTransformer’, ‘TextCNNTransformer’, ‘TextBiGRUTransformer’, ‘TextCharCNNTransformer’, ‘BERTTransformer’,#text

‘ImageOriginalTransformer’, ‘ImageVectorizerTransformer’] #image

For reference also see Monotonicity Constraints in Driverless AI.

Kaggle: Similar to Auto except for the following:
- Any external validation set is concatenated with the train set, with the target marked as missing.
- The test set is concatenated with the train set, with the target marked as missing
- Transformers that do not use the target are allowed to fit_transform across the entirety of the train, validation, and test sets.
- Has several config.toml expert options open-up limits.
nlp_model: Only enable NLP BERT models based on PyTorch to process pure text. To avoid slowdown when using this recipe, enabling one or more GPUs is strongly recommended. For more information, see NLP in Driverless AI.
- included_models = [‘TextBERTModel’, ‘TextMultilingualBERTModel’, ‘TextXLNETModel’, ‘TextXLMModel’,’TextRoBERTaModel’, ‘TextDistilBERTModel’, ‘TextALBERTModel’, ‘TextCamemBERTModel’, ‘TextXLMRobertaModel’]
- enable_pytorch_nlp_transformer = ‘off’
- enable_pytorch_nlp_model = ‘on’
nlp_transformer: Only enable PyTorch based BERT transformers that process pure text. To avoid slowdown when using this recipe, enabling one or more GPUs is strongly recommended. For more information, see NLP in Driverless AI.
- included_transformers = [‘BERTTransformer’]
- excluded_models = [‘TextBERTModel’, ‘TextMultilingualBERTModel’, ‘TextXLNETModel’, ‘TextXLMModel’,’TextRoBERTaModel’, ‘TextDistilBERTModel’, ‘TextALBERTModel’, ‘TextCamemBERTModel’, ‘TextXLMRobertaModel’]
- enable_pytorch_nlp_transformer = ‘on’
- enable_pytorch_nlp_model = ‘off’
image_model: Only enable image models that process pure images (ImageAutoModel). To avoid slowdown when using this recipe, enabling one or more GPUs is strongly recommended. For more information, see Automatic Image Model.
Notes:
- This option disables the Genetic Algorithm (GA).
- Image insights are only available when this option is selected.
image_transformer: Only enable the ImageVectorizer transformer, which processes pure images. For more information, see Embeddings Transformer (Image Vectorizer).
unsupervised: Only enable unsupervised transformers, models and scorers. See for reference.
gpus_max: Maximize use of GPUs (e.g. use XGBoost, RAPIDS, Optuna hyperparameter search, etc. that run on GPUs).

Each pipeline building recipe mode can be chosen, and then fine-tuned using each expert settings. Changing the pipeline building recipe will reset all pipeline building recipe options back to default and then re-apply the specific rules for the new mode, which will undo any fine-tuning of expert options that are part of pipeline building recipe rules.

If choose to do new/continued/refitted/retrained experiment from parent experiment, the recipe rules are not re-applied and any fine-tuning is preserved. To reset recipe behavior, one can switch between ‘auto’ and the desired mode. This way the new child experiment will use the default settings for the chosen recipe.

`enable_genetic_algorithm`¶

`tournament_style`¶

`make_python_scoring_pipeline`¶

`make_mojo_scoring_pipeline`¶

`mojo_for_predictions`¶

`reduce_mojo_size`¶

`make_pipeline_visualization`¶

`benchmark_mojo_latency`¶

`mojo_building_timeout`¶

`mojo_building_parallelism`¶

`kaggle_username`¶

`kaggle_key`¶

`kaggle_timeout`¶

`min_num_rows`¶

`reproducibility_level`¶

`seed`¶

`allow_different_classes_across_fold_splits`¶

`save_validation_splits`¶

`max_num_classes`¶

`max_num_classes_compute_roc`¶

`max_num_classes_client_and_gui`¶

`roc_reduce_type`¶

`max_rows_cm_ga`¶

`use_feature_brain_new_experiments`¶

`feature_brain_level`¶

`feature_brain2`¶

`feature_brain3`¶

`feature_brain4`¶

`feature_brain5`¶

`force_model_restart_to_defaults`¶

`min_dai_iterations`¶

`target_transformer`¶

`fixed_num_folds_evolution`¶

`fixed_num_folds`¶

`fixed_only_first_fold_model`¶

`feature_evolution_data_size`¶

`final_pipeline_data_size`¶

`max_validation_to_training_size_ratio_for_final_ensemble`¶

`force_stratified_splits_for_imbalanced_threshold_binary`¶

`config_overrides`¶

`last_recipe`¶

`feature_brain_reset_score`¶

`feature_brain_save_every_iteration`¶

`which_iteration_brain`¶

`refit_same_best_individual`¶

`restart_refit_redo_origfs_shift_leak`¶

`brain_add_features_for_new_columns`¶

`force_model_restart_to_defaults`¶

`dump_modelparams_every_scored_indiv`¶

`fast_approx_num_trees`¶

`fast_approx_do_one_fold`¶

`fast_approx_do_one_model`¶

`fast_approx_contribs_num_trees`¶

`fast_approx_contribs_do_one_fold`¶

`fast_approx_contribs_do_one_model`¶

`autoviz_recommended_transformation`¶

Experiment Settings¶

max_runtime_minutes¶

max_runtime_minutes_until_abort¶

time_abort¶

pipeline-building-recipe¶

enable_genetic_algorithm¶

tournament_style¶

make_python_scoring_pipeline¶

make_mojo_scoring_pipeline¶

mojo_for_predictions¶

reduce_mojo_size¶

make_pipeline_visualization¶

benchmark_mojo_latency¶

mojo_building_timeout¶

mojo_building_parallelism¶

kaggle_username¶

kaggle_key¶

kaggle_timeout¶

min_num_rows¶

reproducibility_level¶

seed¶

allow_different_classes_across_fold_splits¶

save_validation_splits¶

max_num_classes¶

max_num_classes_compute_roc¶

max_num_classes_client_and_gui¶

roc_reduce_type¶

max_rows_cm_ga¶

use_feature_brain_new_experiments¶

feature_brain_level¶

feature_brain2¶

feature_brain3¶

feature_brain4¶

feature_brain5¶

force_model_restart_to_defaults¶

min_dai_iterations¶

target_transformer¶

fixed_num_folds_evolution¶

fixed_num_folds¶

fixed_only_first_fold_model¶

feature_evolution_data_size¶

final_pipeline_data_size¶

max_validation_to_training_size_ratio_for_final_ensemble¶

force_stratified_splits_for_imbalanced_threshold_binary¶

config_overrides¶

last_recipe¶

feature_brain_reset_score¶

feature_brain_save_every_iteration¶

which_iteration_brain¶

refit_same_best_individual¶

restart_refit_redo_origfs_shift_leak¶

brain_add_features_for_new_columns¶

force_model_restart_to_defaults¶

dump_modelparams_every_scored_indiv¶

fast_approx_num_trees¶

fast_approx_do_one_fold¶

fast_approx_do_one_model¶

fast_approx_contribs_num_trees¶

fast_approx_contribs_do_one_fold¶

fast_approx_contribs_do_one_model¶

autoviz_recommended_transformation¶

`max_runtime_minutes`¶

`max_runtime_minutes_until_abort`¶

`time_abort`¶

`pipeline-building-recipe`¶

`enable_genetic_algorithm`¶

`tournament_style`¶

`make_python_scoring_pipeline`¶

`make_mojo_scoring_pipeline`¶

`mojo_for_predictions`¶

`reduce_mojo_size`¶

`make_pipeline_visualization`¶

`benchmark_mojo_latency`¶

`mojo_building_timeout`¶

`mojo_building_parallelism`¶

`kaggle_username`¶

`kaggle_key`¶

`kaggle_timeout`¶

`min_num_rows`¶

`reproducibility_level`¶

`seed`¶

`allow_different_classes_across_fold_splits`¶

`save_validation_splits`¶

`max_num_classes`¶

`max_num_classes_compute_roc`¶

`max_num_classes_client_and_gui`¶

`roc_reduce_type`¶

`max_rows_cm_ga`¶

`use_feature_brain_new_experiments`¶

`feature_brain_level`¶

`feature_brain2`¶

`feature_brain3`¶

`feature_brain4`¶

`feature_brain5`¶

`force_model_restart_to_defaults`¶

`min_dai_iterations`¶

`target_transformer`¶

`fixed_num_folds_evolution`¶

`fixed_num_folds`¶

`fixed_only_first_fold_model`¶

`feature_evolution_data_size`¶

`final_pipeline_data_size`¶

`max_validation_to_training_size_ratio_for_final_ensemble`¶

`force_stratified_splits_for_imbalanced_threshold_binary`¶

`config_overrides`¶

`last_recipe`¶

`feature_brain_reset_score`¶

`feature_brain_save_every_iteration`¶

`which_iteration_brain`¶

`refit_same_best_individual`¶

`restart_refit_redo_origfs_shift_leak`¶

`brain_add_features_for_new_columns`¶

`force_model_restart_to_defaults`¶

`dump_modelparams_every_scored_indiv`¶

`fast_approx_num_trees`¶

`fast_approx_do_one_fold`¶

`fast_approx_do_one_model`¶

`fast_approx_contribs_num_trees`¶

`fast_approx_contribs_do_one_fold`¶

`fast_approx_contribs_do_one_model`¶

`autoviz_recommended_transformation`¶