Configuration and Authentication
Driverless AI Configuration Options
Image configuration
Edit on GitHub

Image configuration

`enable_image_transformer`

Enable Image V2 Transformer for processing of image data (String) (Expert Setting)

Default value 'auto'

Whether to use pretrained deep learning models for processing of image data as part of the feature engineering pipeline. A column of URIs to images (jpg, png, etc.) will be converted to a numeric representation using ImageNet-pretrained deep learning models. If no GPUs are found, then must be set to 〈on〉 to enable.

`image_transformer_pretrained_models`

Supported ImageNet pretrained architectures for Image V2 Transformer (List) (Expert Setting)

Default value ['levit']

Supported ImageNet pretrained architectures for Image V2 Transformer. Non-default ones will require internet access to download pretrained models from H2O S3 buckets (To get all models, download https://artifacts.cdn.h2o.ai/releases/ai/h2o/pretrained/dai_image_models_2_3_0.zip and unzip inside image_pretrained_models_dir).

`image_transformer_vectorization_output_dimension`

Dimensionality of feature space created by Image V2 Transformer (List) (Expert Setting)

Default value [100]

Dimensionality of feature (embedding) space created by Image V2 Transformer. If more than one is selected, multiple transformers can be active at the same time.

`image_transformer_fine_tune`

Enable fine-tuning of pretrained models used for Image V2 Transformer (Boolean) (Expert Setting)

Default value False

Enable fine-tuning of the ImageNet pretrained models used for the Image V2 Transformer. Enabling this will slow down training, but should increase accuracy.

`image_transformer_fine_tuning_num_epochs`

Number of epochs for fine-tuning used for Image V2 Transformer (Number) (Expert Setting)

Default value 2

Number of epochs for fine-tuning of ImageNet pretrained models used for the Image V2 Transformer.

`default_image_augmentations`

List of augmentations for fine-tuning used for Image V2 Transformer (List) (Expert Setting)

Default value ['HorizontalFlip']

The list of possible image augmentations to apply while fine-tuning the ImageNet pretrained models used for the Image V2 Transformer. Details about individual augmentations could be found here: https://albumentations.ai/docs/. Note: Does not apply to tf_efficientnetv2 as the recommended transformers from huggingface will be used.

`image_transformer_batch_size`

Batch size for Image V2 Transformer. Automatic: -1 (Number) (Expert Setting)

Default value -1

Batch size for Image V2 Transformer. Larger architectures and larger batch sizes will use more memory.Note. Driverless will automatically find the most appropriate batch size if set to -1 (or non-positive).

`image_pretrained_models_dir`

Path to pretrained Image models. It is used to load the pretrained models if there is no Internet access. (String) (Expert Setting)

Default value './pretrained/image/'

Path to pretrained Image models. To get all models, download https://artifacts.cdn.h2o.ai/releases/ai/h2o/pretrained/dai_image_models_2_3_0.zip, then extract it in a directory on the instance where Driverless AI is installed.

`image_download_timeout`

Image download timeout in seconds (Number) (Expert Setting)

Default value 60

Max. number of seconds to wait for image download if images are provided by URL

`string_col_as_image_max_missing_fraction`

Max allowed fraction of missing values for image column (Float) (Expert Setting)

Default value 0.1

Maximum fraction of missing elements in a string column for it to be considered as possible image paths (URIs)

`string_col_as_image_min_valid_types_fraction`

Min. fraction of images that need to be of valid types for image column to be used (Float) (Expert Setting)

Default value 0.8

Fraction of (unique) image URIs that need to have valid endings (as defined by string_col_as_image_valid_types) for a string column to be considered as image data

`image_transformer_use_gpu`

Enable GPU(s) for faster transformations of Image V2 Transformer. (Boolean) (Expert Setting)

Default value True

Whether to use GPU(s), if available, to transform images into embeddings with Image V2 Transformer. Can lead to significant speedups.

`params_image_auto_search_space`

Search parameter overrides for image auto (Dict) (Expert Setting)

Default value {}

Nominally, the time dial controls the search space, with higher time trying more options, but any keys present in this dictionary will override the automatic choices. e.g. params_image_auto_search_space="{'augmentation': ['safe'], 'crop_strategy': ['Resize'], 'optimizer': ['AdamW'], 'dropout': [0.1], 'epochs_per_stage': [5], 'warmup_epochs': [0], 'mixup': [0.0], 'cutmix': [0.0], 'global_pool': ['avg'], 'learning_rate': [3e-4]}" Options, e.g. used for time>=8 # Overfit Protection Options: 〈augmentation〉: ["safe", "semi_safe", "hard"] 〈crop_strategy〉: ["Resize", "RandomResizedCropSoft", "RandomResizedCropHard"] 〈dropout〉: [0.1, 0.3, 0.5] # Global Pool Options:

avgmax – sum of AVG and MAX poolings catavgmax – concatenation of AVG and MAX poolings https://github.com/rwightman/pytorch-image-models/blob/master/timm/models/layers/adaptive_avgmax_pool.py 'global_pool': ['avg', 'avgmax', 'catavgmax']

# Regression: No MixUp and CutMix:: 'mixup': [0.0] 'cutmix': [0.0]
# Classification: Beta distribution coeff to generate weights for MixUp:: 'mixup': [0.0, 0.4, 1.0, 3.0] 'cutmix': [0.0, 0.4, 1.0, 3.0]

# Optimization Options: 'epochs_per_stage': [5, 10, 15] # from 40 to 135 epochs 'warmup_epochs': [0, 0.5, 1] 'optimizer': ["AdamW", "SGD"] 'learning_rate': [1e-3, 3e-4, 1e-4]

`image_auto_arch`

Architectures for image auto (List) (Expert Setting)

Default value []

Nominally, the accuracy dial controls the architectures considered if this is left empty,: but one can choose specific ones. The options in the list are ordered by complexity.

`image_auto_min_shape`

Minimum image size (Number) (Expert Setting)

Default value 64

Any images smaller are upscaled to the minimum. Default is 64, but can be as small as 32 given the pooling layers used.

`image_auto_num_final_models`

Number of models in final ensemble (Number) (Expert Setting)

Default value 0

0 means automatic based upon time dial of min(1, time//2).

`image_auto_num_models`

Number of models in search space (Number) (Expert Setting)

Default value 0

0 means automatic based upon time dial of max(4 * (time - 1), 2).

`image_auto_num_stages`

Number of stages for hyperparameter search (Number) (Expert Setting)

Default value 0

0 means automatic based upon time dial of time + 1 if time < 6 else time - 1.

`image_auto_iterations`

Number of iterations for successive halving (Number) (Expert Setting)

Default value 0

0 means automatic based upon time dial or number of models and stages: set by image_auto_num_models and image_auto_num_stages.

`image_auto_shape_factor`

Image downscale ratio to use for training (Float) (Expert Setting)

Default value 0.0

0.0 means automatic based upon the current stage, where stage 0 uses half, stage 1 uses 3/4, and stage 2 uses full image.: One can pass 1.0 to override and always use full image. 0.5 would mean use half.

`max_image_auto_ddp_cores`

Maximum number of cores to use for image auto model parallel data management (Number) (Expert Setting)

Default value 10

Control maximum number of cores to use for image auto model parallel data management. 0 will disable mp: https://pytorch-lightning.readthedocs.io/en/latest/guides/speed.html

Previous Next

Built with Sphinx using a theme provided by Read the Docs.

Image configuration

enable_image_transformer

image_transformer_pretrained_models

image_transformer_vectorization_output_dimension

image_transformer_fine_tune

image_transformer_fine_tuning_num_epochs

default_image_augmentations

image_transformer_batch_size

image_pretrained_models_dir

image_download_timeout

string_col_as_image_max_missing_fraction

string_col_as_image_min_valid_types_fraction

image_transformer_use_gpu

params_image_auto_search_space

image_auto_arch

image_auto_min_shape

image_auto_num_final_models

image_auto_num_models

image_auto_num_stages

image_auto_iterations

image_auto_shape_factor

max_image_auto_ddp_cores

`enable_image_transformer`

`image_transformer_pretrained_models`

`image_transformer_vectorization_output_dimension`

`image_transformer_fine_tune`

`image_transformer_fine_tuning_num_epochs`

`default_image_augmentations`

`image_transformer_batch_size`

`image_pretrained_models_dir`

`image_download_timeout`

`string_col_as_image_max_missing_fraction`

`string_col_as_image_min_valid_types_fraction`

`image_transformer_use_gpu`

`params_image_auto_search_space`

`image_auto_arch`

`image_auto_min_shape`

`image_auto_num_final_models`

`image_auto_num_models`

`image_auto_num_stages`

`image_auto_iterations`

`image_auto_shape_factor`

`max_image_auto_ddp_cores`