Vertical Pod Autoscaler (VPA) support
Vertical Pod Autoscaler (VPA) is supported in the Deployer. VPA allows dynamic scaling of CPU and memory resources based on application usage, improving resource efficiency and optimizing costs.
For more information, see the official VPA GitHub README.
Note: For a list of known limitations, see the Known limitations section of the VPA GitHub README.
Configurations
The following configurations provide control over resource usage based on VPA settings.
vpa:
# Whether to enable the VPA.
enabled: false
# The CPU max threshold for the VPA.
cpuMaxThreshold: 0
# The CPU unit for the VPA.
# Available units for CPU:
# - CORES: For CPU cores
# - MILLICORES: For CPU millicores
cpuMaxThresholdUnit: "MILLICORES"
# The memory max threshold for the VPA.
memoryMaxThreshold: 0
# The memory unit for the VPA.
# Available units for memory:
# - MIB: For memory in MiB
# - GIB: For memory in GiB
memoryMaxThresholdUnit: "MIB"
Feedback
- Submit and view feedback for this page
- Send feedback about H2O MLOps to cloud-feedback@h2o.ai