Skip to main content
Version: v1.0.0

Understand deployments

In H2O MLOps, deployments are created when model version(s) are served for scoring. Model endpoint security, artifact type, runtime, and Kubernetes options can be configured when deploying a model.

H2O MLOps supports different deployment modes:

  • Real-time deployments: Make a model available as a live REST endpoint that returns predictions immediately when given input data. Types of real-time deployments include:

    • Single model deployments: Serve one model version at a time.
    • A/B test deployments: Compare the performance of two or more models in production.
    • Champion/Challenger deployments: Continuously compare a Champion model against one or more Challenger models to promote the best performer.
  • Batch scoring deployments: Run model scoring jobs on batches of data instead of serving predictions in real time.

You can create and manage deployments using:

To learn more about deployments, refer to the following pages:

To learn more about deployments using the Python client, see:


Feedback