Understand models
H2O MLOps lets you register individual experiments and group them as versions of a registered model to organize a collection of experiments efficiently.
Before deploying an experiment, you must register it in H2O MLOps. You can either register it as a new model or add it as a new version under an existing model. Choosing the second option creates a new version of the existing model.
A registered model is a collection of individual model versions. Registered models group related model versions that solve a specific problem. You can register new experiments and iterations as updated versions of the model.
A model version has a one-to-one relationship with an experiment within a given Workspace. When you're ready to serve your best experiment, register it as a model version.
- When you register an experiment as a model, all data and metadata lineage are maintained.
- Model versions can be served in multiple deployments. There is no limitation on the number of deployments a single model version can be a part of.
- In any given workspace, an experiment can only be registered as one model version. This allows for a one-to-one mapping between an experiment and the model version.
Model schema
Each model can be described by its input and output column names and their types. Knowing the model schema is essential for monitoring purposes. Currently, only models using the known schema can be deployed by MLOps.
Model schema is represented by the experiment metadata attached to the experiment. Deployer expects the model schema to be stored in the json_value
of the input_schema
and output_schema
keys.
Natively supported Driverless AI MOJO2 and H2O-3 MOJO2 models are not required to contain the schema as the schema is an integral part of the MOJO2 artifacts.
Schema format
The following example shows how model schema is formatted:
[{"name": "<column name>", "type": "<column type>"}, ... ]
Column types
The following is a list of supported column types:
- Boolean
- Time64
- Float32
- Float64
- Int32
- Int64
- String
Column type names are not case-sensitive.