H2O Driverless AI Release Notes¶

H2O Driverless AI is a high-performance, GPU-enabled, client-server application for the rapid development and deployment of state-of-the-art predictive analytics models. It reads tabular data from various sources and automates data visualization, grand-master level automatic feature engineering, model validation (overfitting and leakage prevention), model parameter tuning, model interpretability and model deployment. H2O Driverless AI is currently targeting common regression, binomial classification, and multinomial classification applications including loss-given-default, probability of default, customer churn, campaign response, fraud detection, anti-money-laundering, and predictive asset maintenance models. It also handles time-series problems for individual or grouped time-series such as weekly sales predictions per store and department, with time-causal feature engineering and validation schemes. The ability to model unstructured data is coming soon.

High-level capabilities:

Client/server application for rapid experimentation and deployment of state-of-the-art supervised machine learning models
Automatically creates machine learning modeling pipelines for highest predictive accuracy
Automatically creates stand-alone scoring pipeline for in-process scoring or client/server scoring via http or tcp protocols, in Python and Java (low-latency scoring).
Python API or GUI (Java API coming soon)
Multi-GPU and multi-CPU support for powerful workstations and NVidia DGX supercomputers
Machine Learning model interpretation module with global and local model interpretation
Automatic Visualization module
Multi-user support
Backward compatibility

Problem types supported:

Regression (continuous target variable, for age, income, house price, loss prediction, time-series forecasting)
Binary classification (0/1 or “N”/”Y”, for fraud prediction, churn prediction, failure prediction, etc.)
Multinomial classification (0/1/2/3 or “A”/”B”/”C”/”D” for categorical target variables, for prediction of membership type, next-action, product recommendation, etc.)

Data types supported:

Tabular structured data, rows are observations, columns are fields/features/variables
i.i.d. (identically and independently distributed) data
Numeric, categorical and textual fields
Missing values are allowed
Time-series data with a single time-series (time flows across the entire dataset, not per block of data)
Grouped time-series (e.g., sales per store per department per week, all in one file, with 3 columns for store, dept, week)
Time-series problems with a gap between training and testing (i.e., the time to deploy), and a known forecast horizon (after which model has to be retrained)

Data types NOT supported:

Image/video/audio

Data sources supported:

Local file system or NFS
File upload from browser or Python client
Hadoop (HDFS)
S3 (Amazon)
Azure Blob storage
Blue Data Tap
Google big query
Google cloud storage
kdb+
Minio
Snowflake

File formats supported:

Plain text formats of columnar data (.csv, .tsv, .txt)
Compressed archives (.zip, .gz, .bz2)
Excel
Parquet
Feather
Python datatable (.nff, .jay)

Architecture¶

DAI architecture

Roadmap¶

DAI roadmap

Change Log¶

Version 1.6.5 LTS (Oct 21, 2019)¶

Added startup check for Terraform configuration
Fixed ‘Unable to cast str32 into bool8’ bug in scoring
Fixed MLI scoring pipeline imports
Documentation updates:
- Improved documentation for security and authentication (OpenID, PAM, LDAP)
- Improved documentation for Parquet files
- Improved documentation for feature engineering transformers
Various bug fixes

H2O Driverless AI Release Notes¶

Architecture¶

Roadmap¶

Change Log¶

Version 1.6.5 LTS (Oct 21, 2019)¶

Version 1.6.4 LTS (Aug 19, 2019)¶

Version 1.6.3 LTS (June 14, 2019)¶

Version 1.6.2 LTS (May 10, 2019)¶

Version 1.6.1.1 LTS (Apr 24, 2019)¶

Version 1.6.1 LTS (Apr 18, 2019)¶

Version 1.6.0 LTS (Apr 5, 2019)¶

Version 1.5.4 (Feb 24, 2019)¶

Version 1.5.3 (Feb 8, 2019)¶

Version 1.5.2 (Feb 2, 2019)¶

Version 1.5.1 (Jan 22, 2019)¶

Version 1.5.0 (Jan 18, 2019)¶

Version 1.4.2 (Dec 3, 2018)¶

Version 1.4.1 (Nov 11, 2018)¶

Version 1.4.0 (Oct 27, 2018)¶

Version 1.3.1 (Sep 12, 2018)¶

Version 1.3.0 (Sep 4, 2018)¶

Version 1.2.2 (July 5, 2018)¶

Version 1.2.1 (June 26, 2018)¶

Version 1.2.0 (June 11, 2018)¶

Version 1.1.6 (May 29, 2018)¶

Version 1.1.4 (May 17, 2018)¶

Version 1.1.3 (May 16, 2018)¶

Version 1.1.2 (May 8, 2018)¶

Version 1.1.1 (April 23, 2018)¶

Version 1.1.0 (April 19, 2018)¶

Version 1.0.30 (April 5, 2018)¶

Version 1.0.29 (April 4, 2018)¶

Version 1.0.28 (April 3, 2018)¶

Version 1.0.27 (March 31, 2018)¶

Version 1.0.26 (March 28, 2018)¶

Version 1.0.25 (March 22, 2018)¶

Version 1.0.24 (March 8, 2018)¶

Version 1.0.23 (March 7, 2018)¶

Version 1.0.22 (Feb 23, 2018)¶

Version 1.0.21 (Feb 21, 2018)¶

Version 1.0.20 (Feb 17, 2018)¶

Version 1.0.19 (Jan 28, 2018)¶

Version 1.0.18 (Jan 24, 2018)¶

Version 1.0.17 (Jan 23, 2018)¶

Version 1.0.16 (Jan 22, 2018)¶

Version 1.0.15 (Jan 11, 2018)¶

Version 1.0.14 (Jan 11, 2018)¶

Version 1.0.13 (Jan 10, 2018)¶

Version 1.0.11 (Dec 12, 2017)¶

Version 1.0.10 (Dec 4, 2017)¶

Version 1.0.9 (Nov 29, 2017)¶

Version 1.0.8 (Nov 21, 2017)¶

Version 1.0.7 (Nov 17, 2017)¶

Version 1.0.5 (Oct 24, 2017)¶

Version 1.0.4 (Oct 19, 2017)¶

Version 1.0.3 (Oct 9, 2017)¶

Version 1.0.2 (Oct 5, 2017)¶

Version 1.0.1 (Oct 4, 2017)¶

Version 1.0.0 (Sep 24, 2017)¶