Models
Overviewβ
The Models page allows you to explore the supported large language models (LLMs) and perform self-tests on the LLMs used throughout Enterprise h2oGPTe.
Supported LLMsβ
Enterprise h2oGPTe supports the following LLMs:
Major Providers
Research Labs
Enterprise & Other
Major Providers
Commercial AI providers and their flagship models
Metaβ
- meta-llama/Meta-Llama-3.1-8B-Instruct
- meta-llama/Meta-Llama-3.1-70B-Instruct
- meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
- meta-llama/Llama-3.3-70B-Instruct
- meta-llama/Llama-Guard-3-8B
- meta-llama/Llama-3-8b-chat-hf
- meta-llama/Llama-3-70b-chat-hf
- meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
- meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
- meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
- meta-llama/Llama-3.2-3B-Instruct-Turbo
- meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo
- meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo
- meta-llama/Llama-4-Scout-17B-16E-Instruct
- meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
OpenAIβ
- gpt-4o
- gpt-4o-mini
- gpt-4.1
- gpt-4.1-mini
- gpt-4.1-nano
- gpt-4.5-preview
- gpt-5
- gpt-5-mini
- gpt-5-nano
- gpt-5-chat
- gpt-5-codex
- o1
- o1-mini
- o1-preview
- o3
- o3-mini
- o4-mini
Googleβ
- google/gemma-2-27b-it
- gemini-1.5-pro-latest
- gemini-2.0-flash
- gemini-2.0-flash-thinking-exp-01-21
- gemini-2.0-pro-exp-02-05
- gemini-2.5-flash-preview-05-20
- gemini-2.5-pro-preview-06-05
- gemini-2.5-pro-exp-05-06
- gemini-2.5-pro
- gemini-2.5-pro (Vertex AI)
- gemini-2.5-flash
Anthropicβ
- claude-3-5-haiku-20241022
- claude-3-5-sonnet-20241022
- claude-3-7-sonnet-20250219
- claude-sonnet-4-20250514
- claude-opus-4-20250514
Mistral AIβ
Research Labs
Models from AI research organizations
Qwen (Alibaba Cloud)β
- Qwen/Qwen1.5-72B-Chat
- Qwen/Qwen2-72B-Instruct
- Qwen/Qwen2-VL-7B-Instruct
- Qwen/Qwen2.5-72B-Instruct
- Qwen/Qwen2.5-VL-72B-Instruct
- Qwen/QwQ-32B
- Qwen/Qwen3-235B-A22B-FP8
- Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
DeepSeek AIβ
- deepseek-ai/DeepSeek-V3
- deepseek-ai/DeepSeek-R1
- deepseek-ai/deepseek-llm-67b-chat
- deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
- deepseek-ai/DeepSeek-R1-0528
- deepseek-chat
- deepseek-reasoner
NVIDIAβ
Enterprise & Other
H2O.ai models and others
Other Modelsβ
- h2oai/h2o-danube2-1.8b-chat
- h2oai/h2o-danube3-4b-chat
- h2oai/h2ovl-mississippi-2b
- microsoft/Phi-3-mini-128k-instruct
- microsoft/Phi-3-medium-128k-instruct
- microsoft/Phi-3-vision-128k-instruct
- MiniMax-Text-01
- NousResearch/Nous-Capybara-34B
- NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
- openchat/openchat-3.5-1210
- OpenGVLab/InternVL-Chat-V1-5
- OpenGVLab/InternVL2-Llama3-76B
- perplexity-ai/r1-1776
- upstage/SOLAR-10.7B-Instruct-v1.0
- openai/gpt-oss-120b
- grok-4
and many more. Our latest RAG benchmark results lists all tested models: RAG benchmark results.
note
The table inside the LLMs tab renders the supported LLMs.