Version: v1.6.8 🚧

List vision capable LLMs

Overview

Users can list the names of vision-capable multi-modal large language models (LLMs) available in the environment that can natively process images as input.

Example

from h2ogpte import H2OGPTE

client = H2OGPTE(
    address="https://h2ogpte.genai.h2o.ai",
    api_key='sk-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX',
)

vision_capable_llm_names = client.get_vision_capable_llm_names()

for llm in vision_capable_llm_names:
    print(llm)

auto
Qwen/Qwen2-VL-72B-Instruct
mistralai/Pixtral-12B-2409
meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo
meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo
Qwen/Qwen2-VL-7B-Instruct
gemini-1.5-pro-latest
gemini-1.5-flash-latest
claude-3-haiku-20240307
claude-3-sonnet-20240229
claude-3-5-sonnet-20240620
gpt-4o
gpt-4o-mini

Feedback

Submit and view feedback for this page
Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai

Overview​

Example​

Overview

Example