Skip to main content
Version: v1.6.1 🚧

List LLMs and their compatible vision model names

Overview​

When working with a particular large language model (LLM), a user can identify the vision model that is compatible with it. This is important because integrating the two can enhance the overall performance and capabilities of applications that rely on text and visual information.

By leveraging both models, developers can create more sophisticated applications to interpret, analyze, and generate content based on multimodal inputs.

Example​

from h2ogpte import H2OGPTE
from tabulate import tabulate

client = H2OGPTE(
address="https://h2ogpte.genai.h2o.ai",
api_key='sk-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX',
)

table = [[key, value] for key, value in client.get_llm_and_auto_vision_llm_names().items()]

print(tabulate(table, headers=["LLM", "Vision model"], tablefmt="pretty"))
+------------------------------------------------+------------------------------------------------+
| LLM | Vision model |
+------------------------------------------------+------------------------------------------------+
| h2oai/h2o-danube3-4b-chat | mistralai/Pixtral-12B-2409 |
| meta-llama/Meta-Llama-3.1-70B-Instruct | mistralai/Pixtral-12B-2409 |
| meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 | mistralai/Pixtral-12B-2409 |
| Qwen/Qwen2.5-72B-Instruct | mistralai/Pixtral-12B-2409 |
| Qwen/Qwen2-VL-72B-Instruct | Qwen/Qwen2-VL-72B-Instruct |
| mistralai/Pixtral-12B-2409 | mistralai/Pixtral-12B-2409 |
| mistralai/Mixtral-8x7B-Instruct-v0.1 | mistralai/Pixtral-12B-2409 |
| meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | mistralai/Pixtral-12B-2409 |
| meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | mistralai/Pixtral-12B-2409 |
| upstage/SOLAR-10.7B-Instruct-v1.0 | mistralai/Pixtral-12B-2409 |
| mistralai/Mistral-7B-Instruct-v0.3 | mistralai/Pixtral-12B-2409 |
| google/gemma-2-27b-it | mistralai/Pixtral-12B-2409 |
| meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo | meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo |
| meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo | meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo |
| meta-llama/Llama-3.2-3B-Instruct-Turbo | mistralai/Pixtral-12B-2409 |
| mistral-tiny | mistralai/Pixtral-12B-2409 |
| mistral-small-latest | mistralai/Pixtral-12B-2409 |
| mistral-medium | mistralai/Pixtral-12B-2409 |
| mistral-large-latest | mistralai/Pixtral-12B-2409 |
| gemini-1.5-pro-latest | gemini-1.5-pro-latest |
| gemini-1.5-flash-latest | gemini-1.5-flash-latest |
| claude-3-haiku-20240307 | claude-3-haiku-20240307 |
| claude-3-sonnet-20240229 | claude-3-sonnet-20240229 |
| claude-3-5-sonnet-20240620 | claude-3-5-sonnet-20240620 |
| gpt-4o | gpt-4o |
| gpt-4o-mini | gpt-4o-mini |
+------------------------------------------------+------------------------------------------------+

Feedback