Skip to main content
Version: v1.7.0

Agents

What's the H2O AI super agent?​

Enterprise h2oGPTe agents are general-purpose, code-first AI agents that analyze your input, decide which tools to use, write and execute code, read error messages, revise their approach, and iterate until they solve the problem. This differs fundamentally from specific-purpose agents built in frameworks like LangGraph or CrewAI, where each agent handles a single workflow.

H2O agents can do anything. Given the same request twice, an agent might solve it differently each time—adapting its strategy based on context, available tools, and intermediate results.

In h2oGPTe, the agent toggle moves the platform from chat mode (question-and-answer) to agent mode (autonomous task execution). When you enable the agent, the LLM stops simply responding and starts doing.

Agent Marketecture

What agents can do out of the box​

With no configuration, agents have access to a broad set of built-in tools:

Data and AI
Run AutoML experiments, analyze datasets, generate charts, execute Python scripts, and build machine learning models.
Web and research
Search across Google, Bing, Wikipedia, academic journals, and internal sources.
Content and media
Generate images, create presentations, transcribe audio and video, and capture webpage screenshots.
Code and automation
Write, test, and debug code, run shell scripts, integrate with GitHub, and orchestrate multi-step workflows.
Documents
Ask questions about uploaded files, extract insights, summarize content, and auto-tag documents by type and metadata.
External services
Connect to third-party APIs and databases through MCP servers.

See Agent tool configuration for the full list of built-in tools and their configuration parameters.

Agent flavors​

The General Agent is the core H2O AI Super Agent. Through settings and prompting, it can handle virtually any task. To simplify common workflows, h2oGPTe includes purpose-built flavors that work out of the box:

General
Research, content creation, automation
Broadest tool access, maximum flexibility for multi-step tasks.
Data Science
Data analysis, ML experiments
Data-focused tools with H2O Driverless AI integration.
Deep Research
In-depth investigation
Extended iterations and thorough source gathering across many sources.
Coder
Software development, debugging
Code execution tools and development workflow optimization.

To select an agent flavor:

  1. From the h2oGPTe dashboard, click + New chat.
  2. Enable the Agent toggle.
  3. In the agent dropdown, select the flavor that matches your task.
tip

Start with General if you're unsure. It can do everything the specialized flavors do—the flavors just get there faster for their target tasks.

Accuracy levels​

Each agent flavor supports accuracy levels that control how much effort the agent puts into solving your request:

Quick
10 turns · 30sFast answers and simple lookups
Basic
20 turns · 60sStraightforward, single-step tasks
Standard
40 turns · 120sMost tasks (default)
Maximum
80 turns · 240sComplex analysis, highest accuracy

Higher accuracy means the agent can take more steps and spend more time per step, which improves results for complex tasks but takes longer.

Customize agents with collections​

Collections turn a general-purpose agent into a domain-specific one. When you connect an agent to a collection, it can search and reason over your documents alongside its built-in tools.

RAG modes​

How the agent interacts with your documents depends on the RAG (retrieval-augmented generation) mode:

ModeHow it worksBest for
Agentic RAGAgent decides when to query documents using a tool callComplex questions that combine document knowledge with other tools
RLM RAGRetrieval-language model approach, limited to two retrieval iterationsFocused document questions where you want concise, grounded answers
Fast Agentic RAGDocument chunks load directly into the agent's contextSpeed-sensitive queries over smaller document sets

Custom tools​

Extend what agents can do by uploading your own code as tools:

  • Accept structured inputs (for example, a company name)
  • Execute custom business logic (for example, look up brand colors)
  • Return results the agent uses in its workflow (for example, apply colors to a chart)

Add custom tools to collections and configure them through prompt settings. See Tool calling for details.

Custom agents​

For workflows that require a specific orchestration pattern, you can build and upload custom agents using popular frameworks:

  • LangGraph—graph-based agent workflows
  • CrewAI—multi-agent collaboration
  • OpenAI Agents SDK—OpenAI-compatible agents
  • Claude—Anthropic Claude-based agents

You can package custom agents as ZIP files and manage them from the Agents tab. You can create them directly in h2oGPTe using natural language by describing what you want, and the Agents Builder will generate the code.

How the pieces connect​

An agent in h2oGPTe brings together these platform capabilities:

Models — Agents use the LLM configured for your chat session (Claude, GPT, local models, or custom models). The model handles reasoning and tool selection.
Tools — Built-in tools provide core capabilities. Custom tools and MCP servers extend the agent with your own code and external integrations.
Files — Agents can read uploaded files, generate new files (scripts, charts, documents, presentations), and make them available for download.
Collections and RAG — Connect agents to document collections for domain-specific knowledge. Choose a RAG mode based on your accuracy and speed requirements.
Access control — Admins control which tools and models agents can access, and who can create or manage custom agents and API keys.
Evaluation — Use Eval Studio to assess agent response quality, compare configurations, and track performance over time.

Review agent behavior​

Understand how an agent completes your request:

  1. Monitor in real time — While the agent runs, expand the row in the chat view. Watch as it generates and executes code step by step.
  2. Enable Show thoughts — Turn on the Show thoughts toggle to see the agent's reasoning—how it chooses tools and plans its approach.
  3. View agentic analysis — After the task completes, open the Agentic analysis tab for a detailed breakdown of every step, including generated code, tool invocations, and execution results.
  4. Review agent planning — Review the agent's planned approach before execution begins. Approve or modify the plan to ensure it meets your requirements.
  5. Download outputs — Go to the Downloadable files tab to access scripts, charts, documents, and other generated files.

Tutorials​


Feedback