Changelog | Enterprise h2oGPTe

What's new in h2oGPTe v1.6.32

June 18, 2025 · 4 min read

We are excited to announce the release of h2oGPTe 1.6.32! This release brings important improvements, bug fixes, and new features to enhance your experience with h2oGPTe.

Document and collection management

Added a delete option to the documents grid view, allowing for easier file management.
Improved space management and visual layout on the Collections page to enhance the user experience.
Introduced a Lite ingest mode for faster, more streamlined document processing.

User interface and experience

Enhanced the filter bar on the Documents page for improved usability on mobile devices.
Improved the cosmetic appearance of the UI in private mode.
Moved model failure notification to an alert in the sidebar to decrease interruptions to the user experience.
The auto-logout logic has been improved to prevent premature session terminations.
Added a prompt template for British English to better support international users.

System administration and configuration

Added support for accepting older H2O.ai public keys during license checks to improve flexibility.
Exposed WebSocket ping timeouts to be configurable, allowing administrators to fine-tune network settings for chat session connections.
Exposed S3 connection limits to allow for tuning during high-load periods in document ingestion workflows.
Made the web crawl functionality optional, giving administrators control over whether external website crawling is available as a RAG ingestion method.

Performance and scalability

Addressed and resolved slow ingestion speeds for certain PDF documents, ensuring timely processing.
Optimized chat queries for faster and more efficient performance.
Ensured unique processing paths are used for per-page PDFs to improve ingestion reliability.
Increased the memory allocated to the crawler in Kubernetes environments to handle larger workloads.
Updated the system to use the latest version of Chromium for improved performance and security.

Agents and AI

Enhanced the prompt query to ensure the LLM recognizes when it is performing a RAG task.
Deduplicated document metadata sent to the LLM to improve efficiency and response quality.
Updated the PII (Personally Identifiable Information) model and its detection threshold.

System stability and robustness

Implemented a mechanism to identify and mark stale jobs that do not have an active worker.
Enabled sub-services to refresh and register their state independently for better system-wide awareness.
Forced a chat pod to restart if it is unable to consume new user tasks, ensuring service availability.
Disallowed multiple self-tests for the same LLM from running simultaneously to prevent conflicts.

AI developers and API enhancements

Optimized the logic for the automatic chat naming feature by checking the setting before pulling chat history.

Application and UI bugs

Corrected a bug where sending a new agent chat message in the same session would fail after stopping a previous message.
Fixed issue where agent chat with a collection would fail to include the RAG context.
Fixed issue where collection configuration settings were not applied to a new chat until the page was refreshed.
Addressed a security vulnerability related to gemini affecting its accuracy for RAG benchmark.
The final response for non-streamed REST API calls is now always returned as expected.
Addressed a potential security vulnerability related to unsafe quoting in code.
Fixed an issue that prevented non-owners from sharing prompt templates.
Fixed issue to ensure the most up-to-date document name is used in references.
Fixed issue where the page would not resynchronize after redaction.
Fixed issue with the back button to ensure it consistently aligns with the page heading across all relevant pages.
The thumbnail picker container now has a consistent appearance with the thumbnail card container.
The correct parsing algorithm is now used for highlighting evaluated chat messages.
Fixed the visibility condition for the user pairing link.

Backend and API bugs

Corrected the Go package name to resolve build issues.
Reverted a type change to restore correct file type detection with libmagic.

Testing and CI/CD

Addressed several flaky tests to improve the reliability of the test suite.
Tests will no longer fail due to repeated LLM timeouts.
Agent key-related tests are now run serially to prevent conflicts.
Fixed an issue with Vex self-registration in the CI pipeline.

Support

For technical support and questions about this release, please refer to our documentation or contact our support team at support@h2o.ai.

Next steps

We recommend upgrading to v1.6.32 to take advantage of these improvements. The upgrade process is straightforward and maintains all existing data and configurations.

Feedback

Submit and view feedback for this page
Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai

What's new in h2oGPTe v1.6.33

June 18, 2025 · 3 min read

We are excited to announce the release of h2oGPTe 1.6.33! This release brings important improvements, bug fixes, and new features to enhance your experience with h2oGPTe.

Document and collection management

Fixed an issue with the collection count display when filtering collections through search. The count now reflects accurate results, improving clarity during collection management.

User interface and experience

Improved error messages for administrators who lose access to private Collections, providing clearer feedback on permission changes.
Enhanced the display of usernames in card views, replacing badges with cleaner text formatting and improving responsive design for mobile devices.
Fixed layout issues in collection and document cards to provide better visual consistency across different screen sizes.
Improved mobile navigation with better menu handling and responsive design for smaller screens, reducing navigation friction on portable devices.
Enhanced prompt code block rendering with syntax highlighting headers and improved code formatting for better readability and developer experience.

AI developers and API enhancements

Added comprehensive async API examples to the documentation covering authentication, document management, and chat operations to help developers integrate with h2oGPTe more effectively.
Removed requirement to manually close async client connections, simplifying API usage for developers and preventing resource leaks.
Enhanced agent tool detection for Python and shell code blocks, improving code execution capabilities and expanding supported programming languages.

System stability and robustness

Fixed an InvalidStateError that occurred during React component lifecycle events, improving UI reliability.
Improved error handling for agent chat sessions to reduce failure rates when restarting messages.

Application and UI bugs

Fixed cursor navigation issues in code blocks that affected keyboard users.
Resolved mobile button list rendering errors that could affect user interface stability.
Improved error handling for Collection sharing to provide clearer permission feedback.

Backend and API bugs

Improved persistence of Collection configuration settings, eliminating the need to refresh the page.
Fixed non-streamed REST API response handling for better reliability.

Testing and CI/CD

Enhanced PII detection and redaction capabilities with improved accuracy and expanded data type coverage for better data privacy protection in document processing workflows.
Improved arXiv document downloader reliability for more consistent document processing.

Support

For technical support and questions about this release, please refer to our documentation or contact our support team at support@h2o.ai.

Next steps

We recommend upgrading to v1.6.33 to take advantage of these improvements. The upgrade process is straightforward and maintains all existing data and configurations.

Feedback

Submit and view feedback for this page
Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai

We are excited to announce the release of h2oGPTe 1.6! Read on to learn about the new features in the 1.6.x series which will improve your ability to find answers and generate new content based on your private data.

Agent

The main new feature is the h2oGPTe Agent, which introduces powerful autonomous tool use capabilities through code execution, while using LLMs for code generation and reasoning.

The h2oGPTe Agent placed #1 on the GAIA leaderboard, which measures the usefulness of General AI assistants.

It is a highly capable deep research assistant, with full transparency into its inner workings.

For every agentic chat conversation, the output now includes:

Agentic Analysis
- Utilized Tools: shows the subset of tools used by the agent
- Summary View: all the steps taken by the agent
Agentic Internal Chat
- A transcript of all agentic internal conversations, available in summary or detail view
Downloadable files
- All newly created files by the agent, including final artifacts such as PDF/Excel/PowerPoint but also all code snippets and logs used to create those documents
- All files available to the agent, especially relevant for existing collection documents and/or for enabled chat history

Controlling the Agent

The agent can be enabled or disabled inside of the chat message input form for convenience. The agent is acting autonomously based on the user prompts and the prompt templates used. There are 4 presets for agent accuracy, each has a default value for the number of agentic conversation turns and the maximum time each turn can take. Both are controllable by the user.

Quick
Basic (default)
Standard
Maximum

Agent Tools that can be enabled/disabled for every agentic chat query:

Aider Code Generation
Mermaid Chart-Diagram Renderer
Image Generation
Ask Question About Image
Audio-Video Transcription
Convert Document to Text
Download Web Video
Screenshot Webpage
Google Search
Browser Navigation
Scholar Papers Search
Wolfram Alpha Math Science Search
H2O Driverless AI Data Science
Bing Search
Web Image Search
Ask Question About Documents
Wikipedia Articles Search
Wayback Machine Search
Advanced Reasoning
Evaluate Answer
Shell Scripting
Python Coding
RAG Vision
RAG Text
Internet Access
Intranet Access

Each of these tools can be enabled/disabled by the admin.

Roles and Permissions

This version introduces a list of roles defined in the system, and every role is associated with a set of permissions. Users and groups configured by the federated authentication and identity providers (LDAP) are reflected in the system and admins can modify the roles and permissions for each user and group.

Role-based access controls (RBAC) include:

Delete chats
Submit chat feedback
Add collections
Delete collections
Edit collections
Make collection public
Share collections
Show admin center
Allow device pairing when configured
Show extractors
Show live logs
Show models page
Show private button
Add documents
Delete documents
Delete prompt templates
Edit prompt templates
Share prompt templates
Manage roles
Display system notifications
Display developer settings

Custom GPT via collection settings

Collection + Collection Settings + Default Chat Settings = Custom GPT

Each collection now has a set of default chat settings that will be applied for each new chat with this collection. The default chat settings can be applied from any chat session by clicking the 'Apply current settings as collection defaults' button or via API. The collection settings page shows the current set of default settings.

REST API

A new REST API has been implemented, in addition to the existing Python RPC client. It conforms to the OpenAPI standard and it is exposed via built-in Swagger UI.

Auto-generated bindings are available on the API page.

Python REST API
JavaScript REST API
Go REST API

Reasoning models

The models page shows which models support reasoning capabilities and which reasoning models are enabled for which non-reasoning models, as a supporting model, similar to vision capabilities. Reasoning models can be used for chat and RAG use cases and for agentic use cases.

Improved Vision capabilities

Various improvements to Vision model capabilities have been made.

Improved Document Parsing capabilities

Various improvements to parsing:

layout detection
chunking
image captioning
text conversion
document highlighting
Excel document handling (large tables are summarized, but still available in full to the agent for data science)

Improved Handwriting support

The H2O Mississippi model is now used by default for transcription of handwriting to text. It is shipped out of the box.

Support of the latest LLMs

We support all widely available proprietary and open-source models. Some noteworthy new models include:

Claude 3.5 (Bedrock)
OpenAI o1 (Azure)
OpenAI o1-mini (Azure)
DeepSeek V3
DeepSeek R1
Gemini 2.0 Flash
Gemini 2.0 Flash Thinking
MiniMaxAI
Qwen/Qwen2.5
Qwen/Qwen2-VL
Qwen/QwQ
Llama-3.3-70B
Llama-3.2-11B-Vision
Llama-3.2-90B-Vision
H2O Mississippi

Improved Scalability and Speed

Models Service: The backend has been redesigned to allow horizontal scaling for faster speed and higher throughput for document ingestion and chat via a dedicated models service that chat/crawl/core services are sharing.
Optional auto-scaling of the models service via KEDA
Several improvements to improve the responsiveness of the application have been made
The Vex Docker image has been reduced in size
Parallelized Vex DB operations
Text to PDF conversion has been sped up

UX improvements

Guardrails and PII settings can be separately enabled/disabled
Custom guardrails can now be entered from the GUI
PDF display has been improved
Thumbnails for collections
Improved models page
Improved scrolling and pagination
Syntax highlighting for markdown and code blocks
Improved job cancellation
Speed up automatic RAG type detection

Improved Models Self-Test

More functional self-tests, now does multimodal RAG with guided JSON
More likely to expose flaws in configuration of model endpoints

White Labeling

Custom logo, colors, greeting message, and personality in prompt templates

Topic Model

For every collection, a topic model visualization can be created by the click of a button. It shows clusters of similar phrases and concepts found in the documents, and enables a quick overview of the content and potentially spots of low information density for quick visual debugging of the collection.

Code for chat messages

Every chat message now shows the client code needed to run the same query from the Python client.

Collection Expiration and Size limits

Admins can now let collections expire after a certain amount of time. Collection size limits can be set as well.

Public sharing of chats has been sped up.

Security Vulnerability Fixes

No critical or high CVEs at the time of release.

Feedback

Submit and view feedback for this page
Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai

What's new in h2oGPTe v1.5

August 28, 2024 · 5 min read

We are excited to announce the release of h2oGPTe 1.5! Read on to learn about the new features in the 1.5.x series which will improve your ability to find answers and generate new content based on your private data.

Chat First

The GUI has been revamped to lead with chat first. You can start chatting immediately, and add documents or collections to the chat.

New Connectors

New connectors in v1.5:

Amazon S3
Google Cloud Storage
Azure BLOB
Sharepoint
Upload Plain Text (automatically triggered if large text is copy & pasted into the chat)

Choice of OCR models

The v1.5 release brings support for more languages by introducing a new set of OCR model choices (for conversion of documents to text), including the auto-detection of the language for each page of every document.

Automatic (default)
Tesseract (over 60 different languages)
DocTR
PaddleOCR
PaddleOCR-Chinese
PaddleOCR-Arabic
PaddleOCR-Japanese
PaddleOCR-Korean
PaddleOCR-Cyrillic
PaddleOCR-Devanagari
PaddleOCR-Telugu
PaddleOCR-Kannada
PaddleOCR-Tamil

Model Comparison Page

A new models page offers easy comparison between all LLMs

Tabular view of all metrics such as cost, accuracy, speed, latency, context lengths, vision capabilities, guided generation features and chat template
Graphical scatter plot to compare models across 2 dimensions, with optional log-scale
Usage and performance stats are now shown as a tab on the models page
A self-test button shows green or red lights for each LLM within secons to confirm that all LLMs are operational with "quick" and "rag" benchmark modes exposed to all users that test chat and RAG modes.
Admins have access to "full" and "stress" tests as well, to make sure LLMs are configured to handle large contexts properly.

Model Routing with Cost Controls

Automatically chooses the best LLM for the task given cost constraints such as:

Max cost per LLM call
Willingness to pay for extra accuracy (how much to pay for +10% accuracy for this LLM call?)
Willingness to wait for extra accuracy (how long to wait for +10% accuracy for this LLM call?)
Max cost per million tokens for LLMs to be considered
Fixed list of models to choose from

Any of these cost controls can be combined. The GUI exposes the first 3 cost constraints.

Guardrails

Fully customizable Guardrails:

Prompt Guard (fine-tuned DeBERTa v3 model), Jailbreak and Prompt Injection
Llama Guard (fine-tuned LLM), 14 classes of unsafe content
Custom Guardrails, arbitrary LLM and prompting

Guardrails are applied to:

All user prompts

If unsafe content is detected, the following action is performed:

fail

Redaction of PII or regular expressions

These PII detection methods are combined for maximum precision and recall:

regular expressions
Presidio model: 5 languages (en, es, fr, de, zh), 36 different PII entities
Custom PII model: 59 different PII entities

Personally identifiable information (PII) is checked for in these places:

Parsing of documents
LLM input
LLM output

If PII is detected in any of the above places, one of the following actions is performed:

allow
redact
fail

You have full control over the list of entities to flag, via JSON spec, controllable per collection.

Document Metadata

You can now choose what information from the document is provided to the LLM.

Filename
Page Number
Document Text
Document Age
Last Modification Date
Retrieval Score
Ingestion Method
URI

Multimodal Vision Capabilities

v1.5.x brings support for multimodal vision capabilities, including state-of-the-art open-source vision models. This allows processing of flowcharts, images, diagrams and more.

GPT-4o/GPT-4o-mini
Gemini-1.5-Pro/Flash
Claude-3/Claude-3.5
InternVL-Chat
InternVL2-26B/76B

Support for upcoming LLMs via Chat Templates

v1.5.x can support yet unreleased future LLMs using Hugging Face chat templates.

Guided Generation

A powerful new feature in v1.5 is the guided generation. For example, the LLM can be instructed to create perfect JSON that adheres to a provided schema. Or it can be instructed to create output that matches a regular expression, or follows a certain grammar, or contains only output from a provided list of choices.

All these powerful options are exposed in the API:

guided_json
guided_regex
guided_choice
guided_grammar
guided_whitespace_pattern

Note that guided generation also works for vision models. For most (proprietary) models not hosted by vLLM (such as OpenAI, Claude, etc.), only guided_json is supported for now.

Document AI: Summarize, Extract, Process

The document summarization API was generalized to full document processing using the map/reduce paradigm for LLMs. In combination with the new connectors, custom OCR models, document metadata, PII redaction, guided generation, multimodal vision models, prompt templates, powerful Document AI workflows are now possible.

Example use cases:

Custom summaries
Convert flow charts to custom JSON
Extract all financial information
Classify documents or images with custom labels

Tagging Documents and Chatting with a subset of the Collection

You can now tag documents (via the Python client), and provide a list of tags to include when chatting with a collection.

Out of the box prompt templates

Multiple new prompt templates were added for convenience.

Improved Scalability and Speed

Several improvements to improve the responsiveness of the application have been made.

Eval Studio integration

H2O Eval Studio is now integrated into h2oGPTe.

Prompt templates can now be shared with selected users.

Improved Cloud integration

Minio backend for storage can be replaced with S3. GCS/Azure storage backend is upcoming.

Security Vulnerability Fixes

No critical or high CVEs at the time of release.

Live logs for admins

Real-time logs for core/crawl/chat services for administrator users.

Feedback

Submit and view feedback for this page
Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai

What's new in h2oGPTe v1.4.13

May 8, 2024 · 8 min read

We are excited to announce the release of h2oGPTe 1.4.13! Read on to learn about the new features in the 1.4.x series which will improve your ability to find answers and generate new content based on your private data.

Create non-English embeddings

Your data isn't always in English. In fact, your documents, audio files, and images may span many languages, and now, h2oGPTe can help you answer questions on any language.

v1.4.x brings support for a new embedding model, bge-m3. This embedding model is best in class for multi-lingual data and supports more than 100 languages.

We recommend using bge-large-en-v1.5 for English use cases and this is the default embedding model used in the environment.

Customize the Embedding Model per Collection

You may want to customize the embedding model used for each collection of documents or use case, and now you can when creating a new collection.

All documents added to this collection will be embedded using that model, and all queries to this collection will use that embedding model. Please note that you cannot change the emedding model of a collection after the fact, it is only editable while creating the collection.

Embedding Model options

The generative AI space is moving fast and there are new technologies every week. H2O.ai is regularly adding support for new embedding and language models. Today, you can enable the following embedding models in your environment:

bge-large-en-v1.5
bge-m3
instructor-large
bge-large-zh-v1.5
multilingual-e5-large
instructor-xl

Support for new LLMs

The v1.4 release series brings support for many new LLMs including H2O.ai's small language models H2O Danube. Working with Southeast Asia? You may want to use SeaLLM-7B-v2 or sea-lion-7b-instruct.

The full list has 18+ types of LLMs supported with the latest and greatest regularly being added.

Introducing the Prompt Catalog

Gone are the days of having a collection of really good System Prompts saved in a file on your desktop! The new Prompt Catalog comes with 18+ out of the box prompts for common tasks like Question Answering (in many languages) and Expert Summarizing and tones like Funny or Simple.

You can explore all of the available prompts and create your own on the new Prompts page.

When chatting, click the Settings button down near where you chat to chate the prompt template you're using.

Out of the Box Language Prompts

The following languages are supported for RAG out of the box by selecting the right prompt template, remember to also use the multi-language embedding model:

Chinese
Turkish
Russian
Portuguese
Malay
Japanese
Indonesian
Hindi
French
Persian
Spanish Castilian

Out of the Box English Use Cases

While users can create their own prompt templates, there are also 9 common use cases supported out of the box.

Retrieval Augmented Generation (this is the default)
Expert Summarizer: Generates structured summaries
Plain and Simple: Creates simple answers that are easily understood
Funny and Sarcastic: Creates a humorous personality that injects sarcasm into responses
Citations: Provide citations and references to end users
Keynote Presentation: Generates TED-quality keynote presentations
Tech Impact Analyzer: Generates impact analysis of technology projects
Value Determinator: Determines the value provided to the reader
Engineering Wizard: Designs and evaluates different proposals for solving engineering problems

When creating your own prompt template, you can choose to use any of these as the starting point if you would like to slightly alter them.

Example: Citation Prompt Template

As an example, we have uploaded the h2oGPTe documentation into the h2oGPTe product and asked "How can I add a new document into a collection?". With the default prompt template, which is optimized for Retrieval Augmented Generation, we get the following helpful answer:

To add a new document into a collection, follow these steps according to the information provided in the document context:

Navigate to the Enterprise h2oGPTe navigation menu.
Click on 'Collections'.
Select the name of the Collection you want to add a document to.
Click on '+ Add documents'.
Choose a method to import the document, such as Upload documents or Import from system.
Browse for the document you want to add.
Click 'Add'.

Please note that the exact wording of the navigation menu items and buttons might vary slightly depending on the specific version of Enterprise h2oGPTe you are using.

This helps our end users, but maybe would would like to also provide citation to users so they can go read the full documentation themselves. We will click Settings and select Citation from the dropdown list of Prompt Templates. Asking the same question, we get the same answer, except now we have a list of the exact documentation pages that the answer came from.

To add a new document into a collection, follow these steps according to the document sources provided:

1. Navigate to the Enterprise h2oGPTe interface.
2. Click on 'Collections' in the navigation menu.
3. Select the specific collection where you want to add a document.
4. Click on '+ Add documents'.
5. Choose a method to import the document, such as 'Upload documents' or 'Import from system'.
6. Browse and select the document you wish to add.
7. Click 'Add'.

Remember, you can only select an embedding model once when creating a new collection. Also, you can toggle the 'Create short document summaries' button to auto-generate a summary of your document, and the 'Create sample questions for documents' button to receive auto-suggested sample questions based on your document.

[1] Guide - Collections - Add a Document(s) to a Collection - Version: v1.4.11 - Enterprise h2oGPTe documentation
[2] Create a Collection - Version: v1.3.11 - Enterprise h2oGPTe documentation
[3] Add a Document(s) to a Collection - Version: v1.3.11 - Enterprise h2oGPTe documentation
[4] Add a Document(s) to a Collection - Version: v1.1.7 - Enterprise h2oGPTe documentation
[5] Add a Document(s) to a Collection - Version: v1.4.9 - Enterprise h2oGPTe documentation

We can see that we have access to multiple version of the documentation. As a user, we might then ask the same question but clarify which versions we are using.

Chat redesign

There are many changes to the feel and functionality of the Chat sessions in the 1.4 release:

Settings can now be found in the chat tool bar
- Customize the LLM tempurature to make more creative or deterministic answers
- Set the maximum lenght of responses
- Set the number of neighbor chunks for RAG+ to add additional context from the source documents
New controls for each part of the conversation can be found to the right of the user's message
- Copy the response
- Provide feedback if the response was good or bad
- View the entire prompt and context sent to the LLM
- View usage and cost information about the LLM interaction
- Delete this Q&A
Ask questions with audio using the Listen function of the chat toolbar
Easily start chatting with LLMs from the UI without using a collection of data using the New Chat button from the Chat Sessions page

H2O AI Cloud integration

Users of the H2O AI Cloud can now authenticate to their h2oGPTe environment using the Platform Token improving the end-to-end Predictive and Generative workflow.

This is especially helpful when building custom UIs on top of h2oGPTe using Wave. The below code can be used to authenticate to h2oGPTe in your Wave app deployed in the App Store making it so that all users who use your app are logging in to h2oGPTe as themselves.

from h2ogpte import H2OGPTE
import h2o_authn

token_provider = h2o_authn.TokenProvider(
	refresh_token=q.auth.refresh_token,
	token_endpoint_url=f"{os.getenv('H2O_WAVE_OIDC_PROVIDER_URL')}/protocol/openid-connect/token",
	client_id=os.getenv("H2O_WAVE_OIDC_CLIENT_ID"),
	client_secret=os.getenv("H2O_WAVE_OIDC_CLIENT_SECRET"),
)
client = H2OGPTE(address=os.getenv("H2OGPTE_URL"), token_provider=token_provider)

Enhanced Jobs experience

When doing document analytics and chat, many of the steps can take some time, such as ingesting a large website or deleting old files. Long running tasks, or Jobs, can be found by clicking the server icon in the top right hand corner. This will open a queue of any running tasks including the ability to easily read error messages if anything went wrong.

General Improvements

Search and filter documents by name
View the retrieval and LLM response name for each query in the Chat Session Usage
Improved quality of generated example questions
Less steps needed to customize LLM parameters from the Python API
Chat sharing is now available for air-gapped installs

Feedback

Submit and view feedback for this page
Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai

Return to docs

Document and collection management​

User interface and experience​

System administration and configuration​

Performance and scalability​

Agents and AI​

System stability and robustness​

AI developers and API enhancements​

Application and UI bugs​

Backend and API bugs​

Testing and CI/CD​

Support​

Next steps​

Document and collection management​

User interface and experience​

AI developers and API enhancements​

System stability and robustness​

Application and UI bugs​

Backend and API bugs​

Testing and CI/CD​

Support​

Next steps​

Agent​

Controlling the Agent​

Roles and Permissions​

Custom GPT via collection settings​

REST API​

Reasoning models​

Improved Vision capabilities​

Improved Document Parsing capabilities​

Improved Handwriting support​

Support of the latest LLMs​

Improved Scalability and Speed​

UX improvements​

Improved Models Self-Test​

White Labeling​

Topic Model​

Code for chat messages​

Collection Expiration and Size limits​

Faster chat sharing​

Security Vulnerability Fixes​

Chat First​

New Connectors​

Choice of OCR models​

Model Comparison Page​

Model Routing with Cost Controls​

Guardrails​

Redaction of PII or regular expressions​

Document Metadata​

Multimodal Vision Capabilities​

Support for upcoming LLMs via Chat Templates​

Guided Generation​

Document AI: Summarize, Extract, Process​

Tagging Documents and Chatting with a subset of the Collection​

Out of the box prompt templates​

Improved Scalability and Speed​

Eval Studio integration​

Sharing of Prompt Templates​

Improved Cloud integration​

Security Vulnerability Fixes​

Live logs for admins​

Create non-English embeddings​

Customize the Embedding Model per Collection​

Embedding Model options​

Support for new LLMs​

Introducing the Prompt Catalog​

Out of the Box Language Prompts​

Out of the Box English Use Cases​

Example: Citation Prompt Template​

Chat redesign​

H2O AI Cloud integration​

Enhanced Jobs experience​

General Improvements​

Document and collection management

User interface and experience

System administration and configuration

Performance and scalability

Agents and AI

System stability and robustness

AI developers and API enhancements

Application and UI bugs

Backend and API bugs

Testing and CI/CD

Support

Next steps

Document and collection management

User interface and experience

AI developers and API enhancements

System stability and robustness

Application and UI bugs

Backend and API bugs

Testing and CI/CD

Support

Next steps

Agent

Controlling the Agent

Roles and Permissions

Custom GPT via collection settings

REST API

Reasoning models

Improved Vision capabilities

Improved Document Parsing capabilities

Improved Handwriting support

Support of the latest LLMs

Improved Scalability and Speed

UX improvements

Improved Models Self-Test

White Labeling

Topic Model

Code for chat messages

Collection Expiration and Size limits

Faster chat sharing

Security Vulnerability Fixes

Chat First

New Connectors

Choice of OCR models

Model Comparison Page

Model Routing with Cost Controls

Guardrails

Redaction of PII or regular expressions

Document Metadata

Multimodal Vision Capabilities

Support for upcoming LLMs via Chat Templates

Guided Generation

Document AI: Summarize, Extract, Process

Tagging Documents and Chatting with a subset of the Collection

Out of the box prompt templates

Improved Scalability and Speed

Eval Studio integration

Sharing of Prompt Templates

Improved Cloud integration

Security Vulnerability Fixes

Live logs for admins

Create non-English embeddings

Customize the Embedding Model per Collection

Embedding Model options

Support for new LLMs

Introducing the Prompt Catalog

Out of the Box Language Prompts

Out of the Box English Use Cases

Example: Citation Prompt Template

Chat redesign

H2O AI Cloud integration

Enhanced Jobs experience

General Improvements