Skip to main content

One post tagged with "v1.6"

View All Tags

Return to docs

Return to docs

What's new in h2oGPTe v1.6.x

ยท 12 min read

We are excited to announce the release of h2oGPTe 1.6! Read on to learn about the new features and improvements in h2oGPTe 1.6.x series that help you find answers and generate content based on your private data.

Overview of the 1.6.x Releaseโ€‹

The h2oGPTe 1.6.x series delivers significant capabilities for autonomous AI assistance, enhanced APIs, and improved document processing:

Major featuresโ€‹

  • Agent functionality: Autonomous tool use with code execution, placing #1 on the GAIA leaderboard for General AI assistants
  • Role-Based Access Control (RBAC): Comprehensive permission system for users and groups
  • REST API: OpenAPI-compliant interface with auto-generated bindings for Python, JavaScript, and Go
  • Custom GPT creation: Collection-based configuration system for tailored AI assistants

Key improvements across releasesโ€‹

  • Exception handling and resilience: Improved exception handling and increased resilience for document ingestion and old parsing issues (v1.6.35)
  • Internationalization: Added RTL language support to improve UI compatibility with right-to-left languages (v1.6.34)
  • Enhanced mobile experience: Better navigation, responsive design, and improved card layouts (v1.6.33)
  • Developer experience: Async API examples, simplified client connections, and enhanced code execution (v1.6.33)
  • System reliability: Better error handling, improved self-tests, and enhanced stability (v1.6.32, v1.6.33)
  • Performance optimizations: Faster document ingestion, improved chat queries, and streamlined processing (v1.6.32)

Recent patch releases

h2oGPTe v1.6.36โ€‹

Improvementsโ€‹

User interface and experienceโ€‹

  • Prompt templates: Added prompt template group sharing functionality
  • Document management: Fixed manual document tag creation
  • Document browsing: Resolved pagination limit issues when browsing documents
  • Desktop filters: Adjusted document filters minimum width to prevent layout overflow
  • File upload: Improved ingestion dialog appearance and user experience
  • Banner support: Added configurable banner functionality for system administrators

Security and authenticationโ€‹

  • User permissions: Fixed issue where users could assign shared keys from other users, now properly limited to admin shared keys
  • Guest authentication: Improved guest user fingerprint handling as subject identifier
  • Token exchange: Enhanced token exchange for JavaScript RPC client with performance optimizations

System administration and configurationโ€‹

  • Settings management: Improved settings with automatic updates and better performance

Performance and scalabilityโ€‹

Processing optimizationsโ€‹

  • Concurrent processing: Improved handling of concurrent requests for better throughput
  • Image processing: Added support for non-A4 format pictures with proper fitting

Backend improvementsโ€‹

  • MCP enhancements: Upgraded Model Context Protocol (MCP) with various improvements
  • Browser automation: Updated Puppeteer for improved browser-use support

Bug fixesโ€‹

Agent functionalityโ€‹

  • Environment variables: Improved agent environment variable handling for better reliability

User interfaceโ€‹

  • Document tags: Resolved manual document tag creation problems
  • Collection sharing: Enhanced collection sharing functionality with proper access controls

Documentation improvementsโ€‹

  • System administration: Enhanced documentation for system administration and configuration features

Model and integration updatesโ€‹

h2ogpt updates: Multiple h2ogpt version updates with various improvements and fixes

h2oGPTe v1.6.0

Released: January 31, 2025

Agent featuresโ€‹

Agent overviewโ€‹

The h2oGPTe Agent enables autonomous tool use through code execution. It uses large language models (LLMs) for code generation and reasoning. The agent achieved #1 ranking on the GAIA leaderboard, which measures General AI assistant usefulness.

Key features include:

  • Deep Research assistance: Provides autonomous analysis with full transparency into decision-making processes
  • Comprehensive output: Delivers analysis summaries, internal chat transcripts, and downloadable artifacts for each conversation
  • File management: Gives you access to newly created files (PDF, Excel, PowerPoint) and all code snippets used in document creation

Agent controlโ€‹

You can enable or disable the agent through the chat input interface. The agent operates autonomously based on your prompts and configured prompt templates.

Accuracy Presets control conversation depth and processing time:

  • Quick
  • Basic (default)
  • Standard
  • Maximum

Each preset defines the number of conversation turns and maximum processing time per turn.

Agent toolsโ€‹

Administrators can enable or disable these agent tools for chat queries:

Code and Development:

  • Aider Code Generation
  • Shell Scripting
  • Python Coding

Data Visualization:

  • Mermaid Chart-Diagram Renderer
  • Image Generation

Content Processing:

  • Ask Question About Image
  • Audio-Video Transcription
  • Convert Document to Text
  • Screenshot Webpage

Research and Search:

  • Google Search
  • Bing Search
  • Scholar Papers Search
  • Wolfram Alpha Math Science Search
  • Wikipedia Articles Search
  • Wayback Machine Search
  • Web Image Search

Document Analysis:

  • Ask Question About Documents
  • RAG (Retrieval-Augmented Generation) Vision
  • RAG Text

System Integration:

  • H2O Driverless AI Data Science
  • Browser Navigation
  • Download Web Video
  • Advanced Reasoning
  • Evaluate Answer

Network Access:

  • Internet Access
  • Intranet Access

Access control and permissionsโ€‹

Role-based access control (RBAC)โ€‹

This version introduces a comprehensive role and permission system. Each role contains specific permissions, and administrators can assign roles to users and groups from federated authentication providers like LDAP.

Available permissions:

Chat management:

  • Delete chats
  • Submit chat feedback

Collection management:

  • Add collections
  • Delete collections
  • Edit collections
  • Make collection public
  • Share collections

Document management:

  • Add documents
  • Delete documents

Template management:

  • Delete prompt templates
  • Edit prompt templates
  • Share prompt templates

System administration:

  • Show admin center
  • Allow device pairing when configured
  • Show extractors
  • Show live logs
  • Show models page
  • Show private button
  • Manage roles
  • Display system notifications
  • Display developer settings

API and developer toolsโ€‹

REST APIโ€‹

A new REST API complements the existing Python RPC client. The API conforms to the OpenAPI standard and provides built-in Swagger UI documentation.

Auto-Generated bindings:

  • Python REST API
  • JavaScript REST API
  • Go REST API

Custom GPT creationโ€‹

Create custom AI assistants using the formula: Collection + Collection Settings + Default Chat Settings = Custom GPT.

Each collection contains default chat settings that apply to new conversations. You can apply current settings as collection defaults through the Apply current settings as collection defaults button or via API.

Code generation for chat messagesโ€‹

Each chat message displays the equivalent Python client code, enabling developers to replicate queries programmatically.

Model and processing improvementsโ€‹

Reasoning model supportโ€‹

The models page displays reasoning capabilities and shows which reasoning models support non-reasoning models, similar to vision model relationships. Reasoning models work with chat, RAG (Retrieval-Augmented Generation), and agent use cases.

Vision capabilitiesโ€‹

Enhanced vision model functionality across the platform.

Document processingโ€‹

Parsing improvements:

  • Layout detection
  • Chunking algorithms
  • Image captioning
  • Text conversion
  • Document highlighting
  • Excel handling (large tables are summarized while remaining fully accessible to agents for data analysis)

Handwriting recognitionโ€‹

The H2O Mississippi model provides default handwriting-to-text transcription and ships with the platform.

Supported LLMsโ€‹

Support for proprietary and open-source models includes:

Cloud providers:

  • Claude 3.5 (Bedrock)
  • OpenAI o1 (Azure)
  • OpenAI o1-mini (Azure)
  • Gemini 2.0 Flash
  • Gemini 2.0 Flash Thinking

Open-Source Models:

  • DeepSeek V3
  • DeepSeek R1
  • MiniMaxAI
  • Qwen/Qwen2.5
  • Qwen/Qwen2-VL
  • Qwen/QwQ
  • Llama-3.3-70B
  • Llama-3.2-11B-Vision
  • Llama-3.2-90B-Vision

H2O Models:

  • H2O Mississippi

Performance and scalabilityโ€‹

Architecture improvementsโ€‹

  • Models Service: Redesigned backend enables horizontal scaling for document ingestion and chat through a dedicated service shared by chat, crawl, and core services
  • Auto-Scaling: Optional KEDA-based auto-scaling for the models service
  • Database Operations: Parallelized Vex database operations
  • Conversion Speed: Accelerated text-to-PDF conversion

User experience enhancementsโ€‹

Interface improvements:

  • Separate guardrails and PII (Personally Identifiable Information) settings
  • GUI-based custom guardrails configuration
  • Enhanced PDF display
  • Collection thumbnails
  • Improved models page layout
  • Better scrolling and pagination
  • Syntax highlighting for markdown and code blocks
  • Enhanced job cancellation
  • Faster automatic RAG type detection

Model testingโ€‹

Self-Test enhancements:

  • Functional self-tests with multimodal RAG and guided JSON
  • Better detection of model endpoint configuration issues

Administrative featuresโ€‹

White labelingโ€‹

Customization options include:

  • Custom logos
  • Color schemes
  • Greeting messages
  • Personality configuration in prompt templates

Topic modelingโ€‹

Generate topic model visualizations for any collection with a single click. Visualizations show clusters of similar phrases and concepts, providing content overviews and identifying areas for content optimization.

Collection managementโ€‹

Lifecycle controls:

  • Configurable collection expiration times
  • Collection size limits

Performance optimizationsโ€‹

  • Faster public chat sharing

Securityโ€‹

No critical or high CVEs at the time of release.

Support and resourcesโ€‹

For technical support and questions about this release:

Additional resourcesโ€‹

Upgrade informationโ€‹

We recommend upgrading to the latest version of h2oGPTe 1.6.x to access these improvements. The upgrade process preserves all your existing data and configurations.


Feedback