What's new in h2oGPTe v1.6.x
We are excited to announce the release of h2oGPTe 1.6! Read on to learn about the new features and improvements in h2oGPTe 1.6.x series that help you find answers and generate content based on your private data.
Overview of the 1.6.x Releaseโ
The h2oGPTe 1.6.x series delivers significant capabilities for autonomous AI assistance, enhanced APIs, and improved document processing:
Major featuresโ
- Agent functionality: Autonomous tool use with code execution, placing #1 on the GAIA leaderboard for General AI assistants
- Role-Based Access Control (RBAC): Comprehensive permission system for users and groups
- REST API: OpenAPI-compliant interface with auto-generated bindings for Python, JavaScript, and Go
- Custom GPT creation: Collection-based configuration system for tailored AI assistants
Key improvements across releasesโ
- Exception handling and resilience: Improved exception handling and increased resilience for document ingestion and old parsing issues
(v1.6.35)
- Internationalization: Added RTL language support to improve UI compatibility with right-to-left languages
(v1.6.34)
- Enhanced mobile experience: Better navigation, responsive design, and improved card layouts
(v1.6.33)
- Developer experience: Async API examples, simplified client connections, and enhanced code execution
(v1.6.33)
- System reliability: Better error handling, improved self-tests, and enhanced stability
(v1.6.32, v1.6.33)
- Performance optimizations: Faster document ingestion, improved chat queries, and streamlined processing
(v1.6.32)
Recent patch releases
- Patch v1.6.36 (July 2025)
- Patch v1.6.33 (June 2025)
- Patch v1.6.32 (June 2025)
h2oGPTe v1.6.36โ
Improvementsโ
User interface and experienceโ
- Prompt templates: Added prompt template group sharing functionality
- Document management: Fixed manual document tag creation
- Document browsing: Resolved pagination limit issues when browsing documents
- Desktop filters: Adjusted document filters minimum width to prevent layout overflow
- File upload: Improved ingestion dialog appearance and user experience
- Banner support: Added configurable banner functionality for system administrators
Security and authenticationโ
- User permissions: Fixed issue where users could assign shared keys from other users, now properly limited to admin shared keys
- Guest authentication: Improved guest user fingerprint handling as subject identifier
- Token exchange: Enhanced token exchange for JavaScript RPC client with performance optimizations
System administration and configurationโ
- Settings management: Improved settings with automatic updates and better performance
Performance and scalabilityโ
Processing optimizationsโ
- Concurrent processing: Improved handling of concurrent requests for better throughput
- Image processing: Added support for non-A4 format pictures with proper fitting
Backend improvementsโ
- MCP enhancements: Upgraded Model Context Protocol (MCP) with various improvements
- Browser automation: Updated Puppeteer for improved browser-use support
Bug fixesโ
Agent functionalityโ
- Environment variables: Improved agent environment variable handling for better reliability
User interfaceโ
- Document tags: Resolved manual document tag creation problems
- Collection sharing: Enhanced collection sharing functionality with proper access controls
Documentation improvementsโ
- System administration: Enhanced documentation for system administration and configuration features
Model and integration updatesโ
h2ogpt updates: Multiple h2ogpt version updates with various improvements and fixes
h2oGPTe v1.6.33โ
Improvementsโ
User interface and mobile experienceโ
- Collection management: Fixed collection count display when filtering through search, ensuring accurate result counts
- Error messaging: Enhanced error messages for administrators who lose access to private collections
- Card layouts: Improved username display in card views with cleaner text formatting and responsive design for mobile devices
- Mobile navigation: Better menu handling and responsive design for smaller screens
- Code rendering: Enhanced prompt code block rendering with syntax highlighting headers and improved formatting
API and developer experienceโ
- Documentation: Added comprehensive async API examples covering authentication, document management, and chat operations
- Code execution: Enhanced agent tool detection for Python and shell code blocks, expanding supported programming languages
Bug fixesโ
System stabilityโ
- Error handling: Improved UI reliability and stability
- Agent sessions: Enhanced error handling for agent chat sessions to reduce failure rates when restarting messages
User interfaceโ
- Fixed cursor navigation issues in code blocks for keyboard users
- Resolved mobile button list rendering errors affecting interface stability
- Improved error handling for collection sharing with clearer permission feedback
Backend and APIโ
- Enhanced persistence of collection configuration settings, eliminating page refresh requirements
- Fixed non-streamed REST API response handling for better reliability
Performance improvementsโ
- Enhanced PII (Personally Identifiable Information) detection and redaction with improved accuracy and expanded data type coverage for document processing workflows
- Improved arXiv document downloader reliability for consistent document processing
New model supportโ
New model support in this version:
- Claude 3.5 (Bedrock)
- OpenAI o1 (Azure)
- OpenAI o1-mini (Azure)
- Gemini 2.0 Flash
- Gemini 2.0 Flash Thinking
- DeepSeek V3
- DeepSeek R1
- Qwen/Qwen2.5
- Qwen/Qwen2-VL
- Qwen/QwQ
h2oGPTe v1.6.32โ
Improvementsโ
Document and collection managementโ
- File management: Added delete option to documents grid view for easier file management
- Collections layout: Enhanced space management and visual layout on Collections page
- Processing speed: Introduced Lite ingest mode for faster, streamlined document processing
User interface and experienceโ
- Mobile usability: Enhanced filter bar on Documents page for improved mobile device usability
- Private mode: Improved UI appearance in private mode
- Notifications: Moved model failure notifications to sidebar alerts to reduce user experience interruptions
- Session management: Enhanced auto-logout logic to prevent premature session terminations
- Localization: Added prompt template for British English to support international users
System administration and configurationโ
- License flexibility: Added support for older H2O.ai public keys during license checks
- Network configuration: Exposed configurable WebSocket ping timeouts for administrators to fine-tune chat session connections
- Storage optimization: Exposed S3 connection limits for tuning during high-load document ingestion periods
- Feature control: Made web crawl functionality optional, giving administrators control over external website crawling as a RAG (Retrieval-Augmented Generation) ingestion method
Performance and scalabilityโ
Processing optimizationsโ
- PDF handling: Resolved slow ingestion speeds for certain PDF documents
- Chat performance: Optimized chat queries for faster, more efficient performance
- Processing reliability: Ensured unique processing paths for per-page PDFs to improve ingestion reliability
- Browser updates: Updated to latest Chromium version for improved performance and security
AI and agent enhancementsโ
- RAG optimization: Enhanced prompt queries to ensure LLMs recognize RAG task performance
- Metadata efficiency: Deduplicated document metadata sent to LLMs for improved efficiency and response quality
- PII detection: Updated PII (Personally Identifiable Information) model and detection threshold
System reliabilityโ
Stability improvementsโ
- Service coordination: Enabled sub-services to refresh and register state independently for better system awareness
API and developer enhancementsโ
- Performance optimization: Optimized automatic chat naming feature logic by checking settings before pulling chat history
Bug fixesโ
User interface and applicationโ
- Agent messaging: Fixed bug where new agent chat messages would fail after stopping previous messages in the same session
- Collection integration: Fixed agent chat with collections failing to include RAG context
- Configuration persistence: Fixed collection configuration settings not applying to new chats until page refresh
- API responses: Ensured final responses for non-streamed REST API calls are always returned
- Template sharing: Fixed issue preventing non-owners from sharing prompt templates
- Document references: Ensured most up-to-date document names are used in references
- Page synchronization: Fixed page resynchronization issues after redaction
- Navigation consistency: Fixed back button alignment with page headings across all relevant pages
- UI consistency: Aligned thumbnail picker container appearance with thumbnail card containers
- Message highlighting: Applied correct parsing algorithm for highlighting evaluated chat messages
- User interface: Fixed visibility condition for user pairing link
Backend and APIโ
- API reliability: Improved file type detection for better document processing
h2oGPTe v1.6.0
Released: January 31, 2025
Agent featuresโ
Agent overviewโ
The h2oGPTe Agent enables autonomous tool use through code execution. It uses large language models (LLMs) for code generation and reasoning. The agent achieved #1 ranking on the GAIA leaderboard, which measures General AI assistant usefulness.
Key features include:
- Deep Research assistance: Provides autonomous analysis with full transparency into decision-making processes
- Comprehensive output: Delivers analysis summaries, internal chat transcripts, and downloadable artifacts for each conversation
- File management: Gives you access to newly created files (PDF, Excel, PowerPoint) and all code snippets used in document creation
Agent controlโ
You can enable or disable the agent through the chat input interface. The agent operates autonomously based on your prompts and configured prompt templates.
Accuracy Presets control conversation depth and processing time:
- Quick
- Basic (default)
- Standard
- Maximum
Each preset defines the number of conversation turns and maximum processing time per turn.
Agent toolsโ
Administrators can enable or disable these agent tools for chat queries:
Code and Development:
- Aider Code Generation
- Shell Scripting
- Python Coding
Data Visualization:
- Mermaid Chart-Diagram Renderer
- Image Generation
Content Processing:
- Ask Question About Image
- Audio-Video Transcription
- Convert Document to Text
- Screenshot Webpage
Research and Search:
- Google Search
- Bing Search
- Scholar Papers Search
- Wolfram Alpha Math Science Search
- Wikipedia Articles Search
- Wayback Machine Search
- Web Image Search
Document Analysis:
- Ask Question About Documents
- RAG (Retrieval-Augmented Generation) Vision
- RAG Text
System Integration:
- H2O Driverless AI Data Science
- Browser Navigation
- Download Web Video
- Advanced Reasoning
- Evaluate Answer
Network Access:
- Internet Access
- Intranet Access
Access control and permissionsโ
Role-based access control (RBAC)โ
This version introduces a comprehensive role and permission system. Each role contains specific permissions, and administrators can assign roles to users and groups from federated authentication providers like LDAP.
Available permissions:
Chat management:
- Delete chats
- Submit chat feedback
Collection management:
- Add collections
- Delete collections
- Edit collections
- Make collection public
- Share collections
Document management:
- Add documents
- Delete documents
Template management:
- Delete prompt templates
- Edit prompt templates
- Share prompt templates
System administration:
- Show admin center
- Allow device pairing when configured
- Show extractors
- Show live logs
- Show models page
- Show private button
- Manage roles
- Display system notifications
- Display developer settings
API and developer toolsโ
REST APIโ
A new REST API complements the existing Python RPC client. The API conforms to the OpenAPI standard and provides built-in Swagger UI documentation.
Auto-Generated bindings:
- Python REST API
- JavaScript REST API
- Go REST API
Custom GPT creationโ
Create custom AI assistants using the formula: Collection + Collection Settings + Default Chat Settings = Custom GPT.
Each collection contains default chat settings that apply to new conversations. You can apply current settings as collection defaults through the Apply current settings as collection defaults button or via API.
Code generation for chat messagesโ
Each chat message displays the equivalent Python client code, enabling developers to replicate queries programmatically.
Model and processing improvementsโ
Reasoning model supportโ
The models page displays reasoning capabilities and shows which reasoning models support non-reasoning models, similar to vision model relationships. Reasoning models work with chat, RAG (Retrieval-Augmented Generation), and agent use cases.
Vision capabilitiesโ
Enhanced vision model functionality across the platform.
Document processingโ
Parsing improvements:
- Layout detection
- Chunking algorithms
- Image captioning
- Text conversion
- Document highlighting
- Excel handling (large tables are summarized while remaining fully accessible to agents for data analysis)
Handwriting recognitionโ
The H2O Mississippi model provides default handwriting-to-text transcription and ships with the platform.
Supported LLMsโ
Support for proprietary and open-source models includes:
Cloud providers:
- Claude 3.5 (Bedrock)
- OpenAI o1 (Azure)
- OpenAI o1-mini (Azure)
- Gemini 2.0 Flash
- Gemini 2.0 Flash Thinking
Open-Source Models:
- DeepSeek V3
- DeepSeek R1
- MiniMaxAI
- Qwen/Qwen2.5
- Qwen/Qwen2-VL
- Qwen/QwQ
- Llama-3.3-70B
- Llama-3.2-11B-Vision
- Llama-3.2-90B-Vision
H2O Models:
- H2O Mississippi
Performance and scalabilityโ
Architecture improvementsโ
- Models Service: Redesigned backend enables horizontal scaling for document ingestion and chat through a dedicated service shared by chat, crawl, and core services
- Auto-Scaling: Optional KEDA-based auto-scaling for the models service
- Database Operations: Parallelized Vex database operations
- Conversion Speed: Accelerated text-to-PDF conversion
User experience enhancementsโ
Interface improvements:
- Separate guardrails and PII (Personally Identifiable Information) settings
- GUI-based custom guardrails configuration
- Enhanced PDF display
- Collection thumbnails
- Improved models page layout
- Better scrolling and pagination
- Syntax highlighting for markdown and code blocks
- Enhanced job cancellation
- Faster automatic RAG type detection
Model testingโ
Self-Test enhancements:
- Functional self-tests with multimodal RAG and guided JSON
- Better detection of model endpoint configuration issues
Administrative featuresโ
White labelingโ
Customization options include:
- Custom logos
- Color schemes
- Greeting messages
- Personality configuration in prompt templates
Topic modelingโ
Generate topic model visualizations for any collection with a single click. Visualizations show clusters of similar phrases and concepts, providing content overviews and identifying areas for content optimization.
Collection managementโ
Lifecycle controls:
- Configurable collection expiration times
- Collection size limits
Performance optimizationsโ
- Faster public chat sharing
Securityโ
No critical or high CVEs at the time of release.
Support and resourcesโ
For technical support and questions about this release:
Additional resourcesโ
Upgrade informationโ
We recommend upgrading to the latest version of h2oGPTe 1.6.x to access these improvements. The upgrade process preserves all your existing data and configurations.
- Submit and view feedback for this page
- Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai