Architecture
Enterprise h2oGPTe is a retrieval-augmented generation (RAG) and agentic AI platform that runs as a set of cooperating microservices. This section explains how those services fit together, how documents and queries flow through the system, and where your data lives.
How the platform fits together
At a high level, requests enter through a Go API gateway, fan out to Python AI services, read from and write to shared data stores:
- React frontend (UI)—the TypeScript and Tailwind web interface for chat, document viewing, collection management, and administration. It streams responses over WebSockets.
- Go API gateway (Mux)—the single entry point for the UI and API clients. It handles authentication (OIDC via Keycloak), authorization, request routing to backend services, and file operations.
- Python AI services—a set of cooperating services that handle orchestration, retrieval, ingestion, and generation:
- Core—the orchestrator that coordinates the AI services.
- Chat—the RAG pipeline and session management.
- Crawl and Parse—document ingestion and extraction.
- VEX