Skip to main content
Version: v1.7.3-14 🚧

Workers

Overview​

A worker is a background process that handles a specific type of task, such as crawling documents, running chat requests, or serving LLM inference. Enterprise h2oGPTe runs eight service types distributed across one or more nodes (machines).

The Workers page gives you a real-time view of every active worker, the nodes they run on, and the task queues they serve. Use it to verify that all required services are running, identify resource constraints, spot overloaded queues, and confirm that new nodes have registered after scaling.

note

The Workers page requires the Show worker status permission. Only an administrator can assign this permission. If you do not have access, contact your administrator.

Access the Workers page​

  1. In Enterprise h2oGPTe, click Account Circle.
  2. Select System Dashboard.
  3. In the System overview section, click Workers.

The Workers page opens and displays system stats and a visual worker grid.

The Workers page showing system stats cards and a color-coded worker grid organized by node

System stats​

Four summary cards appear at the top of the page. Click any card to open a dialog with a detailed JSON breakdown.

CardDescription
WorkersTotal number of active workers across all nodes. Click to view the full worker list with hostname, queue, PID (process ID), GPU count, and CPU count for each worker.
Nodes & servicesNumber of unique node-service combinations. Click to see the worker count for each node-service pair.
Total queuesNumber of distinct task queues that have at least one worker assigned. Click to see the worker count per queue.
Pending tasksTotal number of tasks waiting in all queues. Click to see the per-queue breakdown of pending tasks. A sustained increase may indicate that workers are not keeping up with demand.

Worker types​

Enterprise h2oGPTe runs eight service types, each displayed with a distinct color in the worker grid:

ServiceColorDescription
chatRoseChat request handling, LLM prompt execution, and response streaming.
corePurpleCore platform operations, including embedding generation and document processing.
crawlGreenDocument ingestion, web crawling, and file processing.
modelsSky blueModel server management and model lifecycle operations.
vexGrayVector store operations, including similarity search and index management.
h2ogptPinkAgent execution and h2oGPT-based agentic workflows.
llmYellowLLM inference serving.
fileserverIndigoFile storage and retrieval operations.

All eight services must have at least one active worker for the platform to operate normally. If a service has no workers, the system raises a degraded-service warning in the sidebar's system-health indicator.

Worker grid​

Below the stat cards, the worker grid displays each node as a bordered card. Workers within a node appear as color-coded squares matching their service type. A color legend appears above the grid.

Each node card shows:

  • Node label: The hostname, service type, worker count, and whether the node has GPU resources.
  • Worker squares: One square per worker. Hover over a square to view the full worker details, including hostname, IP address, PID, queue assignment, GPU count, and CPU count.

Nodes are sorted by worker count, with the largest nodes first.

If no workers are registered, the grid displays No workers to display. When workers are detected, they appear here.

Real-time updates​

The Workers page refreshes automatically every 30 seconds. The system also pushes updates to the page immediately when a worker registers or expires, so you see node changes without waiting for the next refresh interval.

If a worker stops responding, the system removes it from the dashboard after 4 minutes and flags any dependent service as degraded.

Interpret common patterns​

Use the following table to diagnose common patterns on the Workers page:

What you seeWhat it meansRecommended action
All eight services have active workersThe platform is operating normally.No action required.
A service type has no workersThat service is degraded. Dependent features are unavailable.Check the node running that service. Review Live logs for error details.
Pending tasks count is growingWorkers are not processing tasks fast enough.Check whether workers are healthy. Consider scaling the affected service.
A node card shows fewer workers than expectedA worker process may have crashed or an operator may have stopped it.Review logs on that node. The worker restarts automatically when managed by an orchestrator.
Pending tasks count is zeroAll queues are empty and workers are idle.Normal during low-traffic periods.
  • Live logs - Monitor real-time platform events and troubleshoot issues
  • Jobs - View and manage background processing jobs
  • System settings - Configure platform settings including service-level parameters

Feedback