Version: v1.6.37-dev1 🚧

Concepts

Enterprise h2oGPTe uses several key terms across its documentation, and each, in turn, is explained in the following sections.

LLM

A Large Language Model (LLM) is a type of AI model that uses deep learning techniques and uses massive datasets to analyze and generate human-like language. For example, many AI chatbots or AI search engines are powered by LLMs.

Generally speaking, LLMs can be characterized by the following parameters:

Size of the training dataset
Cost of training (computational power)
Size of the model (parameters)
Performance after training (or how well the model is able to respond to a particular question)

GPT

GPT, short for Generative Pre-Trained Transformer, is an advanced open-source language model that utilizes transformer architectures to generate human-like text. It is trained on vast amounts of unlabeled text data from the internet, enabling it to understand and generate coherent and contextually relevant text. Unlike rule-based systems, GPT learns patterns and structures in text data to generate human-like responses.

For more information, see GPT (Generative Pre-Trained Transformer).

RAG

Retrieval-augmented generation (RAG) is an AI framework for improving the quality of responses generated by Large Language Models (LLMs) by grounding the model on external sources of knowledge. RAG-equipped chatbots absorb their information from a variety of sources, including databases, documents, and the internet, to provide accurate and contextually relevant responses. This is particularly useful when users have complex or multi-step queries. Using a RAG system contributes significantly towards making the business more agile, especially if the company has a customer-facing chatbot.

For more information, see Boosting LLMs to New Heights with Retrieval Augmented Generation.

LLM Prompt

A Large Language Model (LLM) Prompt is a question or request you send to an LLM to generate a desired response. This can be a question you want the LLM to answer or a request for the LLM to complete. The goal of using an LLM Prompt is to elicit a specific response from the model, whether it be a piece of information, a summary, or a creative work.

Transformer Neural Networks

Neural networks are an efficient way to solve machine learning problems and can be used in various situations. Neural networks offer precision and accuracy. Finding the correct neural network for each project can increase efficiency. Recurrent neural networks (RNNs) remember previously learned predictions to help make future predictions with accuracy. Unlike RNNs, Transformer Neural Networks do not have a concept of timestamps. This enables them to pass through multiple inputs at once, making them a more efficient way to process data.

For more information, see Transformer Architecture.

Fine-Tuning

Fine-Tuning refers to the process of taking a pre-trained language model and further training it on a specific task or domain to improve its performance on that task. It is an important technique used to adapt Large Language Models (LLMs) to specific tasks and domains.

Self-Reflection

In Enterprise h2oGPTe, Self-Reflection asks another Large Language Model (LLM) to reflect on the answer given to the question based on the provided context. Self-reflection can be used to evaluate the LLM’s performance.

Agents

Agents in Enterprise h2oGPTe are autonomous LLM-powered entities that use tools to complete multi-step tasks such as answering questions, generating summaries, querying APIs, and running scripts.

They follow a structured reasoning process to select tools, break down tasks, and improve over time. You can extend their capabilities with custom tools, such as one that applies brand colors to plots based on company names.

For more information, see Tutorial 9: Creating and using a custom agent tool.

Agentic AI vs generative AI

While generative AI focuses on producing text, images, or other content based on prompts (for example, answering a question or writing an email), agentic AI goes a step further. Agentic AI enables the model to reason, plan, and take actions toward achieving a goal, often over multiple steps and with tool usage.

Generative AI is reactive and prompt-based. Agentic AI is proactive and task-oriented.

Tools

Tools are external functions or APIs that an Agent can invoke to complete a task. In Enterprise h2oGPTe, tools extend the capability of LLMs by allowing them to perform actions beyond natural language generation. These actions may include searching databases, calling REST APIs, sending notifications, or retrieving data from knowledge sources.

Tools are essential for enabling agentic workflows, where the LLM must interact with external systems to provide useful outcomes.

Feedback

Submit and view feedback for this page
Send feedback about Enterprise h2oGPTe to cloud-feedback@h2o.ai

LLM​

GPT​

RAG​

LLM Prompt​

Transformer Neural Networks​

Fine-Tuning​

Self-Reflection​

Agents​

Agentic AI vs generative AI​

Tools​

LLM

GPT

RAG