Tutorials
Learn about H2O LLM DataStudio a no-code application and toolkit designed to streamline data curation, preparation, and augmentation tasks for large language models (LLMs).
Learning path
The H2O LLM DataStudio tutorials are available for all the supported workflows. The workflows include:
Question and Answer
- Tutorial: Preparation of a dataset for the problem type of Question Answering
This tutorial describes the process of preparing a dataset that consists of contextual information, questions, and corresponding answers.
Text Summarization
- Tutorial: Preparation of a dataset for the problem type of Text Summarization
This tutorial describes the process of preparing a dataset that consists of articles and their associated summaries.
Instruct Tuning
- Tutorial: Preparation of a dataset for the problem type of Instruct Tuning
This tutorial describes the process of preparing a dataset that consists of prompts and their respective responses.
Human - Bot Conversations
- Tutorial: Preparation of a dataset for the problem type of Human - Bot Conversations
This tutorial describes the process of preparing a dataset comprising multiple dialogues between human users and chatbots.
Continued PreTraining
- Tutorial: Preparation of a dataset for the problem type of Continued PreTraining
This tutorial describes the process of preparing datasets with extensive texts for further pretraining of language models.
- Submit and view feedback for this page
- Send feedback about H2O LLM DataStudio | Docs to cloud-feedback@h2o.ai