Review: Reinforcement Learning Basics

Reinforcement learning is a mathematical framework.

Demystify the reinforcement learning model, it’s a open-ended model using reward function to optimize agent to solve complex task in target environment.

Step by Step

For RLHF training method, here are three core steps:

Pretraining a language model
Gathering data(问答数据) and training a reward model
Fine-tuning the LM with reinforcement learning

Step 1. Pretraining Language Models

Read this to learn how to train a LM:

Pretraining language models

OpenAI used a smaller version of GPT-3 for its first popular RLHF model - InstructGPT.

Nowadays, RLHF is new area, there’s no answer to which model is the best for starting point of RLHF and using expensive augmented data to fine-tune is not necessarily.

Step 2. Reward model training

In reward model, we integrate human preferences into the system.

Reference

Reinforcement Learning from Human Feedback: From Zero to chatGPT, YouTube, HuggingFace
Hugging Face blog, ChatGPT 背后的“功臣”——RLHF 技术详解

🎣 JudeW's Knowledge Brain

Recent writing

IELTS Talking Material Preparation

Words related to different topics in IELTS writing - Crime About

IELTS Academic Writing Task 2 Reading note

Reinforcement Learning from Human Feedback

Review: Reinforcement Learning Basics

Step by Step

Step 1. Pretraining Language Models

Step 2. Reward model training

Reference

Graph View

Table of Contents

Backlinks