Training

Preference dataset

A preference dataset contains ranked or paired responses.

Quick definition

A preference dataset contains ranked or paired responses.

It trains reward or preference models. In training workflows, preference dataset often shapes model adaptation.

Training adapts models through fine-tuning or preference optimization. It uses curated datasets and evaluation loops.

Training methods tailor models to your domain and use case.

Human rankings of two answers.

Low-quality data can degrade performance. Keep datasets clean, representative, and well-labeled.

In BoltAI, this is referenced when discussing model customization.