Training

Distillation

Distillation transfers knowledge from a large model to a smaller one.

Quick definition

Distillation transfers knowledge from a large model to a smaller one.

It improves efficiency while retaining performance. In training workflows, distillation often shapes model adaptation.

Training adapts models through fine-tuning or preference optimization. It uses curated datasets and evaluation loops.

Training methods tailor models to your domain and use case.

Train a small model using outputs from a larger model.

Low-quality data can degrade performance. Keep datasets clean, representative, and well-labeled.

In BoltAI, this is referenced when discussing model customization.