Generation

Temperature scaling

Temperature scaling adjusts the sharpness of token probabilities.

Quick definition

Temperature scaling adjusts the sharpness of token probabilities.

Lower temperature makes outputs more deterministic. In generation workflows, temperature scaling often shapes output style and randomness.

Generation settings control how the model samples tokens. They trade off creativity, determinism, and safety.

Generation settings trade off creativity, determinism, and safety.

Use 0.2 for precise answers.

High randomness can reduce accuracy while low randomness can be repetitive. Tune per task and evaluate results.

In BoltAI, this appears in model settings that shape responses.