Safety

Hallucination

A hallucination is a confident but incorrect model output.

Quick definition

A hallucination is a confident but incorrect model output.

It often occurs when the model lacks reliable context. In safety workflows, hallucination often shapes risk reduction.

Safety systems combine policy rules, classifiers, and human feedback to reduce harmful outputs.

Safety concepts reduce harmful outputs and protect users.

Inventing a citation that does not exist.

Over-blocking can frustrate users while under-blocking increases risk. Balance safety with usability.

In BoltAI, this relates to safe outputs and content handling.