Models

Attention head

An attention head is one parallel attention mechanism inside a transformer layer.

Quick definition

An attention head is one parallel attention mechanism inside a transformer layer.

Heads specialize in patterns such as locality or long-range links. In models workflows, attention head often shapes model capability and fit.

Model architecture and scale determine capability. Context length, parameter count, and modality support vary across models.

Model architecture affects capability, context length, and speed.

A head may connect opening and closing brackets.

Bigger is not always better. Match the model to the task and evaluate in production.

In BoltAI, this shows up in model selection and configuration.