Models

Self-attention

Self-attention lets a model weigh relationships between tokens in a sequence.

Quick definition

Self-attention lets a model weigh relationships between tokens in a sequence.

Each token attends to others to build contextual representations. In models workflows, self-attention often shapes model capability and fit.

Model architecture and scale determine capability. Context length, parameter count, and modality support vary across models.

Model architecture affects capability, context length, and speed.

Pronouns attend to the nouns they refer to.

Bigger is not always better. Match the model to the task and evaluate in production.

In BoltAI, this shows up in model selection and configuration.