Models

Transformer

Transformer is the neural network architecture behind most modern LLMs.

Quick definition

Transformer is the neural network architecture behind most modern LLMs.

It relies on self-attention to model relationships across tokens in parallel. In models workflows, transformer often shapes model capability and fit.

Model architecture and scale determine capability. Context length, parameter count, and modality support vary across models.

Model architecture affects capability, context length, and speed.

GPT-style models are transformers.

Bigger is not always better. Match the model to the task and evaluate in production.

In BoltAI, this shows up in model selection and configuration.