Models

Mixture of experts

Mixture of experts (MoE) routes inputs to specialized sub-models.

Quick definition

Mixture of experts (MoE) routes inputs to specialized sub-models.

It can improve quality while keeping compute manageable. In models workflows, mixture of experts often shapes model capability and fit.

Model architecture and scale determine capability. Context length, parameter count, and modality support vary across models.

Model architecture affects capability, context length, and speed.

A router selects the best expert for each token.

Bigger is not always better. Match the model to the task and evaluate in production.

In BoltAI, this shows up in model selection and configuration.