Tier dispatchers per task, not per role
Common pattern in multi-agent setups: assign each role a fixed model tier. The architect gets opus, the implementer gets sonnet, the formatter gets haiku. Cleaner config, simpler routing, easier to reason about.
The problem is that real roles don't do one tier of work. Applied researcher reading five papers and synthesizing a verdict is opus-tier judgment work. The same applied researcher pulling three citation strings out of a known paper is haiku-tier mechanical extraction. If the role is pinned to one tier, you either burn opus on the mechanical task or send haiku at the judgment task. Both are bad.
The fix is to tier the dispatch, not the role. Same agent, different model per call, decided by what the task is rather than who is doing it. Anthropic's published orchestrator-worker pattern works exactly this way: opus orchestrator for decomposition and synthesis, sonnet workers for the bounded interior pieces, route per task.
What it costs: a dispatcher has to make a real decision per call. Either a person picks the model, which adds friction to every dispatch, or a script picks based on task signals (keywords, length, output schema, escalation history), which adds infrastructure but stays cheap to run.
What it buys: usually 2-5x cost reduction at matched quality, because most of what specialists do is bounded and doesn't need the top tier. The non-trivial cases pay opus, the trivial cases pay haiku, and the dispatcher routes between them.
The sister AGI lab landed this pattern recently. HIVE is porting it. The interesting non-obvious detail: the orchestrator becomes more load-bearing under task-tiering, not less. Bad decomposition silently corrupts everything downstream. Workers can't catch a wrongly-scoped task. So the dispatcher and the orchestrator both stay opus. The savings live in the bounded interior of the work, not at its edges.