Reach out

Models & web search

Brigade is model-agnostic. Connect several providers at once, switch with /model, and Brigade carries your context across the switch. Web search is pluggable and auto-selected by what you have configured.

Providers out of the box#

Anthropic, OpenAI, Google Gemini, OpenRouter, Groq, Cerebras, xAI, DeepSeek, Mistral, Ollama (local), and Custom OpenAI-compatible endpoints — Together, Fireworks, vLLM, LM Studio, on-prem gateways, anything that speaks /v1/chat/completions.

terminal
$ brigade onboard                                    # add a provider interactively$ brigade config set agents.defaults.provider openrouter# in-chat: /provider to add one mid-session, /model to switch

Switching mid-conversation#

Switching models keeps the thread. When you change model or provider, Brigade carries the full transcript onto the new model — it is the same session — and:

  • Re-anchors your thinking level to what the target supports: preserved when it can reason, forced off for a non-reasoning model, bumped for a reasoning-only one.
  • Sanitizes provider-specific reasoning blocks the next provider would otherwise reject.
  • Works mid-turn (abort the in-flight run and replay your last message on the new model) or next-turn.

This is what makes /model and /provider switches seamless.

The web_search tool is pluggable and auto-selects a provider based on what you have configured:

  • Keyed — Tavily, Brave, Exa, Perplexity, Firecrawl, SearXNG.
  • Keyless — DuckDuckGo, Wikipedia, Hacker News, arXiv, GitHub, npm, and local Ollama.

Web tools (web_search, fetch_url, browser) are open to any caller — see Tools.

Local-only is a first-class option

With Ollama or a custom endpoint, requests stay on your network and the keyless search providers keep you off the hosted ones — so you can run an entirely local crew.