The underlying AI models that power our agents, each with different capabilities.
Automatic model routing through the Vercel AI Gateway. Picks the best available model for each request and falls back to alternates if the primary is overloaded or unavailable.
Automatically picks the best available model with seamless fallback if any provider is overloaded.
Anthropic is an AI safety company and public benefit corporation that builds the Claude family of large language models. Founded in 2021, Anthropic focuses on developing reliable, steerable AI designed to be helpful, harmless, and honest.
Fast and intelligent model for quick tasks
The best combination of speed and intelligence, with adaptive thinking and long context support
Previous-generation Sonnet with long context and extended thinking
Most capable Claude model with a step-change in agentic coding and complex reasoning
Previous-generation Opus with a step-change in agentic coding and complex reasoning
Previous-generation Opus with adaptive thinking for complex reasoning and agentic workflows
Google DeepMind's Gemini models offer multimodal understanding with large context windows, strong reasoning, and efficient performance across tasks.
Near-Pro reasoning and coding at Flash-tier cost and speed. Dynamic thinking is on by default.
Google's most advanced Gemini 3.1 model with powerful agentic capabilities, multimodal understanding, and state-of-the-art reasoning.
Mistral AI builds efficient, open-weight language models. Known for strong multilingual support, fast inference, and competitive performance at lower cost.
Hybrid model optimized for general chat, coding, agentic tasks, and complex reasoning with text and image input support
Balanced model for most tasks with good performance and cost efficiency
Mistral's most capable model with strong multilingual support and advanced reasoning for complex tasks
OpenAI's GPT models deliver strong general-purpose intelligence with broad knowledge, creative writing, and code generation capabilities.
OpenAI's frontier model — fully-retrained base since GPT-4.5, with 1M context window and strong general reasoning. Priced like Claude Opus; available on the Pro plan and above.
GPT-5.4 with a 1M context window, built-in computer-use capabilities, and optional reasoning. Sonnet-equivalent OpenAI option for Basic+ users.
GPT 5.4 mini is an optimized version of GPT 5.4 — fast and efficient for everyday tasks.
Last version of GPT 4
OpenRouter provides unified access to hundreds of AI models from different providers through a single API endpoint.
Z-AI's fast and efficient general-purpose model with strong multilingual capabilities.
Moonshot AI's advanced reasoning model with strong performance on complex tasks.
xAI's fast model with an unprecedented 2 million token context window.
DeepSeek's efficient reasoning model with strong coding and math capabilities.
MiniMax's latest general-purpose model with strong reasoning and large context support.
OpenAI's open-source 120B parameter model available through OpenRouter.
Qwen's flagship model with state-of-the-art reasoning and a 1 million token context window.
xAI's advanced reasoning model with strong coding and agentic capabilities and a 1 million token context window.
DeepSeek's fast and efficient V4 model optimized for speed with a 1 million token context window.
DeepSeek's flagship V4 reasoning model with strong coding and math capabilities and a 1 million token context window.
Qwen's high-performance model with 1 million token context window.
Qwen's fast and efficient model optimized for speed with 1M context.
Automatic model routing through the Vercel AI Gateway. Picks the best available model for each request and falls back to alternates if the primary is overloaded or unavailable.
Automatically picks the best available model with seamless fallback if any provider is overloaded.
auto
Anthropic is an AI safety company and public benefit corporation that builds the Claude family of large language models. Founded in 2021, Anthropic focuses on developing reliable, steerable AI designed to be helpful, harmless, and honest.
Fast and intelligent model for quick tasks
claude-haiku-4-5
The best combination of speed and intelligence, with adaptive thinking and long context support
claude-sonnet-4-6
Previous-generation Sonnet with long context and extended thinking
claude-sonnet-4-5
Most capable Claude model with a step-change in agentic coding and complex reasoning
claude-opus-4-8
Previous-generation Opus with a step-change in agentic coding and complex reasoning
claude-opus-4-7
Previous-generation Opus with adaptive thinking for complex reasoning and agentic workflows
claude-opus-4-6
Google DeepMind's Gemini models offer multimodal understanding with large context windows, strong reasoning, and efficient performance across tasks.
Near-Pro reasoning and coding at Flash-tier cost and speed. Dynamic thinking is on by default.
google/gemini-3.5-flash
Google's most advanced Gemini 3.1 model with powerful agentic capabilities, multimodal understanding, and state-of-the-art reasoning.
google/gemini-3.1-pro-previewMistral AI builds efficient, open-weight language models. Known for strong multilingual support, fast inference, and competitive performance at lower cost.
Hybrid model optimized for general chat, coding, agentic tasks, and complex reasoning with text and image input support
mistral-small-latestBalanced model for most tasks with good performance and cost efficiency
mistral-medium-latestMistral's most capable model with strong multilingual support and advanced reasoning for complex tasks
mistral-large-latest
OpenAI's GPT models deliver strong general-purpose intelligence with broad knowledge, creative writing, and code generation capabilities.
OpenAI's frontier model — fully-retrained base since GPT-4.5, with 1M context window and strong general reasoning. Priced like Claude Opus; available on the Pro plan and above.
gpt-5.5
GPT-5.4 with a 1M context window, built-in computer-use capabilities, and optional reasoning. Sonnet-equivalent OpenAI option for Basic+ users.
gpt-5.4
GPT 5.4 mini is an optimized version of GPT 5.4 — fast and efficient for everyday tasks.
gpt-5.4-mini
Last version of GPT 4
gpt-4.1OpenRouter provides unified access to hundreds of AI models from different providers through a single API endpoint.
Z-AI's fast and efficient general-purpose model with strong multilingual capabilities.
openrouter/z-ai/glm-5-turboMoonshot AI's advanced reasoning model with strong performance on complex tasks.
openrouter/moonshotai/kimi-k2.5xAI's fast model with an unprecedented 2 million token context window.
openrouter/x-ai/grok-4.1-fastDeepSeek's efficient reasoning model with strong coding and math capabilities.
openrouter/deepseek/deepseek-v3.2MiniMax's latest general-purpose model with strong reasoning and large context support.
openrouter/minimax/minimax-m2.7OpenAI's open-source 120B parameter model available through OpenRouter.
openrouter/openai/gpt-oss-120bQwen's flagship model with state-of-the-art reasoning and a 1 million token context window.
openrouter/qwen/qwen3.7-maxxAI's advanced reasoning model with strong coding and agentic capabilities and a 1 million token context window.
openrouter/x-ai/grok-4.3DeepSeek's fast and efficient V4 model optimized for speed with a 1 million token context window.
openrouter/deepseek/deepseek-v4-flashDeepSeek's flagship V4 reasoning model with strong coding and math capabilities and a 1 million token context window.
openrouter/deepseek/deepseek-v4-proQwen's high-performance model with 1 million token context window.
openrouter/qwen/qwen3.5-plus-02-15Qwen's fast and efficient model optimized for speed with 1M context.
openrouter/qwen/qwen3.5-flash-02-23