海量在线大模型 兼容OpenAI API

全部大模型

349个模型 · 2025-12-17 更新
Amazon: Nova 2 Lite
$0.0012/1k
$0.010/1k
amazon/nova-2-lite-v1
Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing documents, extracting information from videos, generating code, providing accurate grounded answers, and automating multi-step agentic workflows.
2025-12-03 1,000,000 text+image->text Nova
Cohere: Command R7B (12-2024)
$0.0001/1k
$0.0006/1k
cohere/command-r7b-12-2024
Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning and multiple steps. Use of this model is subject to Cohere's Usage Policy and SaaS Agreement.
2024-12-14 128,000 text->text Cohere
Cohere: Command R+ (08-2024)
$0.010/1k
$0.040/1k
cohere/command-r-plus-08-2024
command-r-plus-08-2024 is an update of the Command R+ with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint the same. Read the launch post here. Use of this model is subject to Cohere's Usage Policy and SaaS Agreement.
2024-08-30 128,000 text->text Cohere
Cohere: Command R (08-2024)
$0.0006/1k
$0.0024/1k
cohere/command-r-08-2024
command-r-08-2024 is an update of the Command R with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and is competitive with the previous version of the larger Command R+ model. Read the launch post here. Use of this model is subject to Cohere's Usage Policy and SaaS Agreement.
2024-08-30 128,000 text->text Cohere
openrouter/bodybuilder
Transform your natural language requests into structured OpenRouter API request objects. Describe what you want to accomplish with AI models, and Body Builder will construct the appropriate API calls. Example: "count to 10 using gemini and opus." This is useful for creating multi-model requests, custom model routers, or programmatic generation of API calls from human descriptions. BETA NOTICE: Body Builder is in beta, and currently free. Pricing and functionality may change in the future.
2025-12-05 128,000 text->text Router
Auto Router
免费使用
openrouter/auto
Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used, visit Activity, or read the model attribute of the response. Your response will be priced at the same rate as the routed model. The meta-model is powered by Not Diamond. Learn more in our docs. Requests will be routed to the following models: - openai/gpt-5 - openai/gpt-5-mini - openai/gpt-5-nano - openai/gpt-4.1-nano - openai/gpt-4.1 - openai/gpt-4.1-mini - openai/gpt-4.1 - openai/gpt-4o-mini - openai/chatgpt-4o-latest - anthropic/claude-3.5-haiku - anthropic/claude-opus-4-1 - anthropic/claude-sonnet-4-0 - anthropic/claude-3-7-sonnet-latest - google/gemini-2.5-pro - google/gemini-2.5-flash - mistral/mistral-large-latest - mistral/mistral-medium-latest - mistral/mistral-small-latest - mistralai/mistral-nemo - x-ai/grok-3 - x-ai/grok-3-mini - x-ai/grok-4 - deepseek/deepseek-r1 - meta-llama/llama-3.1-70b-instruct - meta-llama/llama-3.1-405b-instruct - mistralai/mixtral-8x22b-instruct - perplexity/sonar - cohere/command-r-plus - cohere/command-r
2023-11-08 2,000,000 text->text Router
Z.AI: GLM 4.6V
$0.0012/1k
$0.0036/1k
z-ai/glm-4.6v
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts and charts directly as visual inputs, and integrates native multimodal function calling to connect perception with downstream tool execution. The model also enables interleaved image-text generation and UI reconstruction workflows, including screenshot-to-HTML synthesis and iterative visual editing.
2025-12-08 131,072 text+image->text Other
Z.AI: GLM 4.6 (exacto)
$0.0018/1k
$0.0070/1k
z-ai/glm-4.6:exacto
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.
2025-09-30 204,800 text->text Other
Z.AI: GLM 4.6
$0.0016/1k
$0.0076/1k
z-ai/glm-4.6
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.
2025-09-30 204,800 text->text Other
Z.AI: GLM 4.5V
$0.0019/1k
$0.0058/1k
z-ai/glm-4.5v
GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding, image Q&A, OCR, and document parsing, with strong gains in front-end web coding, grounding, and spatial reasoning. It offers a hybrid inference mode: a "thinking mode" for deep reasoning and a "non-thinking mode" for fast responses. Reasoning behavior can be toggled via the reasoning enabled boolean. Learn more in our docs
2025-08-11 65,536 text+image->text Other
z-ai/glm-4.5-air:free
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs
2025-07-26 131,072 text->text Other
Z.AI: GLM 4.5 Air
$0.0004/1k
$0.0027/1k
z-ai/glm-4.5-air
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs
2025-07-26 131,072 text->text Other