海量在线大模型 兼容OpenAI API

全部大模型

350个模型 · 2026-04-03 更新
Meta: Llama 3.1 8B Instruct
$0.0001/1k
$0.0002/1k
meta-llama/llama-3.1-8b-instruct
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
2024-07-23 16,384 text->text Llama3
Meta: Llama 3.1 70B Instruct
$0.0016/1k
$0.0016/1k
meta-llama/llama-3.1-70b-instruct
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
2024-07-23 131,072 text->text Llama3
Meta: Llama 3 8B Instruct
$0.0001/1k
$0.0002/1k
meta-llama/llama-3-8b-instruct
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
2024-04-18 8,192 text->text Llama3
Meta: Llama 3 70B Instruct
$0.0020/1k
$0.0030/1k
meta-llama/llama-3-70b-instruct
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
2024-04-18 8,192 text->text Llama3
Llama Guard 3 8B
$0.0001/1k
$0.0002/1k
meta-llama/llama-guard-3-8b
Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM – it generates text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated. Llama Guard 3 was aligned to safeguard against the MLCommons standardized hazards taxonomy and designed to support Llama 3.1 capabilities. Specifically, it provides content moderation in 8 languages, and was optimized to support safety and security for search and code interpreter tool calls.
2025-02-13 131,072 text->text Llama3
DeepSeek: R1 Distill Llama 70B
$0.0028/1k
$0.0032/1k
deepseek/deepseek-r1-distill-llama-70b
DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including: AIME 2024 pass@1: 70.0 MATH-500 pass@1: 94.5 CodeForces Rating: 1633 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.
2025-01-24 131,072 text->text Llama3
xAI: Grok Code Fast 1
$0.0008/1k
$0.0060/1k
x-ai/grok-code-fast-1
Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality work flows.
2025-08-27 256,000 text->text Grok
xAI: Grok 4.20 Multi-Agent
$0.0080/1k
$0.024/1k
x-ai/grok-4.20-multi-agent
Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information across complex tasks. Reasoning effort behavior: - low / medium: 4 agents - high / xhigh: 16 agents
2026-04-01 2,000,000 text+image+file->text Grok
xAI: Grok 4.20
$0.0080/1k
$0.024/1k
x-ai/grok-4.20
Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering consistently precise and truthful responses. Reasoning can be enabled/disabled using the reasoning enabled parameter in the API. Learn more in our docs
2026-04-01 2,000,000 text+image->text Grok
xAI: Grok 4.1 Fast
$0.0008/1k
$0.0020/1k
x-ai/grok-4.1-fast
Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using the reasoning enabled parameter in the API. Learn more in our docs
2025-11-20 2,000,000 text+image+file->text Grok
xAI: Grok 4 Fast
$0.0008/1k
$0.0020/1k
x-ai/grok-4-fast
Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's news post. Reasoning can be enabled/disabled using the reasoning enabled parameter in the API. Learn more in our docs
2025-09-19 2,000,000 text+image+file->text Grok
xAI: Grok 4
$0.012/1k
$0.060/1k
x-ai/grok-4
Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not exposed, reasoning cannot be disabled, and the reasoning effort cannot be specified. Pricing increases once the total tokens in a given request is greater than 128k tokens. See more details on the xAI docs
2025-07-10 256,000 text+image+file->text Grok