海量在线大模型 兼容OpenAI API

全部大模型

228个模型 · 2025-02-09 更新
OpenAI: GPT-3.5 Turbo
$0.0020/1k
$0.0060/1k
openai/gpt-3.5-turbo
GPT-3.5 Turbo is OpenAI’s fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
2023-05-28 16,385 text->text GPT
OpenAI: GPT-4
$0.12/1k
$0.24/1k
openai/gpt-4
OpenAI’s flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning capabilities. Training data: up to Sep 2021.
2023-05-28 8,191 text->text GPT
OpenAI: o1
$0.060/1k
$0.24/1k
openai/o1
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology. Learn more in the launch announcement.
2024-12-18 200,000 text+image->text GPT
DeepSeek: DeepSeek V3
$0.0020/1k
$0.0036/1k
deepseek/deepseek-chat
DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations reveal that the model outperforms other open-source models and rivals leading closed-source models. For model details, please visit the DeepSeek-V3 repo for more information, or see the launch announcement.
2024-12-27 32,768 text->text DeepSeek
DeepSeek: R1
$0.0032/1k
$0.0096/1k
deepseek/deepseek-r1
DeepSeek R1 is here: Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens. It’s 671B parameters in size, with 37B active in an inference pass. Fully open-source model & technical report. MIT licensed: Distill & commercialize freely!
2025-01-20 128,000 text->text DeepSeek
Anthropic: Claude v2
$0.032/1k
$0.096/1k
anthropic/claude-2
Claude 2 delivers advancements in key capabilities for enterprises—including an industry-leading 200K token context window, significant reductions in rates of model hallucination, system prompts and a new beta feature: tool use.
2023-11-22 200,000 text->text Claude
Anthropic: Claude 3.5 Sonnet
$0.012/1k
$0.060/1k
anthropic/claude-3.5-sonnet
New Claude 3.5 Sonnet delivers better-than-Opus capabilities, faster-than-Sonnet speeds, at the same Sonnet prices. Sonnet is particularly good at: Coding: Scores ~49% on SWE-Bench Verified, higher than the last best score, and without any fancy prompt scaffolding Data science: Augments human data science expertise; navigates unstructured data while using multiple tools for insights Visual processing: excelling at interpreting charts, graphs, and images, accurately transcribing text to derive insights beyond just the text alone Agentic tasks: exceptional tool use, making it great at agentic tasks (i.e. complex, multi-step problem solving tasks that require engaging with other systems) multimodal
2024-10-22 200,000 text+image->text Claude
Google: Gemini Flash 1.5
$0.0003/1k
$0.0012/1k
google/gemini-flash-1.5
Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and video. It’s adept at processing visual and text inputs such as photographs, documents, infographics, and screenshots. Gemini 1.5 Flash is designed for high-volume, high-frequency tasks where cost and latency matter. On most common tasks, Flash achieves comparable quality to other Gemini Pro models at a significantly reduced cost. Flash is well-suited for applications like chat assistants and on-demand content generation where speed and scale matter. Usage of Gemini is subject to Google’s Gemini Terms of Use. multimodal
2024-05-14 1,000,000 text+image->text Gemini
google/gemini-2.0-flash-exp:free
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
2024-12-12 1,048,576 text+image->text Gemini
Meta: Llama 2 13B Chat
$0.0009/1k
$0.0009/1k
meta-llama/llama-2-13b-chat
A 13 billion parameter language model from Meta, fine tuned for chat completions
2023-06-20 4,096 text->text Llama2
Meta: Llama 3.1 405B (base)
$0.0080/1k
$0.0080/1k
meta-llama/llama-3.1-405b
Meta’s latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This is the base 405B pre-trained version. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta’s Acceptable Use Policy.
2024-08-02 32,768 text->text Llama3
openai/o1-preview-2024-09-12
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology. Learn more in the launch announcement. Note: This model is currently experimental and not suitable for production use-cases, and may be heavily rate-limited.
2024-09-12 128,000 text->text GPT