海量在线大模型 兼容OpenAI API

全部大模型

320个模型 · 2025-07-23 更新
OpenAI: GPT-4.1 Mini
$0.0016/1k
$0.0064/1k
openai/gpt-4.1-mini
GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on MultiChallenge, and 84.1% on IFEval. Mini also shows strong coding ability (e.g., 31.6% on Aider’s polyglot diff benchmark) and vision understanding, making it suitable for interactive applications with tight performance constraints.
2025-04-15 1,047,576 text+image->text GPT
OpenAI: GPT-4.1
$0.0080/1k
$0.032/1k
openai/gpt-4.1
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.
2025-04-15 1,047,576 text+image->text GPT
openai/gpt-4-turbo-preview
The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. Note: heavily rate limited by OpenAI while in preview.
2024-01-25 128,000 text->text GPT
openai/gpt-4-1106-preview
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to April 2023.
2023-11-06 128,000 text->text GPT
OpenAI: GPT-4 Turbo
$0.040/1k
$0.12/1k
openai/gpt-4-turbo
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.
2024-04-09 128,000 text+image->text GPT
openai/gpt-4-0314
GPT-4-0314 is the first version of GPT-4 released, with a context length of 8,192 tokens, and was supported until June 14. Training data: up to Sep 2021.
2023-05-28 8,191 text->text GPT
OpenAI: GPT-3.5 Turbo Instruct
$0.0060/1k
$0.0080/1k
openai/gpt-3.5-turbo-instruct
This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.
2023-09-28 4,095 text->text GPT
OpenAI: GPT-3.5 Turbo 16k
$0.012/1k
$0.016/1k
openai/gpt-3.5-turbo-16k
This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up to Sep 2021.
2023-08-28 16,385 text->text GPT
openai/gpt-3.5-turbo-0613
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
2024-01-25 4,095 text->text GPT
OpenAI: Codex Mini
$0.0060/1k
$0.024/1k
openai/codex-mini
codex-mini-latest is a fine-tuned version of o4-mini specifically for use in Codex CLI. For direct use in the API, we recommend starting with gpt-4.1.
2025-05-16 200,000 text+image->text GPT
OpenAI: ChatGPT-4o
$0.020/1k
$0.060/1k
openai/chatgpt-4o-latest
OpenAI ChatGPT 4o is continually updated by OpenAI to point to the current version of GPT-4o used by ChatGPT. It therefore differs slightly from the API version of GPT-4o in that it has additional RLHF. It is intended for research and evaluation. OpenAI notes that this model is not suited for production use-cases as it may be removed or redirected to another model in the future.
2024-08-14 128,000 text+image->text GPT
tngtech/deepseek-r1t2-chimera:free
DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.
2025-07-08 163,840 text->text DeepSeek