ModelsHub 模型仓库

OpenAI: GPT-3.5 Turbo Instruct

$0.0060/1k

$0.0080/1k

openai/gpt-3.5-turbo-instruct

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.

2023-09-28 4,095 text->text GPT

立即聊天

OpenAI: GPT-3.5 Turbo 16k

$0.012/1k

$0.016/1k

openai/gpt-3.5-turbo-16k

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up to Sep 2021.

2023-08-28 16,385 text->text GPT

立即聊天

OpenAI: GPT-3.5 Turbo (older v0613)

$0.0040/1k

$0.0080/1k

openai/gpt-3.5-turbo-0613

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

2024-01-25 4,095 text->text GPT

立即聊天

OpenAI: GPT Audio Mini

$0.0024/1k

$0.0096/1k

openai/gpt-audio-mini

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million tokens and output is priced at $2.40 per million tokens.

2026-01-20 128,000 text+audio->text+audio GPT

立即聊天

OpenAI: GPT Audio

$0.010/1k

$0.040/1k

openai/gpt-audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens.

2026-01-20 128,000 text+audio->text+audio GPT

立即聊天

TNG: DeepSeek R1T2 Chimera

$0.0012/1k

$0.0044/1k

tngtech/deepseek-r1t2-chimera

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.

2025-07-08 163,840 text->text DeepSeek

立即聊天

Nex AGI: DeepSeek V3.1 Nex N1

$0.0005/1k

$0.0020/1k

nex-agi/deepseek-v3.1-nex-n1

DeepSeek V3.1 Nex-N1 is the flagship release of the Nex-N1 series — a post-trained model designed to highlight agent autonomy, tool use, and real-world productivity. Nex-N1 demonstrates competitive performance across all evaluation scenarios, showing particularly strong results in practical coding and HTML generation tasks.

2025-12-08 131,072 text->text DeepSeek

立即聊天

DeepSeek: R1 0528

$0.0018/1k

$0.0086/1k

deepseek/deepseek-r1-0528

May 28th update to the original DeepSeek R1 Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass. Fully open-source model.

2025-05-29 163,840 text->text DeepSeek

立即聊天

DeepSeek: DeepSeek V3.2 Speciale

$0.0016/1k

$0.0048/1k

deepseek/deepseek-v3.2-speciale

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance. It builds on DeepSeek Sparse Attention (DSA) for efficient long-context processing, then scales post-training reinforcement learning to push capability beyond the base model. Reported evaluations place Speciale ahead of GPT-5 on difficult reasoning workloads, with proficiency comparable to Gemini-3.0-Pro, while retaining strong coding and tool-use reliability. Like V3.2, it benefits from a large-scale agentic task synthesis pipeline that improves compliance and generalization in interactive environments.

2025-12-01 163,840 text->text DeepSeek

立即聊天

DeepSeek: DeepSeek V3.2 Exp

$0.0011/1k

$0.0016/1k

deepseek/deepseek-v3.2-exp

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve training and inference efficiency in long-context scenarios while maintaining output quality. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs The model was trained under conditions aligned with V3.1-Terminus to enable direct comparison. Benchmarking shows performance roughly on par with V3.1 across reasoning, coding, and agentic tool-use tasks, with minor tradeoffs and gains depending on the domain. This release focuses on validating architectural optimizations for extended context lengths rather than advancing raw task accuracy, making it primarily a research-oriented model for exploring efficient transformer designs.

2025-09-29 163,840 text->text DeepSeek

立即聊天

DeepSeek: DeepSeek V3.2

$0.0010/1k

$0.0015/1k

deepseek/deepseek-v3.2

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism that reduces training and inference cost while preserving quality in long-context scenarios. A scalable reinforcement learning post-training framework further improves reasoning, with reported performance in the GPT-5 class, and the model has demonstrated gold-medal results on the 2025 IMO and IOI. V3.2 also uses a large-scale agentic task synthesis pipeline to better integrate reasoning into tool-use settings, boosting compliance and generalization in interactive environments. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

2025-12-01 163,840 text->text DeepSeek

立即聊天

DeepSeek: DeepSeek V3.1 Terminus

$0.0008/1k

$0.0032/1k

deepseek/deepseek-v3.1-terminus

DeepSeek-V3.1 Terminus is an update to DeepSeek V3.1 that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's performance in coding and search agents. It is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes. It extends the DeepSeek-V3 base with a two-phase long-context training process, reaching up to 128K tokens, and uses FP8 microscaling for efficient inference. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows.

2025-09-22 163,840 text->text DeepSeek

立即聊天

全部大模型