海量在线大模型 兼容OpenAI API

全部大模型

228个模型 · 2025-02-09 更新
google/gemini-2.0-pro-exp-02-05:free
Gemini 2.0 Pro Experimental is a bleeding-edge version of the Gemini 2.0 Pro model. Because it’s currently experimental, it will be heavily rate-limited by Google. Usage of Gemini is subject to Google’s Gemini Terms of Use. multimodal
2025-02-05 2,000,000 text+image->text Gemini
Google: Gemini Pro 1.5
$0.0050/1k
$0.020/1k
google/gemini-pro-1.5
Google’s latest multimodal model, supports image and video[0] in text or chat prompts. Optimized for language tasks including: Code generation Text generation Text editing Problem solving Recommendations Information extraction Data extraction or generation AI agents Usage of Gemini is subject to Google’s Gemini Terms of Use. [0]: Video input is not available through OpenRouter at this time.
2024-04-09 2,000,000 text+image->text Gemini
Google: Gemini Pro 1.0
$0.0020/1k
$0.0060/1k
google/gemini-pro
Google’s flagship text generation model. Designed to handle natural language tasks, multiturn text and code chat, and code generation. See the benchmarks and prompting guidelines from Deepmind. Usage of Gemini is subject to Google’s Gemini Terms of Use.
2023-12-13 32,760 text->text Gemini
google/gemini-2.0-flash-lite-preview-02-05:free
Gemini Flash Lite 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. Because it’s currently in preview, it will be heavily rate-limited by Google. This model will move from free to paid pending a general rollout on February 24th, at $0.075 / $0.30 per million input / ouput tokens respectively.
2025-02-05 1,000,000 text+image->text Gemini
Google: Gemini Flash 2.0
$0.0004/1k
$0.0016/1k
google/gemini-2.0-flash-001
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
2025-02-05 1,000,000 text+image->text Gemini
google/gemini-flash-1.5-8b-exp
Gemini Flash 1.5 8B Experimental is an experimental, 8B parameter version of the Gemini Flash 1.5 model. Usage of Gemini is subject to Google’s Gemini Terms of Use. multimodal Note: This model is currently experimental and not suitable for production use-cases, and may be heavily rate-limited.
2024-08-28 1,000,000 text+image->text Gemini
Google: Gemini Flash 1.5 8B
$0.0001/1k
$0.0006/1k
google/gemini-flash-1.5-8b
Gemini Flash 1.5 8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results. Click here to learn more about this model. Usage of Gemini is subject to Google’s Gemini Terms of Use.
2024-10-03 1,000,000 text+image->text Gemini
google/gemini-exp-1206:free
Experimental release (December 6, 2024) of Gemini.
2024-12-07 2,097,152 text+image->text Gemini
google/gemini-2.0-flash-thinking-exp:free
Gemini 2.0 Flash Thinking Experimental (01-21) is a snapshot of Gemini 2.0 Flash Thinking Experimental. Gemini 2.0 Flash Thinking Mode is an experimental model that’s trained to generate the “thinking process” the model goes through as part of its response. As a result, Thinking Mode is capable of stronger reasoning capabilities in its responses than the base Gemini 2.0 Flash model.
2025-01-22 1,048,576 text+image->text Gemini
google/gemini-2.0-flash-thinking-exp-1219:free
Gemini 2.0 Flash Thinking Mode is an experimental model that’s trained to generate the “thinking process” the model goes through as part of its response. As a result, Thinking Mode is capable of stronger reasoning capabilities in its responses than the base Gemini 2.0 Flash model.
2024-12-20 40,000 text+image->text Gemini
Xwin 70B
$0.015/1k
$0.015/1k
xwin-lm/xwin-lm-70b
Xwin-LM aims to develop and open-source alignment tech for LLMs. Our first release, built-upon on the Llama2 base models, ranked TOP-1 on AlpacaEval. Notably, it’s the first to surpass GPT-4 on this benchmark. The project will be continuously updated.
2023-10-15 8,192 text->text Llama2
sophosympatheia/rogue-rose-103b-v0.2:free
Rogue Rose demonstrates strong capabilities in roleplaying and storytelling applications, potentially surpassing other models in the 103-120B parameter range. While it occasionally exhibits inconsistencies with scene logic, the overall interaction quality represents an advancement in natural language processing for creative applications. It is a 120-layer frankenmerge model combining two custom 70B architectures from November 2023, derived from the xwin-stellarbright-erp-70b-v2 base.
2025-01-18 4,096 text->text Llama2