海量在线大模型 兼容OpenAI API

全部大模型

320个模型 · 2025-07-23 更新
Google: Gemma 2 27B
$0.0032/1k
$0.0032/1k
google/gemma-2-27b-it
Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of text generation tasks, including question answering, summarization, and reasoning. See the launch announcement for more details. Usage of Gemma is subject to Google's Gemma Terms of Use.
2024-07-13 8,192 text->text Gemini
google/gemini-2.5-pro-preview
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.
2025-06-05 1,048,576 text+image->text Gemini
google/gemini-2.5-pro-preview-05-06
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.
2025-05-07 1,048,576 text+image->text Gemini
google/gemini-2.5-pro-exp-03-25
This model has been deprecated by Google in favor of the (paid Preview model)[google/gemini-2.5-pro-preview] Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.
2025-03-26 1,048,576 text+image->text Gemini
Google: Gemini 2.5 Pro
$0.0050/1k
$0.040/1k
google/gemini-2.5-pro
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.
2025-06-17 1,048,576 text+image->text Gemini
google/gemini-2.5-flash-lite-preview-06-17
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.
2025-06-17 1,048,576 text+image->text Gemini
Google: Gemini 2.5 Flash Lite
$0.0004/1k
$0.0016/1k
google/gemini-2.5-flash-lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.
2025-07-23 1,048,576 text+image->text Gemini
Google: Gemini 2.5 Flash
$0.0012/1k
$0.010/1k
google/gemini-2.5-flash
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).
2025-06-17 1,048,576 text+image->text Gemini
Google: Gemini 2.0 Flash Lite
$0.0003/1k
$0.0012/1k
google/gemini-2.0-flash-lite-001
Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5, all at extremely economical token prices.
2025-02-26 1,048,576 text+image->text Gemini
Google: Gemini 2.0 Flash
$0.0004/1k
$0.0016/1k
google/gemini-2.0-flash-001
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
2025-02-05 1,048,576 text+image->text Gemini
Google: Gemini 1.5 Pro
$0.0050/1k
$0.020/1k
google/gemini-pro-1.5
Google's latest multimodal model, supports image and video[0] in text or chat prompts. Optimized for language tasks including: Code generation Text generation Text editing Problem solving Recommendations Information extraction Data extraction or generation AI agents Usage of Gemini is subject to Google's Gemini Terms of Use. [0]: Video input is not available through OpenRouter at this time.
2024-04-09 2,000,000 text+image->text Gemini
Google: Gemini 1.5 Flash 8B
$0.0001/1k
$0.0006/1k
google/gemini-flash-1.5-8b
Gemini Flash 1.5 8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results. Click here to learn more about this model. Usage of Gemini is subject to Google's Gemini Terms of Use.
2024-10-03 1,000,000 text+image->text Gemini