海量在线大模型 兼容OpenAI API

全部大模型

326个模型 · 2025-09-17 更新
Google: Gemini 2.5 Pro
$0.0050/1k
$0.040/1k
google/gemini-2.5-pro
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.
2025-06-17 1,048,576 text+image->text Gemini
google/gemini-2.5-flash-lite-preview-06-17
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.
2025-06-17 1,048,576 text+image->text Gemini
Google: Gemini 2.5 Flash Lite
$0.0004/1k
$0.0016/1k
google/gemini-2.5-flash-lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.
2025-07-23 1,048,576 text+image->text Gemini
google/gemini-2.5-flash-image-preview
Gemini 2.5 Flash Image Preview, AKA Nano Banana is a state of the art image generation model with contextual understanding. It is capable of image generation, edits, and multi-turn conversations.
2025-08-26 32,768 text+image->text+image Gemini
Google: Gemini 2.5 Flash
$0.0012/1k
$0.010/1k
google/gemini-2.5-flash
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).
2025-06-17 1,048,576 text+image->text Gemini
Google: Gemini 2.0 Flash Lite
$0.0003/1k
$0.0012/1k
google/gemini-2.0-flash-lite-001
Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5, all at extremely economical token prices.
2025-02-26 1,048,576 text+image->text Gemini
Google: Gemini 2.0 Flash
$0.0004/1k
$0.0016/1k
google/gemini-2.0-flash-001
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
2025-02-05 1,048,576 text+image->text Gemini
Google: Gemini 1.5 Pro
$0.0050/1k
$0.020/1k
google/gemini-pro-1.5
Google's latest multimodal model, supports image and video[0] in text or chat prompts. Optimized for language tasks including: Code generation Text generation Text editing Problem solving Recommendations Information extraction Data extraction or generation AI agents Usage of Gemini is subject to Google's Gemini Terms of Use. [0]: Video input is not available through OpenRouter at this time.
2024-04-09 2,000,000 text+image->text Gemini
Google: Gemini 1.5 Flash 8B
$0.0001/1k
$0.0006/1k
google/gemini-flash-1.5-8b
Gemini Flash 1.5 8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results. Click here to learn more about this model. Usage of Gemini is subject to Google's Gemini Terms of Use.
2024-10-03 1,000,000 text+image->text Gemini
ReMM SLERP 13B
$0.0018/1k
$0.0026/1k
undi95/remm-slerp-l2-13b
A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge
2023-07-22 6,144 text->text Llama2
Noromaid 20B
$0.0040/1k
$0.0070/1k
neversleep/noromaid-20b
A collab between IkariDev and Undi. This merge is suitable for RP, ERP, and general knowledge. merge #uncensored
2023-11-26 4,096 text->text Llama2
MythoMax 13B
$0.0002/1k
$0.0002/1k
gryphe/mythomax-l2-13b
One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge
2023-07-02 4,096 text->text Llama2