海量在线大模型 兼容OpenAI API

Meta: Llama 3.2 90B Vision Instruct

$0.0032/1k
$0.0064/1k
开始对话
meta-llama/llama-3.2-90b-vision-instruct
上下文长度: 4,096 text+image->text Llama3 2024-09-25 更新
The Llama 90B Vision model is a top-tier, 90-billion-parameter multimodal model designed for the most challenging visual reasoning and language tasks. It offers unparalleled accuracy in image captioning, visual question answering, and advanced image-text comprehension. Pre-trained on vast multimodal datasets and fine-tuned with human feedback, the Llama 90B Vision is engineered to handle the most demanding image-based AI tasks. This model is perfect for industries requiring cutting-edge multimodal AI capabilities, particularly those dealing with complex, real-time visual and textual analysis. Click here for the original model card. Usage of this model is subject to Meta’s Acceptable Use Policy.

模型参数

架构信息

模态: text+image->text
Tokenizer: Llama3
指令类型: llama3

限制信息

上下文长度: 4,096
最大回复长度: 2,048