Meta: Llama 3.2 90B Vision Instruct

$0.0014/1k

$0.0016/1k

meta-llama/llama-3.2-90b-vision-instruct

上下文长度: 32,768 text+image->text Llama3 2024-09-25 更新

The Llama 90B Vision model is a top-tier, 90-billion-parameter multimodal model designed for the most challenging visual reasoning and language tasks. It offers unparalleled accuracy in image captioning, visual question answering, and advanced image-text comprehension. Pre-trained on vast multimodal datasets and fine-tuned with human feedback, the Llama 90B Vision is engineered to handle the most demanding image-based AI tasks. This model is perfect for industries requiring cutting-edge multimodal AI capabilities, particularly those dealing with complex, real-time visual and textual analysis. Click here for the original model card. Usage of this model is subject to Meta's Acceptable Use Policy.

模型参数

架构信息

模态: text+image->text

Tokenizer: Llama3

指令类型: llama3

限制信息

上下文长度: 32,768

最大回复长度: 16,384

Meta: Llama 3.2 90B Vision Instruct

模型参数

架构信息

限制信息

相关模型

Meta: Llama 3.1 405B (base)

Sao10k: Llama 3 Euryale 70B v2.1

Sao10K: Llama 3.3 Euryale 70B

Sao10K: Llama 3.1 Euryale 70B v2.2

Sao10K: Llama 3.1 70B Hanami x1

Sao10K: Llama 3 8B Lunaris