Llama 3.2 90B Vision
Open Source
About
Advanced multimodal model with image reasoning, visual question answering, and document analysis
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| AWS Bedrock | $1.35 | $1.8 | Serverless |
Benchmark Scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Massive Multi-discipline Multimodal Understanding | 60.3 | — | https://mmmu-benchmark.github.io/ |