Llama 3.2 11B Vision
Open Source
About
Multimodal 11B parameter model balancing capability and computational efficiency
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| AWS Bedrock | $0.2 | $0.27 | Serverless |
Benchmark Scores(2)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Massive Multi-discipline Multimodal Understanding | 50.7 | — | https://mmmu-benchmark.github.io/ |
| MMLU PRO | 46.4 | — | https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro |