LLM ReferenceLLM Reference

Llama 3.2 90B Vision

Open Source

About

Advanced multimodal model with image reasoning, visual question answering, and document analysis

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
AWS Bedrock$1.35$1.8Serverless

Benchmark Scores(1)

BenchmarkScoreVersionSource
Massive Multi-discipline Multimodal Understanding60.3https://mmmu-benchmark.github.io/

Rankings

Specifications

FamilyLlama 3.2
Released2024-09-25
Parameters88.8B
Context128K
ArchitectureDecoder Only
Knowledge cutoff2024-03
Specializationgeneral
Trainingfinetuning

Created by

Large-scale open-source AI for social technologies.

Menlo Park, California, United States
Founded 2013
Website

Providers(1)