LLM ReferenceLLM Reference

Llama 3.2 NV EmbedQA 1B v1

llama-3.2-nv-embedqa-1b-v1

About

NVIDIA multilingual embedding model for question-answering retrieval, based on Llama 3.2. Outputs 2048-dimensional embeddings and supports 26 languages with 512-token context. Superseded by v2, which extends context to 4K tokens.

Llama 3.2 NV EmbedQA 1B v1 has a 512-token context window.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
NVIDIA NIMServerless

Rankings

Specifications

FamilyNV-Embed
Released2024-10-08
Parameters1B
Context512
Architectureencoder
Specializationembedding
Trainingpretrained

Created by

Accelerated AI for enterprise solutions

Santa Clara, California, United States
Founded 2015
Website

Providers(1)