LLM ReferenceLLM Reference
NVIDIA NIM

Llama 3.2 NV EmbedQA 1B v1 on NVIDIA NIM

NV-Embed · NVIDIA AI

Serverless

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr
GPU Config1xH100

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

About Llama 3.2 NV EmbedQA 1B v1

NVIDIA multilingual embedding model for question-answering retrieval, based on Llama 3.2. Outputs 2048-dimensional embeddings and supports 26 languages with 512-token context. Superseded by v2, which extends context to 4K tokens.

Get Started

Model Specs

Released2024-10-08
Parameters1B
Context512
Architectureencoder