Breeze 7B on NVIDIA NIM

Name: Breeze 7B on NVIDIA NIM
Brand: MediaTek-Research
SKU: breeze-7b-nvidia-nim

Breeze · MediaTek-Research

ProvisionedOpen Weights

Last refreshed 2026-05-19. Next refresh: weekly.

Why use Breeze 7B on NVIDIA NIM?

NVIDIA NIM offers Breeze 7B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Input / 1M

Output / 1M

Cache

Not sourced

Batch

Not sourced

Setup recipe

Docs fallback

Install

Use the provider REST API or SDK

Auth

Create a provider API key

Call

model: breeze-7b

Model ID

breeze-7b

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID breeze-7b.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Pricing

Type	Rate
GPU Hour Rate	$1.00/GPU·hr
GPU Config	1xH100

Capabilities

No model capability flags are currently sourced.

About Breeze 7B

Breeze-7B is an open-source large language model from MediaTek Research, engineered upon the Mistral-7B architecture. It excels in processing Traditional Chinese while also offering strong performance in English. Its 62,000-token vocabulary enhances comprehension and generation capabilities in Traditional Chinese, resulting in roughly twice the inference speed compared to similar models like Mistral-7B and Llama 7B. Breeze-7B includes multiple variants, such as a base model and instruction-tuned versions for tasks like question answering and summarization. Although a variant with a 64k-token context length was created, it was later removed due to performance issues. The model is competitive in benchmarks, notably those emphasizing Traditional Chinese.

FAQ

What is the context window for Breeze 7B on NVIDIA NIM?

Breeze 7B supports a 32k token context window on NVIDIA NIM.

Who created Breeze 7B?

Breeze 7B was created by MediaTek-Research as part of the Breeze model family.

Is Breeze 7B open source?

Breeze 7B has open weights under Llama 2 Community according to the seed data, but that does not necessarily mean an OSI-approved open-source license.

Get Started

Model Card Docs Portal Pricing

Model Specs

Released2023-11-10

Parameters7B

Context32k

ArchitectureDecoder Only

Provider

NVIDIA NIM

NVIDIA

All models on NVIDIA NIM →Provider setup guide →