LLM Reference

Llama 2 (Korean) Models by Minds And Company

1 model2023Up to 4k ctxFrom $1.8/1M input

About

The Llama 2 (Korean) large language model family includes several models with varying sizes and training data, all tailored for generating Korean text. These models utilize the Llama 2 architecture, recognized for its optimized transformer design 346. A significant feature of this family is the expanded vocabulary that incorporates Korean words and phrases, enhancing the models' capacity to produce natural-sounding Korean language output 457. This family comprises both pretrained models and those fine-tuned for particular tasks, such as chat applications 367. A standout model in the family, Llama-2-Ko-7b, features 7 billion parameters and has undergone extensive training on a Korean text corpus 4712. Variations in parameter counts and training data across different models result in differences in performance and capabilities 456. Researchers and developers can access these models via Hugging Face and other platforms, gaining powerful resources for Korean language processing 457.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view

Use when the workload needs 4k context and 13B parameters.

2023-094k context13B parameters

Release Timeline

1 release group
2023-09
1 current
Llama 2 13B (Korean)
4k context13B parameters
Current

Specifications(1 models)

Llama 2 (Korean) model specifications comparison
ModelReleasedContextParameters
Llama 2 13B (Korean)2023-094k13B

Available From(1 provider)

Pricing

Llama 2 (Korean) model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Llama 2 13B (Korean)IBM watsonx$1.8$1.8Serverless

Frequently Asked Questions

What is Llama 2 (Korean) used for?
The Llama 2 (Korean) large language model family includes several models with varying sizes and training data, all tailored for generating Korean text.
How does Llama 2 (Korean) compare to Claude 3?
Llama 2 (Korean) by Minds And Company is strongest where you need its listed use cases, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Llama 2 (Korean) has 1 listed variant and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
Which Llama 2 (Korean) model should I use?
For the lowest listed input price, start with Llama 2 13B (Korean) through IBM watsonx at $1.8/1M input tokens. For the most capable/latest local choice, evaluate Llama 2 13B (Korean) with 4k context.

Models(1)