Llama 2 (Korean) Models by Minds And Company
About
The Llama 2 (Korean) large language model family includes several models with varying sizes and training data, all tailored for generating Korean text. These models utilize the Llama 2 architecture, recognized for its optimized transformer design 346. A significant feature of this family is the expanded vocabulary that incorporates Korean words and phrases, enhancing the models' capacity to produce natural-sounding Korean language output 457. This family comprises both pretrained models and those fine-tuned for particular tasks, such as chat applications 367. A standout model in the family, Llama-2-Ko-7b, features 7 billion parameters and has undergone extensive training on a Korean text corpus 4712. Variations in parameter counts and training data across different models result in differences in performance and capabilities 456. Researchers and developers can access these models via Hugging Face and other platforms, gaining powerful resources for Korean language processing 457.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 4k context and 13B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Llama 2 13B (Korean) | Use when the workload needs 4k context and 13B parameters. | 2023-09 | 4k context13B parameters | Current |
Release Timeline
1 release groupSpecifications(1 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Llama 2 13B (Korean) | 2023-09 | 4k | 13B |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Llama 2 13B (Korean) | IBM watsonx | $1.8 | $1.8 | Serverless |
Frequently Asked Questions
- What is Llama 2 (Korean) used for?
- The Llama 2 (Korean) large language model family includes several models with varying sizes and training data, all tailored for generating Korean text.
- How does Llama 2 (Korean) compare to Claude 3?
- Llama 2 (Korean) by Minds And Company is strongest where you need its listed use cases, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Llama 2 (Korean) has 1 listed variant and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
- Which Llama 2 (Korean) model should I use?
- For the lowest listed input price, start with Llama 2 13B (Korean) through IBM watsonx at $1.8/1M input tokens. For the most capable/latest local choice, evaluate Llama 2 13B (Korean) with 4k context.
