Fugaku-LLM Models by Fujitsu
About
The Fugaku-LLM family comprises large language models developed using the Fugaku supercomputer, one of the most powerful computational systems in Japan. These models, including the Fugaku-LLM-13B, are distinguished by their 13 billion parameters, making them significantly larger than many other Japanese models, which typically have fewer than 7 billion parameters. Fugaku-LLM models are trained from scratch using proprietary Japanese data, ensuring high transparency and safety. They excel in Japanese language tasks, achieving top scores in benchmarks like the Japanese MT-Bench, particularly in humanities and social sciences. The models are designed for both research and commercial applications, leveraging advanced distributed parallel learning techniques to maximize the supercomputer's performance.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 4k context and 13B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Fugaku-LLM 13B | Use when the workload needs 4k context and 13B parameters. | 2024-01 | 4k context13B parameters | Current |
Release Timeline
1 release groupSpecifications(1 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Fugaku-LLM 13B | 2024-01 | 4k | 13B |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Fugaku-LLM 13B | Microsoft Foundry | $0.81 | $0.94 | Provisioned |
Frequently Asked Questions
- What is Fugaku-LLM used for?
- The Fugaku-LLM family comprises large language models developed using the Fugaku supercomputer, one of the most powerful computational systems in Japan.
- How does Fugaku-LLM compare to Claude 3?
- Fugaku-LLM by Fujitsu is strongest where you need its listed use cases, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Fugaku-LLM has 1 listed variant and reaches up to 4k context, while Claude 3 reaches up to 200k context, so compare the specs and pricing tables before choosing a production model.
- Which Fugaku-LLM model should I use?
- For the lowest listed input price, start with Fugaku-LLM 13B through Microsoft Foundry at $0.81/1M input tokens. For the most capable/latest local choice, evaluate Fugaku-LLM 13B with 4k context.
