Sora Models by OpenAI
OpenAIProprietary
3 models2024–2025
Details
ResearcherOpenAI
LicenseProprietary
Commercial useCommercial use: conditional
Models3
Released2024–2025
Capabilities
MultimodalAll models
Links
WebsiteAbout
Sora is OpenAI's video generation model family, capable of generating high-quality video from text and image prompts.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
1 in view2 retired
SoraCurrent
Use when the workload needs video generation and multimodal inputs.
2024-12video generationmultimodal inputs
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Sora | Use when the workload needs video generation and multimodal inputs. | 2024-12 | video generationmultimodal inputs | Current |
Release Timeline
2 release groups2025-09
2 retired
Sora 2
Archivedvideo generationmultimodal inputs
Sora 2 Pro
Archivedvideo generationmultimodal inputs
2024-12
1 current
Sora
Currentvideo generationmultimodal inputs
Specifications(3 models)
| Model | Released | Multimodal |
|---|---|---|
| Sora | 2024-12 | Yes |
Available From(2 providers)
Frequently Asked Questions
- What is Sora used for?
- Sora is used for video generation and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
- How does Sora compare to GPT Realtime 2?
- Sora by OpenAI is strongest where you need video generation, while GPT Realtime 2 by OpenAI is the closest related family to check for realtime voice. Sora has 3 listed variants, while GPT Realtime 2 reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
- Which Sora model should I use?
- If price is the main constraint, use the pricing table first because Sora does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Sora with multimodal inputs.






