Cosmos 3 Nano Policy DROID
Last refreshed 2026-06-01. Next refresh: weekly.
Cosmos 3 Nano Policy DROID has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Decision context: Vision task fit, 0 tracked provider routes, and research from 2026-06-01.
Use it for
- Teams evaluating vision and json / tool use
- Workloads that can use a 4k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Teams that need a tracked hosted API route today
Cheapest output
-
No tracked output price
Provider routes
0
No provider route in seed
Quality / dollar
Unknown
No task benchmark coverage yet
Freshness
2026-06-01
Researched today
Top use-case fit
Vision
Included by capability and metadata signals in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Benchmark peer barsfor Vision
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
About
Cosmos 3 Nano Policy DROID is a 16B-parameter robotics policy model fine-tuned from Cosmos 3 Nano on the DROID dataset. Given natural language instructions and visual observations from a robot camera (image or video), it generates robot action trajectories (JSON 1D list) for manipulation and control tasks. Compatible with multiple robot embodiments including Franka Panda (single/dual), UR, Google robot, WidowX 250, UMI, and Agibot. Supports 16-400 frame action sequences in various DoF configurations (9D-57D). Intended as a reference implementation for post-training Cosmos 3 Nano on specific robot platforms. The action output modality is represented in prose because the current model schema only has text, vision, video, audio, and related capability flags.
Cosmos 3 Nano Policy DROID has a 4k-token context window.
Capabilities
API Versions
cosmos-3-nano-policy-droid