Cosmos 3 Nano Policy DROID is a 16B-parameter robotics policy model fine-tuned from Cosmos 3 Nano on the DROID dataset. Given natural language instructions and visual observations from a robot camera (image or video), it generates robot action trajectories (JSON 1D list) for manipulation and control tasks. Compatible with multiple robot embodiments including Franka Panda (single/dual), UR, Google robot, WidowX 250, UMI, and Agibot. Supports 16-400 frame action sequences in various DoF configurations (9D-57D). Intended as a reference implementation for post-training Cosmos 3 Nano on specific robot platforms. The action output modality is represented in prose because the current model schema only has text, vision, video, audio, and related capability flags.
2026-05-31
Researched 24d ago
4k
4,000 tokens
No tracked provider route