LLM Reference

Step-1.5V on StepFun API

Step · StepFun

Serverless

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Step-1.5V

Step-1.5V is StepFun's multimodal language model with vision capabilities, building on Step-1 with image understanding.

Get Started

Model Specs

Released2024-06-01
Context128K
ArchitectureDecoder Only

Related Models on StepFun API