Yi VL 34B
About
Vision-language model from Yi VL supporting multimodal understanding.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Replicate API | $0.20 | $1.00 | Serverless |
Rankings
Specifications
FamilyYi VL
Released2024-10-26
Parameters34B
Context131K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning