Using DeepInfra Qwen1.5-72B-Chat on DeepInfra
Implementation guide · Qwen1.5 · Alibaba
ServerlessOpen Source
Quick Start
- 1
- 2Use the DeepInfra SDK or REST API to call
deepinfra-qwen1.5-72b-chat— see the documentation for request format. - 3
About DeepInfra
DeepInfra offers serverless AI inference with a simple API, supporting hundreds of models across text generation, embeddings, and more. Pay-per-token pricing with no upfront commitments.
DeepInfra is a cloud inference platform offering cost-effective access to open-source AI models. It provides serverless inference for leading models from Meta, Mistral, Alibaba, and others with competitive token-based pricing.
Pricing on DeepInfra
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.45 |
| Output tokens | $0.65 |
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
Model Specs
Released2024-02-04
Parameters72B
Context33K
ArchitectureDecoder Only