LLM Reference

Qwen1.5-110B

About

The Qwen1.5-110B is a large language model created by Alibaba Cloud, distinguished as the largest in the Qwen1.5 series. It is a transformer-based, decoder-only model equipped with 110 billion parameters and optimized for efficiency using features like SwiGLU activation and Grouped Query Attention (GQA). Pretrained on an extensive dataset, it supports a 32K context length and multilingual capabilities, handling various languages including English and Chinese. The model excels in tasks like text generation, dialogue systems, and is noted for its competitive performance and advanced tokenizer, making it highly versatile and applicable across multiple NLP tasks. Various quantized versions are available to accommodate different hardware specifications.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(2)

ProviderInput (per 1M)Output (per 1M)Type
Azure OpenAI
Provisioned
Together AI API$1.8$1.8
Serverless

Specifications

FamilyQwen1.5
Released2024-04-25
ArchitectureDecoder Only
Specializationgeneral