Together AI API
Together AI
Platform
Together AI's platform offers a comprehensive suite of tools for building, training, and deploying generative AI models. At its core is the Together API, providing a user-friendly interface for fine-tuning models and running low-latency inference. The platform's GPU Clusters enable distributed training on custom datasets, significantly reducing costs compared to traditional cloud services. Scalability is a key feature, with automatic capacity adjustments based on API request volume, and dedicated endpoints for custom AI models ensure flexible and efficient deployment. The platform emphasizes collaboration and transparency in AI development, supporting open-source initiatives that allow users to leverage pre-trained models while maintaining control over their data and model updates. Together AI's tools facilitate seamless integration of AI features into applications, with expert assistance available for dataset preparation and model training. This approach empowers developers to innovate and optimize their AI solutions effectively, making it an ideal choice for organizations looking to harness the full potential of generative AI.
About Together AI
Together AI is a research-driven artificial intelligence company that provides a comprehensive platform for building, training, and deploying generative AI models. Their platform offers developers and researchers access to some of the fastest and most cost-efficient tools and infrastructure in the industry. Key features of their AI platform include: 1. Open-source contributions: Together AI actively contributes leading research, models, and datasets to advance AI technology. 2. Decentralized cloud services: The platform empowers organizations of all sizes to train, fine-tune, and deploy generative AI models using distributed computing resources. 3. Collaborative AI development: Together AI's expert team works closely with clients to ensure their success in implementing AI solutions. 4. Focus on transparency: The company believes in open and transparent AI systems to drive innovation and create the best outcomes for society. 5. Cutting-edge technology: Together AI leverages advanced technologies like cloud computing, LLMs (Large Language Models), and open-source frameworks to provide state-of-the-art AI capabilities. Together AI's platform is designed to support a wide range of AI applications, from research projects to enterprise-scale deployments, making it a versatile solution for organizations looking to harness the power of generative AI.
Available Models(66)
| Model | Input (per 1M) | Output (per 1M) | Type |
|---|---|---|---|
| Yi 34B | $0.8 | $0.8 | Serverless |
| OLMo 7B | $0.2 | $0.2 | Serverless |
| OLMo 7B Twin-2T | $0.2 | $0.2 | Serverless |
| Chronos Hermes 13B V2 | $0.3 | $0.3 | Serverless |
| Dolphin 2.5 Mixtral 8x7B | $0.6 | $0.6 | Serverless |
| Arctic | $2.4 | $2.4 | Serverless |
| DBRX Instruct | $1.2 | $1.2 | Serverless |
| DeepSeek Coder 33B | $0.8 | $0.8 | Serverless |
| DeepSeek 67B Chat | $0.9 | $0.9 | Serverless |
| Platypus2 70B | $0.9 | $0.9 | Serverless |
| Gemma 7B Instruct | $0.2 | $0.2 | Serverless |
| Gemma 2B Instruct | $0.1 | $0.1 | Serverless |
| MythoMax L2 13B | $0.3 | $0.3 | Serverless |
| Vicuna 13B V1.5 | $0.3 | $0.3 | Serverless |
| Vicuna 7B V1.5 | $0.2 | $0.2 | Serverless |
| OpenHermes 2 Mistral 7B | $0.2 | $0.2 | Serverless |
| OpenHermes 2.5 Mistral 7B | $0.2 | $0.2 | Serverless |
| CodeLlama 70B | $0.9 | $0.9 | Serverless |
| CodeLlama 34B | $0.8 | $0.8 | Serverless |
| CodeLlama 13B | $0.3 | $0.3 | Serverless |
| CodeLlama 7B | $0.2 | $0.2 | Serverless |
| Toppy M 7B | $0.2 | $0.2 | Serverless |
| WizardLM 13B V1.0 | $0.3 | $0.3 | Serverless |
| SOLAR 10.7B | $0.3 | $0.3 | Serverless |
| Alpaca 7B | $0.2 | $0.2 | Serverless |
| StripedHyena Nous 7B | $0.2 | $0.2 | Serverless |
| Llama 3 70B Instruct | $0.88 | $0.88 | Serverless |
| Llama 3 8B Instruct | $0.18 | $0.18 | Serverless |
| Llama 2 70B Chat | $0.9 | $0.9 | Serverless |
| Llama 2 13B Chat | $0.3 | $0.3 | Serverless |
| Llama 2 7B Chat | $0.2 | $0.2 | Serverless |
| Llama 2 7B 32K | $0.2 | $0.2 | Serverless |
| ReMM SLERP L2 13B | $0.3 | $0.3 | Serverless |
| Mistral 7B v0.1 | $0.2 | $0.2 | Serverless |
| Mixtral 8x22B v0.1 | $1.2 | $1.2 | Serverless |
| Nous Capybara 7B V1.9 | $0.2 | $0.2 | Serverless |
| Nous Hermes 2 Mistral 7B | $0.2 | $0.2 | Serverless |
| Nous Hermes 2 Mixtral 8x7B | $0.6 | $0.6 | Serverless |
| Nous Hermes Llama 2 13B | $0.3 | $0.3 | Serverless |
| Nous Hermes Llama 2 7B | $0.2 | $0.2 | Serverless |
| Nous Hermes 2 Yi 34B | $0.8 | $0.8 | Serverless |
| OpenChat 3.5 (0106) | $0.2 | $0.2 | Serverless |
| Mistral 7B OpenOrca | $0.2 | $0.2 | Serverless |
| Qwen1.5-0.5B | $0.1 | $0.1 | Serverless |
| Qwen1.5-1.8B | $0.1 | $0.1 | Serverless |
| Qwen1.5-4B | $0.1 | $0.1 | Serverless |
| Qwen1.5-7B | $0.2 | $0.2 | Serverless |
| Qwen1.5-32B | $0.8 | $0.8 | Serverless |
| Qwen1.5-72B | $0.9 | $0.9 | Serverless |
| Qwen1.5-110B | $1.8 | $1.8 | Serverless |
| Qwen2 72B | $0.9 | $0.9 | Serverless |
| Snorkel Mistral PairRM | $0.2 | $0.2 | Serverless |
| Phi-2 | $0.1 | $0.1 | Serverless |
| NexusRaven-V2 13B | $0.3 | $0.3 | Serverless |
| GPT-JT Moderation 6B | $0.2 | $0.2 | Serverless |
| StripedHyena Hessian 7B | $0.2 | $0.2 | Serverless |
| CodeLlama 70B Python | $0.9 | $0.9 | Serverless |
| CodeLlama 34B Python | $0.8 | $0.8 | Serverless |
| CodeLlama 13B Python | $0.3 | $0.3 | Serverless |
| CodeLlama 7B Python | $0.2 | $0.2 | Serverless |
| Phind CodeLlama 34B V2 | $0.8 | $0.8 | Serverless |
| WizardCoder Python 34B | $0.8 | $0.8 | Serverless |
| Llama Guard 7B | $0.2 | $0.2 | Serverless |
| Llama 3.1 405B Instruct | $5 | $15 | Serverless |
| Llama 3.1 70B Instruct | $0.88 | $0.88 | Serverless |
| Llama 3.1 8B Instruct | $0.18 | $0.18 | Serverless |