LLM Reference

Modal Labs

Modal Labs

AI

Platform

Modal is a serverless cloud platform designed for developers and ML engineers. It allows users to run Python code on scalable GPU infrastructure without managing servers or containers. Modal supports LLM inference, fine-tuning, and batch processing with per-second billing and easy deployment from local development environments.

About Modal Labs

Modal Labs provides serverless cloud infrastructure optimized for Python applications and AI/ML workloads. The platform offers GPU-accelerated inference endpoints and model deployment with automatic scaling.

Company Info

Founded2021
San Francisco, California, United States