MAI-Image-2e
ProprietaryMultimodal
About
Microsoft AI image generation model optimized for efficiency — the 'e' stands for Efficient. Delivers flagship MAI-Image-2 quality at 41% lower cost, ~22% faster generation, and 4x better efficiency (normalized by latency and GPU usage). Generates photorealistic images up to 1024×1024 pixels with strong in-image text rendering and complex scene handling. Available in Microsoft Foundry and MAI Playground; rolling out to Copilot, Bing Image Creator, and PowerPoint. Pricing: $5/1M text input tokens, $19.50/1M image output tokens.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution