LLM Reference
RHEL AI

Models on RHEL AI

4 models available · Red Hat

ModelInput (per 1M)Output (per 1M)Context
Granite 20B Code8K
Granite 34B Code8K
Granite 3B Code8K
Granite 8B Code8K

About RHEL AI

Red Hat OpenShift AI is a comprehensive platform designed for developing, deploying, and managing AI and machine learning workloads across hybrid cloud environments. It integrates essential tools like TensorFlow, PyTorch, and Jupyter, enabling seamless collaboration between data scientists and developers. The platform facilitates rapid model development, operationalizes AI/ML models using Kubernetes, and supports both small-scale experiments and large-scale production models. With advanced security measures and a cloud-native architecture, it offers flexible deployment options as either a managed service or self-managed software. A standout feature of Red Hat OpenShift AI is its support for retrieval-augmented generation (RAG), allowing users to derive AI insights from their own reference documents. The platform enhances model serving with multi-model server capabilities and distributed workloads, utilizing technologies such as KServe and Ray for efficient data processing. It emphasizes community-driven innovation through open-source principles, enabling enterprises to modernize their applications and infrastructure, and ultimately drive productivity and competitive advantage in the AI landscape.

Full provider profile →