How it schedules

API: modelplane.ai/v1alpha1 · ModelDeployment

When an ML team creates a ModelDeployment, the fleet scheduler decides which cluster each replica runs on and which node pool each engine uses. Platform teams don’t drive it directly, but what they publish, the clusters, their labels, and each pool’s InferenceClass, is exactly what the scheduler matches against. This page explains how it places work and where it deliberately stops short, so you can reason about why a deployment landed where it did.

Architecture on Modelplane Docs

How it schedules