Instances
Placement previews, launch flows, instance lookup, and lifecycle management for running models.
📄️ Create an instance from a fully specified placement
Create an instance from an already computed placement object when you want exact control instead of Skulk picking the placement for you.
📄️ Quick-launch a model placement
Place and launch a model with Skulk choosing a valid concrete placement from the requested sharding, instance metadata, and minimum-node constraints. The placement is validated against the current cluster state before the command is forwarded: an impossible placement returns 400 with the specific reason (no connected cycle, exclusions removed every candidate, a node cannot fit its shard with runtime headroom, ...). If node memory info is still being gathered (cluster just formed), the request waits up to 15 seconds for it before returning 503 — retry shortly in that case.
📄️ Compute a concrete placement for one requested combination
Return the exact instance shape Skulk would create for one requested model, sharding mode, instance metadata, and node-count combination.
📄️ Preview valid placements for a model
Return candidate placements for a model before launch. This is the best first step when you want to see what Skulk can place on the current node or cluster. Pass `excluded_node_ids` (repeatable) to mirror the `excluded_nodes` field on POST /place_instance and preview against the post-exclusion topology.
📄️ Get one running instance
Get one running instance
📄️ Delete a running instance
Delete a running instance