AcceleratorMetrics

One accelerator's live readings, normalized across collectors.

The collector-agnostic GPU/accelerator expression: any platform's collector (mactop on Apple Silicon, rocm-smi/sysfs on AMD, nvidia-smi on CUDA) fills the same shape, so the planner and dashboard reason about a heterogeneous fleet uniformly. A field a given collector cannot measure stays None (never a fake zero), so a reader can tell "0%" apart from "not reported". Units are fixed here so collectors normalize at their boundary: utilization_ratio is a 0..1 fraction, power is watts, temperature is degrees Celsius.

vendorVendor (string)

Possible values: [apple, amd, nvidia, intel, cpu, unknown]

Default value: unknown

nameName (string)

Default value: Unknown

utilizationRatio object

anyOf

number
null

number

vramTotalBytes object

anyOf

integer
null

integer

vramUsedBytes object

anyOf

integer
null

integer

gttTotalBytes object

GPU-mappable host (GTT) memory, for unified-memory APUs (e.g. AMD Strix Halo). On such a node the GPU addresses system RAM beyond the BIOS VRAM carve-out through GTT, so the usable GPU pool is far larger than vram_total_bytes (placement uses this to admit big models on a UMA node). None on discrete GPUs / collectors that do not report it.

anyOf

integer
null

integer

powerWatts object

anyOf

number
null

number

temperatureCelsius object

anyOf

number
null

number

clockMhz object

anyOf

integer
null

integer

AcceleratorMetrics
{
  "vendor": "unknown",
  "name": "Unknown",
  "utilizationRatio": 0,
  "vramTotalBytes": 0,
  "vramUsedBytes": 0,
  "gttTotalBytes": 0,
  "powerWatts": 0,
  "temperatureCelsius": 0,
  "clockMhz": 0
}