AcceleratorMetrics
One accelerator's live readings, normalized across collectors.
The collector-agnostic GPU/accelerator expression: any platform's collector
(mactop on Apple Silicon, rocm-smi/sysfs on AMD, nvidia-smi on CUDA) fills
the same shape, so the planner and dashboard reason about a heterogeneous
fleet uniformly. A field a given collector cannot measure stays None
(never a fake zero), so a reader can tell "0%" apart from "not reported".
Units are fixed here so collectors normalize at their boundary:
utilization_ratio is a 0..1 fraction, power is watts, temperature is
degrees Celsius.
Possible values: [apple, amd, nvidia, intel, cpu, unknown]
unknownUnknownutilizationRatio object
- number
- null
vramTotalBytes object
- integer
- null
vramUsedBytes object
- integer
- null
gttTotalBytes object
GPU-mappable host (GTT) memory, for unified-memory APUs (e.g. AMD Strix
Halo). On such a node the GPU addresses system RAM beyond the BIOS VRAM
carve-out through GTT, so the usable GPU pool is far larger than
vram_total_bytes (placement uses this to admit big models on a UMA node).
None on discrete GPUs / collectors that do not report it.
- integer
- null
powerWatts object
- number
- null
temperatureCelsius object
- number
- null
clockMhz object
- integer
- null
{
"vendor": "unknown",
"name": "Unknown",
"utilizationRatio": 0,
"vramTotalBytes": 0,
"vramUsedBytes": 0,
"gttTotalBytes": 0,
"powerWatts": 0,
"temperatureCelsius": 0,
"clockMhz": 0
}