Strix Halo / Ryzen AI Max+
The reference deployment. APU + XDNA NPU + a single unified memory pool — the iGPU carveout is BIOS-tunable up to ~96 GB on 128 GB SKUs, and the slot lifecycle and FLM provider were written against this hardware first.
Verified on Ryzen AI Max iGPU + Vulkan: Qwen 0.5B 217–413 tok/s; Phi-3 Mini Q4 ~71 tok/s, ~280 ms round-trip; concurrent primary + embed ~258 tok/s, <200 ms dispatch.
- Ryzen AI Max+ 395 (128 GB)
- Ryzen AI Max 385 / 390 (64 GB)