https://blog.marcoinacio.com/posts/my-first-impressions-rocm-strix-halo/
Article
-
Author runs LLMs on AMD Strix Halo (unified memory APU) via ROCm and llama.cpp
-
Covers setup, BIOS update requirement, GGUF quantization, and PyTorch GPU detection
-
Strix Halo’s unified memory avoids PCIe bottleneck for inference workloads
Discussion
-
Commenters recommend Unsloth/Bartowski imatrix quants over DIY GGUF conversion
-
AMD’s official Lemonade project cited as better starting point with Strix Halo support
-
Criticism: no benchmarks or token/sec numbers makes writeup hard to evaluate
Discuss on HN