MI300X ROCm llama.cpp Verified

MI300X ROCm llama.cpp Verified | Generated by AI

Home 2026.05

All done. Here’s the full summary:

SSH’d to root@165.245.137.138 — Ubuntu 24.04, 235 GB RAM, 611 GB free disk

GPU detected: AMD MI300X VF (device 0x74b5, gfx942)

llama.cpp setup:

Cloned https://github.com/ggerganov/llama.cpp.git to /root/llama.cpp/
Installed cmake (was missing)
Built with -DGGML_HIP=ON -DAMDGPU_TARGETS=gfx942 — HIP backend detected and compiled
Downloaded Qwen2.5-0.5B Q4_K_M (469 MB) to /root/llama.cpp/models/

Smoke test result — inference on MI300X via ROCm:

Of course! I am Qwen, a large language model created by Alibaba Cloud...

[ Prompt: 748.7 t/s  |  Generation: 484.2 t/s ]

Confirmed: HIP/ROCm backend works end-to-end. The MI300X is fully operational for inference with llama.cpp.

Back Donate