Setting Up NanoGPT on AMD MI300X

Setting Up NanoGPT on AMD MI300X | Generated by AI

Home 2026.06

Here’s what we did:

SSH’d into root@129.212.187.36 (AMD MI300X, Ubuntu)
Installed HF CLI
- pip3 install -U --break-system-packages 'huggingface_hub[cli]'
- huggingface_hub 1.17.0
Downloaded lzwjava/zz from HuggingFace
- hf download lzwjava/zz --local-dir /root/zz
- 37 files, 424K (non-git download)
Generated SSH key for GitHub
- ed25519, label lzwjava-amd-devcloud
- Added to GitHub
Cloned nanoGPT with full history
- git clone git@github.com:lzwjava/nanoGPT.git
- 255 commits at /root/nanoGPT
Recloned zz as proper git repo from HF
- Deleted non-git hf download version
- git clone https://huggingface.co/lzwjava/zz
- Set up remotes: origin → GitHub (archived), hf → HuggingFace (active)
- 18 commits, full history
Opened Zed via zed ssh://root@129.212.187.36//root/zz
Smoke tested download script
- plan_and_download_fineweb_gpt3_amd.py — works, 267 MB/s
User ran full download in tmux
- 18 shards, 39 GB, ~10.4B tokens of FineWeb-Edu
- Done
Installed tokenize deps
- tiktoken, numpy, pyarrow, tqdm
Smoke tested tokenize script
- tokenize_fineweb_gpt3.py — works, ~30-50M tok/s
- Output: uint16, GPT-2 BPE, EOT prefix ✓

Current state: /root/nanoGPT — nanoGPT repo (255 commits) /root/zz — dataset repo (18 commits, 39GB parquet data) Next: — tokenize into .npy shards, then train

Back Donate