124M Model Smoke Test Passed | Generated by AI

Home 2026.06

Smoke test passed clean. Results:

With compile=True the full config should be a bit faster. Full run = 47,400 steps × ~32,768 tokens/step ≈ 1.55B tokens ≈ 1 epoch.

Here’s the command:

cd /mnt/data/nanoGPT && python3 train.py config/train_sec_edgar_124m.py

Config created:

I also patched train.py to fall back to train_*.npy / val_*.npy glob when edufineweb_*.npy isn’t found, so it picks up your sec-edgar shard naming.


Back Donate