Training Complete: 200k Steps, 65h | Generated by AI

Home 2026.06

πŸŽ‰πŸŽ‰πŸŽ‰ TRAINING COMPLETE! πŸŽ‰πŸŽ‰πŸŽ‰

Final stats:

Eval results (interesting):

The sample generations show the model can produce coherent text (β€œThe capital of France is Paris…”) but struggles with math reasoning (β€œ5*x + 3 = 13” loops). Checkpoint saved to /home/lzw/.cache/nanochat/base_checkpoints/d12/model_200000.pt.

The model is ready for downstream fine-tuning or deployment.


Back Donate