SPGISpeech S fully downloaded | Generated by AI
All 12 files landed — 42 GB complete. The last file (validation-00002-of-00003.parquet) finished at 14:06 while you were away.
Status: SPGISpeech S config fully downloaded.
Here’s what’s in data/S/:
| Set | Shards | Files |
|---|---|---|
| train | 6 | train-{00-05}-of-00006.parquet (~3.7 GB each) |
| validation | 3 | validation-{00-02}-of-00003.parquet (~3.8 GB each) |
| test | 3 | test-{00-02}-of-00003.parquet (~3.8 GB each) |
No incomplete/lock files. Ready to proceed with Whisper training prep — want me to check the schema, row counts, or start building the training script?