Recent commits
Default branch activity.
[update] empty_think_ratio
[update] empty_think_ratio
[feat] data process
[update] save interval
[update] safe half
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
[update] empty_think_ratio
[update] empty_think_ratio
[feat] data process
[update] save interval
[update] safe half
[update] empty_think_ratio
[update] empty_think_ratio
[feat] data process
[update] save interval
[update] safe half