Recent commits
Default branch activity.
[update] empty_think_ratio
[update] empty_think_ratio
[feat] data process
[update] save interval
[update] safe half
๐๐ ใๅคงๆจกๅใ2ๅฐๆถๅฎๅ จไป0่ฎญ็ป26M็ๅฐๅๆฐGPT๏ผ๐ Train a 26M-parameter GPT from scratch in just 2h!
[update] empty_think_ratio
[update] empty_think_ratio
[feat] data process
[update] save interval
[update] safe half
[update] empty_think_ratio
[update] empty_think_ratio
[feat] data process
[update] save interval
[update] safe half