Jiayi-Pan/TinyZero
TinyZero: reproduction of DeepSeek R1-Zero for countdown and multiplication tasks using reinforcement learning

View on index · View in 3D Map
// SURVEILLANCE FEED
Discovered repositories from the open source frontier
TinyZero: reproduction of DeepSeek R1-Zero for countdown and multiplication tasks using reinforcement learning

View on index · View in 3D Map