A curated collection of interesting GitHub repositories
View the Project on GitHub
TinyZero: reproduction of DeepSeek R1-Zero for countdown and multiplication tasks using reinforcement learning
View on index