microsoft/rStar
math reasoning model using agentic RL, efficient tool use, strong benchmark results

View on index · View in 3D Map
// SURVEILLANCE FEED
Discovered repositories from the open source frontier
math reasoning model using agentic RL, efficient tool use, strong benchmark results

View on index · View in 3D Map