Implementing DeepSeek R1's GRPO algorithm from scratch

Article URL: https://github.com/policy-gradient/GRPO-Zero Comments URL: https://news.ycombinator.com/item?id=43674825 Points: 4 # Comments: 0

Avr 13, 2025 - 21:55
 0
Implementing DeepSeek R1's GRPO algorithm from scratch

Article URL: https://github.com/policy-gradient/GRPO-Zero

Comments URL: https://news.ycombinator.com/item?id=43674825

Points: 4

# Comments: 0