N
Hacker Next
new
show
ask
jobs
submit
login
Implementing DeepSeek R1's GRPO algorithm from scratch
github.com
192 points by
xcodevn
3 days ago
|
3 comments
add comment