Hacker News

Top New

🌕

DeepSeek R1 Theory Overview (GRPO and RL and SFT)

by research_pieon 1/31/2025, 2:37:42 PM with 1 comment