Hacker News

desideratum

joined 7/24/2019, 9:24:19 PM has 84 karma

Recent Posts

Training LLMs with GRPO and Interpreter Feedback Using WebAssembly
by desideratumon 4/6/2025, 1:42:34 PM with 0 comments
Training Large Language Models with Interpreter Feedback Using WebAssembly
by desideratumon 4/3/2025, 4:39:21 PM with 0 comments
DeepSeek-V3-0324
by desideratumon 3/24/2025, 8:39:25 PM with 2 comments
Training Process Reward Models in Axolotl
by desideratumon 2/26/2025, 9:01:58 AM with 0 comments
Torchtune – a native PyTorch library for fine-tuning LLMs
by desideratumon 10/8/2024, 6:11:07 PM with 0 comments
(Deep Learning Based) Opportunistic Screening to Improve Statin Rates
by desideratumon 4/15/2024, 10:22:37 AM with 0 comments
The theory of Proximal Policy Optimisation implementations
by desideratumon 4/11/2024, 11:16:32 AM with 0 comments
Ask HN: Feel like I'm being lowballed by founders. Where do I go from here?
by desideratumon 12/31/2020, 7:08:25 PM with 6 comments