Top
New
🌕
desideratum
joined
7/24/2019, 9:24:19 PM
has
84
karma
Recent Posts
Training LLMs with GRPO and Interpreter Feedback Using WebAssembly
by
desideratum
on 4/6/2025, 1:42:34 PM with
0
comments
Training Large Language Models with Interpreter Feedback Using WebAssembly
by
desideratum
on 4/3/2025, 4:39:21 PM with
0
comments
DeepSeek-V3-0324
by
desideratum
on 3/24/2025, 8:39:25 PM with
2
comments
Training Process Reward Models in Axolotl
by
desideratum
on 2/26/2025, 9:01:58 AM with
0
comments
Torchtune – a native PyTorch library for fine-tuning LLMs
by
desideratum
on 10/8/2024, 6:11:07 PM with
0
comments
(Deep Learning Based) Opportunistic Screening to Improve Statin Rates
by
desideratum
on 4/15/2024, 10:22:37 AM with
0
comments
The theory of Proximal Policy Optimisation implementations
by
desideratum
on 4/11/2024, 11:16:32 AM with
0
comments
Ask HN: Feel like I'm being lowballed by founders. Where do I go from here?
by
desideratum
on 12/31/2020, 7:08:25 PM with
6
comments