Hacker News

Generalist AI doesn't scale (2024)

by TMWNNon 6/16/2025, 1:33:14 AM with 1 comment

by NoahZunigaon 6/16/2025, 2:32:33 AM
This idea has basically already been implemented (even before this post was published) in a construction called mixture of experts. It makes it easier to train, while still making the model somewhat interconnected.