Hacker News

Arrows of Time for Large Language Models

by tianlongon 2/2/2024, 3:33:39 PM with 2 comments

by nyoncoreon 2/2/2024, 3:47:31 PM
Isn't it obvious that since LLM are trained to predict the next word they do better than to predict the previous one?
by tianlongon 2/2/2024, 4:31:56 PM
There is a link with entropy creation?