Very interesting notes and as commented by one of the sibling comments it is really a rumination on abstraction rather on than implementation off llm.c itself.
It will be very interesting to implement this small llm in D language and use the new tiny autograd library in D [1]. Since now D compiler does support C, this should results in less line of codes excluding the small autograd library in D [2]. Given that the D code will be able to inline C code, and even CTFE it, if done correctly the implementation will be faster as well.
[1] A tiny autograd library:
https://code.dlang.org/packages/tiny-autodiff
[2] Adding ANSI C11 C compiler to D so it can import and compile C files directly:
Actual link: https://twitter.com/karpathy/status/1778153659106533806
It's not a thread, no nitter needed.
This does not explain llm.c. It is more like a rumination on abstraction.
I'm curious who the target audience of this tweet is. Are there a lot of people who know all about tensors, deep learning, and PyTorch but have no idea what C is?