Previous post (but different): https://news.ycombinator.com/item?id=35780921
:( did not have a patience to watch the entire video while focused. I know you could use rounding errors to replace non-linearity. Can somebody summarize what exactly he does for gradient descent here and how well does it work?
tom7 is always the right mix of genius, madness and fun