The Random Transformer

The Random Transformer (via) “Understand how transformers work by demystifying all the math behind them”—Omar Sanseviero from Hugging Face meticulously implements the transformer architecture behind LLMs from scratch using Python and numpy. There’s a lot to take in here but it’s all very clearly explained.

Posted 10th January 2024 at 5:09 am

Simon Willison’s Weblog

Recent articles

Monthly briefing