Simon Willison’s Weblog

The Random Transformer (via) “Understand how transformers work by demystifying all the math behind them”—Omar Sanseviero from Hugging Face meticulously implements the transformer architecture behind LLMs from scratch using Python and numpy. There’s a lot to take in here but it’s all very clearly explained.

Posted 10th January 2024 at 5:09 am

Recent articles

ChatGPT in "4o" mode is not running the new features yet - 15th May 2024
Slop is the new name for unwanted AI-generated content - 8th May 2024
Weeknotes: more datasette-secrets, plus a mystery video project - 7th May 2024
Weeknotes: Llama 3, AI for Data Journalism, llm-evals and datasette-secrets - 23rd April 2024
Options for accessing Llama 3 from the terminal using LLM - 22nd April 2024
AI for Data Journalism: demonstrating what we can do with this stuff right now - 17th April 2024

python 924 transformers 6 ai 563 numpy 4 generativeai 493 llms 470