Simon Willison on annotated-talks

30 posts tagged “annotated-talks”

Annotated versions of talks I have given, with extensive notes and additional links. Here's how I make these.

2025

Living dangerously with Claude

I gave a talk last night at Claude Code Anonymous in San Francisco, the unofficial meetup for coding agent enthusiasts. I decided to talk about a dichotomy I’ve been struggling with recently. On the one hand I’m getting enormous value from running coding agents with as few restrictions as possible. On the other hand I’m deeply concerned by the risks that accompany that freedom.

[... 2,208 words]

12:20 pm / 22nd October 2025 / sandboxing, security, ai, webassembly, prompt-injection, generative-ai, llms, anthropic, claude, annotated-talks, ai-agents, coding-agents, claude-code, lethal-trifecta, async-coding-agents

My Lethal Trifecta talk at the Bay Area AI Security Meetup

I gave a talk on Wednesday at the Bay Area AI Security Meetup about prompt injection, the lethal trifecta and the challenges of securing systems that use MCP. It wasn’t recorded but I’ve created an annotated presentation with my slides and detailed notes on everything I talked about.

[... 2,843 words]

4:30 am / 9th August 2025 / security, my-talks, ai, prompt-injection, generative-ai, llms, annotated-talks, exfiltration-attacks, model-context-protocol, lethal-trifecta

Happy 20th birthday Django! Here’s my talk on Django Origins from Django’s 10th

Today is the 20th anniversary of the first commit to the public Django repository!

[... 8,994 words]

6:47 pm / 13th July 2025 / adrian-holovaty, devfort, django, history, jacob-kaplan-moss, lawrence, lawrence-com, lawrence-journal-world, python, my-talks, the-guardian, annotated-talks

The last six months in LLMs, illustrated by pelicans on bicycles

I presented an invited keynote at the AI Engineer World’s Fair in San Francisco this week. This is my third time speaking at the event—here are my talks from October 2023 and June 2024. My topic this time was “The last six months in LLMs”—originally planned as the last year, but so much has happened that I had to reduce my scope!

[... 6,077 words]

8:42 pm / 6th June 2025 / speaking, my-talks, ai, openai, generative-ai, llms, anthropic, annotated-talks, mistral, gemini, pelican-riding-a-bicycle, deepseek, lethal-trifecta, ai-in-china

Annotated Presentation Creator. I've released a new version of my tool for creating annotated presentations. I use this to turn slides from my talks into posts like this one - here are a bunch more examples.

I wrote the first version in August 2023 making extensive use of ChatGPT and GPT-4. That older version can still be seen here.

This new edition is a design refresh using Claude 3.7 Sonnet (thinking). I ran this command:

llm \
  -f https://til.simonwillison.net/tools/annotated-presentations \
  -s 'Improve this tool by making it respnonsive for mobile, improving the styling' \
  -m claude-3.7-sonnet -o thinking 1

That uses -f to fetch the original HTML (which has embedded CSS and JavaScript in a single page, convenient for working with LLMs) as a prompt fragment, then applies the system prompt instructions "Improve this tool by making it respnonsive for mobile, improving the styling" (typo included).

Here's the full transcript (generated using llm logs -cue) and a diff illustrating the changes. Total cost 10.7781 cents.

There was one visual glitch: the slides were distorted like this:

The slide is distorted by being too high for its width

I decided to try o4-mini to see if it could spot the problem (after fixing this LLM bug):

llm o4-mini \
  -a bug.png \
  -f https://tools.simonwillison.net/annotated-presentations \
  -s 'Suggest a minimal fix for this distorted image'

It suggested adding align-items: flex-start; to my .bundle class (it quoted the @media (min-width: 768px) bit but the solution was to add it to .bundle at the top level), which fixed the bug.

# 15th May 2025, 2:41 pm / css, tools, ai, openai, generative-ai, llms, ai-assisted-programming, claude, annotated-talks, vibe-coding

Building software on top of Large Language Models

I presented a three hour workshop at PyCon US yesterday titled Building software on top of Large Language Models. The goal of the workshop was to give participants everything they needed to get started writing code that makes use of LLMs.

[... 3,726 words]

12:25 pm / 15th May 2025 / pycon, speaking, my-talks, ai, openai, generative-ai, local-llms, llms, embeddings, llm, anthropic, annotated-talks, gemini, vision-llms, llm-tool-use, llm-pricing, llm-reasoning, long-context

What’s new in the world of LLMs, for NICAR 2025

I presented two sessions at the NICAR 2025 data journalism conference this year. The first was this one based on my review of LLMs in 2024, extended by several months to cover everything that’s happened in 2025 so far. The second was a workshop on Cutting-edge web scraping techniques, which I’ve written up separately.

[... 2,797 words]

11:19 pm / 8th March 2025 / data-journalism, speaking, my-talks, ai, generative-ai, local-llms, llms, annotated-talks, gemini, nicar, vision-llms, chatbot-arena

2024

Imitation Intelligence, my keynote for PyCon US 2024

I gave an invited keynote at PyCon US 2024 in Pittsburgh this year. My goal was to say some interesting things about AI—specifically about Large Language Models—both to help catch people up who may not have been paying close attention, but also to give people who were paying close attention some new things to think about.

[... 10,624 words]

4:59 am / 14th July 2024 / definitions, pycon, python, my-talks, ai, generative-ai, llms, annotated-talks, chatbot-arena

Open challenges for AI engineering

I gave the opening keynote at the AI Engineer World’s Fair yesterday. I was a late addition to the schedule: OpenAI pulled out of their slot at the last minute, and I was invited to put together a 20 minute talk with just under 24 hours notice!

[... 5,640 words]

4:35 pm / 27th June 2024 / speaking, my-talks, dropbox, ai, slack, prompt-injection, generative-ai, llms, annotated-talks, slop, exfiltration-attacks, chatbot-arena

Building search-based RAG using Claude, Datasette and Val Town

Retrieval Augmented Generation (RAG) is a technique for adding extra “knowledge” to systems built on LLMs, allowing them to answer questions against custom information not included in their training data. A common way to implement this is to take a question from a user, translate that into a set of search queries, run those against a search engine and then feed the results back into the LLM to generate an answer.

[... 3,372 words]

8:44 pm / 21st June 2024 / projects, my-talks, ai, datasette, prompt-engineering, generative-ai, llms, ai-assisted-programming, anthropic, claude, annotated-talks, val-town, rag, claude-artifacts, claude-3-5-sonnet, steve-krouse, ai-assisted-search, prompt-to-app

Language models on the command-line

I gave a talk about accessing Large Language Models from the command-line last week as part of the Mastering LLMs: A Conference For Developers & Data Scientists six week long online conference. The talk focused on my LLM Python command-line utility and ways you can use it (and its plugins) to explore LLMs and use them for useful tasks.

[... 4,992 words]

4:44 pm / 17th June 2024 / cli, projects, my-talks, ai, datasette, openai, generative-ai, local-llms, llms, llm, anthropic, annotated-talks, llamafile, ollama, files-to-prompt, macwhisper

AI for Data Journalism: demonstrating what we can do with this stuff right now

I gave a talk last month at the Story Discovery at Scale data journalism conference hosted at Stanford by Big Local News. My brief was to go deep into the things we can use Large Language Models for right now, illustrated by a flurry of demos to help provide starting points for further conversations at the conference.

[... 6,081 words]

9:04 pm / 17th April 2024 / data-journalism, journalism, projects, my-talks, ai, datasette, datasette-cloud, generative-ai, llms, llm, annotated-talks, code-interpreter, enrichments, vision-llms, structured-extraction, coding-agents, macwhisper

2023

Financial sustainability for open source projects at GitHub Universe

I presented a ten minute segment at GitHub Universe on Wednesday, ambitiously titled Financial sustainability for open source projects.

[... 2,485 words]

10:48 pm / 10th November 2023 / github, open-source, my-talks, datasette, datasette-cloud, annotated-talks

Embeddings: What they are and why they matter

Embeddings are a really neat trick that often come wrapped in a pile of intimidating jargon.

[... 5,835 words]

1:36 pm / 23rd October 2023 / my-talks, ai, generative-ai, vector-search, embeddings, llm, annotated-talks, rag, clip

Open questions for AI engineering

Last week I gave the closing keynote at the AI Engineer Summit in San Francisco. I was asked by the organizers to both summarize the conference, summarize the last year of activity in the space and give the audience something to think about by posing some open questions for them to take home.

[... 6,928 words]

2:18 pm / 17th October 2023 / my-talks, ai, generative-ai, llms, llm, annotated-talks, code-interpreter, coding-agents

Making Large Language Models work for you

I gave an invited keynote at WordCamp 2023 in National Harbor, Maryland on Friday.

[... 14,189 words]

2:35 pm / 27th August 2023 / speaking, my-talks, wordpress, ai, generative-ai, llms, llm, annotated-talks, code-interpreter, rag, coding-agents

How I make annotated presentations

Giving a talk is a lot of work. I go by a rule of thumb I learned from Damian Conway: a minimum of ten hours of preparation for every one hour spent on stage.

[... 2,128 words]

5:15 pm / 6th August 2023 / alt-text, ocr, projects, speaking, my-talks, tools, ai, generative-ai, llms, ai-assisted-programming, anthropic, claude, annotated-talks

Catching up on the weird world of LLMs

I gave a talk on Sunday at North Bay Python where I attempted to summarize the last few years of development in the space of LLMs—Large Language Models, the technology behind tools like ChatGPT, Google Bard and Llama 2.

[... 10,489 words]

2:51 pm / 3rd August 2023 / ethics, python, my-talks, ai, openai, generative-ai, chatgpt, llms, llm, anthropic, claude, annotated-talks, code-interpreter, ai-ethics, coding-agents

Prompt injection explained, with video, slides, and a transcript

I participated in a webinar this morning about prompt injection, organized by LangChain and hosted by Harrison Chase, with Willem Pienaar, Kojin Oshiba (Robust Intelligence), and Jonathan Cohen and Christopher Parisien (Nvidia Research).

[... 3,120 words]

8:22 pm / 2nd May 2023 / security, my-talks, ai, prompt-engineering, prompt-injection, generative-ai, llms, annotated-talks, exfiltration-attacks

2022

Coping strategies for the serial project hoarder

I gave a talk at DjangoCon US 2022 in San Diego last month about productivity on personal projects, titled “Massively increase your productivity on personal projects with comprehensive documentation and automated tests”.

[... 3,865 words]

3:47 pm / 26th November 2022 / djangocon, documentation, productivity, my-talks, testing, annotated-talks, github-issues

2021

How to build, test and publish an open source Python library

At PyGotham this year I presented a ten minute workshop on how to package up a new open source Python library and publish it to the Python Package Index. Here is the video and accompanying notes, which should make sense even without watching the talk.

[... 2,055 words]

10:02 pm / 4th November 2021 / github, open-source, pypi, python, my-talks, github-actions, annotated-talks

Datasette—an ecosystem of tools for working with small data

This is the transcript and video from a talk I gave at PyGotham 2020 about using SQLite, Datasette and Dogsheep to work with small data.

[... 4,655 words]

6:13 pm / 22nd July 2021 / sqlite, my-talks, datasette, dogsheep, small-data, annotated-talks

Git scraping, the five minute lightning talk

I prepared a lightning talk about Git scraping for the NICAR 2021 data journalism conference. In the talk I explain the idea of running scheduled scrapers in GitHub Actions, show some examples and then live code a new scraper for the CDC’s vaccination data using the GitHub web interface. Here’s the video.

[... 289 words]

12:44 am / 5th March 2021 / data-journalism, scraping, my-talks, github-actions, git-scraping, annotated-talks, nicar

Video introduction to Datasette and sqlite-utils

I put together a 17 minute video introduction to Datasette and sqlite-utils for FOSDEM 2021, showing how you can use Datasette to explore data, and demonstrating using the sqlite-utils command-line tool to convert a CSV file into a SQLite database, and then publish it using datasette publish. Here’s the video, plus annotated screen captures with further links and commentary.

[... 1,969 words]

9 pm / 7th February 2021 / my-talks, datasette, sqlite-utils, annotated-talks

2020

Personal Data Warehouses: Reclaiming Your Data

I gave a talk yesterday about personal data warehouses for GitHub’s OCTO Speaker Series, focusing on my Datasette and Dogsheep projects. The video of the talk is now available, and I’m presenting that here along with an annotated summary of the talk, including links to demos and further information.

[... 5,166 words]

3:53 am / 14th November 2020 / github, speaking, my-talks, datasette, dogsheep, weeknotes, sqlite-utils, annotated-talks

2018

How to Instantly Publish Data to the Internet with Datasette

I presented a session about Datasette at the PyBay 2018 conference in San Francisco. I talked about the project itself and demonstrated ways of creating and publishing databases using csvs-to-sqlite, Datasette Publish and my new sqlite-utils library.

[... 2,043 words]

11:23 pm / 19th August 2018 / my-talks, datasette, sqlite-utils, annotated-talks

2010

Comprehensive notes from my three hour Redis tutorial

Last week I presented two talks at the inaugural NoSQL Europe conference in London. The first was presented with Matthew Wall and covered the ways in which we have been exploring NoSQL at the Guardian. The second was a three hour workshop on Redis, my favourite piece of software to have the NoSQL label applied to it.

[... 263 words]

10:36 pm / 25th April 2010 / brightonmarathon, guardian, marathon, nosql, redis, running, my-talks, highlights, annotated-talks

2009

Node.js is genuinely exciting

I gave a talk on Friday at Full Frontal, a new one day JavaScript conference in my home town of Brighton. I ended up throwing away my intended topic (JSONP, APIs and cross-domain security) three days before the event in favour of a technology which first crossed my radar less than two weeks ago.

[... 2,025 words]

12:50 pm / 23rd November 2009 / async, comet, couchdb, eventio, http, javascript, nodejs, nosql, redis, ryan-dahl, my-talks, tornado, twisted, v8, highlights, annotated-talks

2007

Comet works, and it’s easier than you think

I gave a talk this morning at the Yahoo! Web Developer Summit on Comet, cometd and Bayeux.

[... 1,314 words]

4:22 pm / 5th December 2007 / bayeux, comet, cometd, java, javascript, jetty, speaking, my-talks, annotated-talks

Doing Local Right

“Doing Local Right” was the title of my talk at this year’s @media Europe. Patrick had asked me if I could put together a case study, and I jumped at the chance to share some of the work of my former colleagues at the Lawrence Journal-World newspaper in Lawrence, Kansas. I had the privilege of working at the newspaper for a year in late 2003-2004.

[... 735 words]

10:46 pm / 11th June 2007 / atmedia, atmedia07, atmedia2007, kansas, lawrence, ljworld, local, newspapers, speaking, my-talks, annotated-talks