Tags with descriptions

Owned by simonw, visibility: Public

SQL query
WITH tag_counts AS (
  SELECT 
    tag, 
    description,
    (SELECT count(DISTINCT entry_id) 
     FROM blog_entry_tags 
     WHERE blog_entry_tags.tag_id = blog_tag.id) AS entry_count,
    (SELECT count(DISTINCT blogmark_id) 
     FROM blog_blogmark_tags 
     WHERE blog_blogmark_tags.tag_id = blog_tag.id) AS blogmark_count,
    (SELECT count(DISTINCT quotation_id) 
     FROM blog_quotation_tags 
     WHERE blog_quotation_tags.tag_id = blog_tag.id) AS quotation_count
  FROM blog_tag
  WHERE description IS NOT NULL AND description != ''
)
SELECT 
  tag,
  'https://simonwillison.net/tags/' || tag || '/' as url,
  description,
  (entry_count + blogmark_count + quotation_count) AS post_count,
  entry_count,
  blogmark_count,
  quotation_count
FROM tag_counts

113 rows

tag url description post_count entry_count blogmark_count quotation_count
clip https://simonwillison.net/tags/clip/ [CLIP](https://github.com/openai/CLIP) is an embedding model by OpenAI that supports comparisons between images and text. 8 2 6 0
free-software-foundation https://simonwillison.net/tags/free-software-foundation/ The FSF is the [Free Software Foundation](https://www.fsf.org/). 3 0 3 0
annotated-release-notes https://simonwillison.net/tags/annotated-release-notes/ I sometimes publish extended annotated versions of the release notes for my projects. 33 26 7 0
evals https://simonwillison.net/tags/evals/ Evaluations and benchmarks of AI systems, for example to compare different models and prompting strategies. 23 3 14 6
marimo https://simonwillison.net/tags/marimo/ [Marimo](https://marimo.io/) is an open-source reactive notebook for Python. 4 0 4 0
weeknotes https://simonwillison.net/tags/weeknotes/ I publish weeknotes once every two or three weeks documenting what I've been working on recently. 193 193 0 0
datasette-cloud https://simonwillison.net/tags/datasette-cloud/ The SaaS hosted version of Datasette, at [datasette.cloud](https://www.datasette.cloud/) 46 33 13 0
s3-credentials https://simonwillison.net/tags/s3-credentials/ [s3-credentials](https://s3-credentials.readthedocs.io/) is a CLI tool for creating and managing credentials for S3 buckets. 9 7 2 0
llm-tool-use https://simonwillison.net/tags/llm-tool-use/ Tool use is when an LLM is instructed to occasionally request that an external tool be run on its behalf, with the result passed back to the model for further processing. Sometimes also known as function calling, and one of several ideas that might be referred to as "agents". 32 5 26 1
ham-radio https://simonwillison.net/tags/ham-radio/ The [amateur radio](https://en.wikipedia.org/wiki/Amateur_radio) hobby. 3 0 3 0
phi https://simonwillison.net/tags/phi/ Microsoft's [Phi family](https://azure.microsoft.com/en-us/products/phi) of small Large Language Models. 10 2 8 0
llama https://simonwillison.net/tags/llama/ The [Llama](https://www.llama.com/) family of open weight LLMs from Meta/Facebook AI research. 74 16 45 13
llm https://simonwillison.net/tags/llm/ [LLM](https://llm.datasette.io/) is my command-line tool for running prompts against Large Language Models. 180 65 113 2
recall https://simonwillison.net/tags/recall/ Microsoft Recall is a feature planned for Windows 11 that allows users to search through automated screenshots taken of their previous activity on their PC. 4 0 3 1
rag https://simonwillison.net/tags/rag/ RAG stands for **Retrieval Augmented Generation**. It's a trick where you find additional context relevant to the user's request using other means (such as full-text or vector search) and populate that context as part of the prompt to a Large Language Model. 35 9 21 5
projects https://simonwillison.net/tags/projects/ Posts about projects I have worked on. 482 261 221 0
datasette https://simonwillison.net/tags/datasette/ [Datasette](https://datasette.io/) is an open source tool for exploring and publishing data. 433 238 192 3
claude https://simonwillison.net/tags/claude/ [Claude](https://www.anthropic.com/claude) is Anthropic's family of Large Language Models. 150 28 98 24
anthropic https://simonwillison.net/tags/anthropic/ [Anthropic](https://www.anthropic.com/) are the AI research company behind [Claude](https://claude.ai/). 137 30 82 25
llms https://simonwillison.net/tags/llms/ Large Language Models (LLMs) are the class of technology behind generative text AI systems like OpenAI's [ChatGPT](https://simonwillison.net/tags/chatgpt/), Google's [Gemini](https://simonwillison.net/tags/gemini/) and Anthropic's [Claude](https://simonwillison.net/tags/claude/). 1085 171 656 258
openai https://simonwillison.net/tags/openai/ [OpenAI](https://openai.com/) build [ChatGPT](https://chatgpt.com/) and the GPT series of Large Language Models. 292 72 165 55
mistral https://simonwillison.net/tags/mistral/ [Mistral AI](https://mistral.ai/) release both openly licensed and API-hosted Language Models. 32 6 25 1
boring-technology https://simonwillison.net/tags/boring-technology/ [boringtechnology.club](https://boringtechnology.club/) 8 4 4 0
datasette-lite https://simonwillison.net/tags/datasette-lite/ Datasette running entirely in the browser using Pyodide and WebAssembly, at [lite.datasette.io](https://lite.datasette.io/) 18 13 5 0
sqlite-utils https://simonwillison.net/tags/sqlite-utils/ [sqlite-utils](https://sqlite-utils.datasette.io/) is a Python library and command-line utility for creating and manipulating SQLite databases. 87 60 27 0
shot-scraper https://simonwillison.net/tags/shot-scraper/ [shot-scraper](https://shot-scraper.datasette.io/) is a command-line utility for taking screenshots of websites and scraping content from them using JavaScript. 29 13 16 0
training-data https://simonwillison.net/tags/training-data/ Data used to train LLMs and other machine learning models. 51 8 27 16
sqlite https://simonwillison.net/tags/sqlite/ [SQLite](https://www.sqlite.org/) is the world's [most widely deployed database engine](https://www.sqlite.org/mostdeployed.html). 296 75 206 15
embeddings https://simonwillison.net/tags/embeddings/ See [Embeddings: What they are and why they matter](https://simonwillison.net/2023/Oct/23/embeddings/). 52 7 44 1
nomic https://simonwillison.net/tags/nomic/ [Nomic AI](https://www.nomic.ai) develop various interesting AI projects, including [GPT4All](https://gpt4all.io/) and powerful [embedding models](https://gpt4all.io/). 7 0 7 0
python https://simonwillison.net/tags/python/ The [Python](https://www.python.org/) programming language. 1148 211 906 31
o1 https://simonwillison.net/tags/o1/ OpenAI’s [o1 family](https://openai.com/o1/) of models. 30 5 14 11
psf https://simonwillison.net/tags/psf/ The [Python Software Foundation](https://www.python.org/psf-landing/) is the non-profit organization devoted to advancing open source technology related to the Python programming language. 13 4 9 0
dsf https://simonwillison.net/tags/dsf/ The [Django Software Foundation](https://www.djangoproject.com/foundation/). 10 1 8 1
ladybird https://simonwillison.net/tags/ladybird/ [Ladybird](https://ladybird.org/) is "a truly independent web browser". 5 0 4 1
beeware https://simonwillison.net/tags/beeware/ [BeeWare](https://beeware.org/) "allows you to write your app in Python and release it on multiple platforms". 3 0 3 0
vs-code https://simonwillison.net/tags/vs-code/ Microsoft's partially open source text editor [Visual Studio Code](https://code.visualstudio.com/). 4 0 4 0
val-town https://simonwillison.net/tags/val-town/ [Val Town](https://www.val.town/) provides a web interface for building, running and deploying server-side JavaScript apps. 8 1 7 0
prompt-injection https://simonwillison.net/tags/prompt-injection/ **Prompt Injection** is a security attack against applications built on top of Large Language Models, [introduced here](https://simonwillison.net/2022/Sep/12/prompt-injection/) and further described in [this series of posts](https://simonwillison.net/series/prompt-injection/). 96 26 59 11
overture https://simonwillison.net/tags/overture/ [Overture](https://overturemaps.org/) provides interoperable open map data. 6 1 5 0
llm-pricing https://simonwillison.net/tags/llm-pricing/ Posts about the pricing of various LLMs. See also my [pricing calculator](https://tools.simonwillison.net/llm-prices). 33 7 25 1
smollm https://simonwillison.net/tags/smollm/ [SmolLM](https://huggingface.co/blog/smollm) is a series of small Large Language Models by Hugging Face. 5 1 4 0
structured-extraction https://simonwillison.net/tags/structured-extraction/ Using LLMs to extract structured data from unstructured text and images. 10 2 8 0
pelican-riding-a-bicycle https://simonwillison.net/tags/pelican-riding-a-bicycle/ My [benchmark for LLMs](https://simonwillison.net/2024/Oct/25/pelicans-on-a-bicycle/): "Generate an SVG of a pelican riding a bicycle" 26 9 17 0
claude-artifacts https://simonwillison.net/tags/claude-artifacts/ A feature of [Claude](https://claude.ai/) where it can create shareable, executable HTML and JavaScript pages. See [Everything I built with Claude Artifacts this week](https://simonwillison.net/2024/Oct/21/claude-artifacts/) for several detailed examples. 31 10 20 1
cors https://simonwillison.net/tags/cors/ CORS is [Cross-origin resource sharing](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS), a mechanism by which websites can make APIs available to JavaScript running on other domains around the web. 20 9 10 1
pydantic https://simonwillison.net/tags/pydantic/ [Pydantic](https://docs.pydantic.dev/) is a data validation library for Python based around Python's type annotation feature. 12 1 10 1
ollama https://simonwillison.net/tags/ollama/ [Ollama](https://ollama.com/) is a tool for downloading and running Large Language Models. 29 8 21 0
prompt-caching https://simonwillison.net/tags/prompt-caching/ Some LLM providers offer a feature where common prompt prefixes can be cached, providing a performance boost and price reduction. 10 1 7 2
deepseek https://simonwillison.net/tags/deepseek/ [DeepSeek](https://www.deepseek.com/) is an AI lab from Chinese hedge fund [High-Flyer](https://en.wikipedia.org/wiki/High-Flyer_(company)). 22 2 16 4
ai-agents https://simonwillison.net/tags/ai-agents/ I don't think the term "agents" [has a useful](https://simonwillison.net/2024/Dec/31/llms-in-2024/#-agents-still-haven-t-really-happened-yet), widely shared definition. 36 6 23 7
openai-operator https://simonwillison.net/tags/openai-operator/ [Operator](https://openai.com/index/introducing-operator/) is OpenAI's cloud-hosted browser automation "agent". 3 0 3 0
long-context https://simonwillison.net/tags/long-context/ Tracking how LLMs are getting better at accepting large amounts of input. 15 6 7 2
qwen https://simonwillison.net/tags/qwen/ [Qwen](https://huggingface.co/Qwen) is the Large Language Model family built by Alibaba Cloud. 25 6 17 2
multi-modal-output https://simonwillison.net/tags/multi-modal-output/ LLMs that can output non-textual media content such as images an audio. 13 3 9 1
llama-cpp https://simonwillison.net/tags/llama-cpp/ [llama.cpp](https://github.com/ggerganov/llama.cpp) is an LLM inference library written in C/C++. 21 7 13 1
pleias https://simonwillison.net/tags/pleias/ [PleIAs](https://pleias.fr/) is a French AI lab specializing in "energy-efficient LLMs for information-intensive and highly-regulated industries". 3 0 3 0
datasette-public-office-hours https://simonwillison.net/tags/datasette-public-office-hours/ We run these public office hours community livestream sessions once or twice a month on the [Datasette Discord](https://datasette.io/discord). 6 3 3 0
limbo https://simonwillison.net/tags/limbo/ [Limbo](https://github.com/tursodatabase/limbo) is an implementation of a SQLite-compatible database engine in Rust. 3 0 3 0
code-interpreter https://simonwillison.net/tags/code-interpreter/ ChatGPT Code Interpreter is a mode of [ChatGPT](https://chatgpt.com/) that allows it to write and then execute Python code in a sandboxed environment. OpenAI sometimes call this feature [data analysis](https://help.openai.com/en/articles/8437071-data-analysis-with-chatgpt). 23 14 9 0
chrome https://simonwillison.net/tags/chrome/ Google's [Chrome](https://www.google.com/chrome/) browser. 33 3 23 7
slop https://simonwillison.net/tags/slop/ **Slop** describes AI-generated content that is both *unrequested* and *unreviewed*. See [Slop is the new name for unwanted AI-generated content](https://simonwillison.net/2024/May/8/slop/). 24 3 13 8
llamafile https://simonwillison.net/tags/llamafile/ [llamafile](https://github.com/Mozilla-Ocho/llamafile) provides a mechanism to distribute and run LLMs as a single file. 6 4 2 0
django https://simonwillison.net/tags/django/ The [Django](https://www.djangoproject.com/) web framework. 571 101 457 13
go https://simonwillison.net/tags/go/ The [Go](https://go.dev/) programming language. 38 4 30 4
command-r https://simonwillison.net/tags/command-r/ The [Command R](https://cohere.com/command) family of language models from Cohere. 2 0 2 0
github-codespaces https://simonwillison.net/tags/github-codespaces/ [GitHub Codespaces](https://github.com/features/codespaces) provides cloud development environments accessible through a browser. 10 3 7 0
annotated-talks https://simonwillison.net/tags/annotated-talks/ Annotated versions of talks I have given, with extensive notes and additional links. Here's [how I make these](https://simonwillison.net/2023/Aug/6/annotated-presentations/). 24 24 0 0
datasette-desktop https://simonwillison.net/tags/datasette-desktop/ Datasette as an Electron desktop application, at [datasette.io/desktop](https://datasette.io/desktop) 6 5 1 0
vision-llms https://simonwillison.net/tags/vision-llms/ LLMs that can also be used to interpret images and video, such as GPT-4o, Claude 3 and Gemini Pro. 63 17 42 4
prompt-engineering https://simonwillison.net/tags/prompt-engineering/ The subtle art and craft of effectively prompting and building software on top of LLMs. 138 32 83 23
github-copilot https://simonwillison.net/tags/github-copilot/ [GitHub Copilot](https://github.com/features/copilot) is GitHub's extension for VS Code that uses LLMs to provide context-driven code completion suggestions. 13 4 8 1
github-actions https://simonwillison.net/tags/github-actions/ GitHub's [Actions](https://github.com/features/actions) tool for repository automation. 55 28 27 0
git-history https://simonwillison.net/tags/git-history/ [git-history](https://github.com/simonw/git-history) is a tool for analyzing data scraped to a GitHub repository using SQLite. 6 5 1 0
djp https://simonwillison.net/tags/djp/ [DJP](https://djp.readthedocs.io/) is a plugin system for Django. 5 2 3 0
git-scraping https://simonwillison.net/tags/git-scraping/ [Git scraping](https://simonwillison.net/2020/Oct/9/git-scraping/) is a technique where data is scraped from an external source into a Git repository in order to record changes to that data over time. 34 22 12 0
django-sql-dashboard https://simonwillison.net/tags/django-sql-dashboard/ [Django SQL Dashboard](https://django-sql-dashboard.datasette.io) adds a read-only PostgreSQL SQL dashboard interface to a Django project. 14 8 6 0
mlx https://simonwillison.net/tags/mlx/ The [MLX framework](https://github.com/ml-explore/mlx) for running machine learning models on Apple Silicon. 25 7 18 0
notebooklm https://simonwillison.net/tags/notebooklm/ Google's [NotebookLM](https://notebooklm.google/) "personalized research assistant". 5 1 2 2
observable-plot https://simonwillison.net/tags/observable-plot/ The <a href="https://observablehq.com/plot/">Observable Plot</a> JavaScript visualization library. 9 1 8 0
gpus https://simonwillison.net/tags/gpus/ GPUs aren't just for graphics anymore. 14 0 10 4
sqlite-busy https://simonwillison.net/tags/sqlite-busy/ The `SQLITE_BUSY` error - if you're seeing it in WAL mode that's probably because you aren't using `BEGIN IMMEDIATE` on all of your transactions that perform at least one write. 6 1 5 0
grok https://simonwillison.net/tags/grok/ The Grok family of LLMs from [xAI](https://x.ai/). 4 0 4 0
ai2 https://simonwillison.net/tags/ai2/ Ai2 is the [Allen Institute for Artificial Intelligence](https://allenai.org/). 5 0 4 1
podcast-appearances https://simonwillison.net/tags/podcast-appearances/ Times when I went on a podcast. 30 15 15 0
view-transitions https://simonwillison.net/tags/view-transitions/ [CSS view transitions](https://developer.mozilla.org/en-US/docs/Web/CSS/CSS_view_transitions) and the [view transition JavaScript API](https://developer.mozilla.org/en-US/docs/Web/API/View_Transition_API). 3 0 3 0
deep-research https://simonwillison.net/tags/deep-research/ The pattern where LLM tools put together a report based on many searches chained together. Also the name of similar products from Google Gemini, OpenAI and Perplexity. 5 2 2 1
gemma https://simonwillison.net/tags/gemma/ Google's [Gemma family](https://ai.google.dev/gemma) of openly licensed Large Language Models. 7 1 6 0
ai-ethics https://simonwillison.net/tags/ai-ethics/ Ethical concerns related to building and using AI systems. 163 21 69 73
firefox https://simonwillison.net/tags/firefox/ Mozilla's [Firefox](https://www.mozilla.org/en-US/firefox/) browser. 89 7 74 8
llm-release https://simonwillison.net/tags/llm-release/ New releases of various [LLMs](https://simonwillison.net/tags/llms/). 102 22 79 1
vibe-coding https://simonwillison.net/tags/vibe-coding/ As [defined here](https://simonwillison.net/2025/Feb/6/andrej-karpathy/) - not the same thing as [AI-assisted programming](https://simonwillison.net/tags/ai-assisted-programming/), though there's some overlap. 14 4 8 2
ai-assisted-programming https://simonwillison.net/tags/ai-assisted-programming/ Using AI tools such as Large Language Models to help write code. [Vibe coding](https://simonwillison.net/tags/vibe-coding/) is the less responsible subset of this. 162 42 96 24
ai-energy-usage https://simonwillison.net/tags/ai-energy-usage/ How much energy is used by AI systems? 6 1 4 1
ai https://simonwillison.net/tags/ai/ "[AI is whatever hasn't been done yet](https://en.wikipedia.org/wiki/AI_effect)"—Larry Tesler 1270 171 751 348
model-context-protocol https://simonwillison.net/tags/model-context-protocol/ [Model Context Proocol](https://modelcontextprotocol.io/introduction) (MCP) is an open protocol that standardizes how applications provide context to and tools to LLMs. 6 2 3 1
css-custom-properties https://simonwillison.net/tags/css-custom-properties/ Also known as CSS variables. 7 1 6 0
baked-data https://simonwillison.net/tags/baked-data/ The [Baked Data](https://simonwillison.net/2021/Jul/28/baked-data/) architectural pattern. 11 7 3 1
files-to-prompt https://simonwillison.net/tags/files-to-prompt/ [files-to-prompt](https://github.com/simonw/files-to-prompt) concatenate a directory full of files into a single prompt for use with LLMs. 23 5 18 0
local-llms https://simonwillison.net/tags/local-llms/ LLMs that can run on consumer hardware like laptops or mobile phones. 112 32 78 2
exfiltration-attacks https://simonwillison.net/tags/exfiltration-attacks/ **Exfiltration attacks** are [prompt injection](https://simonwillison.net/tags/promptinjection/) attacks against chatbots that have access to private information, where that information is [exfiltrated](https://en.wikipedia.org/wiki/Data_exfiltration) by the attacker. One common form of this is **Markdown exfiltration** where an attacker tricks the bot into rendering a Markdown image that leaks data encoded in the URL to an external server. 25 7 17 1
llm-reasoning https://simonwillison.net/tags/llm-reasoning/ Improving performance of LLMs through spending more tokens "reasoning" about a problem, as seen in OpenAI's [o-series](https://platform.openai.com/docs/guides/reasoning), DeepSeek's [R1](https://arxiv.org/abs/2501.12948), Qwen's [QwQ](https://github.com/QwenLM/QwQ), Google's [Gemini 2.5](https://blog.google/technology/google-deepmind/gemini-model-thinking-updates-march-2025/) and Anthropic's [Claude 3.7 Sonnet](https://blog.google/technology/google-deepmind/gemini-model-thinking-updates-march-2025/). 45 13 23 9
uv https://simonwillison.net/tags/uv/ [uv](https://github.com/astral-sh/uv) is an "extremely fast Python package and project manager, written in Rust". 57 6 48 3
gemini https://simonwillison.net/tags/gemini/ The Gemini family of multimodal LLMs developed by Google DeepMind. 84 29 51 4
generative-ai https://simonwillison.net/tags/generative-ai/ Machine learning systems that can generate new content: text, images, audio, video and more. 1099 168 665 266
ai-assisted-search https://simonwillison.net/tags/ai-assisted-search/ Using LLMs to build better search engines, and providing search tool access to LLMs. 26 7 16 3
soap https://simonwillison.net/tags/soap/ Simple Object Access Protocol. It wasn't very simple. 14 7 6 1
microphone-ads-conspiracy https://simonwillison.net/tags/microphone-ads-conspiracy/ There's a conspiracy theory that apps on your phone are serving you targeted ads based on spying on what you say through your microphone. I do not believe that this is true. 4 2 2 0
geoguessing https://simonwillison.net/tags/geoguessing/ Guessing the location of an image based on visual clues. 3 1 1 1
ai-personality https://simonwillison.net/tags/ai-personality/ The weird craft of establishing a personality for an AI system. 16 4 7 5
chatbot-arena https://simonwillison.net/tags/chatbot-arena/ The [Chatbot Arena](https://lmarena.ai/) ranks LLMs based on votes from a community of users for the best response from anonymized model pairs. 25 7 15 3
semantic-diffusion https://simonwillison.net/tags/semantic-diffusion/ [Semantic diffusion](https://martinfowler.com/bliki/SemanticDiffusion.html) occurs when a new piece of jargon inevitably takes on an unintended and incorrect definition as it uncontrollably spreads beyond its origin. 4 3 1 0
tokenization https://simonwillison.net/tags/tokenization/ How Large Language Models split text up into tokens. 10 2 8 0
Copy and export data

Duration: 11.29ms