Tags with descriptions

Owned by simonw, visibility: Public

SQL query
WITH tag_counts AS (
  SELECT 
    tag, 
    description,
    (SELECT count(DISTINCT entry_id) 
     FROM blog_entry_tags 
     WHERE blog_entry_tags.tag_id = blog_tag.id) AS entry_count,
    (SELECT count(DISTINCT blogmark_id) 
     FROM blog_blogmark_tags 
     WHERE blog_blogmark_tags.tag_id = blog_tag.id) AS blogmark_count,
    (SELECT count(DISTINCT quotation_id) 
     FROM blog_quotation_tags 
     WHERE blog_quotation_tags.tag_id = blog_tag.id) AS quotation_count
  FROM blog_tag
  WHERE description IS NOT NULL AND description != ''
)
SELECT 
  tag,
  'https://simonwillison.net/tags/' || tag || '/' as url,
  description,
  (entry_count + blogmark_count + quotation_count) AS post_count,
  entry_count,
  blogmark_count,
  quotation_count
FROM tag_counts

75 rows

tag url description post_count entry_count blogmark_count quotation_count
clip https://simonwillison.net/tags/clip/ [CLIP](https://github.com/openai/CLIP) is an embedding model by OpenAI that supports comparisons between images and text. 8 2 6 0
annotated-release-notes https://simonwillison.net/tags/annotated-release-notes/ I sometimes publish extended annotated versions of the release notes for my projects. 24 22 2 0
evals https://simonwillison.net/tags/evals/ Evaluations and benchmarks of AI systems, for example to compare different models and prompting strategies. 15 1 10 4
marimo https://simonwillison.net/tags/marimo/ [Marimo](https://marimo.io/) is an open-source reactive notebook for Python. 4 0 4 0
weeknotes https://simonwillison.net/tags/weeknotes/ I publish weeknotes once every two or three weeks documenting what I've been working on recently. 192 192 0 0
datasette-cloud https://simonwillison.net/tags/datasette-cloud/ The SaaS hosted version of Datasette, at [datasette.cloud](https://www.datasette.cloud/) 45 33 12 0
s3-credentials https://simonwillison.net/tags/s3-credentials/ [s3-credentials](https://s3-credentials.readthedocs.io/) is a CLI tool for creating and managing credentials for S3 buckets. 9 7 2 0
llm-tool-use https://simonwillison.net/tags/llm-tool-use/ Tool use is when an LLM is instructed to occasionally request that an external tool be run on its behalf, with the result passed back to the model for further processing. Sometimes also known as function calling, and one of several ideas that might be referred to as "agents". 13 2 11 0
ham-radio https://simonwillison.net/tags/ham-radio/ The [amateur radio](https://en.wikipedia.org/wiki/Amateur_radio) hobby. 3 0 3 0
phi https://simonwillison.net/tags/phi/ Microsoft's [Phi family](https://azure.microsoft.com/en-us/products/phi) of small Large Language Models. 9 1 8 0
llm https://simonwillison.net/tags/llm/ [LLM](https://llm.datasette.io/) is my command-line tool for running prompts against Large Language Models. 123 49 72 2
recall https://simonwillison.net/tags/recall/ Microsoft Recall is a feature planned for Windows 11 that allows users to search through automated screenshots taken of their previous activity on their PC. 4 0 3 1
rag https://simonwillison.net/tags/rag/ RAG stands for **Retrieval Augmented Generation**. It's a trick where you find additional context relevant to the user's request using other means (such as full-text or vector search) and populate that context as part of the prompt to a Large Language Model. 29 8 17 4
projects https://simonwillison.net/tags/projects/ Posts about projects I have worked on. 449 248 201 0
datasette https://simonwillison.net/tags/datasette/ [Datasette](https://datasette.io/) is an open source tool for exploring and publishing data. 424 236 186 2
claude https://simonwillison.net/tags/claude/ [Claude](https://www.anthropic.com/claude) is Anthropic's family of Large Language Models. 111 23 70 18
anthropic https://simonwillison.net/tags/anthropic/ [Anthropic](https://www.anthropic.com/) are the AI research company behind [Claude](https://claude.ai/). 105 23 63 19
llms https://simonwillison.net/tags/llms/ Large Language Models (LLMs) are the class of technology behind generative text AI systems like OpenAI's [ChatGPT](https://simonwillison.net/tags/chatgpt/), Google's [Gemini](https://simonwillison.net/tags/gemini/) and Anthropic's [Claude](https://simonwillison.net/tags/claude/). 877 138 529 210
openai https://simonwillison.net/tags/openai/ [OpenAI](https://openai.com/) build [ChatGPT](https://chatgpt.com/) and the GPT series of Large Language Models. 243 62 138 43
mistral https://simonwillison.net/tags/mistral/ [Mistral AI](https://mistral.ai/) release both openly licensed and API-hosted Language Models. 21 4 16 1
boring-technology https://simonwillison.net/tags/boring-technology/ [boringtechnology.club](https://boringtechnology.club/) 7 3 4 0
datasette-lite https://simonwillison.net/tags/datasette-lite/ Datasette running entirely in the browser using Pyodide and WebAssembly, at [lite.datasette.io](https://lite.datasette.io/) 16 12 4 0
sqlite-utils https://simonwillison.net/tags/sqlite-utils/ [sqlite-utils](https://sqlite-utils.datasette.io/) is a Python library and command-line utility for creating and manipulating SQLite databases. 86 60 26 0
shot-scraper https://simonwillison.net/tags/shot-scraper/ [shot-scraper](https://shot-scraper.datasette.io/) is a command-line utility for taking screenshots of websites and scraping content from them using JavaScript. 25 13 12 0
training-data https://simonwillison.net/tags/training-data/ Data used to train LLMs and other machine learning models. 49 8 26 15
sqlite https://simonwillison.net/tags/sqlite/ [SQLite](https://www.sqlite.org/) is the world's [most widely deployed database engine](https://www.sqlite.org/mostdeployed.html). 286 75 196 15
embeddings https://simonwillison.net/tags/embeddings/ See [Embeddings: What they are and why they matter](https://simonwillison.net/2023/Oct/23/embeddings/). 47 7 39 1
nomic https://simonwillison.net/tags/nomic/ [Nomic AI](https://www.nomic.ai) develop various interesting AI projects, including [GPT4All](https://gpt4all.io/) and powerful [embedding models](https://gpt4all.io/). 5 0 5 0
python https://simonwillison.net/tags/python/ The [Python](https://www.python.org/) programming language. 1085 192 865 28
o1 https://simonwillison.net/tags/o1/ OpenAI’s [o1 family](https://openai.com/o1/) of models. 22 5 6 11
psf https://simonwillison.net/tags/psf/ The [Python Software Foundation](https://www.python.org/psf-landing/) is the non-profit organization devoted to advancing open source technology related to the Python programming language. 12 4 8 0
dsf https://simonwillison.net/tags/dsf/ The [Django Software Foundation](https://www.djangoproject.com/foundation/). 10 1 8 1
ladybird https://simonwillison.net/tags/ladybird/ [Ladybird](https://ladybird.org/) is "a truly independent web browser". 5 0 4 1
beeware https://simonwillison.net/tags/beeware/ [BeeWare](https://beeware.org/) "allows you to write your app in Python and release it on multiple platforms". 3 0 3 0
vs-code https://simonwillison.net/tags/vs-code/ Microsoft's partially open source text editor [Visual Studio Code](https://code.visualstudio.com/). 4 0 4 0
markdown-exfiltration https://simonwillison.net/tags/markdown-exfiltration/ **Markdown Exfiltration** is a [prompt injection](https://simonwillison.net/tags/promptinjection/) attack against chatbots that have access to private information and can render Markdown images. An attacker can trick the bot into rendering an image that leaks data encoded in the URL to an external server. 15 3 11 1
val-town https://simonwillison.net/tags/val-town/ [Val Town](https://www.val.town/) provides a web interface for building, running and deploying server-side JavaScript apps. 7 1 6 0
prompt-injection https://simonwillison.net/tags/prompt-injection/ **Prompt Injection** is a security attack against applications built on top of Large Language Models, [introduced here](https://simonwillison.net/2022/Sep/12/prompt-injection/) and further described in [this series of posts](https://simonwillison.net/series/prompt-injection/). 85 23 51 11
overture https://simonwillison.net/tags/overture/ [Overture](https://overturemaps.org/) provides interoperable open map data. 6 1 5 0
edge-llms https://simonwillison.net/tags/edge-llms/ LLMs that can run on consumer hardware like laptops or mobile phones. 68 18 48 2
llm-pricing https://simonwillison.net/tags/llm-pricing/ Posts about the pricing of various LLMs. See also my [pricing calculator](https://tools.simonwillison.net/llm-prices). 16 4 12 0
smollm https://simonwillison.net/tags/smollm/ [SmolLM](https://huggingface.co/blog/smollm) is a series of small Large Language Models by Hugging Face. 4 0 4 0
structured-extraction https://simonwillison.net/tags/structured-extraction/ Using LLMs to extract structured data from unstructured text and images. 6 1 5 0
pelican-riding-a-bicycle https://simonwillison.net/tags/pelican-riding-a-bicycle/ My [benchmark for LLMs](https://simonwillison.net/2024/Oct/25/pelicans-on-a-bicycle/): "Generate an SVG of a pelican riding a bicycle" 13 4 9 0
claude-artifacts https://simonwillison.net/tags/claude-artifacts/ A feature of [Claude](https://claude.ai/) where it can create shareable, executable HTML and JavaScript pages. See [Everything I built with Claude Artifacts this week](https://simonwillison.net/2024/Oct/21/claude-artifacts/) for several detailed examples. 25 8 16 1
cors https://simonwillison.net/tags/cors/ CORS is [Cross-origin resource sharing](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS), a mechanism by which websites can make APIs available to JavaScript running on other domains around the web. 20 9 10 1
pydantic https://simonwillison.net/tags/pydantic/ [Pydantic](https://docs.pydantic.dev/) is a data validation library for Python based around Python's type annotation feature. 9 1 7 1
ollama https://simonwillison.net/tags/ollama/ [Ollama](https://ollama.com/) is a tool for downloading and running Large Language Models. 13 3 10 0
inference-scaling https://simonwillison.net/tags/inference-scaling/ Improving performance of LLMs through inference - spending more tokens "thinking" about a problem, as seen in OpenAI's o1 and Qwen's QwQ. 16 7 4 5
prompt-caching https://simonwillison.net/tags/prompt-caching/ Some LLM providers offer a feature where common prompt prefixes can be cached, providing a performance boost and price reduction. 9 1 6 2
deepseek https://simonwillison.net/tags/deepseek/ [DeepSeek](https://www.deepseek.com/) is an AI lab from Chinese hedge fund [High-Flyer](https://en.wikipedia.org/wiki/High-Flyer_(company)). 7 2 5 0
code-interpreter https://simonwillison.net/tags/code-interpreter/ ChatGPT Code Interpreter is a mode of [ChatGPT](https://chatgpt.com/) that allows it to write and then execute Python code in a sandboxed environment. OpenAI sometimes call this feature [data analysis](https://help.openai.com/en/articles/8437071-data-analysis-with-chatgpt). 21 13 8 0
chrome https://simonwillison.net/tags/chrome/ Google's [Chrome](https://www.google.com/chrome/) browser. 30 3 20 7
slop https://simonwillison.net/tags/slop/ **Slop** describes AI-generated content that is both *unrequested* and *unreviewed*. See [Slop is the new name for unwanted AI-generated content](https://simonwillison.net/2024/May/8/slop/). 16 2 11 3
llamafile https://simonwillison.net/tags/llamafile/ [llamafile](https://github.com/Mozilla-Ocho/llamafile) provides a mechanism to distribute and run LLMs as a single file. 6 4 2 0
django https://simonwillison.net/tags/django/ The [Django](https://www.djangoproject.com/) web framework. 567 101 453 13
go https://simonwillison.net/tags/go/ The [Go](https://go.dev/) programming language. 36 4 29 3
ai-assisted-programming https://simonwillison.net/tags/ai-assisted-programming/ Using AI tools such as Large Language Models to help write code. 109 32 65 12
command-r https://simonwillison.net/tags/command-r/ The [Command R](https://cohere.com/command) family of language models from Cohere. 2 0 2 0
github-codespaces https://simonwillison.net/tags/github-codespaces/ [GitHub Codespaces](https://github.com/features/codespaces) provides cloud development environments accessible through a browser. 10 3 7 0
annotated-talks https://simonwillison.net/tags/annotated-talks/ Annotated versions of talks I have given, with extensive notes and additional links. Here's [how I make these](https://simonwillison.net/2023/Aug/6/annotated-presentations/). 23 23 0 0
datasette-desktop https://simonwillison.net/tags/datasette-desktop/ Datasette as an Electron desktop application, at [datasette.io/desktop](https://datasette.io/desktop) 6 5 1 0
vision-llms https://simonwillison.net/tags/vision-llms/ LLMs that can also be used to interpret images and video, such as GPT-4o, Claude 3 and Gemini Pro. 35 9 25 1
prompt-engineering https://simonwillison.net/tags/prompt-engineering/ The subtle art and craft of effectively prompting and building software on top of LLMs. 115 30 64 21
github-copilot https://simonwillison.net/tags/github-copilot/ [GitHub Copilot](https://github.com/features/copilot) is GitHub's extension for VS Code that uses LLMs to provide context-driven code completion suggestions. 13 4 8 1
github-actions https://simonwillison.net/tags/github-actions/ GitHub's [Actions](https://github.com/features/actions) tool for repository automation. 47 25 22 0
git-history https://simonwillison.net/tags/git-history/ [git-history](https://github.com/simonw/git-history) is a tool for analyzing data scraped to a GitHub repository using SQLite. 6 5 1 0
djp https://simonwillison.net/tags/djp/ [DJP](https://djp.readthedocs.io/) is a plugin system for Django. 5 2 3 0
git-scraping https://simonwillison.net/tags/git-scraping/ [Git scraping](https://simonwillison.net/2020/Oct/9/git-scraping/) is a technique where data is scraped from an external source into a Git repository in order to record changes to that data over time. 29 22 7 0
django-sql-dashboard https://simonwillison.net/tags/django-sql-dashboard/ [Django SQL Dashboard](https://django-sql-dashboard.datasette.io) adds a read-only PostgreSQL SQL dashboard interface to a Django project. 13 8 5 0
mlx https://simonwillison.net/tags/mlx/ The [MLX framework](https://github.com/ml-explore/mlx) for running machine learning models on Apple Silicon. 7 3 4 0
notebooklm https://simonwillison.net/tags/notebooklm/ Google's [NotebookLM](https://notebooklm.google/) "personalized research assistant". 5 1 2 2
observable-plot https://simonwillison.net/tags/observable-plot/ The <a href="https://observablehq.com/plot/">Observable Plot</a> JavaScript visualization library. 9 1 8 0
gpus https://simonwillison.net/tags/gpus/ GPUs aren't just for graphics anymore. 13 0 10 3
ai-agents https://simonwillison.net/tags/ai-agents/ I don't think the term "agents" has a useful, widely shared definition. 18 4 9 5
Copy and export data

Duration: 9.74ms