Simon Willison’s Weblog

Subscribe
Atom feed

Elsewhere

Filters: Sorted by date

Release datasette-events-forward 0.1a3 — Forward Datasette analytical events on to another Datasette instance
Release datasette-tail 0.1a2 — Tools for tailing your database
Research h3-library-benchmark — A systematic performance benchmark was conducted on two prominent Python libraries implementing Uber's H3 geospatial indexing system: h3-py (official, C-based) and h3o-python (Rust-based). Results show h3o-python consistently outperforms h3-py on core operations, achieving over 2x speedup for coordinate conversions and up to 13x faster neighbor queries, while area calculations remain comparable.
Research sqlite-query-linter — The SQLite Query Linter is a lightweight Python library that wraps the standard `sqlite3` module to provide configurable linting and rule-based analysis of SQL queries before execution. Acting as a drop-in replacement, it helps catch common syntax errors and platform incompatibilities—such as invalid types in `CAST`, use of unsupported functions, `SELECT *`, missing `WHERE` clauses, and string quoting mistakes—helping developers avoid runtime errors and improve code quality.
Release datasette-search-all 1.1.5a0 — Datasette plugin for searching all searchable tables at once
Release datasette 1.0a20 — An open source multi-tool for exploring and publishing data
Research h3o-python — h3o-python delivers efficient Python bindings for the h3o Rust library, enabling fast and convenient access to H3 geospatial indexing from Python. Utilizing PyO3 and packaged with maturin, it allows encoding geographic coordinates into 64-bit H3 cell indexes, decoding indexes, performing neighborhood queries, calculating great-circle distances, and retrieving surface area metrics—all without requiring a separate H3 installation.
Research wazero-python-claude — Wazero Python Bindings enable seamless integration of the wazero WebAssembly runtime—written in Go—with Python applications, delivering a zero-dependency solution for running WASM modules natively from Python. The project exposes a clean, Pythonic API for instantiating modules, calling exported WASM functions, and managing resources efficiently with context managers. Performance benchmarks demonstrate rapid execution and minimal overhead between Python and WASM.
Release llm-minimax 0.1 — LLM plugin for accessing MiniMax models via their API
Release llm-anthropic 0.21.1 — LLM access to models by Anthropic, including the Claude series
Release llm-anthropic 0.21 — LLM access to models by Anthropic, including the Claude series
Tool Badge Interactive REPL — Interact with a MicroPython device via the Web Serial API to execute Python commands in real-time through a browser-based REPL interface. This tool enables direct communication with compatible microcontroller boards, allowing users to run Python code, query system information, and manage files without requiring terminal software or drivers. The interface provides quick-access buttons for common operations like listing files, checking CPU frequency, and monitoring available memory.
Tool 24×24 Icon Editor — # Documentation
Tool GitHub GraphQL Explorer — Explore GitHub's GraphQL API using an interactive query interface that requires authentication via a personal access token. Enter your GitHub token to connect and start writing, executing, and testing GraphQL queries against GitHub's data in real-time. The explorer provides syntax highlighting, query validation, and response inspection capabilities within an embedded GraphiQL environment.
Tool GitHub Rate Limit Checker — Monitor your GitHub API usage and remaining rate limits with this authentication-based checker. After authenticating with your GitHub account, the tool displays detailed information about your API quotas across different resource types, including remaining calls, reset times, and visual progress indicators. The interface shows critical warnings when your limits are running low, helping you manage your API consumption effectively.
Release datasette-pretty-traces 0.6 — Prettier formatting for ?_trace=1 traces
Research blog-tags-scikit-learn — Automatically assigning meaningful tags to historic, untagged blog posts, this project leverages the Simon Willison blog database and scikit-learn to train and compare multi-label text classification models. Four approaches—TF-IDF + Logistic Regression, Multinomial Naive Bayes, Random Forest, and LinearSVC—were tested on posts’ title and body text using the 158 most frequently used tags.
Research datasette-plugin-skill — Covering every aspect of Datasette plugin development, this project creates a comprehensive skill set for authors—from bootstrapping with cookiecutter to deploying on GitHub and PyPI. It provides precise guides and working code samples for essential plugin hooks like custom SQL functions, authentication, custom views, and output formats.
Tool Terminal to HTML — Convert terminal output into shareable HTML documents with support for colored text formatting. Paste terminal output in RTF, HTML, or plain text format, and the tool instantly generates clean HTML code ready for preview or export. Save your conversions as GitHub Gists for easy sharing and collaboration.
Release pytest-unused-port 0.2 — pytest fixture finding an unused local port
Research python-markdown-comparison — Comparing seven prominent Python markdown libraries, cmarkgfm—bindings to GitHub’s C-based CommonMark/GFM parser—proved dramatically faster (10-50x) than pure Python options such as mistune, Python-Markdown, and marko. The benchmark, spanning small to large markdown documents, consistently found cmarkgfm excels in both speed and stability, making it ideal for high-volume or performance-critical applications.
Research cmarkgfm-in-pyodide — By rewriting cmarkgfm's bindings from CFFI to the Python C API, the project successfully ported GitHub's cmark-gfm Markdown parser to Pyodide. The resulting wheel is fully functional, requires no further building, and supports all GitHub Flavored Markdown features with high performance, thanks to direct C code execution via WebAssembly.
Release datasette-events-forward 0.1a2 — Forward Datasette analytical events on to another Datasette instance
Release datasette-debug-actors-from-ids 0.1a2 — Datasette plugin for trying out the actors_from_ids hook
Release datasette-visible-internal-db 0.1.1a1 — Datasette plugin for making the internal database visible for debugging
Release datasette-create-view 0.1a1 — Create a SQL view from a query
Release datasette-studio 0.1a5 — Datasette pre-configured with useful plugins. Experimental alpha.
Release datasette-load 0.1a4 — API and UI for bulk loading data into Datasette from a URL
Release datasette-import 0.1a6 — Tools for importing data into Datasette

Beats

Years

Tags