Simon Willison’s Weblog

Subscribe
Atom feed for datasette Random

1,455 posts tagged “datasette”

Datasette is an open source tool for exploring and publishing data.

2023

Release datasette-pretty-json 0.3 — Datasette plugin that pretty-prints any column values that are valid JSON objects or arrays
Release datasette-enrichments-re2 0.1a0 — Enrich data using regular expressions powered by re2
Release datasette-enrichments 0.1a5 — Tools for running enrichments against data stored in Datasette
Release datasette-enrichments-gpt 0.1a0 — Datasette enrichment for analyzing row data using OpenAI's GPT models

Weeknotes: DevDay, GitHub Universe, OpenAI chaos

Three weeks of conferences and Datasette Cloud work, four days of chaos for OpenAI.

[... 766 words]

Release datasette-sentry 0.4 — Datasette plugin for configuring Sentry
Release datasette-enrichments-jinja 0.1a1 — Datasette enrichment for evaluating templates in a Jinja sandbox
Release datasette-enrichments 0.1a4 — Tools for running enrichments against data stored in Datasette
Release datasette-enrichments 0.1a3 — Tools for running enrichments against data stored in Datasette
Release datasette-enrichments 0.1a2 — Tools for running enrichments against data stored in Datasette
Release datasette-enrichments-jinja 0.1a0 — Datasette enrichment for evaluating templates in a Jinja sandbox
Release datasette-enrichments 0.1a1 — Tools for running enrichments against data stored in Datasette
Release datasette-enrichments 0.1a0 — Tools for running enrichments against data stored in Datasette

Financial sustainability for open source projects at GitHub Universe

Visit Financial sustainability for open source projects at GitHub Universe

I presented a ten minute segment at GitHub Universe on Wednesday, ambitiously titled Financial sustainability for open source projects.

[... 2,485 words]

Release datasette-edit-schema 0.7.1 — Datasette plugin for modifying table schemas
Release datasette-edit-schema 0.7 — Datasette plugin for modifying table schemas
Release datasette-ripgrep 0.8.2 — Web interface for searching your code using ripgrep, built as a Datasette plugin

Weeknotes: PyBay, AI Engineer Summit, Datasette metadata and JavaScript plugins

I’ve had a bit of a slow two weeks in terms of building things and writing code, thanks mainly to a couple of conference appearances. I did review and land a couple of major contributions to Datasette though.

[... 564 words]

Release datasette-llm-embed 0.2 — Datasette plugin adding a llm_embed(model_id, text) SQL function
Release datasette 0.64.5 — An open source multi-tool for exploring and publishing data
Release datasette-llm-embed 0.1 — Datasette plugin adding a llm_embed(model_id, text) SQL function
Release datasette-llm-embed 0.1a1 — Datasette plugin adding a llm_embed(model_id, text) SQL function

Weeknotes: the Datasette Cloud API, a podcast appearance and more

Datasette Cloud now has a documented API, plus a podcast appearance, some LLM plugins work and some geospatial excitement.

[... 1,243 words]

Getting started with the Datasette Cloud API. I wrote an introduction to the Datasette Cloud API for the company blog, with a tutorial showing how to use Python and GitHub Actions to import data from the Federal Register into a table in Datasette Cloud, then configure full-text search against it.

# 28th September 2023, 11:05 pm / datasette-cloud, apis, datasette

Release datasette-remote-actors 0.1a2 — Datasette plugin for fetching details of actors from a remote endpoint
Release datasette-auth-tokens 0.4a4 — Datasette plugin for authenticating access using API tokens

Geospatial SQL queries in SQLite using TG, sqlite-tg and datasette-sqlite-tg. Alex Garcia built sqlite-tg—a SQLite extension that uses the brand new TG geospatial library to provide a whole suite of custom SQL functions for working with geospatial data.

Here are my notes on trying out his initial alpha releases. The extension already provides tools for converting between GeoJSON, WKT and WKB, plus the all important tg_intersects() function for testing if a polygon or point overlap each other.

It’s pretty useful already. Without any geospatial indexing at all I was still able to get 700ms replies to a brute-force point-in-polygon query against 150MB of GeoJSON timezone boundaries stored as JSON text in a table.

# 25th September 2023, 7:45 pm / datasette, geospatial, sqlite, alex-garcia, gis, geojson, tg

Release datasette 1.0a7 — An open source multi-tool for exploring and publishing data
Release datasette 0.64.4 — An open source multi-tool for exploring and publishing data
Release datasette-upload-dbs 0.3.1 — Upload SQLite database files to Datasette