Simon Willison’s Weblog

Subscribe
Atom feed for datasette Random

1,504 posts tagged “datasette”

Datasette is an open source tool for exploring and publishing data.

2024

Ham radio general exam question pool as JSON. I scraped a pass of my Ham radio general exam this morning. One of the tools I used to help me pass was a Datasette instance with all 429 questions from the official question pool. I've published that raw data as JSON on GitHub, which I converted from the official question pool document using an Observable notebook.

Relevant TIL: How I studied for my Ham radio general exam.

# 11th May 2024, 7:16 pm / json, projects, radio, datasette, observable, ham-radio

datasette-pins — a new Datasette plugin for pinning tables and queries. Alex Garcia built this plugin for Datasette Cloud, and as with almost every Datasette Cloud features we're releasing it as an open source package as well.

datasette-pins allows users with the right permission to "pin" tables, databases and queries to their homepage. It's a lightweight way to customize that homepage, especially useful as your Datasette instance grows to host dozens or even hundreds of tables.

# 9th May 2024, 6:29 pm / plugins, datasette, datasette-cloud, alex-garcia

Weeknotes: more datasette-secrets, plus a mystery video project

Visit Weeknotes: more datasette-secrets, plus a mystery video project

I introduced datasette-secrets two weeks ago. The core idea is to provide a way for end-users to store secrets such as API keys in Datasette, allowing other plugins to access them.

[... 982 words]

Release datasette-upload-dbs 0.3.2 — Upload SQLite database files to Datasette
Release datasette-enrichments 0.4.2 — Tools for running enrichments against data stored in Datasette
Release datasette-enrichments 0.4.1 — Tools for running enrichments against data stored in Datasette
Release datasette-enrichments 0.4 — Tools for running enrichments against data stored in Datasette
Release datasette-secrets 0.2 — Manage secrets such as API keys for use with other Datasette plugins
Release datasette-test 0.3.2 — Utilities to help write tests for Datasette plugins and applications
Release datasette-test 0.3.1 — Utilities to help write tests for Datasette plugins and applications
Release datasette-test 0.3 - release yanked — Utilities to help write tests for Datasette plugins and applications

Food Delivery Leak Unmasks Russian Security Agents. This story is from April 2022 but I realize now I never linked to it.

Yandex Food, a popular food delivery service in Russia, suffered a major data leak.

The data included an order history with names, addresses and phone numbers of people who had placed food orders through that service.

Bellingcat were able to cross-reference this leak with addresses of Russian security service buildings—including those linked to the GRU and FSB.This allowed them to identify the names and phone numbers of people working for those organizations, and then combine that information with further leaked data as part of their other investigations.

If you look closely at the screenshots in this story they may look familiar: Bellingcat were using Datasette internally as a tool for exploring this data!

# 26th April 2024, 1:59 am / data-journalism, datasette, bellingcat

Release datasette-extract 0.1a6 — Import unstructured data (text and images) into structured tables
Release datasette-leaflet-geojson 0.8.2 — Datasette plugin that replaces any GeoJSON column values with a Leaflet map.
Release datasette-secrets 0.1a4 — Manage secrets such as API keys for use with other Datasette plugins
Release datasette-secrets 0.1a3 — Manage secrets such as API keys for use with other Datasette plugins
Release datasette-secrets 0.1a2 — Manage secrets such as API keys for use with other Datasette plugins
Release datasette-edit-schema 0.8a2 — Datasette plugin for modifying table schemas

Weeknotes: Llama 3, AI for Data Journalism, llm-evals and datasette-secrets

Visit Weeknotes: Llama 3, AI for Data Journalism, llm-evals and datasette-secrets

Llama 3 landed on Thursday. I ended up updating a whole bunch of different plugins to work with it, described in Options for accessing Llama 3 from the terminal using LLM.

[... 1,030 words]

Release datasette-secrets 0.1a1 — Manage secrets such as API keys for use with other Datasette plugins
Release datasette-secrets 0.1a0 — Manage secrets such as API keys for use with other Datasette plugins

AI for Data Journalism: demonstrating what we can do with this stuff right now

Visit AI for Data Journalism: demonstrating what we can do with this stuff right now

I gave a talk last month at the Story Discovery at Scale data journalism conference hosted at Stanford by Big Local News. My brief was to go deep into the things we can use Large Language Models for right now, illustrated by a flurry of demos to help provide starting points for further conversations at the conference.

[... 6,081 words]

Release datasette-cors 1.0.1 — Datasette plugin for configuring CORS headers
Release datasette-enrichments-gpt 0.4 — Datasette enrichment for analyzing row data using OpenAI's GPT models

Extracting data from unstructured text and images with Datasette and GPT-4 Turbo. Datasette Extract is a new Datasette plugin that uses GPT-4 Turbo (released to general availability today) and GPT-4 Vision to extract structured data from unstructured text and images.

I put together a video demo of the plugin in action today, and posted it to the Datasette Cloud blog along with screenshots and a tutorial describing how to use it.

# 9th April 2024, 11:03 pm / projects, ai, datasette, datasette-cloud, openai, generative-ai, gpt-4, llms, vision-llms, structured-extraction, gpt

Release datasette-public 0.2.3 — Make selected Datasette databases and tables visible to the public
Release datasette-enrichments 0.3.2 — Tools for running enrichments against data stored in Datasette
Release datasette-cors 1.0 — Datasette plugin for configuring CORS headers
Release datasette-embeddings 0.1a3 — Store and query embedding vectors in Datasette tables