Simon Willison on datasette

437 posts tagged “datasette”

Datasette is an open source tool for exploring and publishing data.

2021

Weeknotes: datasette-tiddlywiki, filters_from_request

I made some good progress on the big refactor this week, including extracting some core logic out into a new Datasette plugin hook. I also got distracted by TiddlyWiki and released a new Datasette plugin that lets you run TiddlyWiki inside Datasette.

[... 1,197 words]

7:08 am / 24th December 2021 / performance, tiddlywiki, datasette, plugins, projects, gis, weeknotes

Weeknotes: Trapped in an eternal refactor

I’m still working on refactoring Datasette’s table view. In doing so I spun out a new plugin, datasette-pretty-traces, which improves Datasette’s tooling for seeing the SQL that was executed to build a specific page.

[... 544 words]

9:01 pm / 16th December 2021 / datasette, weeknotes

Weeknotes: git-history, bug magnets and s3-credentials --public

I’ve stopped considering my projects “shipped” until I’ve written a proper blog entry about them, so yesterday I finally shipped git-history, coinciding with the release of version 0.6—a full 27 days after the first 0.1.

[... 1,013 words]

9:34 pm / 8th December 2021 / git-history, s3-credentials, datasette, weeknotes

git-history: a tool for analyzing scraped data collected using Git and SQLite

I described Git scraping last year: a technique for writing scrapers where you periodically snapshot a source of data to a Git repository in order to record changes to that source over time.

[... 2,002 words]

10:32 pm / 7th December 2021 / scraping, projects, sqlite, datasette, git-history, data-journalism, git, cli

Weeknotes: Shaving some beautiful yaks

I’ve been mostly shaving yaks this week—two in particular: the Datasette table refactor and the next release of git-history. I also built and released my first Web Component!

[... 1,307 words]

3:43 am / 1st December 2021 / web-components, datasette, projects, git-scraping, npm, weeknotes

Weeknotes: Apache proxies in Docker containers, refactoring Datasette

Updates to six major projects this week, plus finally some concrete progress towards Datasette 1.0.

[... 1,630 words]

5:43 am / 22nd November 2021 / refactoring, docker, apache, datasette, proxies, supervisord, weeknotes, sqlite-utils, git-scraping

Weeknotes: git-history, created for a Git scraping workshop

My main project this week was a 90 minute workshop I delivered about Git scraping at Coda.Br 2021, a Brazilian data journalism conference, on Friday. This inspired the creation of a brand new tool, git-history, plus smaller improvements to a range of other projects.

[... 1,239 words]

4:10 am / 15th November 2021 / sqlite-utils, projects, teaching, git-history, s3-credentials, git-scraping, datasette, my-talks, s3, aws, weeknotes

Datasette is four years old today. I marked the occasion with a short Twitter thread about the project so far.

# 13th November 2021, 6:14 pm / datasette

AWS IAM definitions in Datasette (via) As part of my ongoing quest to conquer IAM permissions, I built myself a Datasette instance that lets me run queries against all 10,441 permissions across 280 AWS services. It’s deployed by a build script running in GitHub Actions which downloads a 8.9MB JSON file from the Salesforce policy_sentry repository—policy_sentry itself creates that JSON file by running an HTML scraper against the official AWS documentation!

# 6th November 2021, 3:47 am / aws, datasette

Weeknotes: datasette-jupyterlite, s3-credentials and a Python packaging talk

My big project this week was s3-credentials, described yesterday—but I also put together a fun expermiental Datasette plugin bundling JupyterLite and wrote up my PyGotham talk on Python packaging.

[... 476 words]

5:04 am / 5th November 2021 / jupyter, webassembly, datasette, projects, weeknotes, pyodide

Weeknotes: Learning Kubernetes, learning Web Components

I’ve been mainly climbing the learning curve for Kubernetes and Web Components this week. I also released Datasette 0.59.1 with Python 3.10 compatibility and an updated Docker image.

[... 1,101 words]

2:44 am / 28th October 2021 / docker, web-components, datasette, kubernetes, weeknotes

Datasette 0.59: The annotated release notes

Datasette 0.59 is out, with a miscellaneous grab-bag of improvements. Here are the annotated release notes.

[... 2,103 words]

4:59 am / 19th October 2021 / releasenotes, datasette, weeknotes, annotated-release-notes

Where does all the effort go? Looking at Python core developer activity (via) Łukasz Langa used Datasette to explore 28,780 pull requests made to the CPython GitHub repository, using some custom Python scripts (and sqlite-utils) to load in the data.

# 18th October 2021, 8:21 pm / lukasz-langa, sqlite-utils, datasette, python

Weeknotes number 100

This entry marks my 100th weeknotes, which I’ve managed to post once a week (plus or minus a few days) consistently since 13th September 2019.

[... 593 words]

1:34 am / 19th September 2021 / projects, datasette-desktop, datasette, weeknotes

Datasette Desktop 0.2.0: The annotated release notes

Datasette Desktop is a new macOS desktop application version of Datasette, an “open source multi-tool for exploring and publishing data” built on top of SQLite. I released the first version last week—I’ve just released version 0.2.0 (and a 0.2.1 bug fix) with a whole bunch of critical improvements.

[... 2,208 words]

11:30 pm / 13th September 2021 / electron, releasenotes, datasette-desktop, datasette, projects, annotated-release-notes

Datasette Desktop—a macOS desktop application for Datasette

I just released version 0.1.0 of the new Datasette macOS desktop application, the first version that end-users can easily install. I would very much appreciate your help testing it out!

[... 1,761 words]

7:15 pm / 8th September 2021 / projects, electron, github-actions, datasette, weeknotes, datasette-desktop

Datasette Desktop 0.1.0 (via) This is the first installable version of the new Datasette Desktop macOS application I’ve been building. Please try it out and leave feedback on Twitter or on the GitHub Discussions thread linked from the release notes.

# 8th September 2021, 5:14 am / projects, electron, datasette, datasette-desktop

Building a desktop application for Datasette (and weeknotes)

This week I started experimenting with a desktop application version of Datasette—with the goal of providing people who aren’t comfortable with the command-line the ability to get Datasette up and running on their own personal computers.

[... 1,423 words]

5:13 am / 30th August 2021 / electron, projects, datasette, sqlite-utils, weeknotes, datasette-desktop

Weeknotes: Getting my personal Dogsheep up and running again

I gave a talk about Dogsheep at Noisebridge’s Five Minutes of Fame on Thursday. Just one problem: my regular Dogsheep demo was broken, so I ended up building it from scratch again. In doing so I fixed a few bugs in some Dogsheep tools.

[... 1,311 words]

4:46 am / 22nd August 2021 / datasette, linux, nginx, dogsheep, weeknotes

Datasette on Codespaces, sqlite-utils API reference documentation and other weeknotes

This week I broke my streak of not sending out the Datasette newsletter, figured out how to use Sphinx for Python class documentation, worked out how to run Datasette on GitHub Codespaces, implemented Datasette column metadata and got tantalizingly close to a solution for an elusive Datasette feature.

[... 2,164 words]

4:57 am / 14th August 2021 / sql, sqlite, datasette, sqlite-utils, documentation, github, weeknotes, mypy, github-codespaces

Everything new in Datasette since January, plus Django SQL Dashboard. I sent out the first Datasette newsletter since late January this year, covering everything that’s new in Datasette and sqlite-utils this year and introducing Django SQL Dashboard.

# 10th August 2021, 1:28 am / sqlite-utils, django-sql-dashboard, datasette

Stanford School Enrollment Project (via) This is Project Pelican: I’ve been working with the Big Local News team at Stanford helping bundle up and release the data they’ve been collecting on school enrollment statistics around the USA. This Datasette instance has data from 33 states for every year since 2015—3.3m rows total. Be sure to check out the accompanying documentation!

# 8th August 2021, 12:23 am / journalism, datasette, data-journalism

Apply conversion functions to data in SQLite columns with the sqlite-utils CLI tool

Earlier this week I released sqlite-utils 3.14 with a powerful new command-line tool: sqlite-utils convert, which applies a conversion function to data stored in a SQLite column.

[... 1,941 words]

6:05 am / 6th August 2021 / performance, projects, sqlite, datasette, sqlite-utils, data-science, weeknotes, cli

Weeknotes: datasette-remote-metadata, sqlite-transform --multi

I mentioned Project Pelican (still a codename until the end of the embargo) last week. This week it inspired a new plugin, datasette-remote-metadata.

[... 595 words]

7:32 pm / 29th July 2021 / sqlite, plugins, datasette, projects, weeknotes

The Baked Data architectural pattern

I’ve been exploring an architectural pattern for publishing websites over the past few years that I call the “Baked Data” pattern. It provides many of the advantages of static site generators while avoiding most of their limitations. I think it deserves to be used more widely.

[... 1,896 words]

8:23 pm / 28th July 2021 / staticgenerator, datasette, baked-data, sqlite, design-patterns

Datasette—an ecosystem of tools for working with small data

This is the transcript and video from a talk I gave at PyGotham 2020 about using SQLite, Datasette and Dogsheep to work with small data.

[... 4,655 words]

6:13 pm / 22nd July 2021 / dogsheep, my-talks, datasette, sqlite, small-data, annotated-talks

Weeknotes: sqlite-transform 1.1, Datasette 0.58.1, datasette-graphql 1.5

Work on Project Pelican inspires new features and improvements across a number of different projects.

[... 1,419 words]

9:29 pm / 21st July 2021 / sqlite, datasette, projects, data-journalism, graphql, weeknotes

Datasette downloads per day (with Observable Plot) (via) I built an Observable notebook that imports PyPI package download data from datasette.io (itself scraped from pypistats.org using a scheduled GitHub Action) and plots it using Observable Plot. Datasette downloads from PyPI apparently jumped from ~800/day in May to ~4,000/day in July—would love to know why!

# 17th July 2021, 5:01 pm / observable, datasette, observable-plot

Datasette 0.58: The annotated release notes

I released Datasette 0.58 last night, with new plugin hooks, Unix domain socket support, a major faceting performance fix and a few other improvements. Here are the annotated release notes.

[... 1,062 words]

2:21 am / 16th July 2021 / csrf, releasenotes, sqlite, datasette, annotated-release-notes, d-richard-hipp, samesite

Weeknotes: Fun with Unix domain sockets

A small enhancement to Datasette this week: I’ve added support for proxying via Unix domain sockets.

[... 809 words]

6:57 pm / 13th July 2021 / proxy, datasette, weeknotes

«« first « previous page 7 / 15 next » last »»