Simon Willison’s Weblog

Subscribe

July 2020

88 posts: 6 entries, 21 links, 7 quotes, 54 beats

July 13, 2020

Release datasette-auth-passwords 0.2a — Datasette plugin for authentication using passwords
Release datasette-auth-passwords 0.2 — Datasette plugin for authentication using passwords
Release datasette-auth-passwords 0.2.1 — Datasette plugin for authentication using passwords
Release datasette-auth-passwords 0.3 — Datasette plugin for authentication using passwords

datasette-auth-passwords. My latest plugin: datasette-auth-passwords provides a mechanism for signing into Datasette using a username and password (which is verified in order to set a ds_actor authentication cookie). So far it only supports passwords that are hard-coded into Datasette’s configuration via environment variables, but I plan to add database-backed user accounts in the future.

# 11:39 pm / authentication, passwords, plugins, projects, datasette

July 14, 2020

TIL Password hashing in Python with pbkdf2 — I was researching password hashing for [datasette-auth-passwords](https://github.com/simonw/datasette-auth-passwords). I wanted very secure defaults that would work using the Python standard library without any extra dependencies.
Release datasette-plugin-demos 0.4 — Some examples plugins for Datasette

July 17, 2020

Develomentor podcast: Simon Willison – Data Journalism, The Importance of Side Projects (via) Grant Ingersoll interviewed me for the Develomentor podcast. We talked about my career so far, and how much of it was driven by side-projects that I've worked on individually or with Natalie.

Here's the mp3, or access it through Buzzsprout.

# 1:33 am / data-journalism, interview, podcasts, careers, podcast-appearances

Weeknotes: datasette-auth-passwords, a Datasette logo and a whole lot more

All sorts of project updates this week.

[... 913 words]

Release datasette-insert 0.1a — Datasette plugin for inserting and updating data

July 18, 2020

Tempering Expectations for GPT-3 and OpenAI’s API. Insightful commentary on GPT-3 (which is producing some ridiculously cool demos at the moment thanks to the invite-only OpenAI API) from Max Woolf.

# 7:29 pm / machine-learning, ai, max-woolf, gpt-3, openai, generative-ai, llms

When I was curating my generated tweets, I estimated 30-40% of the tweets were usable comedically, a massive improvement over the 5-10% usability from my GPT-2 tweet generation. However, a 30-40% success rate implies a 60-70% failure rate, which is patently unsuitable for a production application.

Max Woolf

# 7:33 pm / machine-learning, ai, max-woolf, llms, gpt-2

Release github-to-sqlite 2.4 — Save data from GitHub to a SQLite database
Release twitter-to-sqlite 0.21.2 — Save data from Twitter to a SQLite database

July 19, 2020

Instead of seeing instrumentation as a last-ditch effort of strings and metrics, we must think about propagating the full context of a request and emitting it at regular pulses. No pull request should ever be accepted unless the engineer can answer the question, “How will I know if this breaks?”

Charity Majors

# 4:05 pm / observability, charity-majors

Release datasette-insert 0.2a — Datasette plugin for inserting and updating data

July 20, 2020

Release datasette-insert 0.3 — Datasette plugin for inserting and updating data
Release datasette-auth-tokens 0.2.2 — Datasette plugin for authenticating access using API tokens
TIL Upgrading a Heroku PostgreSQL database with pg:copy — Figured this out in https://github.com/simonw/simonwillisonblog/issues/132 - I was running PostgreSQL 9.x and I wanted 11.x in order to use the new `search_type="websearch"` option in Django 3.1.

22 Principles for Great Product Managers (via) By Alex Reeve, a PM at LinkedIn. These are really strong—I particularly liked the “leading your team” section which emphasizes ensuring your team understand the goal and the path to reach it, and that you know what winning will look like and how to tell.

# 8:17 pm / management, product-management

Quite simply, it’s the product manager’s job to articulate two simple things:

  • What game are we playing?
  • How do we keep score?

Do these two things right, and all of a sudden a collection of brilliant individual contributors with talents in engineering, operations, quality, design and marketing will start running in the same direction. Without it, no amount of prioritization or execution management will save you.

Adam Nash

# 8:33 pm / product-management

July 21, 2020

TIL Registering temporary pluggy plugins inside tests — While implementing more finely-grained permissions for `datasette-insert-api` ([issue 8](https://github.com/simonw/datasette-insert-api/issues/8)) I decided I wanted to register a Datasette pluggy plugin for the duration of a single test.
Release datasette-insert 0.4 — Datasette plugin for inserting and updating data
Release datasette-copyable 0.1 — Datasette plugin for outputting tables in formats suitable for copy and paste
Release datasette-copyable 0.2 — Datasette plugin for outputting tables in formats suitable for copy and paste

July 22, 2020

You always get the name of the dog, the editor explained. The dog is a character in your story, and names tell readers a lot about your characters. It’s a crucial storytelling detail, and if you’re alert and inquisitive enough to ask for the name of the dog, you’ll surely not miss any other important details.

Justin Willett

# 2:29 pm / journalism

Release datasette-clone 0.5 — Create a local copy of database files from a Datasette instance

The unofficial Google Cloud Run FAQ. This is really useful: a no-fluff, content rich explanation of Google Cloud Run hosted as a GitHub repo that actively accepts pull requests from the community. It’s maintained by Ahmet Alp Balkan, a Cloud Run engineer who states “Googlers: If you find this repo useful, you should recognize the work internally, as I actively fight for alternative forms of content like this”. One of the hardest parts of working with AWS and GCP is digging through the marketing materials to figure out what the product actually does, so the more alternative forms of documentation like this the better.

# 5:20 pm / documentation, google, cloudrun

TIL Updating a Markdown table of contents with a GitHub Action — [markdown-toc](https://github.com/jonschlinkert/markdown-toc) is a Node script that parses a Markdown file and generates a table of contents for it, based on the headings.

July 23, 2020

Release twitter-to-sqlite 0.21.3 — Save data from Twitter to a SQLite database

2020 » July

MTWTFSS
  12345
6789101112
13141516171819
20212223242526
2728293031