Weeknotes: DevDay, GitHub Universe, OpenAI chaos
22nd November 2023
Three weeks of conferences and Datasette Cloud work, four days of chaos for OpenAI.
The second week of November was chaotically busy for me. On the Monday I attended the OpenAI DevDay conference, which saw a bewildering array of announcements. I shipped LLM 0.12 that day with support for the brand new GPT-4 Turbo model (2-3x cheaper than GPT-4, faster and with a new increased 128,000 token limit), and built ospeak that evening as a CLI tool for working with their excellent new text-to-speech API.
On Tuesday I recorded a podcast episode with the Latent Space crew talking about what was released at DevDay, and attended a GitHub Universe pre-summit for open source maintainers.
Then on Wednesday I spoke at GitHub Universe itself. I published a full annotated version of my talk here: Financial sustainability for open source projects at GitHub Universe. It was only ten minutes long but it took a lot of work to put together—ten minutes requires a lot of editing and planning to get right.
(I later used the audio from that talk to create a cloned version of my voice, with shockingly effective results!)
With all of my conferences for the year out of the way, I spent the next week working with Alex Garcia on Datasette Cloud. Alex has been building out datasette-comments, an excellent new plugin which will allow Datasette users to collaborate on data by leaving comments on individual rows—ideal for collaborative investigative reporting.
Meanwhile I’ve been putting together the first working version of enrichments—a feature I’ve been threatening to build for a couple of years now. The key idea here is to make it easy to apply enrichment operations—geocoding, language model prompt evaluation, OCR etc—to rows stored in Datasette. I’ll have a lot more to share about this soon.
The biggest announcement at OpenAI DevDay was GPTs—the ability to create and share customized GPT configurations. It took me another week to fully understand those, and I wrote about my explorations in Exploring GPTs: ChatGPT in a trench coat?.
And then last Friday everything went completely wild, when the board of directors of the non-profit that controls OpenAI fired Sam Altman over a vague accusation that he was “not consistently candid in his communications with the board”.
It’s four days later now and the situation is still shaking itself out. It inspired me to write about a topic I’ve wanted to publish for a while though: Deciphering clues in a news article to understand how it was reported.
sqlite-utils 3.35.2 and shot-scraper 1.3
I’ll duplicate the full release notes for two of my projects here, because I want to highlight the contributions from external developers.
--load-extension=spatialiteoption and find_spatialite() utility function now both work correctly on
arm64Linux. Thanks, Mike Coats. (#599)
- Fix for bug where
sqlite-utils insertcould cause your terminal cursor to disappear. Thanks, Luke Plant. (#433)
datetime.timedeltavalues are now stored as
TEXTcolumns. Thanks, Harald Nezbeda. (#522)
- Test suite is now also run against Python 3.12.
- Screenshots taken using
shot-scraper --interactive $URL—which allows you to interact with the page in a browser window and then hit
<enter>to take the screenshot—it no longer reloads the page before taking the shot (which ignored your activity). #125
- Improved accessibility of documentation. Thanks, Paolo Melchiorre. #120
Releases these weeks
Datasette plugin for configuring Sentry
Tools for running enrichments against data stored in Datasette
CLI tool for running text through OpenAI Text to speech
Access large language models from the command-line
Datasette plugin for modifying table schemas
Python CLI utility and library for manipulating SQLite databases
LLM plugin for models hosted by Anyscale Endpoints
A command-line utility for taking automated screenshots of websites
TIL these weeks
- Cloning my voice with ElevenLabs—2023-11-16
- Summing columns in remote Parquet files using DuckDB—2023-11-14
More recent articles
- The killer app of Gemini Pro 1.5 is video - 21st February 2024
- Weeknotes: a Datasette release, an LLM release and a bunch of new plugins - 9th February 2024
- LLM 0.13: The annotated release notes - 26th January 2024
- Weeknotes: datasette-test, datasette-build, PSF board retreat - 21st January 2024
- Talking about Open Source LLMs on Oxide and Friends - 17th January 2024
- Publish Python packages to PyPI with a python-lib cookiecutter template and GitHub Actions - 16th January 2024
- What I should have said about the term Artificial Intelligence - 9th January 2024
- Weeknotes: Page caching and custom templates for Datasette Cloud - 7th January 2024
- It's OK to call it Artificial Intelligence - 7th January 2024