Entries

Open questions for AI engineering

Last week I gave the closing keynote at the AI Engineer Summit in San Francisco. I was asked by the organizers to both summarize the conference, summarize the last year of activity in the space and give the audience something to think about by posing some open questions for them to take home.

[... 6,928 words]

2:18 pm / 17th October 2023 / my-talks, ai, generative-ai, llms, llm, annotated-talks, code-interpreter, coding-agents

Multi-modal prompt injection image attacks against GPT-4V

GPT4-V is the new mode of GPT-4 that allows you to upload images as part of your conversations. It’s absolutely brilliant. It also provides a whole new set of vectors for prompt injection attacks.

[... 889 words]

2:24 am / 14th October 2023 / security, ai, openai, prompt-injection, generative-ai, gpt-4, exfiltration-attacks, vision-llms, johann-rehberger

Weeknotes: the Datasette Cloud API, a podcast appearance and more

Datasette Cloud now has a documented API, plus a podcast appearance, some LLM plugins work and some geospatial excitement.

[... 1,243 words]

12:03 am / 1st October 2023 / journalism, projects, sqlite, ai, datasette, weeknotes, datasette-cloud, alex-garcia, generative-ai, llms, llm

Things I’ve learned about building CLI tools in Python

I build a lot of command-line tools in Python. It’s become my favorite way of quickly turning a piece of code into something I can use myself and package up for other people to use too.

[... 1,235 words]

12:12 am / 30th September 2023 / cli, python

Talking Large Language Models with Rooftop Ruby

I’m on the latest episode of the Rooftop Ruby podcast with Collin Donnell and Joel Drapper, talking all things LLM.

[... 15,489 words]

3:39 pm / 29th September 2023 / interviews, podcasts, speaking, ai, generative-ai, llms, llm, code-interpreter, podcast-appearances, coding-agents

Weeknotes: Embeddings, more embeddings and Datasette Cloud

Since my last weeknotes, a flurry of activity. LLM has embeddings support now, and Datasette Cloud has driven some major improvements to the wider Datasette ecosystem.

[... 2,427 words]

5:10 am / 17th September 2023 / plugins, projects, datasette, weeknotes, datasette-cloud, sqlite-utils, alex-garcia, embeddings, llm

Build an image search engine with llm-clip, chat with models with llm chat

LLM is my combination CLI tool and Python library for working with Large Language Models. I just released LLM 0.10 with two significant new features: embedding support for binary files and the llm chat command.

[... 1,188 words]

8:33 pm / 12th September 2023 / cli, projects, ai, annotated-release-notes, generative-ai, local-llms, llms, embeddings, llm, clip

LLM now provides tools for working with embeddings

LLM is my Python library and command-line tool for working with language models. I just released LLM 0.9 with a new set of features that extend LLM to provide tools for working with embeddings.

[... 3,521 words]

8:32 pm / 4th September 2023 / cli, open-source, projects, sqlite, ai, generative-ai, vector-search, llms, embeddings, llm, rag

Datasette 1.0a4 and 1.0a5, plus weeknotes

Two new alpha releases of Datasette, plus a keynote at WordCamp, a new LLM release, two new LLM plugins and a flurry of TILs.

[... 2,709 words]

2:33 pm / 30th August 2023 / plugins, projects, datasette, weeknotes, sqlite-utils, annotated-release-notes, alex-garcia, llm

Making Large Language Models work for you

I gave an invited keynote at WordCamp 2023 in National Harbor, Maryland on Friday.

[... 14,189 words]

2:35 pm / 27th August 2023 / speaking, my-talks, wordpress, ai, generative-ai, llms, llm, annotated-talks, code-interpreter, rag, coding-agents

Datasette Cloud, Datasette 1.0a3, llm-mlc and more

Datasette Cloud is now a significant step closer to general availability. The Datasette 1.03 alpha release is out, with a mostly finalized JSON format for 1.0. Plus new plugins for LLM and sqlite-utils and a flurry of things I’ve learned.

[... 1,690 words]

11:19 pm / 16th August 2023 / plugins, projects, datasette, weeknotes, datasette-cloud, sqlite-utils, llm

How I make annotated presentations

Giving a talk is a lot of work. I go by a rule of thumb I learned from Damian Conway: a minimum of ten hours of preparation for every one hour spent on stage.

[... 2,128 words]

5:15 pm / 6th August 2023 / alt-text, localstorage, ocr, projects, speaking, my-talks, tools, ai, generative-ai, llms, ai-assisted-programming, anthropic, claude, annotated-talks

Weeknotes: Plugins for LLM, sqlite-utils and Datasette

The principle theme for the past few weeks has been plugins.

[... 1,203 words]

12:32 am / 5th August 2023 / cli, plugins, projects, my-talks, datasette, weeknotes, sqlite-utils, llm

Catching up on the weird world of LLMs

I gave a talk on Sunday at North Bay Python where I attempted to summarize the last few years of development in the space of LLMs—Large Language Models, the technology behind tools like ChatGPT, Google Bard and Llama 2.

[... 10,489 words]

2:51 pm / 3rd August 2023 / ethics, python, my-talks, ai, openai, generative-ai, chatgpt, llms, llm, anthropic, claude, annotated-talks, code-interpreter, ai-ethics, coding-agents

Run Llama 2 on your own Mac using LLM and Homebrew

Llama 2 is the latest commercially usable openly licensed Large Language Model, released by Meta AI a few weeks ago. I just released a new plugin for my LLM utility that adds support for Llama 2 and many other llama-cpp compatible models.

[... 1,423 words]

6:56 pm / 1st August 2023 / homebrew, macos, plugins, projects, ai, generative-ai, llama, local-llms, llms, llm, llama-cpp

sqlite-utils now supports plugins

sqlite-utils 3.34 is out with a major new feature: support for plugins.

[... 1,327 words]

5:06 pm / 24th July 2023 / plugins, projects, sqlite, sqlite-utils, alex-garcia

Accessing Llama 2 from the command-line with the llm-replicate plugin

The big news today is Llama 2, the new openly licensed Large Language Model from Meta AI. It’s a really big deal:

[... 1,206 words]

7:30 pm / 18th July 2023 / cli, plugins, projects, ai, generative-ai, llama, local-llms, llms, replicate, llm, llm-release

Weeknotes: Self-hosted language models with LLM plugins, a new Datasette tutorial, a dozen package releases, a dozen TILs

A lot of stuff to cover from the past two and a half weeks.

[... 1,742 words]

5:55 am / 16th July 2023 / plugins, projects, tutorials, ai, datasette, weeknotes, sqlite-utils, generative-ai, local-llms, llms, symbex, llm

My LLM CLI tool now supports self-hosted language models via plugins

LLM is my command-line utility and Python library for working with large language models such as GPT-4. I just released version 0.5 with a huge new feature: you can now install plugins that add support for additional models to the tool, including models that can run on your own hardware.

[... 1,656 words]

2:24 pm / 12th July 2023 / cli, projects, ai, generative-ai, local-llms, llms, llm, anthropic

Weeknotes: symbex, LLM prompt templates, a bit of a break

I had a holiday to the UK for a family wedding anniversary and mostly took the time off... except for building symbex, which became one of those projects that kept on inspiring new features.

[... 1,120 words]

4:30 pm / 27th June 2023 / projects, ai, weeknotes, generative-ai, llms, symbex, llm

Symbex: search Python code for functions and classes, then pipe them into a LLM

I just released a new Python CLI tool called Symbex. It’s a search tool, loosely inspired by ripgrep, which lets you search Python code for functions and classes by name or wildcard, then see just the source code of those matching entities.

[... 1,183 words]

10:11 pm / 18th June 2023 / cli, projects, python, ai, generative-ai, chatgpt, llms, symbex

Understanding GPT tokenizers

Large language models such as GPT-3/4, LLaMA and PaLM work in terms of tokens. They take text, convert it into tokens (integers), then predict which tokens should come next.

[... 1,575 words]

8:37 pm / 8th June 2023 / projects, ai, gpt-3, openai, generative-ai, gpt-4, llms, tokenization

Weeknotes: Parquet in Datasette Lite, various talks, more LLM hacking

I’ve fallen a bit behind on my weeknotes. Here’s a catchup for the last few weeks.

[... 769 words]

9:14 pm / 4th June 2023 / projects, speaking, tutorials, datasette, parquet, weeknotes, datasette-lite, llms

It’s infuriatingly hard to understand how closed models train on their input

One of the most common concerns I see about large language models regards their training data. People are worried that anything they say to ChatGPT could be memorized by it and spat out to other users. People are concerned that anything they store in a private repository on GitHub might be used as training data for future versions of Copilot.

[... 1,465 words]

6:09 pm / 4th June 2023 / ai, openai, generative-ai, chatgpt, llms, anthropic, claude, training-data

ChatGPT should include inline tips

In OpenAI isn’t doing enough to make ChatGPT’s limitations clear James Vincent argues that OpenAI’s existing warnings about ChatGPT’s confounding ability to convincingly make stuff up are not effective.

[... 1,488 words]

7:23 pm / 30th May 2023 / design, prototyping, ai, max-woolf, openai, generative-ai, chatgpt, llms, anthropic, claude

Lawyer cites fake cases invented by ChatGPT, judge is not amused

Legal Twitter is having tremendous fun right now reviewing the latest documents from the case Mata v. Avianca, Inc. (1:22-cv-01461). Here’s a neat summary:

[... 2,844 words]

7:09 pm / 27th May 2023 / ethics, law, ai, openai, generative-ai, chatgpt, llms, ai-ethics, hallucinations

llm, ttok and strip-tags—CLI tools for working with ChatGPT and other LLMs

I’ve been building out a small suite of command-line tools for working with ChatGPT, GPT-4 and potentially other language models in the future.

[... 1,328 words]

9:04 pm / 18th May 2023 / cli, projects, ai, openai, generative-ai, chatgpt, llms, llm, tokenization

Delimiters won’t save you from prompt injection

Prompt injection remains an unsolved problem. The best we can do at the moment, disappointingly, is to raise awareness of the issue. As I pointed out last week, “if you don’t understand it, you are doomed to implement it.”

[... 1,010 words]

3:51 pm / 11th May 2023 / security, ai, openai, prompt-engineering, prompt-injection, generative-ai, llms, andrew-ng

Weeknotes: sqlite-utils 3.31, download-esm, Python in a sandbox

A couple of speaking appearances last week—one planned, one unplanned. Plus sqlite-utils 3.31, download-esm and a new TIL.

[... 608 words]

10:07 pm / 10th May 2023 / projects, python, weeknotes, deno, sqlite-utils, pyodide

Big Opportunities in Small Data

I gave an invited keynote at Citus Con 2023, the PostgreSQL conference. Below is the abstract, video, slides and links from the presentation.

[... 385 words]

3:06 am / 8th May 2023 / postgresql, sqlite, my-talks, datasette, small-data

Simon Willison’s Weblog