Entries

Leaked Google document: “We Have No Moat, And Neither Does OpenAI”

SemiAnalysis published something of a bombshell leaked document this morning: Google “We Have No Moat, And Neither Does OpenAI”.

[... 1,073 words]

4:05 pm / 4th May 2023 / google, open-source, openai, generative-ai, local-llms, llms, paper-review

Midjourney 5.1

Midjourney released version 5.1 of their image generation model on Tuesday. Here’s their announcement on Twitter—if you have a Discord account there’s a more detailed Discord announcement here.

[... 396 words]

3:42 pm / 4th May 2023 / ai, generative-ai, midjourney, text-to-image

Prompt injection explained, with video, slides, and a transcript

I participated in a webinar this morning about prompt injection, organized by LangChain and hosted by Harrison Chase, with Willem Pienaar, Kojin Oshiba (Robust Intelligence), and Jonathan Cohen and Christopher Parisien (Nvidia Research).

[... 3,120 words]

8:22 pm / 2nd May 2023 / security, my-talks, ai, prompt-engineering, prompt-injection, generative-ai, llms, annotated-talks, exfiltration-attacks

download-esm: a tool for downloading ECMAScript modules

I’ve built a new CLI tool, download-esm, which takes the name of an npm package and will attempt to download the ECMAScript module version of that package, plus all of its dependencies, directly from the jsDelivr CDN—and then rewrite all of the import statements to point to those local copies.

[... 1,240 words]

4:47 am / 2nd May 2023 / cli, ecmascript, javascript, projects, npm, ai-assisted-programming

Let’s be bear or bunny

The Machine Learning Compilation group (MLC) are my favourite team of AI researchers at the moment.

[... 599 words]

6:37 pm / 1st May 2023 / ai, generative-ai, llama, local-llms, llms, mlc, vicuna, hallucinations

Weeknotes: Miscellaneous research into Rye, ChatGPT Code Interpreter and openai-to-sqlite

I gave myself some time off stressing about my core responsibilities this week after PyCon, which meant allowing myself to be distracted by some miscellaneous research projects.

[... 891 words]

5:12 am / 1st May 2023 / projects, weeknotes, prompt-injection, chatgpt, rye, code-interpreter, coding-agents

Enriching data with GPT3.5 and SQLite SQL functions

I shipped openai-to-sqlite 0.3 yesterday with a fun new feature: you can now use the command-line tool to enrich data in a SQLite database by running values through an OpenAI model and saving the results, all in a single SQL query.

[... 1,219 words]

11:11 pm / 29th April 2023 / projects, sqlite, ai, openai, generative-ai, chatgpt, llms

The Dual LLM pattern for building AI assistants that can resist prompt injection

I really want an AI assistant: a Large Language Model powered chatbot that can answer questions and perform actions for me based on access to my private data and tools.

[... 2,632 words]

7 pm / 25th April 2023 / definitions, security, ai, prompt-engineering, prompt-injection, generative-ai, llms, exfiltration-attacks, ai-agents

Weeknotes: Citus Con, PyCon and three new niche museums

I’ve had a busy week in terms of speaking: on Tuesday I gave an online keynote at Citus Con, “Big Opportunities in Small Data”. I then flew to Salt Lake City for PyCon that evening and gave a three hour workshop on Wednesday, “Data analysis with SQLite and Python”.

[... 225 words]

4:46 am / 23rd April 2023 / conferences, museums, pycon, speaking, weeknotes

Data analysis with SQLite and Python for PyCon 2023

I’m at PyCon 2023 in Salt Lake City this week.

[... 347 words]

5:03 pm / 20th April 2023 / pycon, python, speaking, sqlite, my-talks, datasette, sqlite-utils, datasette-lite

What’s in the RedPajama-Data-1T LLM training set

RedPajama is “a project to create leading open-source models, starts by reproducing LLaMA training dataset of over 1.2 trillion tokens”. It’s a collaboration between Together, Ontocord.ai, ETH DS3Lab, Stanford CRFM, Hazy Research, and MILA Québec AI Institute.

[... 1,077 words]

6:57 pm / 17th April 2023 / ai, datasette, datasette-lite, generative-ai, llama, local-llms, llms, ai-assisted-programming, redpajama, training-data

Web LLM runs the vicuna-7b Large Language Model entirely in your browser, and it’s very impressive

A month ago I asked Could you train a ChatGPT-beating model for $85,000 and run it in a browser?. $85,000 was a hypothetical training cost for LLaMA 7B plus Stanford Alpaca. “Run it in a browser” was based on the fact that Web Stable Diffusion runs a 1.9GB Stable Diffusion model in a browser, so maybe it’s not such a big leap to run a small Large Language Model there as well.

[... 2,276 words]

3:10 pm / 16th April 2023 / chrome, ai, webassembly, generative-ai, llama, local-llms, llms, mlc, vicuna

sqlite-history: tracking changes to SQLite tables using triggers (also weeknotes)

In between blogging about ChatGPT rhetoric, micro-benchmarking with ChatGPT Code Interpreter and Why prompt injection is an even bigger problem now I managed to ship the beginnings of a new project: sqlite-history.

[... 1,680 words]

5:33 pm / 15th April 2023 / projects, sqlite, datasette, weeknotes, ai-assisted-programming

Prompt injection: What’s the worst that can happen?

Activity around building sophisticated applications on top of LLMs (Large Language Models) such as GPT-3/4/ChatGPT/etc is growing like wildfire right now.

[... 2,302 words]

5:35 pm / 14th April 2023 / security, ai, openai, prompt-engineering, prompt-injection, generative-ai, chatgpt, llms, exfiltration-attacks, ai-agents

Running Python micro-benchmarks using the ChatGPT Code Interpreter alpha

Today I wanted to understand the performance difference between two Python implementations of a mechanism to detect changes to a SQLite database schema. I rendered the difference between the two as this chart:

[... 2,939 words]

1:14 am / 12th April 2023 / python, sqlite, ai, prompt-engineering, generative-ai, chatgpt, llms, ai-assisted-programming, code-interpreter, coding-agents

Thoughts on AI safety in this era of increasingly powerful open source LLMs

This morning, VentureBeat published a story by Sharon Goldman: With a wave of new LLMs, open source AI is having a moment — and a red-hot debate. It covers the explosion in activity around openly available Large Language Models such as LLaMA—a trend I’ve been tracking in my own series LLMs on personal devices—and talks about their implications with respect to AI safety.

[... 782 words]

6:41 pm / 10th April 2023 / ethics, media, open-source, ai, openai, generative-ai, chatgpt, llama, local-llms, llms, ai-ethics

The Changelog podcast: LLMs break the internet

I’m the guest on the latest episode of The Changelog podcast: LLMs break the internet. It’s a follow-up to the episode we recorded six months ago about Stable Diffusion.

[... 454 words]

12:46 am / 8th April 2023 / podcasts, speaking, ai, openai, generative-ai, chatgpt, llama, bard, llms, ai-assisted-programming, podcast-appearances

Working in public

I participated in a panel discussion this week for path to Citus Con, a series of Discord audio events that are happening in the run up to the Citus Con 2023 later this month.

[... 546 words]

12:36 am / 8th April 2023 / open-source, podcasts, speaking, my-talks, podcast-appearances

We need to tell people ChatGPT will lie to them, not debate linguistics

ChatGPT lies to people. This is a serious bug that has so far resisted all attempts at a fix. We need to prioritize helping people understand this, not debating the most precise terminology to use to describe it.

[... 1,174 words]

4:34 pm / 7th April 2023 / ethics, ai, openai, chatgpt, llms, ai-ethics, hallucinations

Weeknotes: A new llm CLI tool, plus automating my weeknotes and newsletter

I started publishing weeknotes in 2019 partly as a way to hold myself accountable but mainly as a way to encourage myself to write more.

[... 830 words]

11:28 pm / 4th April 2023 / cli, projects, datasette, observable, weeknotes, llms, llm

Semi-automating a Substack newsletter with an Observable notebook

I recently started sending out a weekly-ish email newsletter consisting of content from my blog. I’ve mostly automated that, using an Observable Notebook to generate the HTML. Here’s how that system works.

[... 2,520 words]

5:55 pm / 4th April 2023 / blogging, projects, datasette, observable, cors, newsletter, substack

Think of language models like ChatGPT as a “calculator for words”

One of the most pervasive mistakes I see people using with large language model tools like ChatGPT is trying to use them as a search engine.

[... 1,162 words]

4:20 pm / 2nd April 2023 / bing, ai, generative-ai, chatgpt, bard, llms, hallucinations

What AI can do for you on the Theory of Change podcast

Matthew Sheffield invited me on his show Theory of Change to talk about how AI models like ChatGPT, Bing and Bard work and practical applications of things you can do with them.

[... 548 words]

12:17 am / 2nd April 2023 / bing, media, podcasts, ai, generative-ai, chatgpt, bard, llms, podcast-appearances

AI-enhanced development makes me more ambitious with my projects

The thing I’m most excited about in our weird new AI-enhanced reality is the way it allows me to be more ambitious with my projects.

[... 3,336 words]

2:38 pm / 27th March 2023 / projects, ai, generative-ai, chatgpt, github-copilot, applescript, llms, ai-assisted-programming

I built a ChatGPT plugin to answer questions about data hosted in Datasette

Yesterday OpenAI announced support for ChatGPT plugins. It’s now possible to teach ChatGPT how to make calls out to external APIs and use the responses to help generate further answers in the current conversation.

[... 1,801 words]

3:43 pm / 24th March 2023 / projects, datasette, openai, chatgpt, llms, ai-assisted-programming, hallucinations, chatgpt-plugins

Weeknotes: AI won’t slow down, a new newsletter and a huge Datasette refactor

I’m a few weeks behind on my weeknotes, but it’s not through lack of attention to my blog. AI just keeps getting weirder and more interesting.

[... 1,255 words]

11:47 pm / 22nd March 2023 / projects, ai, datasette, weeknotes, generative-ai, llms

Don’t trust AI to talk accurately about itself: Bard wasn’t trained on Gmail

Earlier this month I wrote about how ChatGPT can’t access the internet, even though it really looks like it can. Consider this part two in the series. Here’s another common and non-intuitive mistake people make when interacting with large language model AI systems: asking them questions about themselves.

[... 1,950 words]

3:13 am / 22nd March 2023 / bing, ethics, gmail, google, privacy, ai, generative-ai, chatgpt, bard, llms, training-data, ai-ethics, hallucinations

A conversation about prompt engineering with CBC Day 6

I’m on Canadian radio this morning! I was interviewed by Peter Armstrong for CBC Day 6 about the developing field of prompt engineering.

[... 1,742 words]

4 pm / 18th March 2023 / media, ai, prompt-engineering, generative-ai, llms

Could you train a ChatGPT-beating model for $85,000 and run it in a browser?

I think it’s now possible to train a large language model with similar functionality to GPT-3 for $85,000. And I think we might soon be able to run the resulting model entirely in the browser, and give it capabilities that leapfrog it ahead of ChatGPT.

[... 1,751 words]

3:43 pm / 17th March 2023 / ai, webassembly, generative-ai, chatgpt, llama, local-llms, llms, bloom, mlc, transformers-js, llm-tool-use, llama-cpp

Stanford Alpaca, and the acceleration of on-device large language model development

On Saturday 11th March I wrote about how Large language models are having their Stable Diffusion moment. Today is Monday. Let’s look at what’s happened in the past three days.

[... 2,055 words]

7:19 pm / 13th March 2023 / open-source, stanford, ai, gpt-3, generative-ai, llama, local-llms, llms, fine-tuning, llama-cpp, paper-review

Simon Willison’s Weblog