Simon Willison: LLMs on personal devices

Series: LLMs on personal devices

Large language models that can run on our own devices open up exciting new ways in which these tools can be used.

Large language models are having their Stable Diffusion moment

The open release of the Stable Diffusion image generation model back in August 2022 was a key moment. I wrote how Stable Diffusion is a really big deal at the time.

[... 1,810 words]

7:15 pm / 11th March 2023 / facebook, ai, gpt3, openai, generative-ai, chatgpt, whisper, llama, homebrew-llms, llms

Stanford Alpaca, and the acceleration of on-device large language model development

On Saturday 11th March I wrote about how Large language models are having their Stable Diffusion moment. Today is Monday. Let’s look at what’s happened in the past three days.

[... 2,055 words]

7:19 pm / 13th March 2023 / open-source, stanford, ai, gpt3, generative-ai, llama, homebrew-llms, llms, fine-tuning

Could you train a ChatGPT-beating model for $85,000 and run it in a browser?

I think it’s now possible to train a large language model with similar functionality to GPT-3 for $85,000. And I think we might soon be able to run the resulting model entirely in the browser, and give it capabilities that leapfrog it ahead of ChatGPT.

[... 1,751 words]

3:43 pm / 17th March 2023 / ai, webassembly, generative-ai, chatgpt, llama, homebrew-llms, llms, bloom, mlc, transformers-js

Thoughts on AI safety in this era of increasingly powerful open source LLMs

This morning, VentureBeat published a story by Sharon Goldman: With a wave of new LLMs, open source AI is having a moment — and a red-hot debate. It covers the explosion in activity around openly available Large Language Models such as LLaMA—a trend I’ve been tracking in my own series LLMs on personal devices—and talks about their implications with respect to AI safety.

[... 781 words]

6:41 pm / 10th April 2023 / ethics, media, open-source, ai, openai, generative-ai, chatgpt, llama, homebrew-llms, llms

Web LLM runs the vicuna-7b Large Language Model entirely in your browser, and it’s very impressive

A month ago I asked Could you train a ChatGPT-beating model for $85,000 and run it in a browser?. $85,000 was a hypothetical training cost for LLaMA 7B plus Stanford Alpaca. “Run it in a browser” was based on the fact that Web Stable Diffusion runs a 1.9GB Stable Diffusion model in a browser, so maybe it’s not such a big leap to run a small Large Language Model there as well.

[... 2,276 words]

3:10 pm / 16th April 2023 / chrome, ai, webassembly, generative-ai, llama, homebrew-llms, llms, mlc, vicuna

Let’s be bear or bunny

The Machine Learning Compilation group (MLC) are my favourite team of AI researchers at the moment.

[... 599 words]

6:37 pm / 1st May 2023 / ai, generative-ai, llama, homebrew-llms, llms, mlc, vicuna

Leaked Google document: “We Have No Moat, And Neither Does OpenAI”

SemiAnalysis published something of a bombshell leaked document this morning: Google “We Have No Moat, And Neither Does OpenAI”.

[... 1,073 words]

4:05 pm / 4th May 2023 / google, open-source, openai, generative-ai, homebrew-llms, llms

My LLM CLI tool now supports self-hosted language models via plugins

LLM is my command-line utility and Python library for working with large language models such as GPT-4. I just released version 0.5 with a huge new feature: you can now install plugins that add support for additional models to the tool, including models that can run on your own hardware.

[... 1,656 words]

2:24 pm / 12th July 2023 / projects, ai, generative-ai, homebrew-llms, llms, llm

Run Llama 2 on your own Mac using LLM and Homebrew

Llama 2 is the latest commercially usable openly licensed Large Language Model, released by Meta AI a few weeks ago. I just released a new plugin for my LLM utility that adds support for Llama 2 and many other llama-cpp compatible models.

[... 1,423 words]

6:56 pm / 1st August 2023 / homebrew, macosx, plugins, projects, ai, generative-ai, llama, homebrew-llms, llms, llm

llamafile is the new best way to run a LLM on your own computer

Mozilla’s innovation group and Justine Tunney just released llamafile, and I think it’s now the single best way to get started running Large Language Models (think your own local copy of ChatGPT) on your own computer.

[... 650 words]

8:54 pm / 29th November 2023 / mozilla, ai, generative-ai, cosmopolitan, llama, llms, llamafile, justine-tunney

Many options for running Mistral models in your terminal using LLM

Mistral AI is the most exciting AI research lab at the moment. They’ve now released two extremely powerful smaller Large Language Models under an Apache 2 license, and have a third much larger one that’s available via their API.

[... 2,063 words]

6:18 pm / 18th December 2023 / plugins, projects, ai, generative-ai, llms, llm, mistral, llamafile