Items tagged ai, rag in 2023

Filters: Year: 2023 × ai × rag × Sorted by date

6 results

Exploring GPTs: ChatGPT in a trench coat?

The biggest announcement from last week’s OpenAI DevDay (and there were a LOT of announcements) was GPTs. Users of ChatGPT Plus can now create their own, custom GPT chat bots that other Plus subscribers can then talk to.

[... 5699 words]

3:39 pm / 15th November 2023 / projects, ai, generativeai, chatgpt, llms, codeinterpreter, rag

Embeddings: What they are and why they matter

Embeddings are a really neat trick that often come wrapped in a pile of intimidating jargon.

[... 5835 words]

1:36 pm / 23rd October 2023 / ai, generativeai, embeddings, llm, annotatedtalks, rag

LLM now provides tools for working with embeddings

LLM is my Python library and command-line tool for working with language models. I just released LLM 0.9 with a new set of features that extend LLM to provide tools for working with embeddings.

[... 3466 words]

8:32 pm / 4th September 2023 / opensource, projects, sqlite, ai, generativeai, llms, embeddings, llm, rag

Llama 2 is about as factually accurate as GPT-4 for summaries and is 30X cheaper. Anyscale offer (cheap, fast) API access to Llama 2, so they’re not an unbiased source of information—but I really hope their claim here that Llama 2 70B provides almost equivalent summarization quality to GPT-4 holds up. Summarization is one of my favourite applications of LLMs, partly because it’s key to being able to implement Retrieval Augmented Generation against your own documents—where snippets of relevant documents are fed to the model and used to answer a user’s question. Having a really high performance openly licensed summarization model is a very big deal. # 30th August 2023, 2:37 pm

Making Large Language Models work for you

I gave an invited keynote at WordCamp 2023 in National Harbor, Maryland on Friday.

[... 14188 words]

2:35 pm / 27th August 2023 / speaking, wordpress, ai, generativeai, llms, llm, annotatedtalks, rag

How to implement Q&A against your documentation with GPT3, embeddings and Datasette

If you’ve spent any time with GPT-3 or ChatGPT, you’ve likely thought about how useful it would be if you could point them at a specific, current collection of text or documentation and have it use that as part of its input for answering questions.

[... 3491 words]

11:47 pm / 13th January 2023 / projects, search, sqlite, ai, datasette, gpt3, generativeai, vectorsearch, llms, embeddings, rag

Simon Willison’s Weblog