Simon Willison’s Weblog

Subscribe

Tuesday, 21st February 2023

In defense of prompt engineering

Prompt engineering as a discipline doesn’t get nearly the respect it deserves.

[... 924 words]

FlexGen (via) This looks like a very big deal. FlexGen is a paper and accompanying code that massively reduces the resources needed to run some of the current top performing open source GPT-style large language models. People on Hacker News report being able to use it to run models like opt-30b on their own hardware, and it looks like it opens up the possibility of running even larger models on hardware available outside of dedicated research labs.

# 6:41 pm / ai, gpt3, generative-ai, llms

2023 » February

MTWTFSS
  12345
6789101112
13141516171819
20212223242526
2728