Simon Willison’s Weblog

Subscribe

Sunday, 16th November 2025

With AI now, we are able to write new programs that we could never hope to write by hand before. We do it by specifying objectives (e.g. classification accuracy, reward functions), and we search the program space via gradient descent to find neural networks that work well against that objective.

This is my Software 2.0 blog post from a while ago. In this new programming paradigm then, the new most predictive feature to look at is verifiability. If a task/job is verifiable, then it is optimizable directly or via reinforcement learning, and a neural net can be trained to work extremely well. It's about to what extent an AI can "practice" something.

The environment has to be resettable (you can start a new attempt), efficient (a lot attempts can be made), and rewardable (there is some automated process to reward any specific attempt that was made).

Andrej Karpathy

# 6:29 pm / ai, andrej-karpathy, generative-ai, llms, ai-agents

2025 » November

MTWTFSS
     12
3456789
10111213141516
17181920212223
24252627282930