Simon Willison’s Weblog

Subscribe

Items tagged mistral, claude in 2024

Filters: Year: 2024 × mistral × claude × Sorted by date


The GPT-4 barrier has finally been broken

Four weeks ago, GPT-4 remained the undisputed champion: consistently at the top of every key benchmark, but more importantly the clear winner in terms of “vibes”. Almost everyone investing serious time exploring LLMs agreed that it was the most capable default model for the majority of tasks—and had been for more than a year.

[... 697 words]

Types

Years

Months

Tags