Simon Willison’s Weblog

30th April 2026 - Link Blog

Our evaluation of OpenAI's GPT-5.5 cyber capabilities. The UK's AI Security Institute previously evaluated Claude Mythos: now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now.

Posted 30th April 2026 at 11:03 pm

Recent articles

OpenAI’s accidental cyberattack against Hugging Face is science fiction that happened - 22nd July 2026
A Fireside Chat with Cat and Thariq from the Claude Code team - 21st July 2026
Kimi K3, and what we can still learn from the pelican benchmark - 16th July 2026

This is a link post by Simon Willison, posted on 30th April 2026.

ai 2,140 openai 434 generative-ai 1,892 llms 1,859 anthropic 313 claude 292 ai-security-research 27 gpt 130

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe