OpenAI: Introducing our latest image generation model in the API. The astonishing native image generation capability of GPT-4o - a feature which continues to not have an obvious name - is now available via OpenAI's API.
It's quite expensive. OpenAI's estimates are:
Image outputs cost approximately $0.01 (low), $0.04 (medium), and $0.17 (high) for square images
Since this is a true multi-modal model capability - the images are created using a GPT-4o variant, which can now output text, audio and images - I had expected this to come as part of their chat completions or responses API. Instead, they've chosen to add it to the existing /v1/images/generations API, previously used for DALL-E.
They gave it the terrible name gpt-image-1 - no hint of the underlying GPT-4o in that name at all.
I'm contemplating adding support for it as a custom LLM subcommand via my llm-openai plugin, see issue #18 in that repo.
Recent articles
- A new SQL-powered permissions system in Datasette 1.0a20 - 4th November 2025
- New prompt injection papers: Agents Rule of Two and The Attacker Moves Second - 2nd November 2025
- Hacking the WiFi-enabled color screen GitHub Universe conference badge - 28th October 2025