Simon Willison’s Weblog

Subscribe

OpenAI: Introducing our latest image generation model in the API. The astonishing native image generation capability of GPT-4o - a feature which continues to not have an obvious name - is now available via OpenAI's API.

It's quite expensive. OpenAI's estimates are:

Image outputs cost approximately $0.01 (low), $0.04 (medium), and $0.17 (high) for square images

Since this is a true multi-modal model capability - the images are created using a GPT-4o variant, which can now output text, audio and images - I had expected this to come as part of their chat completions or responses API. Instead, they've chosen to add it to the existing /v1/images/generations API, previously used for DALL-E.

They gave it the terrible name gpt-image-1 - no hint of the underlying GPT-4o in that name at all.

I'm contemplating adding support for it as a custom LLM subcommand via my llm-openai plugin, see issue #18 in that repo.