Midjourney released version 5.1 of their image generation model on Tuesday. Here’s their announcement on Twitter—if you have a Discord account there’s a more detailed Discord announcement here.
They claim that “V5.1 is more opinionated (like V4) and is MUCH easier to use with short prompts”—in comparison to v5.
Last night (9:30pm PST on Wednesday May 3rd) they switched 5.1 to be the default—previously you had to add
--v 5.1 to a prompt in order to use it.
To compare the v5 and v5.1 models, I ran the prompt pelicans having a tea party through them both.
v5 is the version of Midjourney that came out on March 15th, and really felt like a turning point in that it was the first to reliably produce photorealistic images. If you’ve seen the flurry of memes of the Pope in a Balenciaga puffy jacket, you’ve seen Midjourney 5.
I find the difference between the two so interesting. The v5 one went for photo-realism—the pelicans are in a natural setting, and while they are standing near a tea service none of them are really interacting with it beyond looking at it.
For 5.1, the model seems to have made very different choices. These pelicans are in a formal setting—a tea room, albeit in some with an oil painting of the ocean behind them. The style is more illustrative than photographic, and definitely more whimsical. They’re interacting wit hthe tea—which means the model as added creepy little hands in three cases and in one case given them pink claws, albeit in addition to their existing wings.
I think 5.1 does a better job with this admittedly vague and silly prompt.
I use Midjourney pretty regularly now, exclusively for entertainment. It’s a lot of fun.
More recent articles
- Lawyer cites fake cases invented by ChatGPT, judge is not amused - 27th May 2023
- llm, ttok and strip-tags - CLI tools for working with ChatGPT and other LLMs - 18th May 2023
- Delimiters won't save you from prompt injection - 11th May 2023
- Weeknotes: sqlite-utils 3.31, download-esm, Python in a sandbox - 10th May 2023
- Leaked Google document: "We Have No Moat, And Neither Does OpenAI" - 4th May 2023
- Prompt injection explained, with video, slides, and a transcript - 2nd May 2023
- download-esm: a tool for downloading ECMAScript modules - 2nd May 2023
- Let's be bear or bunny - 1st May 2023
- Weeknotes: Miscellaneous research into Rye, ChatGPT Code Interpreter and openai-to-sqlite - 1st May 2023