Introducing Midjourney V5: The Next Generation of Text-to-Image AI

Written by Sebastian on March 24, 2023 · 7 min read

tutorial

March 15th the latest version 5 of Midjourney, one of the most advanced text-to-image AIs, was released. Building on the strengths of its predecessors, Midjourney V5 brings you a range of powerful new features and improvements that will take your image generation experience to new heights. In this blog post, we’ll take you through the features and enhancements of Midjourney V5 and demonstrate how it raises the bar for AI-generated imagery.

What is Midjourney V5?

The Midjourney project began as an ambitious endeavor to create an AI capable of transforming textual descriptions into stunning visual images. Over last months, it has released multiple versions, each refining the capabilities of the AI and making it more powerful and versatile.

The latest version Midjourney V5 marks a significant milestone in the evolution of our AI technology. It builds upon the previous versions’ success and adds groundbreaking new features to deliver an unparalleled image generation experience for users.

How To Get Access To Midjourney

In order to explore some of Midjourney’s new capabilities you need to get

Step 1: Join the Midjourney Discord server

Image

Step 2: Find a Newbies Channel

Image

Step 3: Use the /imagine command

Note: If you don’t see a pop-up when typing the /imagine command, try logging out, updating the Discord app, and logging back in. Commands only work in bot channels and will not work in regular channels like #trial-support.

Image

Step 4: Process the Job

In order to make use of version 5 you have to add the

--v 5

parameter to the end of your prompt, or use the /settings command and select 5️⃣ MJ Version 5 to enable the new features and enhancements of Midjourney V5.

New Features and Enhancements of Midjourney V5

Now that you’ve access to Midjourney’s functionality we’re ready to take a look at the most important features and enhancements of version 5 in the following.

Higher native resolution

Midjourney V5 boasts a higher native resolution compared to V4, resulting in sharper and more detailed images. This improvement not only allows for better image quality but also provides additional potential for further upscaling.

The 5th version of Midjourney delivers a significant enhancement in resolution, now capable of generating images up to 1024x1024 pixels, which is twice the resolution of previous versions.

Unlimited aspect ratios

With V5, users can now generate images in any desired aspect ratio, providing greater flexibility for creative projects and opening up new possibilities for unique and eye-catching visuals.

To apply a specific aspect ratio you need to add the following parameter to your prompt:

--ar [aspect ration]

E.g. if you want to get your image generated in a 16:9 aspect ration you need to use:

--ar 16:9

Sharper, more detailed images

Midjourney V5 produces images with enhanced sharpness and detail compared to its predecessor. By examining side-by-side comparisons of images generated by V4 and V5, users can appreciate the significant improvements in image quality and clarity.

Let’s do a side by side comparison with the following very simple prompt. First, the prompt for version 4:

an artist in her studio --v 4

Result of version Midjourney V4:

Image

Second, the prompt using version 5:

an artist in her studio --v 5

Result of Midjourney V5:

Image

Improved coherence and composition

V5 excels at generating images with better coherence and composition. It now handles large groups of people more effectively and produces more realistic hands and fewer artifacts, resulting in more aesthetically pleasing and accurate images.

Wider range of supported styles

The stylistic range of Midjourney V5 has been expanded, allowing users to generate images in a more extensive selection of styles. From intricate landscapes to detailed architecture, V5 can now capture and render a broader array of visual aesthetics.

Let’s try it out by using the following prompt:

Balcony of a hotel design by Philippe Starck, atmospheric background style 
of blade runner 2049, photography in the style of 
Thomas Struth --ar 16:9 --v 5

Which delivers the following result:

Image

Here you can see that different styles are combined to generate this image.

Enhanced natural language processing

Midjourney V5 features a more sophisticated natural language processing system, which allows for better interpretation of text prompts. This improvement results in more accurate and contextually relevant image generation. In comparison to former version your prompt will benefit even more from being written in the form of sentences rather than just listing keywords.

For example consider the following prompt:

An inside view of a wizard's glasshouse. The glasshouse is attached to a 
house and has a sitting area that's surrounded by plants and small trees. 
Morning and a beautiful light passes through the glass. the photograph is 
taken with a Canon EOS 5D with a wide aperture lens set to F/8 and ISO 
set to 400. --ar 16:9 --s 500 --v 5

In the following screenshot you can see the result returned by Midjourney:

Image

Let’s decide for option 4 and by clicking on U4 retrieve the upscaled version:

Image

As you can see this result has benefit a lot from the prompt description that was not only provided as a list of keywords but instead as a description of the scene in natural language.

Support for tiling

The new tiling feature enables users to generate repeating patterns and textures seamlessly. This functionality is particularly useful for creating backgrounds, wallpapers, and other design elements that require consistency and continuity.

The --tile parameter generates images that can be used as repeating tiles to create seamless patterns for fabrics, wallpapers and textures, let’s do a quick example:

Prompt:

vibrant green frogs watercolor --v 5 --tile

Result:

Image

Improved image prompts and remixes

Midjourney V5 handles image prompts and remixes more effectively than its predecessor. By comparing V4 and V5 image prompt handling, users will notice a significant difference in the quality and relevance of the generated images.

Support for image weights

The new image weights feature offers users greater control over the final output. By adjusting image weights, users can fine-tune the prominence of different elements within the generated image, enabling greater customization and creative expression.

When using a photo or picture as a reference, you can now adjust its influence by employing the

--iw N

parameter, with N values ranging from 0.5 to 2.0.

Conclusion

Midjourney V5 represents a major leap forward in text-to-image AI technology. With its enhanced resolution, advanced features, and improved performance, it offers users a powerful and versatile tool for their creative endeavors.

It’s exciting to see how users will employ Midjourney V5’s capabilities to bring their ideas to life and create stunning visuals that push the boundaries of what AI-generated imagery can achieve.