Sponsored By: AE Studio
This essay is brought to you by AE Studio, your custom software development, and AI Partner. Built for founders, AE helps you achieve rapid success with custom MVPs, drive innovation, and maximize ROI.
If you have to regularly generate images using an AI-powered tool like Midjourney, you’re probably familiar with the challenges. Perhaps you ask it for an illustration of people at work, and it serves up solely Mad Men-era shots overladen with testosterone. Maybe you tell it to make you a bouquet without roses, and it gives you a flower quadrant worthy of Beauty and the Beast. During one particularly funny week at Every, we discovered that Midjourney’s ultimate Achilles heel was “person dreaming of piggybank.” The AI could not wrap its “head” around that concept, regardless of what details we fed it. (The results were nightmare fuel.)
I’ve spent a lot of time experimenting with Midjourney’s capabilities in recent months—I come from an advertising design background, so I’m particularly motivated to master the new medium. I waded through Midjourney’s detailed technical documentation, followed smart creators tweeting Midjourney tips, and stalked public channels in Midjourney’s Discord.
I learned that there are nearly infinite possibilities for directing Midjourney’s output. It’s incredibly powerful and adaptable, and you can exert much more control over the results than you’d think from just looking at Midjourney’s UI. It’s not an intuitive interface though—it takes research and practice to make full use of Midjourney’s command language (much like mastering keyboard shortcuts in your favorite apps).
At Every, we rely on Midjourney to generate our article images, so I pulled together a guide for our internal use. When we realized it might be a helpful resource for others, we decided to publish it for our subscribers.
In this article, I’ll walk you through the most powerful and useful techniques I’ve come across. We’ll cover:
- Getting started in Midjourney
- Understanding Midjourney’s quirks with interpreting prompts
- Customizing Midjourney’s image outputs after the fact
- Experimenting with a range of styles and content
- Uploading and combining images to make new ones via image injections
- Brainstorming art options with parameters like “chaos” and “weird”
- Finalizing your Midjourney output’s aspect ratio
And much more.
Are you ready to take your business to the next level?
AE Studio can help you achieve your goals faster than the competition with our custom software and AI solutions. We work closely with founders and executives to understand your needs and create a solution that is tailored to your success. Whether you need an MVP to test your idea, or a custom AI solution to drive innovation, AE can help.
We are the world's most effective team of developers, data scientists, and designers. We have a proven track record of success, and we are committed to helping you achieve your goals.
Schedule a free scoping session today to see how AE can help you transform your business.
Getting started (for beginners)
The most challenging part about using Midjourney is that there’s no official web app yet, and the whole front-end user experience happens through Discord. So the order of operations to get set up is a little convoluted, and you have to create your Midjourney images through Discord chatrooms (channels).
If you already have some basic familiarity with Midjourney and its UI, you may want to jump ahead to the “Promptwork” section. If you’re a beginner, follow Midjourney’s quick start instructions to get set up. Before you move on in this guide, just make sure:
- You’ve created a Discord account
- You’ve subscribed to Midjourney (there is no free trial anymore unfortunately)
- You’ve joined Midjourney’s Discord server
- You’ve joined a “newbies” channel in Midjourney’s Discord server
In whatever newbie channel you choose to join, you will see thousands of other people submitting prompts, and the images Midjourney generates for them. I learned a lot in the beginning just from studying other people’s approaches.
Private Midjourney access
If your company or community has its own Discord server, then you can add a Midjourney bot for internal use. That way, you can generate images without working in a cluttered, public newbies channel, and you can more easily peruse the images generated by your co-workers or direct community. (Note: Midjourney still reserves the right to serve up your images in other places, like their showcase page.) You can also enable Direct Messages with a Midjourney bot if you prefer to work privately (so only you can see them).
So, now that you’re all set up, how do you use Midjourney?
The magic happens with /imagine
To interact with Midjourney you must first type a Command in whatever Discord newbies channel you’ve joined. A Command refers to the action that you want Midjourney to perform. The most common one is /imagine followed by your text prompt telling Midjourney what kind of image to create.
/imagine <your text prompt>
You can be as specific or as vague as you want in your prompt. Then hit enter to see what the model cooks up. In less than a minute you should see a grid of 4 images that Midjourney sent back to you. Here’s an example you can try:
/imagine cute robot, white background
After getting the grid output you will see a few options beneath it.
- U → Upscale: Selects one of the four and makes it big / improves its resolution so you can use it.
- V → Variate: Generates variations from the selected output
- 🔄 → Rerun: Use this option if you you like to rerun the same prompt again and see more results
The options apply in a clockwise direction. So if you really like the top right image (2), then you would click U2. and get the upscaled version almost instantly. To download the upscaled image: click on it to open it to full size, and then right-click and choose save image.
Congratulations, now you know how to use Midjourney like 95% of its users. If you want to learn what the other 5% knows, keep reading.
So you generated an image, now what?
After an image is upscaled you are presented with a few options.
Variation:
The first row of buttons allows you to regenerate new options that can vary subtly or strongly from the original depending on what you are looking for. It’s an easy way to riff and brainstorm.
Zoom out:
The middle row of buttons allows you to zoom out from your output and generate additional visual context surrounding it. In other words, Midjourney will add more of the “background scene” behind the main subject in your image.
Panning:
The last row of buttons lets you expand your image as implied by the buttons, left, right, up and down. It generates more of the scene in a specific direction (the same way panning your camera would in the real world). This also gives your images new aspect ratios.
Promptwork: Where the magic happens
I usually recommend starting with a simple prompt so you understand how Midjourney interprets the concept. Your prompt can be a sentence, a word, a letter, or even just an emoji.
The fewer words you include, the more influence every word has on the output. It gives the language descending priority—so if you write a whole paragraph, the words and phrases at the start of the prompt will shape the output the most.
You don’t need to include explicit instructions like “make me an image” or “create an illustration.” Instead, get right to the point of describing the visual you want. For example:
The model then grabs each “token” you write (i.e., Elmo, skydiving, jungle) and compares them to its training data to generate the desired image.
Less specific prompts are great for experimenting and gathering unexpected results—it’s fun to see how the mysterious black box of AI interprets a prompt like “heartache” or “sustenance.”
More detailed prompts give you greater control over the end product. “Cooking pasta” will differ dramatically from “old Italian man making fettuccine, 1950s kitchen photography.” I usually take the “design squiggle” approach to image generation, coming up with a bunch of options via simple prompts in the beginning, then honing in on a particular approach by adding more detail.
Changing just one token in the prompt can reshape the image dramatically. Case in point: Banana Factory, Banana Machine, Banana Toy, and Banana Art.
Gettin’ picky with it (the prompts, that is)
You can experiment with a range of aesthetic and content variables to generate different kinds of images. Just remember that the more concepts you include, the more data that Midjourney has to consider when generating an output.
Some options:
- Subject: person, animal, character, location, object, etc.
- Medium: photograph, painting, illustration, sculpture, cartoon, doodle, tapestry, watercolor, 3D render, etc.
- Environment: indoors, outdoors, on the moon, in Narnia, underwater, the Emerald City, etc.
- Lighting: soft, ambient, overcast, neon, studio lights, warm, etc
- Color: vibrant, desaturated, muted, bright, monochromatic, colorful, black and white, pastel, etc.
- Atmosphere: foggy, sunny, misty, cold, dirty, etc.
- Mood: sedate, calm, dramatic, raucous, minimalist, energetic, etc.
- Composition: portrait, macro, scan, headshot, closeup, birds-eye view, etc.
- Style: Wes Anderson, minimalist, Van Gogh, Anime, Solarpunk, gothic, graffiti, medieval, etc..
To play with the look and feel of an image, add the phrase “in the style of” to your prompt, like “NYC in the style of stained glass” or “banana toy in the style of Andy Warhol.” The options are endless—you can channel styles of painting, cinematography, periods in history, and more. For inspiration check out this extensive list made by Metaroids.
Caption: Midjourney outputs for the prompt “NYC in the style of Pixar / Basquiat / stained glass.”
A common mistake: Midjourney doesn’t understand ‘no’
Midjourney isn’t always the most intuitive user experience...
This post continues for paid subscribers. It's a 4,000 word bible covering:
- Understanding Midjourney’s quirks with interpreting prompts
- Customizing Midjourney’s image outputs after the fact
- Experimenting with a range of styles and content
- Uploading and combining images to make new ones via image injections
- Brainstorming art options with parameters like “chaos” and “weird”
- Finalizing your Midjourney output’s aspect ratio
Curious? Subscribe below.
Find Out What
Comes Next in Tech.
Start your free trial.
New ideas to help you build the future—in your inbox, every day. Trusted by over 75,000 readers.
SubscribeAlready have an account? Sign in
What's included?
- Unlimited access to our daily essays by Dan Shipper, Evan Armstrong, and a roster of the best tech writers on the internet
- Full access to an archive of hundreds of in-depth articles
- Priority access and subscriber-only discounts to courses, events, and more
- Ad-free experience
- Access to our Discord community
Thanks to our Sponsor: AE Studio
Thanks again to our sponsor AE Studio, the go-to partner for founders and executives wanting to take their ideas to the next level. AE works closely with founders and executives to create solutions that are tailored to your specific needs. Whether you're looking to launch a new product, improve your customer experience, or automate your business processes, they can help.
Their team of experts has a proven track record of success having helped businesses of all sizes achieve their goals, from startups to Fortune 500 companies.
So what are you waiting for? Schedule a free scoping session today and see how AE can help you transform your business.