AI Image Prompts: The Formula That Works on Every Model

Most prompt guides give you a list and send you on your way. You copy something, paste it, get mediocre results, and wonder what you're doing wrong.

The problem usually isn't the AI. It's the prompt structure.

I've spent a lot of time testing prompts across Midjourney, Stable Diffusion, DALL·E, Gemini, and Leonardo. The single biggest thing separating good results from great ones is understanding why a prompt works, not just what it says.

Before I get into the breakdown: I put together a tested prompt library called AI Image Generation Prompts on Gumroad. It's 100+ curated styles across cinematic, anime, fantasy, sci-fi, illustration, and more, organized by category and copy-paste ready for all the major models. If you want results without the learning curve, start there. If you want to understand what makes prompts tick, read on.

Why Most Prompts Fail

Here's what a beginner prompt looks like:

❝

"a knight in a forest"

Here's what an experienced one looks like:

❝

"a battle-worn knight standing in a misty ancient forest, dramatic side lighting, photorealistic, ultra detailed armor with scratches and moss, shallow depth of field, cinematic mood, shot on Sony A7R IV"

Same subject. Completely different output. The difference isn't talent. It's knowing which ingredients the model needs to produce what you're picturing.

The Anatomy of a Prompt That Actually Works

Every strong image prompt has the same building blocks. You don't need all of them every time, but once you know them you can't un-know them.

Subject — Be specific. Not "a woman" but "a middle-aged Japanese botanist examining a rare orchid."

Environment — Where is the scene? Indoor, outdoor, time of day, weather, setting details. The model fills gaps with whatever it thinks is most common. You don't want that.

Lighting — The most underused element. "Golden hour," "harsh neon," "soft overcast," "candlelit" produce completely different images from the same subject. Lighting sets the emotional register of the whole image.

Style and Art Direction — Is this a photograph, an oil painting, a graphic novel panel, a Studio Ghibli still? Be explicit.

Mood and Emotion — Melancholy, joyful, tense, ethereal. This isn't about adding a single word at the end. It's about threading emotional tone through every descriptor.

Camera and Composition — "Close-up," "wide establishing shot," "bird's-eye view," "35mm lens," "f/1.8 bokeh." These control how the scene is framed, not just what's in it.

Quality Modifiers — "Ultra detailed," "photorealistic," "8K," "sharp focus," "cinematic color grading" tell the model to treat this as a high-effort output. Use sparingly and match them to the style.

Put these together and your prompts stop being vague requests and start being actual visual briefs.

The Same Prompt Behaves Differently on Different Models

This is something almost nobody writes about, and it matters if you switch between tools.

Midjourney responds well to art movements, photographer names, and film stocks. It interprets abstract descriptions loosely and produces aesthetically polished results even from vague inputs. The tradeoff is that extreme precision sometimes fights it.

Stable Diffusion rewards precision. Specific negative prompts and technical modifiers make a real difference. Leave things vague and it fills gaps in unpredictable ways.

DALL·E is strong with plain language. You can describe a scene like you're talking to a person and get something close to what you pictured. Abstract stylistic requests are its weaker spot.

Gemini (Nano Banana) handles photorealistic and mixed-media styles well. It's also better than most at rendering readable text within images, which matters for mockups and graphic design prompts.

Leonardo is popular with game designers and concept artists. Creature design, environment art, and fantasy illustration are where it tends to shine.

Knowing this means you stop blaming the prompt when the real issue is model mismatch.

40 Prompts by Category

These follow the anatomy framework above. Use them as-is or as starting points to build from.

Cinematic

A detective standing under a flickering street lamp in a rainy 1940s city, low-key noir lighting, film grain, desaturated colors, 35mm anamorphic lens
Two astronauts on a desolate red planet at dusk, massive storm on the horizon, dramatic backlighting, wide shot, photorealistic, cinematic color grading
A chase through a neon-lit Hong Kong alley, motion blur, wet pavement reflections, handheld camera feel, teal and orange palette
An old lighthouse keeper watching a storm from the window, warm interior light against dark rolling waves, long exposure feel, moody
Soldiers crossing a foggy river at dawn, muted olive and grey tones, documentary photography style, grainy, distant silhouettes

Photography (Realistic)

Portrait of a street vendor in Marrakech, natural golden hour sidelight, 85mm lens, shallow depth of field, candid expression
Aerial view of a salt flat at sunrise, perfect mirror reflection of clouds, minimalist, pastel sky, ultra wide angle
Close-up of an elderly carpenter's hands holding a chisel, warm workshop light, fine wood dust visible, macro lens, black and white
A child running through a field of sunflowers, backlit by late afternoon sun, motion blur on legs, warm film tones
A crowded Tokyo crossing at night, long exposure light trails, vibrant signage, blue hour sky

Anime and Manga

A female swordmaster on a cliff overlooking a burning village, dramatic manga panel composition, ink lines, high contrast shading
Two rivals facing off in a rain-soaked arena, dynamic action pose, speed lines, Shonen style, intense expressions
A cozy ramen shop interior at night, rain on the windows, warm light, slice-of-life anime style, Studio Ghibli influence
A cyberpunk mech pilot checking their visor before launch, Akira-influenced design, industrial colors, detailed linework
A nine-tailed spirit fox in a moonlit bamboo grove, ethereal glow, Makoto Shinkai color palette, painterly background

Fantasy

A dragon librarian cataloguing ancient spellbooks underground, candlelight, moss-covered stone shelves, whimsical
A dwarf blacksmith forging a legendary sword in a lava cave, heat distortion, sparks and embers, heroic pose
A travelling witch resting in a forest clearing, broomstick and potions nearby, soft dappled sunlight, magical realism
An underwater elven city with bioluminescent coral towers, deep blue light, fish drifting through the streets
A knight made of autumn leaves standing guard at a forest gate, surreal, hyperdetailed, golden light

Sci-Fi

A lone hacker in a server farm city, wires and glowing drives everywhere, teal ambient light, cyberpunk, first-person perspective
A generation ship arriving at a ringed exoplanet after 200 years, vast scale, warm planetary light, space dust, awe-inspiring
A robot farmer tending crops on Mars, utilitarian suit design, red dust in the air, sunrise, bittersweet mood
Neural interface surgery in a clean white clinic, clinical lighting, chrome instruments, near-future aesthetic
An alien market bazaar on a space station, hundreds of species, vibrant colors, neon signs in fictional languages, wide angle

Illustration

A vintage travel poster for a fictional ocean city, art deco design, muted teal and gold palette, hand-printed texture
A children's book illustration of a bear chef baking a cake, soft watercolors, warm pastels, charming round character design
An editorial illustration of a person overwhelmed by social media, bold graphic shapes, muted palette, metaphorical, magazine style
A fantasy kingdom map with forests, mountains, and a volcano, parchment texture, hand-drawn label style, sepia tones
An isometric city at night inside a glass snow globe, tiny detailed buildings, cozy warm lights, ultra detailed

Portrait

A closeup portrait of an Inuit elder, deep wrinkles, proud expression, fur-trimmed hood, overcast Arctic light, film photography style
A young musician backstage before a concert, nervous energy, dim yellow bulbs around a mirror, documentary feel
A scientist studying coral reef samples at her desk, surrounded by tanks and specimens, natural window light, focused
A drag performer backstage applying rhinestones, dramatic makeup half-done, warm vanity light, joyful confidence
A poet on a rooftop at sunset writing in a worn notebook, golden backlight, wind in their hair, contemplative

Product and Commercial

A luxury perfume bottle on black marble, dramatic single-beam spotlight, water droplets on glass, commercial photography
A minimalist running shoe floating on pure white, soft studio light, 3/4 angle, product photography, ultra sharp
An artisan coffee cup on a rainy window sill, moody cafe background, bokeh lights, lifestyle photography, warm
A gaming headset surrounded by blue circuit-board light trails, dark background, tech product photography, bold
A skincare serum bottle in fresh botanicals, clean white light, flatlay, beauty industry aesthetic

What Actually Kills Good Prompts

Being vague on mood. "Beautiful" and "stunning" mean nothing to the model. "Melancholy, overcast, unsaturated" means something.

Stacking conflicting styles. "Photorealistic anime oil painting in Studio Ghibli style with cinematic lighting" sends the model in four directions at once. Pick a direction and commit.

Forgetting camera framing. Most people describe what is in the image but not how it's being seen. Adding a lens type, shot distance, or angle improves composition more than most people expect.

Using the same quality modifiers for every prompt. "Ultra detailed 8K hyperrealistic" actively hurts certain styles, particularly illustration and anime. Match your modifiers to the visual language you're targeting.

Skipping negative prompts. Most models support them. Use them to remove common artifacts: extra limbs, blurry faces, watermarks, washed-out contrast.

How to Iterate Instead of Starting Over

When a result is almost right, most people scrap the whole prompt and start again. Usually the wrong call.

Change one variable at a time. If the composition is good but the lighting is wrong, only change the lighting descriptor. If the style is right but the subject looks off, refine the subject description. Think of it as adjusting a camera lens rather than switching cameras.

After twenty or thirty iterations across different prompts, you start seeing which modifiers reliably improve which styles. That pattern recognition compounds fast. Keeping even a short note of what worked saves a lot of repetition.

The Short Version

Writing good prompts is a learnable skill. Once you understand that a prompt is a visual brief rather than a wish, the consistency of your outputs changes quickly.

Subject, environment, lighting, style, mood, camera, quality. Get those seven elements working together and you stop getting lucky with AI images and start getting consistent.

If you'd rather start with prompts that are already tested and organized, the AI Image Generation Prompts library has 100+ styles across every major category, formatted for copy-paste use across Midjourney, Stable Diffusion, DALL·E, Gemini, Leonardo, and more. It's the fastest way to go from "why does this look wrong" to results you can actually use.