Scale is everything in fantasy concept art, and it’s the hardest thing to get AI to understand. This prompt formula from r/PromptEngineering cracks it.
TL;DR: Three specific modifiers (“dramatic scale contrast,” “wide landscape composition,” and “cinematic lighting”) are doing the heavy lifting in a dragon vs. warrior image prompt that’s generating genuinely cinematic results on Hifun AI.
The Prompt That Started It
The post’s author, u/AdSome4897, shared the following base prompt after experimenting with large-creature-vs-small-subject compositions:
massive ancient dragon descending toward a lone warrior, epic fantasy valley, dramatic scale contrast, cinematic lighting, wide landscape composition, ultra detailed fantasy concept art, storm clouds, glowing dragon eyes
On the surface it looks like a typical fantasy prompt. But this person wasn’t just throwing adjectives at it; they were engineering a specific visual relationship between two subjects. That’s a different task entirely. Most fantasy prompts describe a scene. This one describes a dynamic: a massive thing and a small thing in deliberate tension, with the camera positioned to make that tension legible. The word choices aren’t decorative. They’re structural instructions about how the frame should be organized.
The platform matters here too. Hifun AI, where these outputs were tested, handles compositional language reasonably well when it’s explicit. The lesson carries to Midjourney, Stable Diffusion, and Flux, but the more compositionally explicit your prompt, the more consistent your results across any model.
Breaking Down What’s Actually Working
The original poster identified three phrases that moved the results from generic to cinematic. Here’s why each one matters:
- 🐉 “Dramatic scale contrast” tells the model that the size difference between subjects is the visual point of the image, not a byproduct. Without it, models tend to equalize subjects in the frame. The AI defaults to making things look “balanced,” which actively destroys the effect you’re going for. This phrase overrides that default and signals that imbalance is the goal.
- 🌄 “Wide landscape composition” pulls the camera back and gives the small figure room to exist in the environment. Tight compositions make scale invisible. When you crop close, you lose the environmental context that makes a dragon feel enormous. The landscape itself becomes a measuring stick; when the valley stretches behind the warrior and the dragon fills the sky, the brain calculates the size difference automatically.
- “Cinematic lighting” shifts the output from illustration mode into film concept art mode. It introduces directional, dramatic light that reinforces the mood rather than just illuminating the scene. Think of how a single strong backlight on a small figure, with the large subject blocking part of the sky, creates silhouette drama. That’s cinematic lighting doing compositional work, not just aesthetic work.
One community member added a detail worth noting: specificity on both subjects helps. Phrases like “dragon scales shimmering” or “warrior’s cloak billowing” give the model texture to work with on each element. When you’re composing two subjects in tension, the more distinct each one is, the more the contrast reads. Vague subjects collapse into each other visually. Specific ones hold their individual identity even when one towers over the other. Think of it as giving the model two separate character briefs rather than one scene description.
Another commenter pointed out that adding a ground-level camera angle (“low angle shot” or “worm’s-eye view”) amplifies the effect further. When the camera sits at the small subject’s eye level, the large subject dominates the upper frame naturally. It’s a reinforcement of the scale logic already baked into the other three modifiers.
Use Cases Beyond Dragons
The framework here isn’t genre-specific. The underlying logic (dominant large subject, scale contrast, wide composition, cinematic lighting) transfers cleanly across contexts:
- Sci-fi: colossal mech looming over a lone pilot, debris field, dramatic scale contrast, cinematic lighting, wide industrial composition
- Horror: ancient leviathan rising from fog, single figure on shoreline, wide coastal composition, atmospheric lighting
- Game design reference: boss creature + environment + scale reference in a single concept art shot
- Marketing/brand: product as monolith, human silhouette for scale, same compositional logic applied commercially
The three core modifiers stay consistent. The subject, setting, and atmosphere are what you swap. If you’re building a style guide or a prompt library for a specific project, these three phrases belong in your locked layer, the constants that travel with every variation you test.
Prompt of the Day
Want to test the structure yourself? Use this template as a starting point:
[massive subject] looming over [lone small figure], [setting], dramatic scale contrast, cinematic lighting, wide landscape composition, ultra detailed [style] concept art, [atmospheric element], [one specific visual detail per subject]
Fill in the brackets for your genre and run it. The three modifiers in the middle are the ones doing the structural work; keep those consistent while you iterate on everything else. A useful exercise: run the prompt once with all three modifiers, then strip them one at a time and compare. The difference between having “dramatic scale contrast” and not having it is often the difference between an image that reads immediately and one that needs explanation.
The Takeaway
Most prompts fail at scale not because the model can’t generate it, but because the prompt never asked for it explicitly. Saying “massive dragon” describes a subject. Saying “dramatic scale contrast” describes a visual relationship. That distinction is where this formula earns its keep. AI image models are pattern matchers; they need relational language to produce relational compositions, not just noun-based descriptions of things that happen to be in the same frame.
If you’re working on any kind of two-subject composition (fantasy, sci-fi, or otherwise), these three phrases are worth keeping in your default toolkit. They’re low-cost additions to any prompt, and the upside when they land is significant.
Try the template and share what you get.
Prompt for generating cinematic dragon vs warrior fantasy scenes
by u/AdSome4897 in PromptEngineering