피파 한 줄 정리: Character consistency는 single technique이 아니라 stack — reference sheet + 독특한 visual 앵커 + face 도구 (--oref / IP-Adapter) + 일관된 prompt 표현.
Mental model: Imagine you're creating a children's book. Your main character — let's call her Mia — appears on every page in different poses, outfits, and settings. A human illustrator achieves consistency by keeping a character sheet pinned to their wall: Mia's face from every angle, her proportions, her signature red boots, her curly brown hair. Every time they draw a new page, they glance at the sheet. AI image generation needs the same anchor, just delivered differently.
Strategy 1: The Reference Sheet
Generate a multi-view character sheet first, then use it as a reference for all subsequent generations. A good character sheet shows:
- Face from front, 3/4 angle, and profile
- Full body with proportions visible
- Signature outfit details
- Key accessories or distinguishing features
- 2–3 key expressions
Leonardo AI benchmarks show 92% consistency when using character sheets across 50+ pose variations. The sheet gives the model a strong visual prior for what the character should look like.
Strategy 2: Costume and Feature Anchors
Even without a formal reference sheet, you can improve consistency by giving the character distinctive, unusual features that are easy for the model to reproduce:
- Hair: Unusual color (teal blue) or distinctive style (asymmetric bob with a red streak)
- Clothing: Signature garment (yellow raincoat, leather jacket with patches)
- Accessories: Memorable props (round glasses, a scar across the eyebrow, a specific necklace)
The more unique the feature, the more consistently the model reproduces it. "Brown hair" gives the model enormous freedom. "Teal hair in a side braid with gold beads" is much more constrained.
Strategy 3: Face Anchoring (Platform-Specific)
Modern platforms offer dedicated face consistency features:
- Midjourney V7 --oref: Upload a face photo, set --ow to control consistency strength. Achieves ~95% consistency at mid-weight settings.
- DALL-E Gen_ID: Generates a persistent character identity within a single conversation. Simpler but limited to ~80% consistency and session-bound.
- IP-Adapter (Stable Diffusion / ComfyUI): Most flexible. Inject face embeddings with fine-grained weight control. Requires more setup but offers the most control.
Strategy 4: Consistent Prompt Language
Describe your character the same way every time. Create a character prompt block that you copy-paste into every generation:
Changing even small details in the prompt ("amber eyes" → "golden eyes" → "hazel eyes") creates inconsistency, because each variation activates a slightly different region of the model's learned space.
- Use a multi-view reference sheet as the character's visual foundation.
- Anchor with distinctive, unusual features — the more unique, the more consistent.
- Use platform-specific face anchoring tools (--oref, Gen_ID, IP-Adapter).
- Maintain identical prompt language across all generations for the same character.
- Stack multiple strategies for best results — no single technique is sufficient alone.