Consistent Characters

Your videos and images can now use the same character for brand consistency

Jun 10, 2025

If you happened to find your way here from my YouTube on creating consistent characters in Gemini Veo3... Welcome. Glad you stopped by.

Use this prompt to have ChatGPT fully describe the avatar image you have created and uploaded to the chat.

Once you run this prompt, you’ll get a detailed analysis of your avatar in bullet list format. Copy the entire ChatGPT result and paste into a new prompt. Then, use the “Convert ChatGPTs result to JSON” prompt (below the character description prompt at the bottom of the page).

---Here’s the Character Description prompt---

CONTEXT: I am looking to create videos with text-to-video solutions like Google’s Gemini VEO3 and Runway ML. Part of the challenge of this process is the ability to create consistent characters. I will load a photo into ChatGPT and have ChatGPT fully describe the person in the image based on the list of characteristics below. This way, VEO3 can recreate the same person for each new scene. This will require extensive detail on all aspects of the subject.

ROLE You are a highly renowned forensic facial examiner and medical aesthetician with more than 20 years of experience analyzing and describing facial features.

Here is the list of characteristics for you to reference when completely describing the person in the photo

1. Identity & Demographic Cues

Apparent age range

Gender presentation

Ancestry / ethnic cues (skin undertone, facial bone structure, hair pattern)

Approximate height and build (short, tall, slim, broad, etc.)

Typical posture (upright, relaxed, slouched)

2. Facial Structure

Head shape (oval, round, square, heart, diamond)

Forehead height and slope

Cheekbone prominence and angle

Jawline contour (sharp, soft, rounded)

Chin shape (cleft, pointed, broad)

3. Hair

Color (base tone, highlights, lowlights)

Length and cut (buzzed, bob, shoulder-length, waist-length)

Texture (straight, wavy, curly, coily)

Hairline pattern and parting

Styling details (braids, ponytail, loose, gelled)

4. Eyes & Brows

Eye shape (almond, hooded, deep-set, monolid)

Iris color and pattern (hazel with golden flecks, dark brown, ice blue, etc.)

Sclera tint (bright white, slightly veined)

Eyebrow shape, thickness, grooming (arched, bushy, thin, natural)

Eyelash length and fullness

5. Nose, Mouth & Ears

Nose bridge height, width, tip shape, nostril flare

Lip fullness, cupid’s bow definition, natural color tint

Teeth visibility (gap, straight, slight overlap)

Ear size, lobe attachment, piercings

6. Skin

Base tone (light olive, deep mahogany, medium tan)

Surface texture (smooth, fine lines, visible pores)

Undertone (cool, neutral, warm)

Freckles, moles, scars, birthmarks

Makeup or bare skin (foundation finish, blush placement, highlights)

7. Facial & Body Hair

Beard or mustache style, length, density, edge lines

Sideburn length and fade

Arm, leg, or chest hair visibility (for uncovered areas)

8. Body Proportions

Shoulder width versus hip width

Torso length

Limb length (long-legged, short-armed, proportional)

Muscle definition (athletic, average, soft)

9. Clothing

Garment type and fit (tailored suit, loose hoodie, cropped denim jacket)

Color palette and patterns

Fabric texture (wool tweed, cotton jersey, silk satin)

Layering and closures (single-breasted, zip-front, button-down)

Condition and style cues (pressed, distressed, vintage, modern)

10. Accessories

Eyewear (frame shape, material, lens tint)

Jewelry (ear studs, hoop earrings, layered necklaces, wristwatch)

Headwear (cap, beanie, fedora)

Bags, belts, gloves, scarves

Tech items (earbuds, smartwatch)

11. Distinguishing Marks & Modifiers

Tattoos (placement, style, color, size)

Piercings beyond ears (nose ring, eyebrow bar)

Medical devices (hearing aid, walking cane)

Temporary elements (bandages, stage makeup)

12. Posture, Gesture & Expression

Weight distribution (leaning on left hip, balanced)

Hand position (arms crossed, hands in pockets)

Facial expression baseline (neutral, friendly smile, intense glare)

Micro-expressions (raised brow, slight smirk)

13. Lighting & Color Grading

Key light direction and quality (soft window light from camera left, hard top light)

Fill and rim presence

Color temperature (warm 3200 K, cool 5500 K)

Shadow depth and softness

Overall grade (high-contrast noir, pastel low-contrast, cinematic teal-orange)

14. Camera & Framing

Lens focal length (35 mm environmental, 85 mm portrait, 16 mm wide)

Aperture / depth of field (shallow bokeh, deep focus)

Angle (eye level, low-angle hero, high-angle)

Crop (full body, three-quarter, tight headshot)

Motion blur expectations for moving shots

15. Environment & Context (if visible)

Setting type (urban street, studio seamless, forest clearing)

Background detail level (clean backdrop, textured brick wall)

Props in hand or nearby (coffee cup, guitar, clipboard)

Color interplay between subject and setting

16. Movement & Action Cues (for animation continuity)

Typical gait or walk cycle (brisk, casual, stooped)

Signature gestures (adjusting glasses, brushing hair back)

Interaction style with objects or other characters

17. Stylistic Tags & References

Era or subculture influence (’90s grunge, cyberpunk, 1950s classic)

Cinematic or photographic references (film grain Kodak Portra 400, glossy Vogue editorial)

Descriptive keywords used consistently across prompts (e.g., “Jordan K., shaved-side undercut, emerald eyes, freckled nose”)

---Convert ChatGPT’s result to JSON with this prompt---

Please write this description in JSON format. Don’t elaborate or create any type if story. Simply use the categories provided to define the person’s face in JSON inclusive of the descriptive details.

Consistent Characters

Your videos and images can now use the same character for brand consistency

Ready for more?