Back to blog
Video9 min read

Kling AI Prompt Format: 6-Part Framework + Examples (2026)

Kling AI prompt format explained. 6-part framework: subject, action, context, style, camera, motion. With 20 tested prompts and motion brush tips.

NH
Nafiul Hasan
Founder, Prompt Architects

title: "Kling AI Prompt Format: 6-Part Framework + Examples (2026)" slug: "23-kling-ai-prompt-format" description: "Kling AI prompt format explained. 6-part framework: subject, action, context, style, camera, motion. With 20 tested prompts and motion brush tips." publishedAt: "2026-07-26" updatedAt: "2026-07-26" postNum: 23 pillar: 3 targetKeyword: "kling ai prompt format" keywords:

  • "kling ai prompt format"
  • "kling ai prompts"
  • "kling video prompt"
  • "ai video prompt" ogImage: "https://prompt-architects.com/og/23-kling-ai-prompt-format.png" author: name: "Nafiul Hasan" role: "Founder, Prompt Architects" url: "https://prompt-architects.com/about" ctaFeature: "video" related: [21, 22, 24] faq:
  • q: "What's the best prompt format for Kling AI?" a: "The 6-part framework: subject + action + context + style + camera + motion. Order matters — front-load subject and action; camera and motion modifiers go in the back half. Kling responds especially well to explicit motion descriptions because its training emphasized motion fidelity."
  • q: "How is Kling different from Veo 3 prompt-wise?" a: "Kling weights motion language more heavily. 'Slow gimbal arc orbit' or 'fast whip pan' produce tighter results in Kling than equivalent phrasing in Veo 3. Kling also handles motion brush (paint motion paths onto reference images) which Veo 3 doesn't."
  • q: "How long should a Kling prompt be?" a: "100-250 words for most shots. Below 60, output gets generic. Above 350, motion intent dilutes. For multi-shot sequences, use shorter focused prompts per shot rather than one long prompt."
  • q: "Does Kling generate audio?" a: "Currently no native audio (as of April 2026). Kling outputs silent video; you score and dub separately. Veo 3 is the leader on synchronized audio. For projects where audio matters and Kling's stylized motion isn't critical, Veo 3 wins."
  • q: "Should I use text-to-video or image-to-video in Kling?" a: "Image-to-video (I2V) when you have a reference image you love. Kling's I2V is best-in-class — preserves source identity, lighting, composition while adding motion. Text-to-video for original concepts. Many pros generate the still in Midjourney first, then animate via Kling I2V."

TL;DR: Kling AI uses a 6-part prompt framework (subject + action + context + style + camera + motion). Motion language weighted heavily. Best-in-class image-to-video. 20 tested prompts below.

The 6-part Kling structure

PartWhat it doesExample
1. SubjectWho or what is in frame"30yo woman with curly red hair, charcoal wool coat, leather portfolio"
2. ActionWhat they're doing"walking briskly across cobblestone street, glancing back over shoulder"
3. ContextWhere, when, atmosphere (3-5 elements max)"Paris, autumn dusk, light rain, Notre Dame in background, soft fog"
4. StyleVisual aesthetic anchor"cinematic film look, 35mm film grain, melancholic palette"
5. CameraFraming + lens + movement"medium close-up tracking shot, 35mm lens, slight handheld feel"
6. MotionExplicit motion intent (Kling's strength)"smooth gimbal motion at walking pace, subtle horizontal camera drift"

Why motion gets its own block

Kling trained heavily on motion fidelity. While Veo 3 and Sora respect motion cues embedded in scene description, Kling does measurably better when motion has its own explicit block.

Without explicit motion:
"A woman walks across a cobblestone street."

With explicit motion (Kling-optimal):
"A woman walks across a cobblestone street.
Motion: smooth gimbal tracking from her right side at walking
pace. Subtle horizontal camera drift. Hair moves naturally with
walking rhythm."

The second version produces tighter motion behavior in Kling specifically.

A complete example

Subject: A 30-year-old woman with curly red hair, light freckles,
wearing a long charcoal wool coat, holding a leather portfolio.

Action: Walking briskly across a wet cobblestone street, glancing
back over her shoulder once mid-walk.

Context: Paris at dusk in late autumn, light rain falling,
Notre Dame visible in soft focus background, lamp posts lit,
atmospheric haze.

Style: Cinematic film look, 35mm film grain, golden hour mixed
with cool streetlamp blue, melancholic palette.

Camera: Medium close-up tracking shot from her right side, 35mm
lens, slight handheld feel for intimacy.

Motion: Smooth gimbal arc following at walking pace. Subject
holds frame center-right. Subtle vertical camera bob mimicking
walking rhythm. Hair and coat move naturally with motion.

Result: 8-10 second clip with reliable subject motion, camera flow, and atmospheric coherence.

20 tested Kling prompts

Cinematic narrative (5)

1. Solo character moment

Subject: [character + 3 distinguishing features]
Action: [single beat — looking, reaching, pausing]
Context: [location + time + atmospheric layer]
Style: cinematic film look, 35mm grain, [palette]
Camera: medium close-up, locked-off static, 35mm lens
Motion: minimal camera; subject performs one slow deliberate action

2. Two-character dialogue

Subject: two people, [descriptions]
Action: in conversation, slight smile from one, considered nod from other
Context: [setting + ambient details]
Style: cinematic, [palette]
Camera: medium two-shot, slight depth of field
Motion: subtle facial micro-expressions, minimal camera drift

3. Tracking shot through environment

Subject: [character], walking purposefully
Action: traverses [location] at steady pace
Context: [atmospheric details]
Style: cinematic, [palette]
Camera: medium tracking shot from behind/side, 35mm lens
Motion: smooth gimbal follow at walking pace, gentle camera drift

4. Slow push-in on object

Subject: [object — letter, photograph, key item]
Action: stationary, dust motes drifting in light
Context: [setting — desk, mantel, table]
Style: warm cinematic, shallow depth of field
Camera: dolly push from medium to close-up, 50mm lens
Motion: slow steady forward push; environmental dust drift

5. Wide establishing reveal

Subject: [character or anchor element]
Action: stationary; environmental elements move (clouds, water, leaves)
Context: [vast scene]
Style: cinematic wide, atmospheric haze, golden hour
Camera: wide shot, 24mm lens, slow gimbal arc
Motion: slow sweeping camera reveals subject in environment

Product / commercial (5)

6. Hero product turntable

Subject: [product with material/finish details]
Action: rotating slowly on dark walnut surface
Context: studio backdrop, deep shadow
Style: luxury commercial photography, side-lit
Camera: medium close-up, 50mm lens, locked-off static
Motion: smooth 360° turntable rotation; reflections shift

7. Liquid pour

Subject: [liquid + container]
Action: pouring into vessel
Context: dark backdrop, single rim light
Style: high-contrast commercial
Camera: medium shot side-on, 50mm
Motion: slow-motion pour, splash dynamics, droplet fall

8. Lifestyle product placement

Subject: [product] in domestic context
Action: stationary; environmental life around it (steam, motion in background)
Context: [home setting + warm light]
Style: lifestyle commercial, warm hygge
Camera: medium shot, slight angle
Motion: steam rises, light shifts, no camera movement

9. Hand-reach product

Subject: [product] on surface, hand entering frame
Action: hand reaches and lifts product
Context: [surface + lighting]
Style: clean commercial
Camera: top-down or 3/4 angle, locked
Motion: hand enters from edge, lifts product smoothly out of frame

10. Reveal from dust

Subject: [product] on pedestal
Action: dust cloud parts revealing product
Context: dark backdrop, single key light
Style: dramatic commercial
Camera: medium static
Motion: dust dissipates revealing product; product remains static

Action / kinetic (5)

11. Skater trick

Subject: [skater] mid-trick
Action: kickflip / ollie / grind
Context: urban skatepark or street
Style: high-contrast action photography
Camera: low angle, 24mm wide, dynamic
Motion: 60fps slow-motion, board flips, body rotates

12. Runner at sunrise

Subject: [runner in technical wear]
Action: running on track
Context: track at dawn, golden first light
Style: athletic commercial
Camera: medium tracking from side, 35mm
Motion: gimbal moves at runner's pace, slight motion blur background

13. Cooking sequence

Subject: hands [chopping / sizzling / plating]
Action: continuous cooking motion
Context: warm kitchen, overhead practical light
Style: food editorial
Camera: top-down or 3/4 close-up, 50mm
Motion: rhythmic chopping, steam rises, ingredients move

14. Crowd movement

Subject: market crowd, no central subject
Action: people moving in different directions through space
Context: marketplace, dappled light
Style: documentary observational
Camera: top-down or high-angle wide, 24mm
Motion: time-lapse-like flow of people; camera locked

15. Vehicle drive-by

Subject: [vehicle] passing
Action: drives across frame
Context: [environment]
Style: cinematic, [time of day + palette]
Camera: locked-off side-on, 50mm
Motion: vehicle enters left, exits right at speed; motion blur

Mood / abstract (5)

16. Slow-motion fabric

Subject: silk fabric in wind
Action: undulating motion
Context: dark backdrop or cloud sky
Style: abstract slow-motion
Camera: medium close-up, 85mm
Motion: 120fps slow-motion fabric undulation; gentle gimbal drift

17. Particles in light

Subject: dust motes / particles
Action: drifting through shaft of light
Context: dim atmospheric room
Style: ethereal abstract
Camera: medium close-up, 50mm
Motion: particles drift slowly; camera locked

18. Liquid macro

Subject: surface tension of [liquid]
Action: drop falls, ripples spread
Context: black backdrop, side light
Style: macro art photography
Camera: extreme close-up, macro lens
Motion: 240fps ultra slow-motion ripple expansion

19. Time-lapse clouds

Subject: cloud formation
Action: clouds shifting overhead
Context: open sky, golden hour
Style: time-lapse landscape
Camera: locked-off wide, 24mm
Motion: 4× speed cloud movement; no camera motion

20. Geometric morph

Subject: geometric shapes
Action: morphing between forms
Context: neutral abstract space
Style: minimal motion design
Camera: locked-off centered, 50mm
Motion: smooth shape interpolation; camera static

Motion brush (Kling's signature feature)

Kling lets you paint motion paths onto a reference image. Instead of describing motion in text, you literally draw the motion intent.

Workflow

  1. Upload reference image (or generate in Midjourney first)
  2. Switch to motion brush mode
  3. Paint regions where motion should occur (hair, water, fabric, vehicles)
  4. Set intensity per region (0-100%)
  5. Add direction vectors where applicable
  6. Generate

Use cases where motion brush wins:

  • Cinemagraphs (mostly still image, one element animated)
  • Selective motion (water flows but everything else stays static)
  • Direction control (smoke rises specifically up-and-left)
  • Brand-asset animation (existing logo or product photo gets subtle motion)

Image-to-video (I2V) workflow

Kling's I2V is widely considered best-in-class as of April 2026.

Standard I2V flow

1. Generate or select source image
2. Upload to Kling I2V mode
3. Describe motion intent (or use motion brush)
4. Set duration (5s or 10s)
5. Set aspect ratio (preserved from source by default)
6. Generate

Result: video that respects source image's identity, lighting, composition while adding believable motion.

When I2V > T2V

  • Brand-consistent imagery (you already have approved stills)
  • Concept exploration (you generated a still you love; want it animated)
  • Cost / time control (Midjourney still + Kling I2V is faster than text-to-video iteration)

Camera modifier reference (Kling-tested)

Kling responds well to camera language similar to Veo 3, with slight differences in motion emphasis.

CategoryModifiers
Framingwide shot, medium shot, medium close-up, close-up, extreme close-up, two-shot
Movementstatic, gimbal smooth, dolly in/out, tracking shot, handheld, whip pan, crane up/down, orbit
Angleeye-level, low angle, high angle, top-down, Dutch tilt
Lens24mm wide, 35mm standard, 50mm portrait, 85mm telephoto, macro
Speed24fps cinematic, 60fps slow-mo, 120fps ultra slow-mo, time-lapse

Common mistakes

  1. Vague motion. "She moves" produces unpredictable movement. Specify: "she walks slowly, gentle hair movement, subtle smile change."
  2. Conflicting motion + camera. "Static camera + tracking shot" confuses output. Pick one.
  3. Single-prompt long sequences. Kling caps at 10s natively. For longer sequences, generate per shot and edit together.
  4. Skipping style block. "Cinematic" is generic. Be specific: "35mm film grain, golden hour palette, anamorphic lens flare."
  5. Forgetting aspect ratio. Default 16:9. Specify 9:16 for Stories/Reels, 1:1 for square.

Power moves

  1. Use Midjourney → Kling pipeline. Generate striking still in MJ, animate via Kling I2V.
  2. Motion brush for cinemagraphs. Mostly still image, one element alive — high engagement on social.
  3. Save 6-part templates with {{placeholders}} for repeated structures.
  4. Combine T2V + I2V: generate environment via T2V, animate hero via I2V, composite in post.

Tools that ship Kling 6-part templates + motion brush integration (Prompt Architects) save the structure-typing friction. Same skill underneath.

Frequently asked questions

What's the best prompt format for Kling AI?
The 6-part framework: subject + action + context + style + camera + motion. Order matters — front-load subject and action; camera and motion modifiers go in the back half. Kling responds especially well to explicit motion descriptions because its training emphasized motion fidelity.
How is Kling different from Veo 3 prompt-wise?
Kling weights motion language more heavily. 'Slow gimbal arc orbit' or 'fast whip pan' produce tighter results in Kling than equivalent phrasing in Veo 3. Kling also handles motion brush (paint motion paths onto reference images) which Veo 3 doesn't.
How long should a Kling prompt be?
100-250 words for most shots. Below 60, output gets generic. Above 350, motion intent dilutes. For multi-shot sequences, use shorter focused prompts per shot rather than one long prompt.
Does Kling generate audio?
Currently no native audio (as of April 2026). Kling outputs silent video; you score and dub separately. Veo 3 is the leader on synchronized audio. For projects where audio matters and Kling's stylized motion isn't critical, Veo 3 wins.
Should I use text-to-video or image-to-video in Kling?
Image-to-video (I2V) when you have a reference image you love. Kling's I2V is best-in-class — preserves source identity, lighting, composition while adding motion. Text-to-video for original concepts. Many pros generate the still in Midjourney first, then animate via Kling I2V.
Free Chrome Extension

Stop rewriting prompts. Start shipping.

Works with ChatGPT, Claude, Gemini, Grok, Midjourney, Ideogram, Veo3 & Kling. 5.0★ on the Chrome Web Store.

Add to Chrome — Free