Available models
| Model | Type | Strengths |
|---|---|---|
| Kling v3 | Text/Image-to-video | Best human motion, cinematic, 3-15s |
| Kling v2.6 | Text/Image-to-video | Fast, cost-effective, 3-10s |
| Kling v3 Motion Control | Controlled video | Camera path and subject motion |
| Kling Avatar v2 | Avatar | Talking head from a single photo |
| Kling Lip Sync | Audio-driven | Sync lips to any audio track |
Text-to-video
All examples below reuse the same
client.
Image-to-video
Motion control
Avatar generation
Create a talking-head video from a single reference photo:Lip sync
Tips
- v3 vs v2.6. v3 has better quality and human motion. v2.6 is faster and cheaper.
- Duration affects cost. Start with 3-4s clips to test prompts.
- Motion control.
dolly_zoom_in,dolly_zoom_out,pan_left,pan_right,tilt_up,tilt_down,orbit.