Kling v3

Kling AI's flagship 3.0 video model. Cinematic text-to-video and image-to-video with native audio (sound effects, ambient, lip-synced dialogue), multi-shot storyboarding (up to 6 shots), element/character referencing, and end-frame control. Supports 3-15s clips. Pro tier targets 1080p text-to-video; Standard tier handles image-to-video.

Context
via Fal

Benchmarks