- Home
- Text to Video
- Kling 3.0
Kling 3.0 β Multi-Shot Video With Native Multilingual Audio
Kling 3.0 is Kuaishou's text-to-video and image-to-video model. A single Kling 3.0 render outputs a 15-second clip with up to 6 storyboarded shots, native multilingual audio, and full multimodal input (text, image, audio, video). Released 2026-02-04.


