We're excited to announce that Seedance 2.0 is now available on seedance2.media. This is ByteDance's latest multimodal video generation model — and the first to generate video with synchronized audio in a single pass.
You can start using Seedance 2.0 right now from your browser at seedance2.media/ai-video-generator.
What is Seedance 2.0
Seedance 2.0 is a major leap from the previous generation. It's not just a text-to-video model — it's a unified multimodal system that accepts text, images, video clips, and audio as input, giving you director-level control over your video output.
The most significant upgrade: built-in audio generation. Every video can include synchronized sound — dialogue, ambient noise, music, sound effects — all generated by the AI to match the visuals. No separate audio tools, no manual syncing, no post-production required.
Key Features
Synchronized Audio
Seedance 2.0 generates video and audio together. A rainstorm scene includes rain sounds. A café scene includes ambient chatter. A cinematic drone shot includes a sweeping orchestral score. You can toggle audio on or off per generation.
Multimodal Input
Three generation modes are supported:
- Text to Video — Describe your scene in natural language
- Image to Video — Upload reference images (up to 9) to guide the visual style
- Omni-Reference — Combine images, video clips, and audio references for maximum creative control
Up to 15 Seconds
Generate clips of 5, 10, or 15 seconds — a significant increase from the 10–12 second limit of Seedance 1.x.
7 Aspect Ratios
| Aspect Ratio | Use Case |
|---|---|
| Auto | AI selects the best fit |
| 16:9 | YouTube, presentations |
| 9:16 | TikTok, Reels, Shorts |
| 1:1 | Instagram, social ads |
| 4:3 | Product demos |
| 3:4 | Pinterest, portrait |
| 21:9 | Cinematic ultrawide |
Director-Level Control
Seedance 2.0 understands complex cinematic language — camera movements, lighting, depth of field, visual effects, and mid-video style transitions. The model emphasizes physical accuracy: weight, inertia, and natural motion dynamics.
How It Compares
| Feature | Seedance 2.0 | Seedance 1.5 | Runway Gen-3 | Kling 1.6 | Pika 2.1 |
|---|---|---|---|---|---|
| Built-in Audio | Yes | Yes | No | No | No |
| Multimodal Input | Text + Image + Video + Audio | Text + Image | Text + Image | Text + Image | Text + Image |
| Max Duration | 15s | 12s | 10s | 10s | 5s |
| Aspect Ratios | 7 | 6 | 3 | 3 | 3 |
| Resolution | 480p / 720p | Up to 1080p | Up to 720p | Up to 1080p | Up to 1080p |
| Style Transitions | Yes | No | No | No | No |
Pricing
Seedance 2.0 uses a credit-based system:
| Resolution | 5 seconds | 10 seconds | 15 seconds |
|---|---|---|---|
| 480p | 120 credits | 240 credits | 360 credits |
| 720p | 240 credits | 480 credits | 720 credits |
Audio is included at no extra cost. See our pricing page for credit packages and subscription plans.
Try It Now
Seedance 2.0 is live and ready to use. Select Seedance 2.0 from the model dropdown on our video generator to get started.

