Home Seedream 5 Pricing Start creating
Dreamina Seedance 2.0

Seedance 2.0 Prompt Guide

Professional-grade video generation with native audio, deep multimodal referencing, and seamless editing — distilled into 18 annotated prompt examples.

18
Example prompts
5
Guide sections
Native
Audio + video

About this guide

Dreamina Seedance 2.0 is a professional-grade video generation model that natively supports joint audio and video output, with outstanding semantic understanding and multimodal interaction. This guide walks through the core techniques — text formulas, reference control, text rendering, image/video references, and non-destructive editing — with 18 official examples.

All videos and images below are autonomously generated by Seedance/Seedream visual models. Reused with permission from BytePlus ModelArk.
01

General principles

1.1 Basic formula for text instructions

Seedance 2.0 excels at following natural language logic. Flexibly combine these elements to match your creative intent:

  • Subject + action — the logical foundation. Clearly define who is performing what action.
  • Atmosphere — set the overall tone by describing spatial background, lighting details, or a specific visual style.
  • Sound design — advanced instructions can include scene or ambient sound effects for immersive, synchronized audiovisual output.
1.2 Reference control for multimodal inputs

Beyond text descriptions, lock the ideal frame state with reference materials. Seedance 2.0 supports deep referencing of images, audio, and video.

  • In your prompt, clearly specify the reference object — e.g. “Use the composition of Image 1” or “Match the motion of Video 2”.
  • The model extracts core features from the reference and merges them with your text, maintaining high fidelity and predictability while still allowing creative variation.
02

Text rendering

Seedance 2.0 generates readable text across T2V (Text-to-Video), I2V (Image-to-Video), R2V (Reference-to-Video), and V2V (Video-to-Video). It adapts font style and color to your scene automatically, and gives you granular control over style, timing, and layout.

2.1 Slogans

Seedance 2.0 auto-detects the scene context to match the most appropriate font aesthetic. For strict brand consistency, pair with a logo reference (see 3.2).

Prompt template
[Text Content] + [Timing] + [Positioning] + [Entrance/Appearance Style], [Visual Attributes (Color, Font Style)]
2.2 Subtitles
Prompt template
Display subtitles at the bottom-center with the text. The subtitles must be perfectly synchronized with the audio rhythm and pacing.
2.3 Speech bubbles
Prompt template
[Character] says, "[Dialogue]." Speech bubbles appear around the character containing the spoken text.
Best practices
  • Common vocabulary. Standard, widely recognized words render most accurately.
  • Avoid obscure words. Dictionary-deep terms may produce inconsistent glyphs.
  • Minimize special symbols. Complex or non-standard punctuation hurts font fidelity.
2.1 · Example 1
Output
Reference image
Reference image 1 Image 1
Prompt
Hand-drawn comic style: Three people are sitting around a table enjoying the fried chicken shown in Image 1, with a friendly and joyful atmosphere. The frame then gradually blurs, and the text "Bite", "Laugh", "Seedance" in order appears in the center of the screen.
2.2 · Example 2 Voiceover
Output
Reference image
Reference image 1 Image 1
Prompt
I2V: A time-lapse of a mountain landscape transitioning from a vast, starry night to a vibrant dawn. Voiceover: A deep, serene male voice says: 'In the vast silence of the cosmos, our world is but a fleeting moment. Yet, within it, life defiantly thrives.' > Text Integration: Render the narration as subtitles at the bottom-center. Subtitles must be perfectly synchronized with audio timing.
2.2 · Example 3 Dubbing
Output
Reference image
Reference image 1 Image 1
Prompt
R2V: A shot of these two people in Image 1 chatting in a modern office. The woman speaks first with a playful tone: "You always arrive right on time, don't you just love that perfect timing?" followed by the man's smiling reply: "I have my own rhythm." > Text Integration: Render the dialogue as subtitles at the bottom-center of the screen. Subtitles should appear sequentially as each character speaks.
2.3 · Example 4 Playground scene
Output
Reference image
Reference image 1 Image 1
Prompt
The two characters from Image 1, both dressed in sportswear, are running on the school playground. The girl looks at the boy, smiling confidently as she says: "We can definitely do it!". Cut to a close-up of the boy. He hesitates and replies: "Are you sure?". Cut back to a medium close-up of the girl. She speaks in a light, upbeat tone: "Yes!" Her demeanor is bright and resolute. Speech bubbles containing the corresponding lines appear around the speaking character.
2.3 · Example 5 Apple field
Output
Reference image
Reference image 1 Image 1
Prompt
Refer to the character design of the girl in Image 1 and Image 2. The scene is set in an apple field: the girl picks one apple, takes a bite, smiles and says "This is the real deal!". A speech bubble pops up beside the girl, with this line written inside.
03

Image reference

Seedance 2.0 supports multi-perspective references for subjects, and multi-image references for scene layouts and sequences. If your workflow requires a specific order, upload images in sequence and reference them in your prompt as Image 1, Image 2, … Image N.

3.1 Multi-perspective subject reference
Prompt template
Refer to / Extract / Combine / Use the [Subject] from [Image N] to generate [Scene Description], maintaining consistent [Subject] features.
3.2 Multi-image reference
Prompt template
Refer to / Extract / Combine / Follow the [Description of referenced elements] from [Image N] to generate [Scene Description], while maintaining the consistency of [Referenced Elements].
3.1 · Example 1 Consumer electronics
Output
Reference image
Reference image 1 Image 1
Prompt
Use the cameras featured in Image 1, Image 2 and Image 3. Replace the original background with a white one, and place the cameras on a white table. The shooting lens first focuses on the cameras in close-up, then slowly rotates 360° with the cameras as the main subject, clearly displaying the front, sides and back of each camera.
3.1 · Example 2 Home & lifestyle
Output
Reference image
Reference image 1 Image 1
Prompt
In a warm-toned home setting, present the thermos shown in the reference image in a medium shot. Then smoothly push the camera into a close-up of the thermos. Next, a hand naturally enters the frame off-screen, gently grips the thermos body and picks it up. The camera follows the slight rotating motion of the hand to showcase the thermos.
3.1 · Example 3 Characters
Output
Reference image
Reference image 1 Image 1
Prompt
Refer to the image of the woman in Image 1, Image 2 and Image 3, and generate a scene of her eating a cake in a coffee shop.
3.2 · Example 4 Logo reference
Output
Reference image
Reference image 1 Image 1
Prompt
The scene is set on an aerial corridor in a neon-drenched futuristic metropolis, where flying vehicles and holographic ads intertwine. Featuring the girl from Reference Image 2, the sequence opens with a medium shot of her releasing a silver floating lantern embedded with a holographic projection. The camera then pulls back to reveal floating lanterns flooding the sky, which gradually converge at the center of the frame to form the logo from Reference Image 1. The entire piece adopts a 3D cyberpunk sci-fi animation style.
3.2 · Example 5 Multi-subject reference
Output
Reference image
Reference image 1 Image 1
Prompt
Using the cat and dog from the reference Image 1 and Image 2 as prototypes, the scene unfolds in a cozy apartment. The dog is lying on the ground eating dog food when the cat approaches, extending a paw to nudge the dog. The dog pauses its meal upon noticing the cat, and the cat snuggles up next to the dog. The entire scene features a warm colored tone.
3.2 · Example 6 Multi-element reference
Output
Reference image
Reference image 1 Image 1
Prompt
The scene is set in the restaurant from Image 4 with people coming and going. The girl from Image 1, wearing the clothes from Image 2, is organizing the items on the counter. The boy, a customer, from Image 3 approaches her to ask for her contact information. The logo from Image 5 remains in the bottom right corner throughout.
3.2 · Example 7 Multi-panel sequence reference
Output
Reference image
Reference image 1 Image 1
Prompt
Refer to the sequence in Image 1 to create an intense high-energy fight sequence. All frame compositions from Image 1 shall be presented in strict predefined order, after which the two characters engage in fierce, fast-paced combat.
3.2 · Example 8 Sequence reference
Output
Reference image
Reference image 1 Image 1
Prompt
Refer to the composition in Image 3. A girl (her character design refers to Image 1) is waiting for her father to finish cooking, and she says: "아빠, 배고파요! 밥 다 됐어요?" Then the camera pans right and cuts to the frame and composition shown in Image 4. The father (his character design refers to Image 2) replies to her: "거의 다 됐어, 조금만 기다려!" Next, the camera cuts back to a close-up shot of the daughter's slightly disappointed facial expression, and she says: "아직 멀었어요? 맛있는 냄새 나는데..." Then the shot switches to a close-up of the father's face, and he says: "이제 진짜 금방이야. '빨리빨리' 하지 말고 손부터 씻고 와!"
04

Video reference

Seedance 2.0 supports video-based referencing for motion, camera movement, and visual effects. Upload videos in sequence and reference them as Video 1, Video 2, … Video N.

4.1 Motion reference
Prompt template
Refer to the [Motion Description] from [Video N] to generate [Scene Description], keeping the motion details consistent.
4.2 Camera motion reference
Prompt template
Refer to the [Camera Movement Description] from [Video N] to generate [Scene Description], keeping the scene consistent.
4.3 Visual effects (VFX) reference
Prompt template
Refer to the [VFX Effects Description] from [Video N] to generate [Scene Description], keeping the special effects consistent.
4.1 · Example 1 Artistic
Output
References
Reference image 1 Image 1
Video 1
Prompt
Refer to the character movements and shot language in Video 1 to create a fight scene with the character from Image 2 on the left and the character from Image 1 on the right. Include intense background music.
4.1 · Example 2 Marketing
Output
Reference video
Video 1
Prompt
Referencing the running shape of the horse in the video, generate a scene: a golden steed runs on the grassland, then freezes its magnificent running posture and turns into a horse-shaped gold pendant.
4.2 · Example 3
Output
References
Reference image 1 Image 1
Video 1
Prompt
Referring to the camera movement in Video 1, create a concept video for a science and technology park, with the tall building in Image 1 as the visual center, also using a first-person diving perspective, to reflect the sense of technology in the park from Image 1.
4.3 · Example 4 Video production
Output
References
Reference image 1 Image 1
Video 1
Prompt
Refer to the golden particle effects in Video 1, so that when the character in Image 1 plays the flute, the same particle effects surround their body.
4.3 · Example 5 Creative FX
Output
References
Reference image 1 Image 1
Video 1
Prompt
Refer to the special effects shown in Video 1 to generate identical wings for the girl in Image 1, ensuring the wing formation trajectory follows the exact same motion path and sequence depicted in the video.
05

Video editing

Seedance 2.0 supports non-destructive video editing — adding, removing, or modifying elements; extending forward or backward; and completing tracks across multiple clips. Original segments are preserved for perfect continuity.

5.1 Adding, removing, or modifying elements
Prompt template
Adding: At [Timestamp] and [Spatial Location] of [Video N], add [intended element].
Removing: Remove [Element] from [Video N], keeping the rest unchanged.
Modifying: Replace [original element] in [Video N] with [intended element].
5.2 Extending videos
Prompt template
Extend [Video N] forward/backward + [Description of extended content]
Generate content before/after [Video N] + [Description of extended content]
5.3 Completing tracks
Prompt template
[Video 1] + [Transition Description] + followed by [Video 2] + [Transition Description] + followed by [Video 3]
Note: Track completion supports up to 3 video clips with a combined duration of 15 seconds. The model auto-trims connecting segments for seamless synthesis.
5.1 · Example 1 Add elements
Output
Reference video
Video 1
Prompt
Add snacks such as fried chicken and pizza to the countertop in Video 1.
5.1 · Example 2 Remove elements
Output
Reference video
Video 1
Prompt
Remove everything that isn't office stuff from the table in Video 1, keeping the rest of the video content unchanged.
5.1 · Example 3 Modify elements
Output
References
Reference image 1 Image 1
Video 1
Prompt
Replace the perfume featured in Video 1 with the face cream from Image 1, with all original motions and camera work preserved.
5.2 · Example 4 Extend forward
Output
Reference video
Video 1
Prompt
Generate the content after Video 1: the two men who are late run towards them, the five people finally meet and have a friendly chat.
5.2 · Example 5 Extend backward
Output
Reference video
Video 1
Prompt
Extend the opening segment of Video 1: Set up an over-the-shoulder shot of the man in a hoodie, and the man says: "It's not that bad. You're just stressed. Everyone goes through this, you just need to keep going."
5.3 · Example 6
Output
Reference videos
Video 1
Video 2
Prompt
Video 1. The moment a leaf falls to the ground, it sets off a special effect of golden particles. A gust of wind blows by, leading into Video 2.

Ready to create with Seedance 2.0?

Start generating video with joint audio, reference control, and non-destructive editing — right on Doitong.

Start creating