bytedance/seedance-2.0/reference-to-video
ByteDance's most advanced reference-to-video model. Generate video from up to 9 images, 3 videos, and 3 audio clips with native audio and cinematic camera control.
Inference
Commercial use
Partner
Prompt examples
Examples are generated using the Seedance 2 Reference to Video. You can customize them by clicking on the "Playground" button.
Style: Hybrid visual style — photorealistic, documentary-level environment combined with stylized 3D animated characters. The subject and the fan are fully 3D animated characters seamlessly composited into a live-action realistic world. Single continuous unbroken shot from a handheld camera within a dense crowd. Natural micro-shake, eye-level perspective.
Character Style: The subject from @Image1 is rendered as a polished 3D animated character with stylized proportions, soft subsurface skin shading, expressive features, and clean rim lighting — while maintaining a perfectly consistent face and the exact outfit from the reference image. The fan they interact with is also a 3D animated character in the same rendering style. Both characters retain cinematic CG quality with realistic interaction with the surrounding light (flash bounces, streetlight highlights, shadow casting on real ground).
Lighting & Environment: Fully photorealistic. Nighttime at an upscale event in New York City. Illuminated by real streetlights and camera flashes. Mixed reflections on polished surfaces (phones, cars), soft realistic shadows, and a slight atmospheric haze for depth. The crowd, barricades, hotel facade, SUVs, and street are all live-action realistic — only the two main characters are stylized 3D.
Subject: The 3D animated subject maintains a calm, controlled presence with a subtle, confident smile, perfectly matching the face and outfit from @Image1.
Action Sequence: The shot begins completely immersed in a restless, chaotic realistic crowd behind barricades. The view is partially obscured by real people raising smartphones to record. As the camera lifts slightly above shoulder level, the 3D animated subject exits a luxury hotel in the background. Bright media flashes erupt, illuminating the CG character against the realistic environment. Real security personnel step into frame, pushing the crowd back, causing the camera to shake naturally. Through shifting gaps in the crowd, the animated subject walks forward clearly into center frame. The subject pauses to interact with a 3D animated fan, leaning in briefly for a selfie while giving a calm, controlled wave. The camera pans to follow as a luxury convoy of three premium black SUVs (photorealistic) pulls up. A real security guard opens the back door of the middle SUV. The animated subject steps inside, rolls down the window to wave one last time, and the vehicles begin to pull away as the realistic crowd jumps to capture the moment.
Audio: Loud, chaotic crowd cheering and whistling. Overlapping voices shouting the subject's name. A barrage of rapid camera shutter clicks. Distant New York City sirens and traffic. The rustling of heavy fabric and footsteps. The deep, heavy bass of an SUV engine idling and pulling away.
aspect_ratioauto