Snap Inc.1 University of Trento2 UC Merced3 Fondazione Bruno Kessler4
Work performed while interning at Snap Inc.*
Snap Video can assist designers in the generation of long stories. We make use of an LLM to generate a story plot, video prompts for different scenes, and scripts for audio narrations. We generate all video assets using our model while tuning the video prompts to obtain the desired visuals, and synthesize the audio narration.
Postproduction software is used to assemble the final video. The generated video assets are trimmed and composed into a sequence to form the video track, to which text overlays are added. Background music is inserted and the synthesized audio narration is aligned to the video content to generate the final result.