
VideoPoet by Google is a generative video model that creates and edits videos from text, image, audio, or video prompts using a unified autoregressive framework.
VideoPoet by Google is a large language model for video that generates and transforms video content directly from text, images, or existing footage. Built as a unified multimodal model, it supports text-to-video, image-to-video, video stylization, and video editing within a single framework. Users can create short video clips from written prompts, extend or modify existing videos, or animate static images by describing desired motion, style, or atmosphere.
The model operates autoregressively on visual and audio tokens, enabling synchronized generation of both video frames and sound. This allows VideoPoet to produce coherent, temporally consistent motion and basic audio that align with the input description. It can perform tasks such as adding motion to still images, changing the visual style of a clip, performing subject-driven edits, or generating looping or seamlessly transitioning video segments.
Please sign in to comment
💬 No comments yet
Be the first to share your thoughts!
Explore 466+ top alternatives to VideoPoet by Google

Magiclight.AI automatically generates complete videos up to 50 minutes long from user-provided ideas, scripts, or stories, enabling efficient creation of finished video content.
Typeframes is a web-based tool that generates short promotional videos from text prompts, combining stock footage, AI voices, music, and animated text for social media.
Creatomate is an API that generates and automates videos from templates, supporting both no-code workflows and code integrations via Zapier, Make, Node.js, PHP, Ruby, and Python.
Hidream AI is a Chinese AIGC platform that enables text-to-image, image-to-image, text-to-video, image-to-video creation, intelligent image editing, layout, and community-based design sharing.