Koox Image To Video AI converts static images into short videos with realistic motion and cinematic-style effects. Upload a photo, add a text prompt, and wait for the render—the system generates the animation and scene dynamics for you.
Control with first and last frames
You can upload a starting frame and an ending frame to better control composition and the final shot. The model then generates smooth motion between them.
- Input formats: JPG, PNG, WebP
- Max file size: up to 10 MB
Audio and aspect ratio
Koox Image To Video AI can add an audio track so you get a video with sound right away. Videos are generated in a 16:9 aspect ratio, which fits common uses like YouTube, presentations, and web publishing.
- Optional audio track
- Built-in audio enhancement
- 16:9 output
Generation speed and related tools
Estimated generation time is about 1.5 minutes per clip. The site also offers other models—from Face Swap to Text to Image—so you can handle multiple visual tasks in one place.
- ~1.5 minutes per video (estimated)
- Additional models available on the same site

