Now available at Zyka

Grok Imagine Video v1.5 Is Here.

xAI's #1-ranked image-to-video model. One still image, one prompt — cinematic video with native audio in seconds.

Powered by Zyka.ai·Used by filmmakers & marketers·30+ frontier models
Grok Imagine Video v1.5 preview
Sora 2·
Kling v3·
Seedance 2.0·
Nano Banana·
Veo 3.1·
Flux 2·
Grok Imagine·
WAN 2.6·
LTX 2.3·
GPT Image 2·
Sora 2·
Kling v3·
Seedance 2.0·
Nano Banana·
Veo 3.1·
Flux 2·
Grok Imagine·
WAN 2.6·
LTX 2.3·
GPT Image 2·
THE MODEL

What is Grok Imagine Video v1.5?

Grok Imagine Video v1.5 is xAI's dedicated image-to-video and text-to-video engine, built on Aurora, the autoregressive architecture trained on 110,000 NVIDIA GB200 GPUs.

It is separate from the Grok chatbot: a focused video generation model for animating stills, generating scenes from prompts, and producing native audio in the same pass.

On launch day, it ranked #1 on Image-to-Video Arena, surpassing Seedance 2.0 and Google Veo while reducing character warping and visual inconsistency versus v1.0.

Extend from Frame lets teams build multi-shot sequences from the last frame instead of starting over, and Zyka places it beside Kling v3, Seedance 2.0, Sora 2, and 30+ other models.

Built for creatorsFilmmakersMarketers
Image → Video
Source image becomes the exact first frame
Native Audio
Music, SFX & lip-sync in one generation
Extend from Frame
Chain clips without re-generating
Aurora Architecture
Reduced warping, stronger consistency
DEMOS

Demos That Are Blowing Minds

Hover any clip to preview, click to watch full-screen with sound. Each demo links straight to a prompt you can try on Zyka.

01Dialogue + Audio

UGC Product Spot

A creator delivers a polished beauty pitch at a vanity, with friendly eye contact, soft studio lighting, natural hand gestures, and clean synced speech.

Try on Zyka
02Product Motion

Product Hero Slow Spin

A neon running shoe rotates on moss in bright daylight, mixing close product detail, outdoor texture, and a smooth commercial reveal.

Try on Zyka
03Camera Motion

Fjord With Orchestral Lift

A forward glide over a cold fjord builds into a wide cinematic landscape, with drifting mist, rippling water, a red boat, and swelling music.

Try on Zyka
04Ambient Audio

Rainy Fishing-Town Turn

A weathered fisherman turns toward camera as rain rolls through the harbor, pairing a quiet character beat with crisp ambience and mood.

Try on Zyka
05Image to Video

Sunlit Portrait Motion

A still indoor portrait becomes a quiet cinematic moment as the subject lifts her face into warm window light and the room breathes around her.

Try on Zyka
06Native Audio

Courtside Fitness Beat

A sunny athletic scene holds steady on the subject while subtle body movement, outdoor ambience, and synced sound keep the shot energetic.

Try on Zyka
07Face Motion

Studio Face Turn

A clean studio headshot animates into a controlled face turn, preserving facial detail while adding a subtle blink, expression shift, and polished lighting.

Try on Zyka
08Fast Preview

API Playground Output

An open-top driving shot turns into a fast lifestyle preview, with wind, road motion, cockpit detail, and upbeat action pacing.

Try on Zyka
WHY IT MATTERS

Why Grok Imagine v1.5 Feels Different

The Aurora architecture closes the gap between a still and a finished film.

Ranked #1 on Launch Day

Jumped 52 Elo points over v1.0 on the Image-to-Video Arena leaderboard — past Seedance 2.0 and Google Veo in one release.

Native Audio, Zero Extra Steps

Background music, sound effects, and lip-synced dialogue generate in the same pass as the video. No audio pipeline to stitch afterward.

Aurora Consistency Engine

Autoregressive architecture trained on 110,000 NVIDIA GB200 GPUs. Faces, hands and multi-subject scenes hold together across the full clip.

Built for Production Chains

Extend from Frame stacks clips without regenerating from scratch. Shoot a 90-second sequence from a single starting image.

Every Grok Imagine Video v1.5 clip is generated via xAI's Aurora engine. Available on Zyka alongside 30+ frontier models — one subscription, every model.
USE CASES

How Teams Use Grok Imagine v1.5

Practical workflows for creators, marketers, and production teams building short-form video from prompts and visual inputs.

Talking-character shorts

A portrait plus a line of dialogue returns a lip-synced clip with voice, ready for social and explainer content.

Product motion ads

Animate a product photo into a short spot with a clear camera move and built-in sound design, no audio editing afterward.

Music-driven clips

Generate footage with a matching score and effects in a single pass, so the cut and the sound arrive together.

Story sequences

Extend a clip from its last frame to build a longer beat from one starting image, then keep chaining shots.

Style and character guidance

Use visual inputs to hold a look or a character steady across several generations from the same brief.

Storyboard to motion

Turn a single keyframe into moving footage to preview a shot before committing to a full production.

STEAL THESE

Steal These Prompts

Trending Grok Imagine v1.5-style prompts. Tap any to start on Zyka.

Product

Animate this perfume bottle: slow 180° rotation, liquid catching light inside, a few drops falling in slow motion from the dropper. Clean white background, soft studio music, 720p 10 seconds.

Try on Zyka
Portrait

Bring this illustration to life: the subject turns their head slowly toward camera, eyes open and blink, a gentle breeze moves their hair. Keep every detail of the original art style. Soft ambient sound, 8 seconds.

Try on Zyka
Cinematic

A spaceship hangar at blue hour: ground crew move in slow motion, welding sparks drift upward, a massive cargo door begins to lower in the background. Orchestral score, 720p 15 seconds, wide anamorphic lens.

Try on Zyka
Nature

This macro photo of a dewdrop on a spider web comes to life: the drop trembles as morning wind moves the web, tiny rainbow refractions shift inside. Natural ambient birdsong and breeze, 720p 8 seconds.

Try on Zyka
Architecture

A brutalist apartment block at dawn: windows light up one by one from the ground floor upward, pigeons take flight from the roof, morning city sounds grow from silence. Slow upward tilt, 720p 12 seconds.

Try on Zyka

Ready to create with the best AI video models?

Join thousands of creators generating videos, images and voices with the world's best AI models — all in one place.

Kling v3 Motion ControlSeedance 2.0Nano BananaSora 2Veo 3.1