SAN FRANCISCO — xAI pushed its visual AI capabilities into new territory on June 3, announcing Grok Imagine 1.5 Preview — an update that adds image-to-video generation to the platform and delivers higher-quality still images for developers and enterprises building on the Grok API.
The launch comes as the company's combined compute and model portfolio continues to expand at a rapid pace, adding another dimension to the multimodal assistant Elon Musk has positioned as the backbone of xAI's consumer and developer ecosystem.
Image-to-Video at 720p
The headline feature of Grok Imagine 1.5 is grok-imagine-video-1.5-preview, an image-to-video model that converts still images into cinematic short clips at resolutions up to 720p, driven by text prompts. Developers pass an existing image alongside a descriptive prompt and receive a fluid video output that brings the scene to life.
The model is available immediately through the xAI API at x.ai/api/imagine. xAI positioned the capability as an entry point for product demos, ad creation, and social content — use cases that previously required expensive dedicated video generation services.
Shortly after the launch, Elon Musk demonstrated the capability publicly by posting an AI-generated trailer for the Iliad, the ancient Greek epic, created entirely with Grok Imagine 1.5. The clip drew widespread attention and underscored the creative range the model supports.
Quality Mode Raises the Bar for Still Images
Alongside the video model, xAI introduced a Quality Mode for the Grok Imagine image generation and editing API. The mode runs four high-quality images per generation request rather than the standard single output, delivering improvements across detail, lighting, shadow rendering, and text accuracy within images.


