How was the video made in Grok Imagine?
To create this video, I used Grok Imagine, an AI-powered visual generation tool that transforms detailed descriptions into realistic visual scenes.
It all started with carefully crafting the script. Beyond simply writing a description, the idea was to imagine the scene as if planning a film shot. First, I defined the setting: a typical supermarket aisle, well-lit, with shelves full of everyday products and no recognizable brands to avoid visual distractions.
Nano Banana Pro Prompt

Interior aisle of a generic convenience store, similar to a modern supermarket, with no visible logos or branding. Shelves are fully stocked with boxes of cereal, cookies, and various other products, all neatly arranged. Several ordinary people are shopping, some browsing products and others pushing shopping carts. The shoppers act indifferently toward each other, without interaction or eye contact.
In the center of the aisle is Ryu (Street Fighter), integrated into the scene as an out-of-context element, but with an appearance identical to the video game character: white karate gi with rolled-up sleeves, black belt, and red gloves. His style must remain true to the original video game design, while the entire environment and the other characters maintain extreme photorealism.
Realistic supermarket lighting, uniform white light, soft shadows, natural depth of field, detailed textures, 4K photorealistic quality, high level of detail, cinematic look.
4:5 aspect ratio.
The key to the experiment was introducing an element completely out of context: Ryu from Street Fighter. The instruction was clear: he had to look exactly like he does in the video game, with his white karate gi, rolled-up sleeves, and red gloves, but placed in the middle of a completely realistic environment. This contrast between reality and fiction is what gives the scene its power.
Finally, the visual details: typical supermarket lighting, soft shadows, natural depth of field, and a photorealistic 4K finish with a 4:5 aspect ratio to ensure it worked well in portrait formats.
Once the prompt was entered into Grok Imagine, it was just a matter of adjusting a few parameters and letting the tool do its work. The result was a video with a very cinematic aesthetic, where an iconic video game character appears integrated into a completely everyday situation, as if he had always been there.
Grok Imagine Prompt (Image to Video)
SCENE / SETTING
Interior aisle of a generic supermarket. Tall, neat shelves, fully stocked with cereal boxes and various products. Several people are shopping in a calm, everyday manner, pushing shopping carts. The lighting is uniform, typical of a supermarket, with soft shadows.
The entire setting and the people must be depicted in high-quality photorealism.MAIN CHARACTER
In the center of the aisle is Ryu (Street Fighter).
Ryu must strictly maintain his original video game appearance (illustrated style, not realistic human):White karate uniform
Black belt
Barefoot
Red gloves
Athletic body, very defined musculature
Red headband tied around his forehead
Mandatory instruction: Ryu must remain as an illustrated video game character. Everything else (people, setting, products) must be completely photorealistic.ACTION
Ryu begins the scene by performing a backflip. Landing on his feet, he adopts a firm stance and begins unleashing rapid, powerful blows toward the cereal boxes on the left side of the aisle. The movements are exaggerated and stylized, true to the classic animations of the Street Fighter video game.As Ryu attacks the products, the people around him react with surprise: they turn their heads to look at him and begin to hurriedly leave the aisle, pushing their carts away from the scene.
VISUAL STYLE
Clear contrast between the illustrated character and the hyperrealistic environment. Fluid movement, cinematic feel, high level of detail, realistic lighting, visual coherence between action and environment.