Grok Imagine Text To Video
model_id: grok-imagine-video-t2v
Model Type: Stable Diffusion
Generate stunning videos with native audio from text or images; up to 15s duration, 720p/480p resolution, 8 aspect ratios, 24fps—ideal for social clips an...
API Code Snippet
curl --location --request POST 'https://stablediffusionapi.com/api/v4/dreambooth' \
curl --header 'Content-Type: application/json' \
{
"key": "api-key",
"model_id": "grok-imagine-video-t2v",
"prompt": "ultra realistic close up portrait ((beautiful pale cyberpunk female with heavy black eyeliner)), blue eyes, shaved side haircut, hyper detail, cinematic lighting, magic neon, dark red city, Canon EOS R3, nikon, f/1.4, ISO 200, 1/160s, 8K, RAW, unedited, symmetrical balance, in-frame, 8K",
"negative_prompt": "painting, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, deformed, ugly, blurry, bad anatomy, bad proportions, extra limbs, cloned face, skinny, glitchy, double torso, extra arms, extra hands, mangled fingers, missing lips, ugly face, distorted face, extra legs, anime",
"width": "512",
"height": "512",
"samples": "1",
"num_inference_steps": "30",
"seed": null,
"guidance_scale": 7.5,
"webhook": null,
"track_id": null
}
No. of images generated
372
Images
Generated
Images generated with Grok Imagine Text To Video and its prompt