Meta Llama 3.2 11B Vision Instruct Turbo
model_id: meta-llama-Llama-3.2-11B-Vision-Instruct-Turbo
Model Type: Stable Diffusion
11B parameter multimodal model for image-text tasks, offering high-speed image captioning and visual question answering with a 128K token context window.<...
API Code Snippet
curl --location --request POST 'https://stablediffusionapi.com/api/v4/dreambooth' \
curl --header 'Content-Type: application/json' \
{
"key": "api-key",
"model_id": "meta-llama-Llama-3.2-11B-Vision-Instruct-Turbo",
"prompt": "ultra realistic close up portrait ((beautiful pale cyberpunk female with heavy black eyeliner)), blue eyes, shaved side haircut, hyper detail, cinematic lighting, magic neon, dark red city, Canon EOS R3, nikon, f/1.4, ISO 200, 1/160s, 8K, RAW, unedited, symmetrical balance, in-frame, 8K",
"negative_prompt": "painting, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, deformed, ugly, blurry, bad anatomy, bad proportions, extra limbs, cloned face, skinny, glitchy, double torso, extra arms, extra hands, mangled fingers, missing lips, ugly face, distorted face, extra legs, anime",
"width": "512",
"height": "512",
"samples": "1",
"num_inference_steps": "30",
"seed": null,
"guidance_scale": 7.5,
"webhook": null,
"track_id": null
}
No. of images generated
5
Images
Generated
Images generated with Meta Llama 3.2 11B Vision Instruct Turbo and its prompt