Meta Llama 3.2 90B Vision Instruct Turbo
model_id: meta-llama-Llama-3.2-90B-Vision-Instruct-Turbo
Model Type: Stable Diffusion
Multimodal 90-billion-parameter LLM processes text and high-res images (up to 1120x1120px), excels in visual reasoning, captioning, Q&A, and multilingual...
API Code Snippet
curl --location --request POST 'https://stablediffusionapi.com/api/v4/dreambooth' \
curl --header 'Content-Type: application/json' \
{
"key": "api-key",
"model_id": "meta-llama-Llama-3.2-90B-Vision-Instruct-Turbo",
"prompt": "ultra realistic close up portrait ((beautiful pale cyberpunk female with heavy black eyeliner)), blue eyes, shaved side haircut, hyper detail, cinematic lighting, magic neon, dark red city, Canon EOS R3, nikon, f/1.4, ISO 200, 1/160s, 8K, RAW, unedited, symmetrical balance, in-frame, 8K",
"negative_prompt": "painting, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, deformed, ugly, blurry, bad anatomy, bad proportions, extra limbs, cloned face, skinny, glitchy, double torso, extra arms, extra hands, mangled fingers, missing lips, ugly face, distorted face, extra legs, anime",
"width": "512",
"height": "512",
"samples": "1",
"num_inference_steps": "30",
"seed": null,
"guidance_scale": 7.5,
"webhook": null,
"track_id": null
}
No. of images generated
5
Images
Generated
Images generated with Meta Llama 3.2 90B Vision Instruct Turbo and its prompt