Meta Llama 3.2 11B Vision Instruct Turbo

Author

model_id: meta-llama-Llama-3.2-11B-Vision-Instruct-Turbo

Model Type: Stable Diffusion

11B parameter multimodal model for image-text tasks, offering high-speed image captioning and visual question answering with a 128K token context window.<...

Open in Playground

Load in Enterprise

View API Docs

API Code Snippet

Copy

  
  curl --location --request POST 'https://stablediffusionapi.com/api/v4/dreambooth' \
  curl  --header 'Content-Type: application/json' \
  {
  "key": "api-key",
  "model_id": "meta-llama-Llama-3.2-11B-Vision-Instruct-Turbo",
  "prompt": "ultra realistic close up portrait ((beautiful pale cyberpunk female with heavy black eyeliner)), blue eyes, shaved side haircut, hyper detail, cinematic lighting, magic neon, dark red city, Canon EOS R3, nikon, f/1.4, ISO 200, 1/160s, 8K, RAW, unedited, symmetrical balance, in-frame, 8K",
  "negative_prompt": "painting, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, deformed, ugly, blurry, bad anatomy, bad proportions, extra limbs, cloned face, skinny, glitchy, double torso, extra arms, extra hands, mangled fingers, missing lips, ugly face, distorted face, extra legs, anime",
  "width": "512",
  "height": "512",
  "samples": "1",
  "num_inference_steps": "30",
  "seed": null,
  "guidance_scale": 7.5,
  "webhook": null,
  "track_id": null
  }

No. of images generated

5

Images
Generated

Images generated with Meta Llama 3.2 11B Vision Instruct Turbo and its prompt

Get Dedicated Server to server APIs at scale

Meta Llama 3.2 11B Vision Instruct Turbo

API Code Snippet

Copy

No. of images generated

5

Images Generated

Images
Generated