Image Generation
OpenRouter supports image generation through models that have "image" in their output_modalities. These models can create images from text prompts when you specify the appropriate modalities in your request.
Model Discovery
You can find image generation models in several ways:
On the Models Page
Visit the Models page and filter by output modalities to find models capable of image generation. Look for models that list "image" in their output modalities.
In the Chatroom
When using the Chatroom, click the Image button to automatically filter and select models with image generation capabilities. If no image-capable model is active, you’ll be prompted to add one.
API Usage
To generate images, send a request to the /api/v1/chat/completions endpoint with the modalities parameter set to include both "image" and "text".
Basic Image Generation
Image Configuration Options
Gemini image-generation models support additional configuration through the image_config parameter. Read more about using Gemini Image Gen models here: https://ai.google.dev/gemini-api/docs/image-generation
Aspect Ratio
Set image_config.aspect_ratio to request specific aspect ratios for generated images.
Supported aspect ratios:
1:1→ 1024×1024 (default)2:3→ 832×12483:2→ 1248×8323:4→ 864×11844:3→ 1184×8644:5→ 896×11525:4→ 1152×8969:16→ 768×134416:9→ 1344×76821:9→ 1536×672
Image Size (Gemini only)
Set image_config.image_size to control the resolution of generated images. This parameter is currently only supported by Gemini models.
Supported sizes:
1K→ Standard resolution (default)2K→ Higher resolution4K→ Highest resolution
You can combine both aspect_ratio and image_size in the same request:
Streaming Image Generation
Image generation also works with streaming responses:
Response Format
When generating images, the assistant message includes an images field containing the generated images:
Image Format
- Format: Images are returned as base64-encoded data URLs
- Types: Typically PNG format (
data:image/png;base64,) - Multiple Images: Some models can generate multiple images in a single response
- Size: Image dimensions vary by model capabilities
Model Compatibility
Not all models support image generation. To use this feature:
- Check Output Modalities: Ensure the model has
"image"in itsoutput_modalities - Set Modalities Parameter: Include
"modalities": ["image", "text"]in your request - Use Compatible Models: Examples include:
google/gemini-2.5-flash-image-previewblack-forest-labs/flux.2-problack-forest-labs/flux.2-flexsourceful/riverflow-v2-standard-preview- Other models with image generation capabilities
Best Practices
- Clear Prompts: Provide detailed descriptions for better image quality
- Model Selection: Choose models specifically designed for image generation
- Error Handling: Check for the
imagesfield in responses before processing - Rate Limits: Image generation may have different rate limits than text generation
- Storage: Consider how you’ll handle and store the base64 image data
Troubleshooting
No images in response?
- Verify the model supports image generation (
output_modalitiesincludes"image") - Ensure you’ve included
"modalities": ["image", "text"]in your request - Check that your prompt is requesting image generation
Model not found?
- Use the Models page to find available image generation models
- Filter by output modalities to see compatible models