Image Generation

Generate new images from text prompts or edit existing images in your workspace. Save results to your project with preview in chat. This experimental feature requires an OpenRouter API key.

Experimental Feature

Image Generation is an experimental feature that requires enabling in settings and configuring an OpenRouter API key.

Quick Start

Enable Image Generation in Settings > Experimental and save your OpenRouter API key.
Switch to any coding model you prefer (e.g., Claude 3.7 Sonnet, Gemini Pro, GPT-5 mini).
In chat, type an explicit instruction to create an image (for example: "Generate an image of a low-poly kangaroo mascot on a dark background").
Approve the action and choose a save path (e.g., images/kangaroo.png). Roo saves the file and shows a preview.

Key Features

Create new images from text prompts using natural language
Edit and transform existing images in your workspace
Saves to your workspace at a path you choose; appropriate extension (.png or .jpg) is auto-added if missing
Shows a preview of the generated/edited image in the conversation
Currently uses Gemini 2.5 Flash Image Preview models via OpenRouter
Simple on/off toggle under Experimental settings

Use Cases

Image Generation

Before: You had to copy prompts to an external site, download the result, then move it into your workspace.

With this feature: Ask Roo to generate an image, approve, pick a save location, and continue editing with the image already in your project.

Image Editing

Before: Export image, upload to external editor, make changes, download, import back to project.

With this feature: Ask Roo to transform your existing image directly - it reads the file, applies your edits, and saves the result in your project.

How It Works

Simple Flow

Turn on Image Generation and set your OpenRouter API key.
Use any coding model; Roo exposes an Image tool to the model.
When you instruct "Generate an image …", Roo enhances the prompt and invokes that tool, which calls the Gemini 2.5 Flash Image Preview model via OpenRouter.
Roo writes the resulting image to your chosen path and shows a preview in the chat.

Behind the Scenes

When enabled, Roo sends your prompt (and optionally an existing image) to an image-capable model through OpenRouter. The generated or edited image returned by OpenRouter is saved to the path you specify inside your current workspace. Roo shows a preview in the chat and the file appears in your file explorer.

Requirements

OpenRouter account and API key
Internet access
An open, writable workspace folder

Configuration

1. Enable Image Generation (Experimental)

Purpose: Turns the feature on so Roo can create images on request
Default: Off
Location: Settings > Experimental

2. OpenRouter API Key

Purpose: Authorizes image generation requests
Default: Empty (required)
Get your key: https://openrouter.ai/keys

3. Image Generation Model

Purpose: Selects which model to use for generation
Default: Gemini 2.5 Flash Image Preview
Available Models: Currently limited to Gemini 2.5 Flash Image Preview and its free variant

Using Image Generation

In chat, instruct Roo to generate an image and specify clear characteristics (subject, style, lighting, composition, aspect ratio).
Confirm the action when prompted. Roo may ask you to choose a save path (for example: images/sunset.png).
Roo generates the image and saves it. If you don't include an extension, the appropriate extension (.png or .jpg) is added based on the output format.
See the image preview in the chat and locate the file in your workspace.

Example Prompts

Create an image of a friendly low-poly kangaroo mascot on a dark gradient background, 1024x1024 PNG.
Generate a watercolor illustration of a mountain lake at sunrise with light mist, soft lighting, pastel palette, 16:9 aspect ratio.
Example using GPT-5 mini: Generate an image of a retro terminal wallpaper with green phosphor glow and subtle scanlines, minimal noise, 3840x2160.

Editing Existing Images

Roo can also transform and edit existing images in your workspace:

Ask Roo to edit an image, describing the transformation you want
Specify both the input image path and where to save the result
Roo will apply your requested edits to the existing image

Supported Input Formats: PNG, JPG, JPEG, GIF, WEBP

Example Requests:

"Transform photos/portrait.jpg into a watercolor painting and save as art/watercolor-portrait.png"
"Upscale and enhance images/logo.png to higher resolution"
"Apply a vintage filter to screenshots/app.png"

Note: Both the input image path and output path must be accessible (not blocked by .rooignore)

Tips for Better Results

Be Specific

Include these elements in your prompts:

Style: artistic medium, art movement, or specific artist style
Mood: emotional tone, atmosphere
Color palette: specific colors or color schemes
Camera/lighting: angle, perspective, lighting conditions
Aspect ratio: dimensions or orientation

Limitations

Experimental feature; availability and model list are limited
Currently limited to Gemini 2.5 Flash Image Preview models
One image is produced per request
Output formats supported: PNG or JPG
Supported input formats for editing: PNG, JPG, JPEG, GIF, WEBP only
Image paths must be accessible (not blocked by .rooignore restrictions)
Usage may be subject to your OpenRouter plan limits and costs

Status

This feature is experimental and may change or be removed in future versions. Provide feedback through GitHub Issues.

Quick Start​

Key Features​

Use Cases​

Image Generation​

Image Editing​

How It Works​

Simple Flow​

Behind the Scenes​

Requirements​

Configuration​

1. Enable Image Generation (Experimental)​

2. OpenRouter API Key​

3. Image Generation Model​

Using Image Generation​

Example Prompts​

Editing Existing Images​

Tips for Better Results​

Be Specific​

Limitations​

Status​

Quick Start

Key Features

Use Cases

Image Generation

Image Editing

How It Works

Simple Flow

Behind the Scenes

Requirements

Configuration

1. Enable Image Generation (Experimental)

2. OpenRouter API Key

3. Image Generation Model

Using Image Generation

Example Prompts

Editing Existing Images

Tips for Better Results

Be Specific

Limitations

Status