GPT Image 1.5 Features: Generation, Editing, Inpainting, and Output Controls

Breakdown of GPT Image 1.5 features covering text-to-image generation, image editing, inpainting, multi-turn editing, streaming, transparent backgrounds, output customisation, and known limitations.

Arooj Ishtiaq

Fri Mar 27 2026 • Updated Wed Apr 22 2026

9 mins Read

ON THIS PAGE

GPT Image 1.5 is OpenAI's latest and most advanced image generation model, built for superior instruction following, text rendering, detailed editing, and real-world knowledge. It is the most capable model in the GPT Image family, and the one OpenAI recommends for the best overall output quality.

This article covers OpenAI GPT Image 1.5 features, including how each feature works in practice, and where the model's limitations sit.

GPT Image 1.5 Features Overview at a Glance

Feature	Supported	Notes
Text-to-image generation	Yes	Single image or multiple images per request
Image editing	Yes	Edit existing images using a text prompt
Inpainting with mask	Yes	Edit a specific region while preserving the rest
Multi-turn image editing	Yes	Iteratively refine images across multiple instructions
Streaming with partial images	Yes	Receive up to 3 partial previews during generation
Transparent backgrounds	Yes	PNG and WebP formats only
High input fidelity	Yes	Preserves fine details from reference images more accurately
Multiple image references	Yes	Combine multiple input images in a single generation
Auto size, quality, background	Yes	Model selects the best option based on the prompt
Output formats	Yes	PNG (default), JPEG, WebP
Image variations	No	Not supported on GPT Image 1.5
Fine-tuning	No	Not supported

How GPT Image 1.5 Turns Prompts Into Visuals?

GPT Image 1.5 generates images from text prompts with superior instruction following compared to its predecessors in the DALL-E series. It understands the real-world knowledge and context behind a prompt rather than simply matching words to pixels, producing more accurate outputs across complex or compositionally demanding subjects.

Key generation capabilities:

Generate a single image or multiple images in one request
Returns
base64-encoded image data by default, ready to save or display immediately
Supports an auto quality setting where the model selects the best output quality based on the prompt

When using GPT Image 1.5, creators can leverage GPT Image 1.5 features for precise instruction-following and real-world context interpretation. ImagineArt's AI image generator provides a dedicated workspace for generation across all available models.

Recommended read: What is GPT Image 1.5?

Image Editing With a Prompt

GPT Image 1.5 supports image editing through a dedicated edit capability. You provide an existing image alongside a text prompt describing the change you want, and the model applies the edit while preserving unchanged areas.

You can also provide multiple images as references to generate a new composed image. Practical applications include:

Combining individual product images into a styled arrangement without reshooting
Placing a product into a new background context while preserving product detail
Generating campaign imagery from existing brand assets without rebuilding compositions from scratch

For product teams working with reference-based imagery, ImagineArt AI product photo generator is the most relevant starting point for this type of multi-reference product photography workflow.

High Input Fidelity

GPT Image 1.5 supports a high input fidelity mode that preserves details from reference images more accurately in the output. This is particularly relevant when input images contain faces or logos that need to carry through the edit without degradation. With GPT Image 1.5, the first five input images benefit from higher fidelity preservation when this mode is enabled.

This makes it more reliable for:

Brand asset work where logos and identity elements must remain visually intact
Portrait editing where facial accuracy across the edit is a production requirement
Product imagery where material texture and colour fidelity cannot be lost in the process

Inpainting: Edit One Region, Preserve Everything Else

Inpainting allows you to edit a targeted area of an existing image using a mask that indicates which region should be replaced, while everything outside that region remains completely unchanged.

An important distinction from DALL-E 2 is that masking with GPT Image 1.5 is entirely prompt-guided. The model uses the mask as directional guidance rather than treating its edges as hard pixel boundaries, which produces more natural-looking results at the transition between edited and preserved areas.

Common inpainting use cases:

Replacing a product background in an e-commerce image without touching the product
Removing or replacing a specific object in a scene
Updating text or label elements on packaging without regenerating the full image
Adding an element to an existing composition while preserving the surrounding context

For teams doing precision object-level editing, ImagineArt AI object remover and AI object replacer are dedicated tools that handle targeted removal and replacement as part of a broader editing workflow.

Multi-Turn Editing: Refine Images Step by Step

Multi-turn editing allows you to refine an image across a sequence of separate instructions, where each new instruction builds on the output of the previous one. Rather than describing an entire final result in a single prompt, you work toward it progressively through incremental changes.

This approach is well-suited for:

Creative workflows where the desired output is not fully defined upfront
Client feedback workflows where changes are applied across review rounds
An iterative design exploration where early outputs inform what the next instruction should address

For teams building iterative creative production into a structured pipeline, the ImagineArt AI Workflow system supports automated pipelines that chain generation and editing steps into repeatable sequences.

Recommended read: How to Use GPT Image 1.5? | ImagineArt

Streaming: Preview Your Image While It Generates

GPT Image 1.5 supports streaming, which means you can receive partial previews of an image as it is being generated rather than waiting for the full output. Up to three partial images can be streamed during a single generation.

This is most useful when:

You want early visibility into the direction of an output before it completes
You are building interactive applications where progressive visual feedback improves user experience
You want to cancel and adjust a prompt early if the partial output is clearly not going in the right direction

Transparent Backgrounds

GPT Image 1.5 supports transparent background generation, which produces images where the background is removed entirely rather than filled with a colour or scene. This is useful for assets that need to be placed on different backgrounds, layered into designs, or exported as standalone subjects.

Transparent background support:

Available on PNG and WebP output formats only
Works best when quality is set to medium or high
Directly applicable to product photography, icon design, logo generation, and any asset intended for layered use

For creators producing assets for layered design work, the ImagineArt AI image editor and background changer are complementary tools for working with transparent or replaced backgrounds after generation.

Output Customisation: Size, Quality, Format and Compression Controls

GPT Image 1.5 gives you direct control over how output images are configured. All options below apply to GPT Image 1.5 specifically.

Size options:

1024x1024 (square)
1536x1024 (landscape)
1024x1536 (portrait)
Auto (model selects based on the prompt)

Quality options:

Low, Medium, High, or Auto
Higher quality settings produce more detailed outputs, but take longer to generate and cost more
Square images at standard quality are the fastest to generate

Output format options:

PNG (default)
JPEG (faster than PNG, recommended when generation speed matters)
WebP

Compression:

Available for JPEG and WebP formats
Adjustable from 0 to 100 percent

For teams managing generation costs across different quality settings and output formats, ImagineArt's subscription plans page provides a full breakdown of credit costs and plan tiers to help plan production budgets accurately.

How Is Generation Cost Calculated?

GPT Image 1.5 generates images by producing image tokens, and both the time and cost of generation are proportional to the number of tokens required. Larger image sizes and higher quality settings produce more tokens and therefore cost more.

Token counts by quality setting:

Quality	Square 1024×1024	Portrait 1024×1536	Landscape 1536×1024
Low	272 tokens	408 tokens	400 tokens
Medium	1,056 tokens	1,584 tokens	1,568 tokens
High	4,160 tokens	6,240 tokens	6,208 tokens

The total cost of a generation is the sum of output image tokens, input text tokens for the prompt, and input image tokens if you are using the edits endpoint with reference images. If streaming is enabled, each partial image adds 100 image output tokens to the total.

Known Limitations of GPT Image 1.5 Features

OpenAI is transparent about the current limitations of GPT Image 1.5 features. Understanding these upfront helps set accurate expectations before building workflows around the model.

Latency: Complex prompts may take up to 2 minutes to process. This is a known characteristic of high-quality generation and should be factored into time-sensitive production workflows
Text rendering: Although significantly improved over the DALL-E series, the model can still struggle with precise text placement and clarity in some compositions
Consistency: The model may occasionally struggle to maintain visual consistency for recurring characters or brand elements across multiple separate generations
Composition control: Despite improved instruction following, the model may have difficulty placing elements precisely in structured or layout-sensitive compositions

OpenAI has now released GPT Image 2, which builds on these features with further improvements to image quality, text rendering, and multi-subject composition.

Achieve Better Results Faster

GPT Image 1.5 gives you the tools to produce high-quality visuals with total control. By delivering accurate text, seamless editing, and rapid output, it eliminates the common frustrations of digital creation.

Your time is valuable. Choose the tool that works as fast as you do and delivers professional results on the first try.

Arooj Ishtiaq

Arooj is a SaaS content writer specializing in AI models and applied technology. At ImagineArt, she creates sharp, product-focused content that helps creators and businesses understand, adopt, and get real value from AI tools.

How to Use GPT Image 1.5? | ImagineArt

Arooj Ishtiaq

AI Image Generation

What Is GPT Image 1.5? OpenAI's New Image Generation Model Explained

Arooj Ishtiaq

GPT Image 1.5 Features: Generation, Editing, Inpainting, and Output Controls

GPT Image 1.5 Features Overview at a Glance

How GPT Image 1.5 Turns Prompts Into Visuals?

Image Editing With a Prompt

High Input Fidelity

Inpainting: Edit One Region, Preserve Everything Else

Multi-Turn Editing: Refine Images Step by Step

Streaming: Preview Your Image While It Generates

Transparent Backgrounds

Output Customisation: Size, Quality, Format and Compression Controls

How Is Generation Cost Calculated?

Known Limitations of GPT Image 1.5 Features

Achieve Better Results Faster

Arooj Ishtiaq

More topic you may like

How to Use GPT Image 1.5? | ImagineArt

What Is GPT Image 1.5? OpenAI's New Image Generation Model Explained