ImagineArt 2.0 Edit — Beach with retriever

ImagineArt 2.0 Edit

Compose with images. Direct with one line.

Bring the subject. Bring the scene. Bring the wardrobe. Write one instruction. ImagineArt 2.0 Edit composes a frame you couldn't have shot.

Images

Start Editing Free Contact Sales for API

Eight capabilities · One unified model Direct, don't prompt.

01 — Composition

Multi-Subject
Composition

Two people, a product, a place — composed into one frame. Up to four images, intentionally arranged.

02 — Identity

Same face.
New world.
Every time.

One input portrait. Eight cinematic worlds. Identity preserved scene after scene.

03 — Outfit Swap

Re-dress
in one frame

04 — Style

Paint in
any voice

Painterly aesthetics. Film-stock transfer. Brand-aligned grading.

05 — Why It's Different

Direction over
prompting

4 Input images per request

8 Aspect ratios + auto

2048px Output resolution

1 Generation. Pixel-faithful.

Up to 4

Input Images

v2.0

8×

Native Aspect Ratios

+ Auto

01 — Multi-Subject Composition

Two people. A product.
A place. One frame.

Most editing models choose between subject fidelity and scene fidelity. ImagineArt 2.0 Edit holds all of them at once — character likeness, wardrobe fidelity, environmental light, and chemistry between the subjects — in a single composed image. Direct a scene the way a director casts a film, and locations it.

See more compositions

Images 3 of 4

01 Subject A

02 Subject B

03 Scene

Output · 16:9

Place both subjects together at a wooden dining table inside a luxurious villa interior — facing each other, engaged in natural conversation, sharing subtle laughter and expressive eye contact.

Images 3 of 4

01 Subject A

02 Subject B

03 Scene

Composition variant — Mediterranean terrace at golden hour

Output · 16:9

Same two subjects — re-cast to a new location. Identity, wardrobe, and chemistry travel together; the terrace light wraps both of them as if they were photographed there.

02 — Identity Preservation

One face.
Eight worlds.

Consistency reads at a glance only when you can see many outputs together. A single portrait — flapper, astronaut, samurai, rock star — and the same identity walks unchanged into every frame.

See more identities

Input · 1 of 1

Casting note

A single portrait, every era.

Freckles. Hazel eyes. Auburn curl. Same micro-features rendered across every output — across genre, lighting, and stylization.

1920s speakeasy, beaded fringe, champagne coupe.

Astronaut in the ISS cupola, Earth below.

Renaissance noblewoman — Rembrandt sfumato.

Samurai at dawn, lacquered red-and-black armor.

Bond-style spy, Monte Carlo casino, neon glass.

Saharan explorer at golden hour, camel silhouette.

Wuxia heroine above the bamboo forest.

'70s rock musician — Annie Leibovitz stage glow.

03 — Outfit Swap

Re-dress
without re-shooting.

Hand the model a garment photo — the gown's sequins, the jacket's seams, the fabric's drape — and ImagineArt 2.0 Edit re-clothes the subject without losing face, body proportion, or pose. Editorial-grade fabric reflection and shadow, in one generation.

See more swaps

Images 2 of 4

01 Model

02 Garment

Output · 3:4

Every sequin lands. Face, body, and pose preserved. Subtle red-carpet bokeh added on direction.

04 — Product on Model

Flat product.
Real model. One shot.

E-commerce composites used to mean three days, a studio, and a retoucher. Hand a flat product shot and a casting portrait to ImagineArt 2.0 Edit — the model places the bag, stitches the leather, lands the brass hardware, and casts a real shadow under it.

See more product shots

Images 2 of 4

01 Product

02 Model

Output · 3:4

Pebbled grain, brass hardware, stitch count — preserved. The bag belongs in her hand.

Multi-Subject Composition

Identity Preservation

Outfit Swap

Product on Model

Style Transfer

Background Replacement

Subject Insertion

Pose Transfer

Multi-Subject Composition

Identity Preservation

Outfit Swap

Product on Model

Style Transfer

Background Replacement

Subject Insertion

Pose Transfer

05 — Style Transfer

Paint a photograph
in any voice.

Hand a content image and a style example. ImagineArt 2.0 Edit holds the structural truth of the photograph — its architecture, geometry, and light direction — and re-renders the entire surface in the painted aesthetic of your style example. Brand-aligned grading, hand-painted homages, film-stock translations — preserved geometry, transformed surface.

See more stylizations

01 · Content New York · Manhattan

02 · Style Van Gogh · oil painting

Output · 3:2

06 — Background Replacement

Same subject.
Anywhere you need.

Studio grey becomes Italian piazza. Sweater, beard, eye direction, expression — preserved. New light falls believably across the subject from the new environment. No relight pass, no compositing — one generation, photoreal contact.

More relocations

Input

Studio · neutral grey backdrop

Output

"Place him in an Italian piazza at golden hour."

Input

Studio · neutral grey backdrop

Output

"Put him in the cockpit over Patagonian glaciers."

07 / 08 — Insertion & Pose Transfer

Subject. Pose.
Composited natively.

Drop a subject into a plate. Re-pose an identity. ImagineArt 2.0 Edit treats every input as a directable role — casting, blocking, and lighting decisions become part of a single generation, not a downstream Photoshop pipeline.

Explore workflows

07 · Subject Insertion 2 of 4

01 Subject

02 Scene

Output · 16:9

Every fur strand backlit by the same sun the beach was shot under. Wet-sand reflection beneath each paw.

08 · Pose & Expression Transfer 2 of 4

01 Identity

02 Pose

Output · 3:4

Identity from frame one. Pose from frame two. Single rim light direction stitched across both.

09 — Aspect Ratios & Formats

Every canvas.
Native, never cropped.

ImagineArt 2.0 Edit reads your first input image and matches its frame — or accepts an explicit aspect ratio. Square for grids. Ultra-wide for film. Vertical for shorts. Eight canvas ratios plus auto.

16:9 Cinematic

21:9 Ultrawide

9:16 Stories & Reels

4:3 Classic

1:1 Square

All supported aspect ratios — and auto

21:9

16:9

5:4

4:3

1:1

3:4

4:5

9:16

auto

11 — How It Works

Three steps.
One generation.

Bring your images. Write one instruction. Get a finished frame — at whatever aspect ratio you need. Or let auto match the first one.

Cast up to four images

A subject, a scene, a garment, a product — hand the model anywhere from one to four images. Each role is implicit in your instruction.

Write one instruction

Direct rather than describe. Tell the model what should happen — who goes where, what gets worn, what gets replaced. Plain English. One sentence is usually enough.

Receive a finished frame

One 2048-pixel image. Lighting, composition, fabric, faces — already integrated. No layer stack, no relight pass, no compositing.

A finished frame composed by ImagineArt 2.0 Edit

Direction over Prompting

Identity Preserved

Up to Four Images

Native Aspect Ratios

2048-Pixel Output

One Generation

Pixel-Faithful

Image-Grounded

Direction over Prompting

Identity Preserved

Up to Four Images

Native Aspect Ratios

2048-Pixel Output

One Generation

Pixel-Faithful

Image-Grounded

ImagineArt 2.0 Edit

Multi-SubjectComposition

Same face.New world.Every time.

Re-dressin one frame

Paint inany voice

Direction overprompting

Two people. A product.A place. One frame.

One face.Eight worlds.

Re-dresswithout re-shooting.

Flat product.Real model. One shot.

Paint a photographin any voice.

Same subject.Anywhere you need.

Subject. Pose.Composited natively.

Every canvas.Native, never cropped.

Three steps.One generation.

Cast up to four images

Write one instruction

Receive a finished frame

Direct your first frame.

Multi-Subject
Composition

Same face.
New world.
Every time.

Re-dress
in one frame

Paint in
any voice

Direction over
prompting

Two people. A product.
A place. One frame.

One face.
Eight worlds.

Re-dress
without re-shooting.

Flat product.
Real model. One shot.

Paint a photograph
in any voice.

Same subject.
Anywhere you need.

Subject. Pose.
Composited natively.

Every canvas.
Native, never cropped.

Three steps.
One generation.