Try FLUX.1-Kontext [dev] in the Workbench
Run this model interactively, tune parameters, and compare outputs.
flux-kontext-dev
black-forest-labs/FLUX.1-Kontext-dev is a 12 billion parameter multimodal LVM (vision model) designed for high-fidelity image editing and generation using both text and image inputs.
It excels in iterative image editing, maintaining character, style, and object consistency over multiple successive edits with minimal visual drift. The model reliably performs both local and global modifications, and is noted for its robust contextual understanding and ability to process instructions for precise regional edits or full scene transformations.
Some other noteworthy features of black-forest-labs/FLUX.1-Kontext-dev include:
- Preserving unique elements (like characters or objects) across different scenes and environments, even after several edits.
- Generating novel images with style or character references guided by either text or example images.
| Metric | Value |
|---|---|
| Parameter Count | 12 billion |
| Mixture of Experts | No |
| Context Length | 0K (not applicable for vision model) |
| Multilingual | No |
| Quantized* | Yes |
| Precision* | FP8 |
Example request
- Sync
- Async
- Async with SSE
See the image editing reference for more details.
- Minimal
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.
Request parameters
Required parameters
| Field | Type | Default | Description |
|---|---|---|---|
input_image | string | "https://hub.oxen.ai/api/repos/ox/Oxen-Character-Simple-Vector-Graphic/file/main/images/reference/bloxy_white_bg.png" | Image to use as reference. Must be jpeg, png, gif, or webp. Format: uri. |
prompt | string | "High quality illustration of the character riding a cow, with a text bubble saying 'Woohoo!' - preserve all colors and traits of the original character, and render the cow in the same style" | Text description of what you want to generate, or the instruction on how to edit the given image. |
Optional parameters
| Field | Type | Default | Description |
|---|---|---|---|
aspect_ratio | string | "match_input_image" | Aspect ratio of the generated image. Use ‘match_input_image’ to match the aspect ratio of the input image. One of: 1:1, 16:9, 21:9, 3:2, 2:3, 4:5, 5:4, 3:4, 4:3, 9:16, 9:21, match_input_image. |
num_inference_steps | integer | 28 | Number of inference steps Range: 4 – 50. |
guidance | number | 2.5 | Guidance scale for generation Range: 0 – 10. |
seed | integer | — | Random seed for reproducible generation. Leave blank for random. |
output_format | string | "webp" | Output image format One of: webp, jpg, png. |
output_quality | integer | 80 | Quality when saving the output images, from 0 to 100. 100 is best quality, 0 is lowest quality. Not relevant for .png outputs Range: 0 – 100. |
disable_safety_checker | boolean | false | Disable NSFW safety checker |