Overview
Fine-tune image editing models to learn specific transformations: style transfer, object manipulation, background changes, or any image-to-image task.Your Data
Your training data needs three columns:- Control/Input image - The original image
- Target/Output image - The transformed image
- Caption - Text describing the transformation
edits.parquet:
| control_image | edited_image | caption |
|---|---|---|
| inputs/001.jpg | outputs/001.jpg | add sunglasses to the person |
| inputs/002.jpg | outputs/002.jpg | change background to beach |
| inputs/003.jpg | outputs/003.jpg | apply vintage filter |
Minimal Example
Key Parameters
Only these fields are required to start:| Parameter | Description | Example |
|---|---|---|
control_image_column | Input/original image column | "control_image", "input", "source" |
image_column | Output/transformed image column | "edited_image", "output", "target" |
caption_column | Transformation description column | "caption", "prompt", "description" |
steps | Number of training steps (2000-5000 typical) | 3000 |
Supported Models
Popular choices for image editing:black-forest-labs/FLUX.1-Kontext-dev- High quality, versatileQwen/Qwen-Image-Edit- Fast, good for quick iterations
See the full model list for all available options.
Data Requirements
For best results:- Quantity: 20-100 image pairs minimum
- Quality: High resolution, aligned transformations
- Captions: Clear descriptions of what changed between input and output
- Consistency: Transformations should follow a consistent pattern or style
Sample During Training
Add sample prompts to see progress during training:Monitor Progress
Next Steps
- Advanced parameters - Resolution, LoRA, sampling settings
- Deploy your model - Use your fine-tuned model
- Full tutorial - Complete walkthrough with deployment
Common Issues
Images not aligned
Images not aligned
Ensure input and output images show the same scene/subject. The model learns the transformation between them.
Edits too subtle or too strong
Edits too subtle or too strong
Adjust
learning_rate (lower for subtle, higher for stronger). Default is 0.0002.Out of memory error
Out of memory error
Reduce
batch_size to 1 and sample_height/sample_width to 512 or 768.Model not learning the style
Model not learning the style
Ensure your captions consistently describe the transformation. Train for more steps (4000-5000) or increase dataset size.