Overview
Fine-tune image generation models to create images in your specific style, brand, or artistic direction. Perfect for branded content, character design, or artistic styles.Your Data
Your training data should have:- Image column - Paths or URLs to your training images
- Caption column - Text descriptions of each image
images.parquet:
| image | caption |
|---|---|
| images/001.jpg | a red sports car in cyberpunk style |
| images/002.jpg | a blue sedan in cyberpunk style |
| images/003.jpg | a motorcycle in cyberpunk style |
Minimal Example
Key Parameters
Only these fields are required to start:| Parameter | Description | Example |
|---|---|---|
image_column | Name of your image column | "image", "file", "path" |
caption_column | Name of your caption/prompt column | "caption", "prompt", "description" |
steps | Number of training steps (1000-3000 typical) | 2000 |
Supported Models
Popular choices for image generation:black-forest-labs/FLUX.1-dev- High quality, state-of-the-artblack-forest-labs/FLUX.2-dev- Latest version, even better qualityQwen/Qwen-Image- Fast, good for quick iterations
See the full model list for all available options.
Data Requirements
For best results:- Quantity: 10-50 images minimum, 100-500 images ideal
- Quality: High resolution, consistent style
- Captions: Descriptive prompts that explain what makes your images unique
- Consistency: Images should share common elements (style, subject, theme)
Monitor Progress
Image models generate sample outputs during training. Check them to see progress:Next Steps
- Advanced parameters - Learning rate, LoRA, sampling settings
- Deploy your model - Generate images with your fine-tuned model
- Parameter guide - Understanding training parameters
Common Issues
Images not loading
Images not loading
Ensure image paths are relative to your repository root, or use full URLs. Check that images are committed to your Oxen repository.
Out of memory error
Out of memory error
Reduce
batch_size to 1. Image models require significant GPU memory.Results don't match my style
Results don't match my style
Try training for more steps (3000-5000) or increase your dataset size. Ensure captions clearly describe the unique aspects of your style.
Training taking too long
Training taking too long
Start with 1000 steps for testing. FLUX models take 1-2 hours on GPU for 2000 steps.