Stable Diffusion XL (SDXL) is a major upgrade from SD 1.5, offering enhanced realism, sharper details, better composition, and improved text rendering. However, hereβs the thing: SDXL behaves differently than earlier models and requires specific techniques for best results. This guide is designed to give you professional-grade results with clear explanations and actionable settings β no fluff, just what works.
This guide is based on practical testing and trusted resources from:
- Stability AI SDXL documentation
- Hugging Face SDXL model insights
- Community research and expert workflows
- StableDiffusionArt.com
- Industry prompt engineers and workflow experts
β Table of Contents
- What makes SDXL different?
- Recommended software & environments
- SDXL model structure explained
- Best generation settings (tested configurations)
- Prompting for SDXL β structure and techniques
- Negative prompting best practices
- Refiner usage β when and how to apply
- Resolution and aspect ratios
- SDXL LoRA and training compatibility
- SDXL with ControlNet
- Upscaling strategies
- Performance tips
- Troubleshooting
- Conclusion
π 1. What Makes SDXL Different?
SDXL is not just a bigger version of Stable Diffusion 1.5 β itβs actually a next-generation model built for realism, composition control, and high-resolution output. Understanding how it differs from SD1.5 is key to using it effectively β theyβre more different than you might think.
SDXL improves over SD1.5, but it requires more GPU VRAM (minimum 8GB recommended) β keep that in mind. Key differences:
| Feature | SD 1.5 | SDXL |
|---|---|---|
| Output quality | Good | High |
| Detail & realism | Medium | High |
| Text generation | Poor | Improved |
| Handles complex prompts | Limited | Yes |
| Base resolution | 512Γ512 | 1024Γ1024 |
| VRAM required | 6GB | 8β12GB |
Important: SDXL responds differently to promptsβshort tag-based prompts from SD1.5 do not work well. SDXL prefers descriptive sentence-style prompts β think full sentences, not just keywords. This is a common mistake people make.
π 2. Recommended Software
SDXL works well in the following UIs:
| Software | Why use it |
|---|---|
| ComfyUI | Best for SDXL workflows and refiners |
| Stable Diffusion WebUI Forge | Faster SDXL performance |
| AUTOMATIC1111 (latest) | Works but slower |
| InvokeAI | Best for inpainting & unified canvas |
ComfyUI and Forge are highly recommended for SDXL.
π 3. SDXL Model Structure
SDXL uses two models:
- Base model β Creates initial image structure
- Refiner model β Improves details and textures
Both can be used together for optimal quality, especially in portrait and product rendering use cases.
βοΈ 4. Best SDXL Settings (Recommended for Quality and Stability)
These settings are based on benchmark testing across Forge, ComfyUI, and A1111 environments. They balance quality and render time. These are tested settings for high quality:
| Setting | Value |
|---|---|
| Steps | 25β35 |
| Sampler | DPM++ 2M Karras |
| CFG Scale | 5β7 |
| Refiner switch | At step 0.75 |
| Seed | -1 (random) |
| Resolution | 1024Γ1024 (base) |
For portraits use: Euler a or DPM++ SDE.
π‘ 5. Prompting for SDXL β Best Practices
Unlike SD1.5 which prefers short tag-style prompts, SDXL works best with natural language prompts. You should write descriptive phrases like a photographer or filmmaker β the more descriptive, the better. Think storytelling, not just keywords.
β SD1.5 vs SDXL Prompt Example
| Model | Weak Prompt | Strong Prompt |
|---|---|---|
| SD1.5 | βcyberpunk girl, neonβ | β Works well |
| SDXL | βportrait of a cyberpunk woman, neon lights, dramatic rim light, shallow depth of field, detailed skinβ | β Best results |
β Prompt Template for SDXL
[Subject], [Scene], [Lighting], [Camera], [Style], [Details]β Good SDXL Prompt Example
Cinematic portrait of a Scandinavian woman with freckles, soft studio lighting, 85mm lens photography, film look, ultra detailed skin texture, sharp depth of field, magazine editorial styleπ 6. Negative Prompting for SDXL
Negative prompts help control quality β theyβre your way of telling SDXL what you donβt want. SDXL does not need long negative lists like SD 1.5, which is nice because you can keep things simpler.
β Recommended Negative Prompt
low quality, blurry, pixelated, distorted, extra limbs, watermark, text, deformed handsOptional Advanced Negative Prompt
bad anatomy, low detail, overexposed, underexposed, noisy, overly saturated, cartoonish, artifactsπ 7. SDXL Refiner β When and How to Use It
SDXL includes a base model and an optional refiner model. The refiner improves fine details like eyes, skin, shadows, and edges β itβs basically a polish pass that can make a big difference in final quality.
When to Use the Refiner
| Use Case | Refiner Needed? |
|---|---|
| Portraits | β Yes |
| Realistic Photography | β Yes |
| Products/Logos | β Yes |
| Anime/Concept Art | Optional |
| Fast Preview Tests | β No |
Recommended Refiner Settings
| Setting | Value |
|---|---|
| Refiner Switch | 0.65 β 0.80 |
| Steps (Base + Refiner) | 15 + 10 |
| Sampler | DPM++ 2M Karras |
π 8. Aspect Ratios & Resolution for SDXL
SDXL was trained at 1024Γ1024 but supports flexible resolutions.
Best Resolutions for SDXL
| Ratio | Resolution |
|---|---|
| Square | 1024Γ1024 |
| Portrait | 832Γ1216 / 896Γ1152 |
| Landscape | 1152Γ896 / 1216Γ832 |
| Ultra-Wide | 1536Γ640 |
Avoid unusual values like 1000Γ1000 or 900Γ900 β they reduce model quality.
π 9. SDXL with ControlNet
ControlNet works well with SDXL but requires SDXL-compatible models.
Recommended ControlNet Models for SDXL
| Model | Use Case |
|---|---|
| controlnet-canny-sdxl | Edge maps |
| controlnet-depth-sdxl | Depth & lighting |
| controlnet-openpose-sdxl | Human poses |
Enable pixel-perfect for best results.
π‘ 10. Using LoRA with SDXL β Best Practices
LoRA models for SD 1.5 are not compatible with SDXL. You must use SDXL LoRAs only.
Correct LoRA Folder Paths
Place LoRA files here:
models/Lora/Recommended LoRA Strengths
| Type | Strength |
|---|---|
| Character LoRA | 0.6 β 0.9 |
| Style LoRA | 0.4 β 0.7 |
| Clothing/Item LoRA | 0.3 β 0.6 |
Use no more than 3 LoRAs at once to maintain model stability.
π 11. Upscaling for SDXL β High Quality Strategy
SDXL images can be upscaled without losing detail.
Best Upscaling Methods
| Method | Tool | Quality |
|---|---|---|
| HighRes Fix | A1111/Forge | β Good |
| Latent Upscale | ComfyUI | ββ Better |
| 4x-UltraSharp | ComfyUI/ESRGAN | βββ Excellent |
Recommended HighRes Fix Settings
| Option | Value |
|---|---|
| Denoise strength | 0.35 β 0.45 |
| Upscale by | 1.5x β 2x |
| Steps | 15 β 20 |
π 12. Recommended Samplers for SDXL
Based on testing:
| Goal | Sampler |
|---|---|
| Fast previews | Euler a |
| Best balanced quality | DPM++ 2M Karras |
| Portraits | DPM++ SDE |
| Sharp details | DPM++ 3M SDE |
π 13. SDXL Workflow Examples
Workflow A β Standard SDXL (Beginner Friendly)
- Load SDXL Base model
- Set resolution 1024Γ1024
- Steps 30, Sampler DPM++ 2M Karras
- Generate base image
- Optional: Apply upscaler (4x UltraSharp)
Workflow B β SDXL with Refiner (High Quality)
- Generate with SDXL Base (70% of steps)
- Switch to SDXL Refiner (30% of steps)
- Use DPM++ SDE for refined detail
π‘ 14. Performance Tips (VRAM Saving)
Need more Speed?
Rent an RTX 4090 on RunPod and generate complex SDXL refiner setups in seconds.
| GPU VRAM | Recommended SDXL Settings |
|---|---|
| 4β6GB | Use 768Γ768 + medvram |
| 8GB | 1024Γ1024 default |
| 12GB+ | 2-pass upscale workflow |
Tips:
- Lower resolution first, upscale later
- Use βschnellβ samplers for previews
- Avoid too many LoRAs (VRAM heavy)
π οΈ 15. Troubleshooting
| Problem | Solution |
|---|---|
| Flat images | Lower CFG to 5β6 |
| Washed out images | Increase contrast in prompt |
| Blurry output | Use refiner |
| Hands look bad | Use ControlNet βopenposeβ |
| Missing detail | Increase steps to 35 |
π References and further reading
- Stability AI β Stable Diffusion XL 1.0 release announcement
- Hugging Face β SDXL base 1.0 model card (stabilityai)
- Hugging Face β SDXL refiner 1.0 model card (stabilityai)
- Civitai β SDXL models and community LoRAs
π Related Guides
- FLUX in ComfyUI
- FLUX in Stable Diffusion Forge
- Stable Diffusion Prompting
- Install Stable Diffusion on Mac
β Conclusion
SDXL is a powerful model for realistic and artistic image generation when used correctly. With the right settings, refined prompting, and control workflows, it produces significantly better detail and coherence than SD1.5 β but you need to use it the right way, not the SD1.5 way. Thatβs the key takeaway here.