Instructions to use Bl4ckSpaces/SpaceDiffusion-XL-V-PRED-2.0 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Bl4ckSpaces/SpaceDiffusion-XL-V-PRED-2.0 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Bl4ckSpaces/SpaceDiffusion-XL-V-PRED-2.0", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
- DiffusionBee
π SpaceDiffusion-XL V-PRED 2.0 (v1 Final)
Welcome to the apex of anime and illustration generation. SpaceDiffusion-XL V-PRED 2.0 is a highly advanced, V-Prediction-based SDXL model designed to push the boundaries of anime aesthetics, anatomical precision, and complex prompt adherence.
This model is not a simple mix; it is the culmination of advanced merging architecture, relentless DoRA fine-tuning, and highly curated datasets.
𧬠Model Lineage & Architecture
SpaceDiffusion-XL V-PRED 2.0 is born from a rigorous and complex developmental pipeline:
- The Foundation (AetherV-XL): The core base of this model is AetherV-XL, which was crafted using advanced merging techniques (Singular Value Decomposition / SVD and Semantic Routing) to perfectly balance the structural understanding of NoobAI-XL (V-Pred) and the breathtaking aesthetic versatility of Illustrious-XL (V-Pred).
- The Forging (DoRA Fine-tuning): AetherV-XL was not left as a mere merge. It underwent intense, multi-day fine-tuning using DoRA (Weight-Decomposed Low-Rank Adaptation) at Rank 64 across dual-GPU clusters to aggressively enhance detail rendering, lighting logic, and character consistency.
- The Final Fusion: The resulting DoRA weights were permanently fused into the base model at full FP32 precision before being precisely quantized down to FP16 for optimal inference performance without losing a single bit of quality.
π Dataset Knowledge Base
- Primary Knowledge: Comprehensive coverage of anime, illustration, and pop culture subjects up to late 2025.
- Modern Injection: Includes a highly concentrated, specialized micro-injection of early 2026 data (via our final DoRA finetuning phase using multi-resolution smart bucketing).
β οΈ MANDATORY SETTINGS (READ BEFORE USING!)
Because this model uses the V-Prediction schedule, standard SDXL settings will result in deep-fried or completely broken images. You MUST follow these exact parameters:
- Prediction Type:
v_prediction(CRITICAL!) - CLIP Skip:
2(CRITICAL for anime styles. Do not use Clip Skip 1). - Prompting Style: Danbooru Tagging (e.g.,
1girl, solo, looking at viewer, blue hair...). - Recommended Sampler:
Euler a,DPM++ 2M Karras, orDPM++ 3M SDE Karras. - Steps:
25 - 40 - CFG Scale:
5.0 - 7.5 - Resolution: Highly optimized for
1024x1024,832x1216,1024x1536, and up to1.7xmega-resolutions (1280x1280,1088x1600,1600x1088).
π οΈ How to Use (Tutorial)
1. ComfyUI (Recommended)
To use this model in ComfyUI, you must tell the sampler to use V-Prediction.
- Load the
SpaceDiffusion_XL_V_PRED_2_0_v1_FINAL.safetensorsusing the standard Load Checkpoint node. - Add a ModelSamplingDiscrete node.
- Connect the
modeloutput from the Checkpoint node to themodelinput of the ModelSamplingDiscrete node. - Set the
samplingsetting on the ModelSamplingDiscrete node tov_prediction. - Connect the
modeloutput to your KSampler. - Ensure your CLIP Text Encoders are set to CLIP Skip -2.
2. A1111 / Forge WebUI
- Place the model in your
models/Stable-diffusionfolder. - Go to Settings > Stable Diffusion.
- Look for the setting related to SDXL and V-Prediction. Ensure your WebUI is updated to a version that auto-detects V-Prediction from safetensors metadata.
- Set CLIP skip to
2in the UI settings.
3. Diffusers (Python)
import torch
from diffusers import StableDiffusionXLPipeline, EulerDiscreteScheduler
model_id = "Bl4ckSpaces/SpaceDiffusion-XL-V-PRED-2.0"
# Explicitly define the V-Prediction scheduler
scheduler = EulerDiscreteScheduler.from_pretrained(
model_id,
subfolder="scheduler",
prediction_type="v_prediction"
)
pipe = StableDiffusionXLPipeline.from_single_file(
"SpaceDiffusion_XL_V_PRED_2_0_v1_FINAL.safetensors",
scheduler=scheduler,
torch_dtype=torch.float16
).to("cuda")
# Don't forget Clip Skip 2 is natively handled if you use the correct prompt weighting or Compel!
prompt = "masterpiece, best quality, ultra-detailed, 1girl, solo, glowing eyes, cyberpunk city"
image = pipe(prompt, num_inference_steps=30, guidance_scale=6.5).images[0]
image.save("output.png")
- Downloads last month
- 426