Enhanced Stable Diffusion Client/Server System

A modern, user-friendly interface for Stable Diffusion image generation with advanced features

Installation • Features • Documentation • Contributing

🚀 Quick Start

# Clone the repository
git clone https://github.com/yourusername/stable-diffusion-client
cd stable-diffusion-client

# Install dependencies
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

# Start the server
python server.py

# In a new terminal, start the client
python client.py

Visit http://localhost:7860 to access the web interface.

Overview

A comprehensive Stable Diffusion interface system combining a FastAPI server with a Gradio web client. This system provides advanced image generation capabilities, model management, and image-to-video conversion features through an intuitive user interface.

Features

🎨 Interactive Web Interface: Intuitive Gradio-based UI with organized tabs and controls
🤖 Model Management:
- Support for both local .safetensors models and online Hugging Face models
- Model comparison capabilities
- Automatic model scanning and loading
🖼️ Image Generation:
- Batch processing support
- Custom scheduler configurations
- Adjustable generation parameters
- Real-time status updates
🎥 Image-to-Video Conversion:
- Multiple animation presets
- Region-based animation (face, body, background)
- Customizable motion types
- Duration and frame control
📊 Advanced Features:
- Side-by-side model comparison
- Automatic memory optimization
- Comprehensive metadata tracking
- Progress monitoring
💾 Project Management:
- Organized output structure
- Metadata preservation
- Prompt set management

Server Setup

Requirements

Python 3.8+
CUDA-capable GPU
PyTorch with CUDA support
FastAPI and dependencies

Installation

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

Starting the Server

python server.py

The server will:

Scan for local models in the models directory
Display available models (local and online)
Prompt for model selection
Initialize the selected model
Start the FastAPI server on port 8001

Model Directory Structure

models/
├── model1.safetensors    # Local model file
├── model1.json          # Optional metadata
├── model2.safetensors
└── model2.json

Model Metadata Format

{
    "model_type": "SD",     // or "SDXL"
    "base_model": "SD 1.5",
    "description": "Model description",
    "merged_from": ["model1", "model2"]
}

Client Usage

Starting the Client

python client.py [--port PORT] [--share] [--debug]

Interface Tabs

1. 🔌 Connection

Server URL configuration (default: http://localhost:8001)
Model refresh functionality
Connection status monitoring

2. 📁 Project

Prompt set management through YAML files
Prompt set selection
Configuration status display

3. ⚙️ Settings

Model selection (single or multiple for comparison)
Image dimensions (256-1024 pixels)
Generation steps (1-100)
Guidance scale (1-20)
Scheduler configuration
Batch size control
Output directory management

4. ✏️ Prompt

Main prompt input
Negative prompt input
Real-time validation

5. 🖼️ Output

Generation controls
Progress monitoring
Image gallery
Status updates

6. 🎥 Image to Video

Animation presets:
- Subtle: 20 frames, 2 seconds
- Normal: 24 frames, 2 seconds
- Slow: 40 frames, 8 seconds
- Ultra slow: 40 frames, 12 seconds
Region selection
Motion type configuration
Custom output settings

Generation Parameters

{
    "prompt": str,
    "negative_prompt": str = "",
    "width": int = 512,          # 384-2048
    "height": int = 512,         # 384-2048
    "num_steps": int = 30,       # 1-150
    "guidance_scale": float = 7.5, # 1.0-20.0
    "scheduler_type": str = "dpmsolver++",
    "karras_sigmas": bool = True,
    "enable_attention_slicing": bool = True,
    "enable_vae_slicing": bool = True,
    "enable_vae_tiling": bool = True
}

API Endpoints

Server API

POST /generate: Generate images with specified parameters
GET /models: List available models
GET /health: Check server status and GPU information
POST /compare: Generate images with multiple models

Health Check Response

{
    "status": "ok",
    "cuda_available": true,
    "model_loaded": true,
    "current_model": {
        "name": "Model Name",
        "type": "SD/SDXL",
        "base_model": "Base Model Info",
        "default_size": 512
    },
    "gpu_info": {
        "name": "GPU Name",
        "total_memory_gb": "16.00",
        "used_memory_gb": "4.00",
        "free_memory_gb": "12.00"
    }
}

Output Structure

outputs/
├── images/
│   ├── prefix_model_timestamp.png
│   ├── prefix_model_timestamp_0.png
│   └── ...
└── metadata/
    ├── prefix_model_timestamp.yaml
    └── prefix_model_timestamp_0.yaml

Best Practices

Performance Optimization

Monitor GPU memory through the health endpoint
Use batch sizes appropriate for your GPU
Enable memory optimizations for large models
Consider model type when selecting resolution

Generation Tips

Start with default settings
Use comparison mode to evaluate models
Save successful prompts
Monitor generation status
Select appropriate animation presets

Error Handling

Verify server connection before generation
Monitor generation status
Check error messages in status area
Use appropriate model for desired resolution

Development

Contributing

Fork the repository
Create a feature branch
Implement changes
Submit a pull request

Building From Source

git clone https://github.com/SikamikanikoBG/ImageGenerator
cd stable-diffusion-client
pip install -r requirements.txt

License

MIT License

Acknowledgments

Stability AI for Stable Diffusion
Hugging Face for model distribution
Gradio team for the UI framework
FastAPI team for the server framework

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
input		input
src		src
.gitignore		.gitignore
client.py		client.py
client_env.yaml		client_env.yaml
env.yaml		env.yaml
readme.md		readme.md
server.py		server.py
server_env.yaml		server_env.yaml
start_client.sh		start_client.sh
start_server.sh		start_server.sh

SikamikanikoBG/ImageGenerator

Folders and files

Latest commit

History

Repository files navigation