Initial implementation of AudioCraft Studio

Complete web interface for Meta's AudioCraft AI audio generation: - Gradio UI with tabs for all 5 model families (MusicGen, AudioGen, MAGNeT, MusicGen Style, JASCO) - REST API with FastAPI, OpenAPI docs, and API key auth - VRAM management with ComfyUI coexistence support - SQLite database for project/generation history - Batch processing queue for async generation - Docker deployment optimized for RunPod with RTX 4090 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2025-11-25 19:34:27 +01:00
commit ffbf02b12c
67 changed files with 12032 additions and 0 deletions
--- a/.env.example
+++ b/.env.example
@@ -0,0 +1,42 @@
 # AudioCraft Studio Configuration
 # Copy this file to .env and customize as needed
 # Server Configuration
 AUDIOCRAFT_HOST=0.0.0.0
 AUDIOCRAFT_GRADIO_PORT=7860
 AUDIOCRAFT_API_PORT=8000
 # Paths (relative to project root)
 AUDIOCRAFT_DATA_DIR=./data
 AUDIOCRAFT_OUTPUT_DIR=./outputs
 AUDIOCRAFT_CACHE_DIR=./cache
 # VRAM Management
 # Reserve this much VRAM for ComfyUI (GB)
 AUDIOCRAFT_COMFYUI_RESERVE_GB=10
 # Safety buffer to prevent OOM (GB)
 AUDIOCRAFT_SAFETY_BUFFER_GB=1
 # Unload idle models after this many minutes
 AUDIOCRAFT_IDLE_UNLOAD_MINUTES=15
 # Maximum number of models to keep loaded
 AUDIOCRAFT_MAX_CACHED_MODELS=2
 # API Authentication
 # Generate a secure random key for production
 AUDIOCRAFT_API_KEY=your-secret-api-key-here
 # Generation Defaults
 AUDIOCRAFT_DEFAULT_DURATION=10.0
 AUDIOCRAFT_MAX_DURATION=300.0
 AUDIOCRAFT_DEFAULT_BATCH_SIZE=1
 AUDIOCRAFT_MAX_BATCH_SIZE=8
 AUDIOCRAFT_MAX_QUEUE_SIZE=100
 # Database
 AUDIOCRAFT_DATABASE_URL=sqlite+aiosqlite:///./data/audiocraft.db
 # Logging
 AUDIOCRAFT_LOG_LEVEL=INFO
 # PyTorch Optimization (recommended)
 PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1,76 @@
 # Python
 __pycache__/
 *.py[cod]
 *$py.class
 *.so
 .Python
 build/
 develop-eggs/
 dist/
 downloads/
 eggs/
 .eggs/
 lib/
 lib64/
 parts/
 sdist/
 var/
 wheels/
 *.egg-info/
 .installed.cfg
 *.egg
 # Virtual environments
 .venv/
 venv/
 ENV/
 # IDE
 .idea/
 .vscode/
 *.swp
 *.swo
 *~
 # Testing
 .pytest_cache/
 .coverage
 htmlcov/
 .tox/
 .nox/
 # Type checking
 .mypy_cache/
 # Project specific
 data/
 outputs/
 cache/
 *.db
 *.sqlite
 *.sqlite3
 # Logs
 *.log
 logs/
 # Environment
 .env
 .env.local
 .env.*.local
 # Model weights (downloaded from HuggingFace)
 *.bin
 *.safetensors
 *.pt
 *.pth
 # Audio files (generated)
 *.wav
 *.mp3
 *.flac
 *.ogg
 # Temp files
 /tmp/
 *.tmp
--- a/83
+++ b/83
@@ -0,0 +1,83 @@
 # AudioCraft Studio Dockerfile for RunPod
 # Optimized for NVIDIA RTX 4090 (24GB VRAM)
 FROM nvidia/cuda:12.1-cudnn8-runtime-ubuntu22.04
 # Set environment variables
 ENV DEBIAN_FRONTEND=noninteractive
 ENV PYTHONUNBUFFERED=1
 ENV PYTHONDONTWRITEBYTECODE=1
 ENV PIP_NO_CACHE_DIR=1
 ENV PIP_DISABLE_PIP_VERSION_CHECK=1
 # CUDA settings
 ENV CUDA_HOME=/usr/local/cuda
 ENV PATH="${CUDA_HOME}/bin:${PATH}"
 ENV LD_LIBRARY_PATH="${CUDA_HOME}/lib64:${LD_LIBRARY_PATH}"
 # AudioCraft settings
 ENV AUDIOCRAFT_OUTPUT_DIR=/workspace/outputs
 ENV AUDIOCRAFT_DATA_DIR=/workspace/data
 ENV AUDIOCRAFT_MODEL_CACHE=/workspace/models
 ENV AUDIOCRAFT_HOST=0.0.0.0
 ENV AUDIOCRAFT_GRADIO_PORT=7860
 ENV AUDIOCRAFT_API_PORT=8000
 # Install system dependencies
 RUN apt-get update && apt-get install -y --no-install-recommends \
    git \
    curl \
    wget \
    ffmpeg \
    libsndfile1 \
    libsox-dev \
    sox \
    build-essential \
    python3.10 \
    python3.10-venv \
    python3.10-dev \
    python3-pip \
    && rm -rf /var/lib/apt/lists/*
 # Set Python 3.10 as default
 RUN update-alternatives --install /usr/bin/python python /usr/bin/python3.10 1 \
    && update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.10 1
 # Upgrade pip
 RUN pip install --upgrade pip setuptools wheel
 # Create workspace directory
 WORKDIR /workspace
 # Create necessary directories
 RUN mkdir -p /workspace/outputs /workspace/data /workspace/models /workspace/app
 # Copy requirements first for caching
 COPY requirements.txt /workspace/app/
 WORKDIR /workspace/app
 # Install PyTorch with CUDA support
 RUN pip install torch==2.1.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu121
 # Install other requirements
 RUN pip install -r requirements.txt
 # Install AudioCraft from source for latest features
 RUN pip install git+https://github.com/facebookresearch/audiocraft.git
 # Copy application code
 COPY . /workspace/app/
 # Create non-root user for security (optional, RunPod often uses root)
 # RUN useradd -m -u 1000 audiocraft && chown -R audiocraft:audiocraft /workspace
 # USER audiocraft
 # Expose ports
 EXPOSE 7860 8000
 # Health check
 HEALTHCHECK --interval=30s --timeout=10s --start-period=60s --retries=3 \
    CMD curl -f http://localhost:7860/ || exit 1
 # Default command
 CMD ["python", "main.py"]
--- a/README.md
+++ b/README.md
@@ -0,0 +1,197 @@
 # AudioCraft Studio
 A comprehensive web interface for Meta's AudioCraft AI audio generation models, optimized for RunPod deployment with NVIDIA RTX 4090 GPUs.
 ## Features
 ### Models Supported
 - **MusicGen** - Text-to-music generation with melody conditioning
 - **AudioGen** - Text-to-sound effects and environmental audio
 - **MAGNeT** - Fast non-autoregressive music generation
 - **MusicGen Style** - Style-conditioned music from reference audio
 - **JASCO** - Chord and drum-conditioned music generation
 ### Core Capabilities
 - **Gradio Web UI** - Intuitive interface with real-time generation
 - **REST API** - Full-featured API with OpenAPI documentation
 - **Batch Processing** - Queue system for multiple generations
 - **Project Management** - Organize and browse generation history
 - **VRAM Management** - Smart model loading/unloading, ComfyUI coexistence
 - **Waveform Visualization** - Visual audio feedback
 ## Quick Start
 ### Local Development
 ```bash
 # Clone repository
 git clone https://github.com/your-username/audiocraft-ui.git
 cd audiocraft-ui
 # Create virtual environment
 python -m venv venv
 source venv/bin/activate  # Linux/Mac
 # or: venv\Scripts\activate  # Windows
 # Install dependencies
 pip install -r requirements.txt
 # Run application
 python main.py
 ```
 Access the UI at `http://localhost:7860`
 ### Docker
 ```bash
 # Build and run
 docker-compose up --build
 # Or build manually
 docker build -t audiocraft-studio .
 docker run --gpus all -p 7860:7860 -p 8000:8000 audiocraft-studio
 ```
 ### RunPod Deployment
 1. Build and push Docker image:
 ```bash
 docker build -t your-dockerhub/audiocraft-studio:latest .
 docker push your-dockerhub/audiocraft-studio:latest
 ```
 2. Create RunPod template using `runpod.yaml` as reference
 3. Deploy with RTX 4090 or equivalent GPU
 ## Configuration
 Configuration via environment variables:
 | Variable | Default | Description |
 |----------|---------|-------------|
 | `AUDIOCRAFT_HOST` | `0.0.0.0` | Server bind address |
 | `AUDIOCRAFT_GRADIO_PORT` | `7860` | Gradio UI port |
 | `AUDIOCRAFT_API_PORT` | `8000` | REST API port |
 | `AUDIOCRAFT_OUTPUT_DIR` | `./outputs` | Generated audio output |
 | `AUDIOCRAFT_DATA_DIR` | `./data` | Database and config |
 | `AUDIOCRAFT_COMFYUI_RESERVE_GB` | `10` | VRAM reserved for ComfyUI |
 | `AUDIOCRAFT_MAX_LOADED_MODELS` | `2` | Max models in memory |
 | `AUDIOCRAFT_IDLE_UNLOAD_MINUTES` | `15` | Auto-unload idle models |
 See `.env.example` for full configuration options.
 ## API Usage
 ### Authentication
 ```bash
 # Get API key from Settings page or generate via CLI
 curl -X POST http://localhost:8000/api/v1/system/api-key/regenerate \
  -H "X-API-Key: YOUR_CURRENT_KEY"
 ```
 ### Generate Audio
 ```bash
 # Synchronous generation
 curl -X POST http://localhost:8000/api/v1/generate \
  -H "X-API-Key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "musicgen",
    "variant": "medium",
    "prompts": ["upbeat electronic dance music with synth leads"],
    "duration": 10
  }'
 # Async (queue) generation
 curl -X POST http://localhost:8000/api/v1/generate/async \
  -H "X-API-Key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "request": {
      "model": "musicgen",
      "prompts": ["ambient soundscape"],
      "duration": 30
    },
    "priority": 5
  }'
 ```
 ### Check Job Status
 ```bash
 curl http://localhost:8000/api/v1/generate/jobs/{job_id} \
  -H "X-API-Key: YOUR_API_KEY"
 ```
 Full API documentation available at `http://localhost:8000/api/docs`
 ## Architecture
 ```
 audiocraft-ui/
 ├── config/
 │   ├── settings.py      # Pydantic settings
 │   └── models.yaml      # Model registry
 ├── src/
 │   ├── core/
 │   │   ├── base_model.py     # Abstract model interface
 │   │   ├── gpu_manager.py    # VRAM management
 │   │   ├── model_registry.py # Model loading/caching
 │   │   └── oom_handler.py    # OOM recovery
 │   ├── models/
 │   │   ├── musicgen/         # MusicGen adapter
 │   │   ├── audiogen/         # AudioGen adapter
 │   │   ├── magnet/           # MAGNeT adapter
 │   │   ├── musicgen_style/   # Style adapter
 │   │   └── jasco/            # JASCO adapter
 │   ├── services/
 │   │   ├── generation_service.py
 │   │   ├── batch_processor.py
 │   │   └── project_service.py
 │   ├── storage/
 │   │   └── database.py       # SQLite storage
 │   ├── api/
 │   │   ├── app.py            # FastAPI app
 │   │   └── routes/           # API endpoints
 │   └── ui/
 │       ├── app.py            # Gradio app
 │       ├── components/       # Reusable UI components
 │       ├── tabs/             # Model generation tabs
 │       └── pages/            # Projects, Settings
 ├── main.py                   # Entry point
 ├── Dockerfile
 └── docker-compose.yml
 ```
 ## ComfyUI Coexistence
 AudioCraft Studio is designed to run alongside ComfyUI on the same GPU:
 1. Set `AUDIOCRAFT_COMFYUI_RESERVE_GB` to reserve VRAM for ComfyUI
 2. Models are automatically unloaded when idle
 3. Coordination file at `/tmp/audiocraft_comfyui_coord.json` prevents conflicts
 ## Development
 ```bash
 # Install dev dependencies
 pip install -r requirements-dev.txt
 # Run tests
 pytest
 # Format code
 black src/ config/
 ruff check src/ config/
 # Type checking
 mypy src/
 ```
 ## License
 This project uses Meta's AudioCraft library. See [AudioCraft License](https://github.com/facebookresearch/audiocraft/blob/main/LICENSE).
--- a/config/init.py
+++ b/config/init.py
@@ -0,0 +1,5 @@
 """Configuration module for AudioCraft Studio."""
 from config.settings import Settings, get_settings
 __all__ = ["Settings", "get_settings"]
--- a/config/models.yaml
+++ b/config/models.yaml
@@ -0,0 +1,151 @@
 # AudioCraft Model Registry Configuration
 # This file defines all available models and their configurations
 models:
  musicgen:
    enabled: true
    display_name: "MusicGen"
    description: "Text-to-music generation with optional melody conditioning"
    default_variant: medium
    variants:
      small:
        hf_id: facebook/musicgen-small
        vram_mb: 1500
        max_duration: 30
        description: "Fast, lightweight model (300M params)"
      medium:
        hf_id: facebook/musicgen-medium
        vram_mb: 5000
        max_duration: 30
        description: "Balanced quality and speed (1.5B params)"
      large:
        hf_id: facebook/musicgen-large
        vram_mb: 10000
        max_duration: 30
        description: "Highest quality, slower (3.3B params)"
      melody:
        hf_id: facebook/musicgen-melody
        vram_mb: 5000
        max_duration: 30
        conditioning:
          - melody
        description: "Melody-conditioned generation (1.5B params)"
      stereo-small:
        hf_id: facebook/musicgen-stereo-small
        vram_mb: 1800
        max_duration: 30
        channels: 2
        description: "Stereo output, fast (300M params)"
      stereo-medium:
        hf_id: facebook/musicgen-stereo-medium
        vram_mb: 6000
        max_duration: 30
        channels: 2
        description: "Stereo output, balanced (1.5B params)"
      stereo-large:
        hf_id: facebook/musicgen-stereo-large
        vram_mb: 12000
        max_duration: 30
        channels: 2
        description: "Stereo output, highest quality (3.3B params)"
      stereo-melody:
        hf_id: facebook/musicgen-stereo-melody
        vram_mb: 6000
        max_duration: 30
        channels: 2
        conditioning:
          - melody
        description: "Stereo melody-conditioned (1.5B params)"
  audiogen:
    enabled: true
    display_name: "AudioGen"
    description: "Text-to-sound effects generation"
    default_variant: medium
    variants:
      medium:
        hf_id: facebook/audiogen-medium
        vram_mb: 5000
        max_duration: 10
        description: "Sound effects generator (1.5B params)"
  magnet:
    enabled: true
    display_name: "MAGNeT"
    description: "Fast non-autoregressive music generation"
    default_variant: medium-10secs
    variants:
      small-10secs:
        hf_id: facebook/magnet-small-10secs
        vram_mb: 1500
        max_duration: 10
        description: "Fast 10-second clips (300M params)"
      medium-10secs:
        hf_id: facebook/magnet-medium-10secs
        vram_mb: 5000
        max_duration: 10
        description: "Quality 10-second clips (1.5B params)"
      small-30secs:
        hf_id: facebook/magnet-small-30secs
        vram_mb: 1800
        max_duration: 30
        description: "Fast 30-second clips (300M params)"
      medium-30secs:
        hf_id: facebook/magnet-medium-30secs
        vram_mb: 6000
        max_duration: 30
        description: "Quality 30-second clips (1.5B params)"
  musicgen-style:
    enabled: true
    display_name: "MusicGen Style"
    description: "Style-conditioned music generation from reference audio"
    default_variant: medium
    variants:
      medium:
        hf_id: facebook/musicgen-style
        vram_mb: 5000
        max_duration: 30
        conditioning:
          - style
        description: "Style transfer from reference audio (1.5B params)"
  jasco:
    enabled: true
    display_name: "JASCO"
    description: "Chord and drum-conditioned music generation"
    default_variant: chords-drums-400M
    variants:
      chords-drums-400M:
        hf_id: facebook/jasco-chords-drums-400M
        vram_mb: 2000
        max_duration: 10
        conditioning:
          - chords
          - drums
        description: "Chord/drum control, fast (400M params)"
      chords-drums-1B:
        hf_id: facebook/jasco-chords-drums-1B
        vram_mb: 4000
        max_duration: 10
        conditioning:
          - chords
          - drums
        description: "Chord/drum control, higher quality (1B params)"
 # Default generation parameters
 defaults:
  generation:
    duration: 10
    temperature: 1.0
    top_k: 250
    top_p: 0.0
    cfg_coef: 3.0
 # VRAM thresholds for warnings
 vram:
  warning_threshold: 0.85  # 85% utilization warning
  critical_threshold: 0.95  # 95% utilization critical
 # Presets are loaded from data/presets/*.yaml
 presets_dir: "./data/presets"
--- a/config/settings.py
+++ b/config/settings.py
@@ -0,0 +1,94 @@
 """Application settings with environment variable support."""
 from functools import lru_cache
 from pathlib import Path
 from typing import Optional
 from pydantic import Field
 from pydantic_settings import BaseSettings, SettingsConfigDict
 class Settings(BaseSettings):
    """Application configuration with environment variable support.
    All settings can be overridden via environment variables prefixed with AUDIOCRAFT_.
    Example: AUDIOCRAFT_API_PORT=8080
    """
    model_config = SettingsConfigDict(
        env_prefix="AUDIOCRAFT_",
        env_file=".env",
        env_file_encoding="utf-8",
        extra="ignore",
    )
    # Server Configuration
    host: str = Field(default="0.0.0.0", description="Server bind host")
    gradio_port: int = Field(default=7860, description="Gradio UI port")
    api_port: int = Field(default=8000, description="FastAPI port")
    # Paths
    data_dir: Path = Field(default=Path("./data"), description="Data directory")
    output_dir: Path = Field(default=Path("./outputs"), description="Generated audio output")
    cache_dir: Path = Field(default=Path("./cache"), description="Model cache directory")
    models_config: Path = Field(
        default=Path("./config/models.yaml"), description="Model registry config"
    )
    # VRAM Management
    comfyui_reserve_gb: float = Field(
        default=10.0, description="VRAM reserved for ComfyUI (GB)"
    )
    safety_buffer_gb: float = Field(
        default=1.0, description="Safety buffer to prevent OOM (GB)"
    )
    idle_unload_minutes: int = Field(
        default=15, description="Unload models after idle time (minutes)"
    )
    max_cached_models: int = Field(
        default=2, description="Maximum number of models to keep loaded"
    )
    # API Authentication
    api_key: Optional[str] = Field(default=None, description="API key for authentication")
    cors_origins: list[str] = Field(
        default=["*"], description="Allowed CORS origins"
    )
    # Generation Defaults
    default_duration: float = Field(default=10.0, description="Default generation duration")
    max_duration: float = Field(default=300.0, description="Maximum generation duration")
    default_batch_size: int = Field(default=1, description="Default batch size")
    max_batch_size: int = Field(default=8, description="Maximum batch size")
    max_queue_size: int = Field(default=100, description="Maximum generation queue size")
    # Database
    database_url: str = Field(
        default="sqlite+aiosqlite:///./data/audiocraft.db",
        description="Database connection URL",
    )
    # Logging
    log_level: str = Field(default="INFO", description="Logging level")
    def ensure_directories(self) -> None:
        """Create required directories if they don't exist."""
        self.data_dir.mkdir(parents=True, exist_ok=True)
        self.output_dir.mkdir(parents=True, exist_ok=True)
        self.cache_dir.mkdir(parents=True, exist_ok=True)
        (self.data_dir / "presets").mkdir(parents=True, exist_ok=True)
    @property
    def database_path(self) -> Path:
        """Extract database file path from URL."""
        if self.database_url.startswith("sqlite"):
            # Handle both sqlite:/// and sqlite+aiosqlite:///
            path = self.database_url.split("///")[-1]
            return Path(path)
        raise ValueError("Only SQLite databases are supported")
@lru_cache
 def get_settings() -> Settings:
    """Get cached settings instance."""
    return Settings()
--- a/docker-compose.yml
+++ b/docker-compose.yml
@@ -0,0 +1,64 @@
 # Docker Compose for local development and testing
 # For RunPod deployment, use the Dockerfile directly
 version: '3.8'
 services:
  audiocraft:
    build:
      context: .
      dockerfile: Dockerfile
    container_name: audiocraft-studio
    ports:
      - "7860:7860"  # Gradio UI
      - "8000:8000"  # REST API
    volumes:
      # Persistent storage
      - audiocraft-outputs:/workspace/outputs
      - audiocraft-data:/workspace/data
      - audiocraft-models:/workspace/models
      # Development: mount source code
      - ./src:/workspace/app/src:ro
      - ./config:/workspace/app/config:ro
    environment:
      - AUDIOCRAFT_HOST=0.0.0.0
      - AUDIOCRAFT_GRADIO_PORT=7860
      - AUDIOCRAFT_API_PORT=8000
      - AUDIOCRAFT_DEBUG=false
      - AUDIOCRAFT_COMFYUI_RESERVE_GB=0  # No ComfyUI in this compose
      - NVIDIA_VISIBLE_DEVICES=all
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: 1
              capabilities: [gpu]
    restart: unless-stopped
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:7860/"]
      interval: 30s
      timeout: 10s
      retries: 3
      start_period: 60s
  # Optional: Run alongside ComfyUI
  # comfyui:
  #   image: your-comfyui-image
  #   container_name: comfyui
  #   ports:
  #     - "8188:8188"
  #   volumes:
  #     - comfyui-data:/workspace
  #   deploy:
  #     resources:
  #       reservations:
  #         devices:
  #           - driver: nvidia
  #             count: 1
  #             capabilities: [gpu]
 volumes:
  audiocraft-outputs:
  audiocraft-data:
  audiocraft-models:
--- a/main.py
+++ b/main.py
@@ -0,0 +1,147 @@
 #!/usr/bin/env python3
 """Main entry point for AudioCraft Studio."""
 import asyncio
 import logging
 import sys
 from pathlib import Path
 # Add project root to path
 sys.path.insert(0, str(Path(__file__).parent))
 from config.settings import get_settings
 from src.core.gpu_manager import GPUMemoryManager
 from src.core.model_registry import ModelRegistry
 from src.services.generation_service import GenerationService
 from src.services.batch_processor import BatchProcessor
 from src.services.project_service import ProjectService
 from src.storage.database import Database
 from src.ui.app import create_app
 # Configure logging
 logging.basicConfig(
    level=logging.INFO,
    format="%(asctime)s - %(name)s - %(levelname)s - %(message)s",
    handlers=[
        logging.StreamHandler(),
        logging.FileHandler("audiocraft.log"),
    ],
 )
 logger = logging.getLogger(__name__)
 async def initialize_services():
    """Initialize all application services."""
    settings = get_settings()
    # Initialize database
    logger.info("Initializing database...")
    db = Database(settings.database_path)
    await db.initialize()
    # Initialize GPU manager
    logger.info("Initializing GPU manager...")
    gpu_manager = GPUMemoryManager(
        device_id=0,
        comfyui_reserve_bytes=int(settings.comfyui_reserve_gb * 1024**3),
    )
    # Initialize model registry
    logger.info("Initializing model registry...")
    model_registry = ModelRegistry(
        gpu_manager=gpu_manager,
        max_loaded=settings.max_loaded_models,
        idle_timeout_seconds=settings.idle_unload_minutes * 60,
    )
    # Initialize services
    logger.info("Initializing services...")
    generation_service = GenerationService(
        model_registry=model_registry,
        gpu_manager=gpu_manager,
        output_dir=settings.output_dir,
    )
    batch_processor = BatchProcessor(
        generation_service=generation_service,
        max_queue_size=settings.max_queue_size,
    )
    project_service = ProjectService(
        db=db,
        output_dir=settings.output_dir,
    )
    return {
        "db": db,
        "gpu_manager": gpu_manager,
        "model_registry": model_registry,
        "generation_service": generation_service,
        "batch_processor": batch_processor,
        "project_service": project_service,
    }
 def main():
    """Main entry point."""
    settings = get_settings()
    logger.info("=" * 60)
    logger.info("AudioCraft Studio")
    logger.info("=" * 60)
    logger.info(f"Host: {settings.host}")
    logger.info(f"Gradio Port: {settings.gradio_port}")
    logger.info(f"API Port: {settings.api_port}")
    logger.info(f"Output Dir: {settings.output_dir}")
    logger.info("=" * 60)
    # Initialize services
    logger.info("Initializing services...")
    try:
        services = asyncio.run(initialize_services())
    except Exception as e:
        logger.error(f"Failed to initialize services: {e}")
        logger.warning("Starting in demo mode without backend services")
        services = {}
    # Create and launch app
    logger.info("Creating Gradio application...")
    app = create_app(
        generation_service=services.get("generation_service"),
        batch_processor=services.get("batch_processor"),
        project_service=services.get("project_service"),
        gpu_manager=services.get("gpu_manager"),
        model_registry=services.get("model_registry"),
    )
    # Start batch processor if available
    batch_processor = services.get("batch_processor")
    if batch_processor:
        logger.info("Starting batch processor...")
        asyncio.run(batch_processor.start())
    # Launch the app
    logger.info("Launching application...")
    try:
        app.launch(
            server_name=settings.host,
            server_port=settings.gradio_port,
            share=False,
            show_api=settings.api_enabled,
        )
    except KeyboardInterrupt:
        logger.info("Shutting down...")
    finally:
        # Cleanup
        if batch_processor:
            asyncio.run(batch_processor.stop())
        if "db" in services:
            asyncio.run(services["db"].close())
        logger.info("Shutdown complete")
 if __name__ == "__main__":
    main()
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -0,0 +1,89 @@
 [project]
 name = "audiocraft-ui"
 version = "0.1.0"
 description = "Sophisticated AI audio web application based on Facebook's AudioCraft"
 readme = "README.md"
 license = { text = "MIT" }
 requires-python = ">=3.10"
 authors = [{ name = "AudioCraft UI Team" }]
 keywords = ["audio", "music", "generation", "ai", "audiocraft", "gradio"]
 classifiers = [
    "Development Status :: 3 - Alpha",
    "Intended Audience :: Developers",
    "License :: OSI Approved :: MIT License",
    "Programming Language :: Python :: 3",
    "Programming Language :: Python :: 3.10",
    "Programming Language :: Python :: 3.11",
    "Topic :: Multimedia :: Sound/Audio",
    "Topic :: Scientific/Engineering :: Artificial Intelligence",
 ]
 dependencies = [
    # Core ML
    "torch>=2.1.0",
    "torchaudio>=2.1.0",
    "audiocraft>=1.3.0",
    "xformers>=0.0.22",
    # UI
    "gradio>=4.0.0",
    # API
    "fastapi>=0.104.0",
    "uvicorn[standard]>=0.24.0",
    "python-multipart>=0.0.6",
    # GPU Monitoring
    "pynvml>=11.5.0",
    # Storage
    "aiosqlite>=0.19.0",
    # Configuration
    "pydantic>=2.5.0",
    "pydantic-settings>=2.1.0",
    "pyyaml>=6.0",
    # Audio Processing
    "numpy>=1.24.0",
    "scipy>=1.11.0",
    "librosa>=0.10.0",
    "soundfile>=0.12.0",
 ]
 [project.optional-dependencies]
 dev = [
    "pytest>=7.4.0",
    "pytest-asyncio>=0.21.0",
    "pytest-cov>=4.1.0",
    "ruff>=0.1.0",
    "mypy>=1.6.0",
 ]
 [project.scripts]
 audiocraft-ui = "src.main:main"
 [build-system]
 requires = ["hatchling"]
 build-backend = "hatchling.build"
 [tool.hatch.build.targets.wheel]
 packages = ["src"]
 [tool.ruff]
 line-length = 100
 target-version = "py310"
 [tool.ruff.lint]
 select = ["E", "F", "I", "N", "W", "UP"]
 ignore = ["E501"]
 [tool.mypy]
 python_version = "3.10"
 warn_return_any = true
 warn_unused_configs = true
 ignore_missing_imports = true
 [tool.pytest.ini_options]
 asyncio_mode = "auto"
 testpaths = ["tests"]
--- a/requirements.txt
+++ b/requirements.txt
@@ -0,0 +1,30 @@
 # Core ML
 torch>=2.1.0
 torchaudio>=2.1.0
 audiocraft>=1.3.0
 xformers>=0.0.22
 # UI
 gradio>=4.0.0
 # API
 fastapi>=0.104.0
 uvicorn[standard]>=0.24.0
 python-multipart>=0.0.6
 # GPU Monitoring
 pynvml>=11.5.0
 # Storage
 aiosqlite>=0.19.0
 # Configuration
 pydantic>=2.5.0
 pydantic-settings>=2.1.0
 pyyaml>=6.0
 # Audio Processing
 numpy>=1.24.0
 scipy>=1.11.0
 librosa>=0.10.0
 soundfile>=0.12.0
--- a/runpod.yaml
+++ b/runpod.yaml
@@ -0,0 +1,77 @@
 # RunPod Template Configuration
 # Use this as reference when creating a RunPod template
 name: AudioCraft Studio
 description: AI-powered music and sound generation using Meta's AudioCraft
 # Container settings
 container:
  image: your-dockerhub-username/audiocraft-studio:latest
  # Or build from GitHub
  # dockerfile: Dockerfile
  # context: https://github.com/your-username/audiocraft-ui.git
 # GPU requirements
 gpu:
  type: RTX 4090  # Recommended: RTX 4090, RTX 3090, A100
  count: 1
  minVram: 24  # GB
 # Resource limits
 resources:
  cpu: 8
  memory: 32  # GB
  disk: 100   # GB (for model cache and outputs)
 # Port mappings
 ports:
  - name: Gradio UI
    internal: 7860
    external: 7860
    protocol: http
  - name: REST API
    internal: 8000
    external: 8000
    protocol: http
 # Volume mounts
 volumes:
  - name: outputs
    mountPath: /workspace/outputs
    size: 50  # GB
  - name: models
    mountPath: /workspace/models
    size: 30  # GB (model cache)
  - name: data
    mountPath: /workspace/data
    size: 10  # GB
 # Environment variables
 env:
  - name: AUDIOCRAFT_HOST
    value: "0.0.0.0"
  - name: AUDIOCRAFT_GRADIO_PORT
    value: "7860"
  - name: AUDIOCRAFT_API_PORT
    value: "8000"
  - name: AUDIOCRAFT_COMFYUI_RESERVE_GB
    value: "10"  # Reserve VRAM for ComfyUI if running alongside
  - name: AUDIOCRAFT_MAX_LOADED_MODELS
    value: "2"
  - name: AUDIOCRAFT_IDLE_UNLOAD_MINUTES
    value: "15"
  - name: HF_HOME
    value: "/workspace/models/huggingface"
 # Startup command
 command: ["python", "main.py"]
 # Health check
 healthCheck:
  path: /
  port: 7860
  initialDelaySeconds: 120
  periodSeconds: 30
  timeoutSeconds: 10
  failureThreshold: 3
--- a/scripts/download_models.py
+++ b/scripts/download_models.py
@@ -0,0 +1,116 @@
 #!/usr/bin/env python3
 """Pre-download AudioCraft models for faster startup."""
 import argparse
 import os
 from pathlib import Path
 def download_musicgen_models(variants: list[str] = None):
    """Download MusicGen models."""
    from audiocraft.models import MusicGen
    variants = variants or ["small", "medium", "large", "melody"]
    for variant in variants:
        print(f"Downloading MusicGen {variant}...")
        try:
            model = MusicGen.get_pretrained(f"facebook/musicgen-{variant}")
            del model
            print(f"  ✓ MusicGen {variant} downloaded")
        except Exception as e:
            print(f"  ✗ Failed to download MusicGen {variant}: {e}")
 def download_audiogen_models():
    """Download AudioGen models."""
    from audiocraft.models import AudioGen
    print("Downloading AudioGen medium...")
    try:
        model = AudioGen.get_pretrained("facebook/audiogen-medium")
        del model
        print("  ✓ AudioGen medium downloaded")
    except Exception as e:
        print(f"  ✗ Failed to download AudioGen: {e}")
 def download_magnet_models(variants: list[str] = None):
    """Download MAGNeT models."""
    from audiocraft.models import MAGNeT
    variants = variants or ["small", "medium", "audio-small-10secs", "audio-medium-10secs"]
    for variant in variants:
        print(f"Downloading MAGNeT {variant}...")
        try:
            model = MAGNeT.get_pretrained(f"facebook/magnet-{variant}")
            del model
            print(f"  ✓ MAGNeT {variant} downloaded")
        except Exception as e:
            print(f"  ✗ Failed to download MAGNeT {variant}: {e}")
 def main():
    parser = argparse.ArgumentParser(description="Pre-download AudioCraft models")
    parser.add_argument(
        "--models",
        nargs="+",
        choices=["musicgen", "audiogen", "magnet", "all"],
        default=["all"],
        help="Models to download",
    )
    parser.add_argument(
        "--musicgen-variants",
        nargs="+",
        default=["small", "medium"],
        help="MusicGen variants to download",
    )
    parser.add_argument(
        "--magnet-variants",
        nargs="+",
        default=["small", "medium"],
        help="MAGNeT variants to download",
    )
    parser.add_argument(
        "--cache-dir",
        type=str,
        default=None,
        help="Model cache directory",
    )
    args = parser.parse_args()
    # Set cache directory
    if args.cache_dir:
        os.environ["HF_HOME"] = args.cache_dir
        os.environ["TORCH_HOME"] = args.cache_dir
        Path(args.cache_dir).mkdir(parents=True, exist_ok=True)
    models = args.models
    if "all" in models:
        models = ["musicgen", "audiogen", "magnet"]
    print("=" * 50)
    print("AudioCraft Model Downloader")
    print("=" * 50)
    print(f"Cache directory: {os.environ.get('HF_HOME', 'default')}")
    print(f"Models to download: {models}")
    print("=" * 50)
    if "musicgen" in models:
        download_musicgen_models(args.musicgen_variants)
    if "audiogen" in models:
        download_audiogen_models()
    if "magnet" in models:
        download_magnet_models(args.magnet_variants)
    print("=" * 50)
    print("Download complete!")
    print("=" * 50)
 if __name__ == "__main__":
    main()
--- a/scripts/start.sh
+++ b/scripts/start.sh
@@ -0,0 +1,55 @@
 #!/bin/bash
 # Startup script for AudioCraft Studio
 # Used in Docker container and RunPod
 set -e
 echo "=========================================="
 echo "  AudioCraft Studio"
 echo "=========================================="
 # Create directories if they don't exist
 mkdir -p "${AUDIOCRAFT_OUTPUT_DIR:-/workspace/outputs}"
 mkdir -p "${AUDIOCRAFT_DATA_DIR:-/workspace/data}"
 mkdir -p "${AUDIOCRAFT_MODEL_CACHE:-/workspace/models}"
 # Check GPU availability
 echo "Checking GPU..."
 if command -v nvidia-smi &> /dev/null; then
    nvidia-smi --query-gpu=name,memory.total,memory.free --format=csv
 else
    echo "Warning: nvidia-smi not found"
 fi
 # Check Python and dependencies
 echo "Python version:"
 python --version
 echo "PyTorch version:"
 python -c "import torch; print(f'PyTorch: {torch.__version__}, CUDA: {torch.cuda.is_available()}')"
 # Check AudioCraft installation
 echo "AudioCraft version:"
 python -c "import audiocraft; print(audiocraft.__version__)" 2>/dev/null || echo "AudioCraft installed from source"
 # Generate API key if not exists
 if [ ! -f "${AUDIOCRAFT_DATA_DIR:-/workspace/data}/.api_key" ]; then
    echo "Generating API key..."
    python -c "
 from src.api.auth import get_key_manager
 km = get_key_manager()
 if not km.has_key():
    key = km.generate_new_key()
    print(f'Generated API key: {key}')
    print('Store this key securely - it will not be shown again!')
 "
 fi
 # Start the application
 echo "Starting AudioCraft Studio..."
 echo "Gradio UI: http://0.0.0.0:${AUDIOCRAFT_GRADIO_PORT:-7860}"
 echo "REST API:  http://0.0.0.0:${AUDIOCRAFT_API_PORT:-8000}"
 echo "API Docs:  http://0.0.0.0:${AUDIOCRAFT_API_PORT:-8000}/api/docs"
 echo "=========================================="
 exec python main.py "$@"
--- a/src/init.py
+++ b/src/init.py
@@ -0,0 +1,3 @@
 """AudioCraft Studio - AI Audio Generation Web Application."""
 __version__ = "0.1.0"
--- a/src/api/init.py
+++ b/src/api/init.py
@@ -0,0 +1,5 @@
 """REST API for AudioCraft Studio."""
 from src.api.app import create_api_app
 __all__ = ["create_api_app"]
--- a/src/api/app.py
+++ b/src/api/app.py
@@ -0,0 +1,150 @@
 """FastAPI application for AudioCraft Studio REST API."""
 from typing import Any, Optional
 from contextlib import asynccontextmanager
 from fastapi import FastAPI, Request
 from fastapi.middleware.cors import CORSMiddleware
 from fastapi.responses import JSONResponse
 import time
 from config.settings import get_settings
 from src.api.routes import (
    generation_router,
    projects_router,
    models_router,
    system_router,
 )
 from src.api.routes.generation import set_services as set_generation_services
 from src.api.routes.projects import set_services as set_project_services
 from src.api.routes.models import set_services as set_model_services
 from src.api.routes.system import set_services as set_system_services
@asynccontextmanager
 async def lifespan(app: FastAPI):
    """Application lifespan handler."""
    # Startup
    yield
    # Shutdown
 def create_api_app(
    generation_service: Any = None,
    batch_processor: Any = None,
    project_service: Any = None,
    gpu_manager: Any = None,
    model_registry: Any = None,
 ) -> FastAPI:
    """Create and configure the FastAPI application.
    Args:
        generation_service: Service for handling generations
        batch_processor: Service for batch/queue processing
        project_service: Service for project management
        gpu_manager: GPU memory manager
        model_registry: Model registry for loading/unloading
    Returns:
        Configured FastAPI application
    """
    settings = get_settings()
    app = FastAPI(
        title="AudioCraft Studio API",
        description="REST API for AI-powered music and sound generation",
        version="1.0.0",
        docs_url="/api/docs" if settings.api_enabled else None,
        redoc_url="/api/redoc" if settings.api_enabled else None,
        openapi_url="/api/openapi.json" if settings.api_enabled else None,
        lifespan=lifespan,
    )
    # CORS middleware
    app.add_middleware(
        CORSMiddleware,
        allow_origins=settings.cors_origins,
        allow_credentials=True,
        allow_methods=["*"],
        allow_headers=["*"],
    )
    # Request timing middleware
    @app.middleware("http")
    async def add_process_time_header(request: Request, call_next):
        start_time = time.time()
        response = await call_next(request)
        process_time = time.time() - start_time
        response.headers["X-Process-Time"] = str(process_time)
        return response
    # Global exception handler
    @app.exception_handler(Exception)
    async def global_exception_handler(request: Request, exc: Exception):
        return JSONResponse(
            status_code=500,
            content={
                "error": "Internal server error",
                "detail": str(exc) if settings.debug else "An unexpected error occurred",
            },
        )
    # Inject service dependencies
    set_generation_services(generation_service, batch_processor)
    set_project_services(project_service)
    set_model_services(model_registry)
    set_system_services(gpu_manager, batch_processor, model_registry)
    # Register routers
    app.include_router(generation_router, prefix="/api/v1")
    app.include_router(projects_router, prefix="/api/v1")
    app.include_router(models_router, prefix="/api/v1")
    app.include_router(system_router, prefix="/api/v1")
    # Root endpoint
    @app.get("/")
    async def root():
        return {
            "name": "AudioCraft Studio API",
            "version": "1.0.0",
            "docs": "/api/docs",
        }
    # API info endpoint
    @app.get("/api/v1")
    async def api_info():
        return {
            "version": "1.0.0",
            "endpoints": {
                "generation": "/api/v1/generate",
                "projects": "/api/v1/projects",
                "models": "/api/v1/models",
                "system": "/api/v1/system",
            },
        }
    return app
 def run_api_server(
    app: FastAPI,
    host: Optional[str] = None,
    port: Optional[int] = None,
 ) -> None:
    """Run the API server.
    Args:
        app: FastAPI application
        host: Server hostname
        port: Server port
    """
    import uvicorn
    settings = get_settings()
    uvicorn.run(
        app,
        host=host or settings.host,
        port=port or settings.api_port,
        log_level="info",
    )
--- a/src/api/auth.py
+++ b/src/api/auth.py
@@ -0,0 +1,133 @@
 """API authentication middleware."""
 import secrets
 import hashlib
 from typing import Optional
 from pathlib import Path
 from fastapi import HTTPException, Security, status
 from fastapi.security import APIKeyHeader
 from config.settings import get_settings
 # API key header
 api_key_header = APIKeyHeader(name="X-API-Key", auto_error=False)
 def generate_api_key() -> str:
    """Generate a new API key."""
    return secrets.token_urlsafe(32)
 def hash_api_key(key: str) -> str:
    """Hash an API key for storage."""
    return hashlib.sha256(key.encode()).hexdigest()
 def verify_api_key(key: str, hashed: str) -> bool:
    """Verify an API key against its hash."""
    return secrets.compare_digest(hash_api_key(key), hashed)
 class APIKeyManager:
    """Manage API keys for authentication."""
    def __init__(self, key_file: Optional[Path] = None):
        """Initialize the key manager.
        Args:
            key_file: Path to store API key hash
        """
        self.settings = get_settings()
        self.key_file = key_file or Path(self.settings.data_dir) / ".api_key"
        self._key_hash: Optional[str] = None
        self._load_key()
    def _load_key(self) -> None:
        """Load API key hash from file."""
        if self.key_file.exists():
            self._key_hash = self.key_file.read_text().strip()
    def _save_key(self, key_hash: str) -> None:
        """Save API key hash to file."""
        self.key_file.parent.mkdir(parents=True, exist_ok=True)
        self.key_file.write_text(key_hash)
        self._key_hash = key_hash
    def generate_new_key(self) -> str:
        """Generate and store a new API key.
        Returns:
            The new API key (only shown once)
        """
        key = generate_api_key()
        self._save_key(hash_api_key(key))
        return key
    def verify(self, key: str) -> bool:
        """Verify an API key.
        Args:
            key: API key to verify
        Returns:
            True if valid, False otherwise
        """
        if not self._key_hash:
            return False
        return verify_api_key(key, self._key_hash)
    def has_key(self) -> bool:
        """Check if an API key has been generated."""
        return self._key_hash is not None
 # Global key manager instance
 _key_manager: Optional[APIKeyManager] = None
 def get_key_manager() -> APIKeyManager:
    """Get the global key manager instance."""
    global _key_manager
    if _key_manager is None:
        _key_manager = APIKeyManager()
    return _key_manager
 async def verify_api_key_dependency(
    api_key: Optional[str] = Security(api_key_header),
 ) -> str:
    """FastAPI dependency to verify API key.
    Args:
        api_key: API key from header
    Returns:
        The verified API key
    Raises:
        HTTPException: If key is missing or invalid
    """
    settings = get_settings()
    # Skip auth if disabled
    if not settings.api_key_required:
        return "anonymous"
    if api_key is None:
        raise HTTPException(
            status_code=status.HTTP_401_UNAUTHORIZED,
            detail="API key required",
            headers={"WWW-Authenticate": "ApiKey"},
        )
    key_manager = get_key_manager()
    if not key_manager.verify(api_key):
        raise HTTPException(
            status_code=status.HTTP_403_FORBIDDEN,
            detail="Invalid API key",
        )
    return api_key
--- a/src/api/models.py
+++ b/src/api/models.py
@@ -0,0 +1,166 @@
 """Pydantic models for API requests and responses."""
 from datetime import datetime
 from typing import Any, Optional
 from enum import Enum
 from pydantic import BaseModel, Field
 class ModelFamily(str, Enum):
    """Available model families."""
    MUSICGEN = "musicgen"
    AUDIOGEN = "audiogen"
    MAGNET = "magnet"
    MUSICGEN_STYLE = "musicgen-style"
    JASCO = "jasco"
 class JobStatus(str, Enum):
    """Generation job status."""
    PENDING = "pending"
    RUNNING = "running"
    COMPLETED = "completed"
    FAILED = "failed"
    CANCELLED = "cancelled"
 # Generation requests
 class GenerationRequest(BaseModel):
    """Request to generate audio."""
    model: ModelFamily = Field(..., description="Model family to use")
    variant: str = Field("medium", description="Model variant")
    prompts: list[str] = Field(..., min_length=1, max_length=10, description="Text prompts")
    duration: float = Field(10.0, ge=1, le=30, description="Duration in seconds")
    temperature: float = Field(1.0, ge=0, le=2, description="Sampling temperature")
    top_k: int = Field(250, ge=0, le=500, description="Top-K sampling")
    top_p: float = Field(0.0, ge=0, le=1, description="Top-P (nucleus) sampling")
    cfg_coef: float = Field(3.0, ge=1, le=10, description="CFG coefficient")
    seed: Optional[int] = Field(None, description="Random seed for reproducibility")
    conditioning: Optional[dict[str, Any]] = Field(None, description="Model-specific conditioning")
    project_id: Optional[str] = Field(None, description="Project to save to")
 class BatchGenerationRequest(BaseModel):
    """Request to add generation to queue."""
    request: GenerationRequest
    priority: int = Field(0, ge=0, le=10, description="Job priority (higher = sooner)")
 # Generation responses
 class GenerationResult(BaseModel):
    """Result of a completed generation."""
    id: str = Field(..., description="Generation ID")
    audio_url: str = Field(..., description="URL to download audio")
    waveform_url: Optional[str] = Field(None, description="URL to waveform image")
    duration: float = Field(..., description="Actual duration in seconds")
    seed: int = Field(..., description="Seed used for generation")
    model: str = Field(..., description="Model used")
    variant: str = Field(..., description="Variant used")
    prompt: str = Field(..., description="Prompt used")
    created_at: datetime = Field(..., description="Creation timestamp")
 class JobResponse(BaseModel):
    """Response for a queued job."""
    job_id: str = Field(..., description="Job ID for tracking")
    status: JobStatus = Field(..., description="Current status")
    position: Optional[int] = Field(None, description="Queue position if pending")
    progress: Optional[float] = Field(None, description="Progress 0-1 if running")
    result: Optional[GenerationResult] = Field(None, description="Result if completed")
    error: Optional[str] = Field(None, description="Error message if failed")
 # Project models
 class ProjectCreate(BaseModel):
    """Request to create a project."""
    name: str = Field(..., min_length=1, max_length=100)
    description: Optional[str] = Field(None, max_length=500)
 class ProjectResponse(BaseModel):
    """Project information."""
    id: str
    name: str
    description: Optional[str]
    generation_count: int
    created_at: datetime
    updated_at: datetime
 class GenerationResponse(BaseModel):
    """Generation record from database."""
    id: str
    project_id: str
    model: str
    variant: str
    prompt: str
    duration_seconds: float
    seed: int
    audio_path: str
    waveform_path: Optional[str]
    parameters: dict[str, Any]
    created_at: datetime
 # Model info
 class ModelVariantInfo(BaseModel):
    """Information about a model variant."""
    id: str
    name: str
    vram_mb: int
    description: str
    capabilities: list[str]
 class ModelInfo(BaseModel):
    """Information about a model family."""
    id: str
    name: str
    description: str
    variants: list[ModelVariantInfo]
    loaded: bool
    current_variant: Optional[str]
 # System info
 class GPUStatus(BaseModel):
    """GPU memory status."""
    device_name: str
    total_gb: float
    used_gb: float
    available_gb: float
    utilization_percent: float
    temperature_c: Optional[float]
 class QueueStatus(BaseModel):
    """Generation queue status."""
    queue_size: int
    active_jobs: int
    completed_today: int
    failed_today: int
 class SystemStatus(BaseModel):
    """Overall system status."""
    gpu: GPUStatus
    queue: QueueStatus
    loaded_models: list[str]
    uptime_seconds: float
 # Pagination
 class PaginatedResponse(BaseModel):
    """Paginated list response."""
    items: list[Any]
    total: int
    page: int
    page_size: int
    pages: int
--- a/src/api/routes/init.py
+++ b/src/api/routes/init.py
@@ -0,0 +1,13 @@
 """API route modules."""
 from src.api.routes.generation import router as generation_router
 from src.api.routes.projects import router as projects_router
 from src.api.routes.models import router as models_router
 from src.api.routes.system import router as system_router
 __all__ = [
    "generation_router",
    "projects_router",
    "models_router",
    "system_router",
 ]
--- a/src/api/routes/generation.py
+++ b/src/api/routes/generation.py
@@ -0,0 +1,234 @@
 """Generation API endpoints."""
 from typing import Any
 from fastapi import APIRouter, Depends, HTTPException, BackgroundTasks, status
 from fastapi.responses import FileResponse
 from src.api.auth import verify_api_key_dependency
 from src.api.models import (
    GenerationRequest,
    BatchGenerationRequest,
    GenerationResult,
    JobResponse,
    JobStatus,
 )
 router = APIRouter(prefix="/generate", tags=["generation"])
 # Service dependencies (injected at app startup)
 _generation_service = None
 _batch_processor = None
 def set_services(generation_service: Any, batch_processor: Any) -> None:
    """Set service dependencies."""
    global _generation_service, _batch_processor
    _generation_service = generation_service
    _batch_processor = batch_processor
@router.post(
    "/",
    response_model=GenerationResult,
    summary="Generate audio synchronously",
    description="Generate audio and wait for completion. For long generations, consider using the async endpoint.",
 )
 async def generate_sync(
    request: GenerationRequest,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> GenerationResult:
    """Generate audio synchronously."""
    if _generation_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Generation service not available",
        )
    try:
        result, generation = await _generation_service.generate(
            model_id=request.model.value,
            variant=request.variant,
            prompts=request.prompts,
            duration=request.duration,
            temperature=request.temperature,
            top_k=request.top_k,
            top_p=request.top_p,
            cfg_coef=request.cfg_coef,
            seed=request.seed,
            conditioning=request.conditioning,
            project_id=request.project_id,
        )
        return GenerationResult(
            id=generation.id,
            audio_url=f"/api/v1/audio/{generation.id}",
            waveform_url=f"/api/v1/audio/{generation.id}/waveform" if generation.waveform_path else None,
            duration=result.duration,
            seed=result.seed,
            model=request.model.value,
            variant=request.variant,
            prompt=request.prompts[0],
            created_at=generation.created_at,
        )
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.post(
    "/async",
    response_model=JobResponse,
    summary="Queue generation job",
    description="Add a generation to the queue for async processing.",
 )
 async def generate_async(
    request: BatchGenerationRequest,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> JobResponse:
    """Add generation to queue."""
    if _batch_processor is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Batch processor not available",
        )
    try:
        job = _batch_processor.add_job(
            model_id=request.request.model.value,
            variant=request.request.variant,
            prompts=request.request.prompts,
            duration=request.request.duration,
            temperature=request.request.temperature,
            top_k=request.request.top_k,
            top_p=request.request.top_p,
            cfg_coef=request.request.cfg_coef,
            seed=request.request.seed,
            conditioning=request.request.conditioning,
            project_id=request.request.project_id,
            priority=request.priority,
        )
        return JobResponse(
            job_id=job.id,
            status=JobStatus.PENDING,
            position=_batch_processor.get_position(job.id),
        )
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.get(
    "/jobs/{job_id}",
    response_model=JobResponse,
    summary="Get job status",
    description="Check the status of a queued generation job.",
 )
 async def get_job_status(
    job_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> JobResponse:
    """Get status of a queued job."""
    if _batch_processor is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Batch processor not available",
        )
    job = _batch_processor.get_job(job_id)
    if job is None:
        raise HTTPException(
            status_code=status.HTTP_404_NOT_FOUND,
            detail=f"Job {job_id} not found",
        )
    response = JobResponse(
        job_id=job.id,
        status=JobStatus(job.status.value),
    )
    if job.status.value == "pending":
        response.position = _batch_processor.get_position(job_id)
    elif job.status.value == "running":
        response.progress = job.progress
    elif job.status.value == "completed" and job.result:
        response.result = GenerationResult(
            id=job.result.id,
            audio_url=f"/api/v1/audio/{job.result.id}",
            waveform_url=f"/api/v1/audio/{job.result.id}/waveform",
            duration=job.result.duration,
            seed=job.result.seed,
            model=job.model_id,
            variant=job.variant,
            prompt=job.prompts[0],
            created_at=job.completed_at,
        )
    elif job.status.value == "failed":
        response.error = job.error
    return response
@router.delete(
    "/jobs/{job_id}",
    summary="Cancel job",
    description="Cancel a pending or running job.",
 )
 async def cancel_job(
    job_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> dict:
    """Cancel a queued job."""
    if _batch_processor is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Batch processor not available",
        )
    success = _batch_processor.cancel_job(job_id)
    if not success:
        raise HTTPException(
            status_code=status.HTTP_404_NOT_FOUND,
            detail=f"Job {job_id} not found or cannot be cancelled",
        )
    return {"message": f"Job {job_id} cancelled"}
@router.get(
    "/jobs",
    response_model=list[JobResponse],
    summary="List jobs",
    description="List all jobs in the queue.",
 )
 async def list_jobs(
    status_filter: str = None,
    limit: int = 50,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> list[JobResponse]:
    """List queued jobs."""
    if _batch_processor is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Batch processor not available",
        )
    jobs = _batch_processor.list_jobs(status_filter=status_filter, limit=limit)
    return [
        JobResponse(
            job_id=job.id,
            status=JobStatus(job.status.value),
            position=_batch_processor.get_position(job.id) if job.status.value == "pending" else None,
            progress=job.progress if job.status.value == "running" else None,
        )
        for job in jobs
    ]
--- a/src/api/routes/models.py
+++ b/src/api/routes/models.py
@@ -0,0 +1,228 @@
 """Models API endpoints."""
 from typing import Any
 from fastapi import APIRouter, Depends, HTTPException, status
 from src.api.auth import verify_api_key_dependency
 from src.api.models import ModelInfo, ModelVariantInfo
 router = APIRouter(prefix="/models", tags=["models"])
 # Service dependency (injected at app startup)
 _model_registry = None
 def set_services(model_registry: Any) -> None:
    """Set service dependencies."""
    global _model_registry
    _model_registry = model_registry
 # Static model information
 MODEL_CATALOG = {
    "musicgen": {
        "id": "musicgen",
        "name": "MusicGen",
        "description": "Text-to-music generation with optional melody conditioning",
        "variants": [
            {"id": "small", "name": "Small", "vram_mb": 1500, "description": "Fast, 300M params", "capabilities": ["text"]},
            {"id": "medium", "name": "Medium", "vram_mb": 5000, "description": "Balanced, 1.5B params", "capabilities": ["text"]},
            {"id": "large", "name": "Large", "vram_mb": 10000, "description": "Best quality, 3.3B params", "capabilities": ["text"]},
            {"id": "melody", "name": "Melody", "vram_mb": 5000, "description": "With melody conditioning", "capabilities": ["text", "melody"]},
            {"id": "stereo-small", "name": "Stereo Small", "vram_mb": 1800, "description": "Stereo, 300M params", "capabilities": ["text", "stereo"]},
            {"id": "stereo-medium", "name": "Stereo Medium", "vram_mb": 6000, "description": "Stereo, 1.5B params", "capabilities": ["text", "stereo"]},
            {"id": "stereo-large", "name": "Stereo Large", "vram_mb": 12000, "description": "Stereo, 3.3B params", "capabilities": ["text", "stereo"]},
            {"id": "stereo-melody", "name": "Stereo Melody", "vram_mb": 6000, "description": "Stereo with melody", "capabilities": ["text", "melody", "stereo"]},
        ],
    },
    "audiogen": {
        "id": "audiogen",
        "name": "AudioGen",
        "description": "Text-to-sound effects and environmental audio",
        "variants": [
            {"id": "medium", "name": "Medium", "vram_mb": 5000, "description": "1.5B params", "capabilities": ["text", "sfx"]},
        ],
    },
    "magnet": {
        "id": "magnet",
        "name": "MAGNeT",
        "description": "Fast non-autoregressive music generation",
        "variants": [
            {"id": "small", "name": "Small Music", "vram_mb": 2000, "description": "Fast music, 300M params", "capabilities": ["text", "music"]},
            {"id": "medium", "name": "Medium Music", "vram_mb": 5000, "description": "Balanced music, 1.5B params", "capabilities": ["text", "music"]},
            {"id": "audio-small", "name": "Small Audio", "vram_mb": 2000, "description": "Fast sound effects", "capabilities": ["text", "sfx"]},
            {"id": "audio-medium", "name": "Medium Audio", "vram_mb": 5000, "description": "Balanced sound effects", "capabilities": ["text", "sfx"]},
        ],
    },
    "musicgen-style": {
        "id": "musicgen-style",
        "name": "MusicGen Style",
        "description": "Style-conditioned music from reference audio",
        "variants": [
            {"id": "medium", "name": "Medium", "vram_mb": 5000, "description": "1.5B params, style conditioning", "capabilities": ["text", "style"]},
        ],
    },
    "jasco": {
        "id": "jasco",
        "name": "JASCO",
        "description": "Chord and drum-conditioned music generation",
        "variants": [
            {"id": "chords", "name": "Chords", "vram_mb": 5000, "description": "Chord-conditioned generation", "capabilities": ["text", "chords"]},
            {"id": "chords-drums", "name": "Chords + Drums", "vram_mb": 5500, "description": "Full symbolic conditioning", "capabilities": ["text", "chords", "drums"]},
        ],
    },
 }
@router.get(
    "/",
    response_model=list[ModelInfo],
    summary="List models",
    description="Get information about all available models.",
 )
 async def list_models(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> list[ModelInfo]:
    """List all available models."""
    models = []
    for model_id, info in MODEL_CATALOG.items():
        loaded = False
        current_variant = None
        if _model_registry:
            loaded = _model_registry.is_loaded(model_id)
            if loaded:
                current_variant = _model_registry.get_current_variant(model_id)
        models.append(
            ModelInfo(
                id=info["id"],
                name=info["name"],
                description=info["description"],
                variants=[ModelVariantInfo(**v) for v in info["variants"]],
                loaded=loaded,
                current_variant=current_variant,
            )
        )
    return models
@router.get(
    "/{model_id}",
    response_model=ModelInfo,
    summary="Get model info",
    description="Get detailed information about a specific model.",
 )
 async def get_model(
    model_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> ModelInfo:
    """Get model information by ID."""
    if model_id not in MODEL_CATALOG:
        raise HTTPException(
            status_code=status.HTTP_404_NOT_FOUND,
            detail=f"Model {model_id} not found",
        )
    info = MODEL_CATALOG[model_id]
    loaded = False
    current_variant = None
    if _model_registry:
        loaded = _model_registry.is_loaded(model_id)
        if loaded:
            current_variant = _model_registry.get_current_variant(model_id)
    return ModelInfo(
        id=info["id"],
        name=info["name"],
        description=info["description"],
        variants=[ModelVariantInfo(**v) for v in info["variants"]],
        loaded=loaded,
        current_variant=current_variant,
    )
@router.post(
    "/{model_id}/load",
    summary="Load model",
    description="Load a model into GPU memory.",
 )
 async def load_model(
    model_id: str,
    variant: str = "medium",
    api_key: str = Depends(verify_api_key_dependency),
 ) -> dict:
    """Load a model into memory."""
    if model_id not in MODEL_CATALOG:
        raise HTTPException(
            status_code=status.HTTP_404_NOT_FOUND,
            detail=f"Model {model_id} not found",
        )
    if _model_registry is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Model registry not available",
        )
    try:
        await _model_registry.load_model(model_id, variant)
        return {"message": f"Model {model_id} ({variant}) loaded successfully"}
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.post(
    "/{model_id}/unload",
    summary="Unload model",
    description="Unload a model from GPU memory.",
 )
 async def unload_model(
    model_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> dict:
    """Unload a model from memory."""
    if model_id not in MODEL_CATALOG:
        raise HTTPException(
            status_code=status.HTTP_404_NOT_FOUND,
            detail=f"Model {model_id} not found",
        )
    if _model_registry is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Model registry not available",
        )
    try:
        await _model_registry.unload_model(model_id)
        return {"message": f"Model {model_id} unloaded successfully"}
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.get(
    "/loaded",
    response_model=list[str],
    summary="List loaded models",
    description="Get list of currently loaded models.",
 )
 async def list_loaded_models(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> list[str]:
    """List currently loaded models."""
    if _model_registry is None:
        return []
    return _model_registry.get_loaded_models()
--- a/src/api/routes/projects.py
+++ b/src/api/routes/projects.py
@@ -0,0 +1,250 @@
 """Projects API endpoints."""
 from typing import Any, Optional
 from fastapi import APIRouter, Depends, HTTPException, Query, status
 from fastapi.responses import FileResponse
 from src.api.auth import verify_api_key_dependency
 from src.api.models import (
    ProjectCreate,
    ProjectResponse,
    GenerationResponse,
    PaginatedResponse,
 )
 router = APIRouter(prefix="/projects", tags=["projects"])
 # Service dependency (injected at app startup)
 _project_service = None
 def set_services(project_service: Any) -> None:
    """Set service dependencies."""
    global _project_service
    _project_service = project_service
@router.post(
    "/",
    response_model=ProjectResponse,
    status_code=status.HTTP_201_CREATED,
    summary="Create project",
    description="Create a new project for organizing generations.",
 )
 async def create_project(
    request: ProjectCreate,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> ProjectResponse:
    """Create a new project."""
    if _project_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Project service not available",
        )
    try:
        project = await _project_service.create_project(
            name=request.name,
            description=request.description,
        )
        return ProjectResponse(**project)
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.get(
    "/",
    response_model=list[ProjectResponse],
    summary="List projects",
    description="Get all projects.",
 )
 async def list_projects(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> list[ProjectResponse]:
    """List all projects."""
    if _project_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Project service not available",
        )
    try:
        projects = await _project_service.list_projects()
        return [ProjectResponse(**p) for p in projects]
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.get(
    "/{project_id}",
    response_model=ProjectResponse,
    summary="Get project",
    description="Get project details by ID.",
 )
 async def get_project(
    project_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> ProjectResponse:
    """Get a project by ID."""
    if _project_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Project service not available",
        )
    try:
        project = await _project_service.get_project(project_id)
        if project is None:
            raise HTTPException(
                status_code=status.HTTP_404_NOT_FOUND,
                detail=f"Project {project_id} not found",
            )
        return ProjectResponse(**project)
    except HTTPException:
        raise
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.delete(
    "/{project_id}",
    status_code=status.HTTP_204_NO_CONTENT,
    summary="Delete project",
    description="Delete a project and all its generations.",
 )
 async def delete_project(
    project_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> None:
    """Delete a project."""
    if _project_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Project service not available",
        )
    try:
        await _project_service.delete_project(project_id)
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.get(
    "/{project_id}/generations",
    response_model=PaginatedResponse,
    summary="List generations",
    description="Get generations for a project with pagination.",
 )
 async def list_generations(
    project_id: str,
    page: int = Query(1, ge=1),
    page_size: int = Query(20, ge=1, le=100),
    model: Optional[str] = Query(None, description="Filter by model"),
    api_key: str = Depends(verify_api_key_dependency),
 ) -> PaginatedResponse:
    """List generations for a project."""
    if _project_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Project service not available",
        )
    try:
        offset = (page - 1) * page_size
        generations = await _project_service.list_generations(
            project_id=project_id,
            limit=page_size + 1,  # +1 to check if more pages
            offset=offset,
            model_filter=model,
        )
        has_more = len(generations) > page_size
        generations = generations[:page_size]
        # Estimate total (could be improved with actual count query)
        total = offset + len(generations) + (1 if has_more else 0)
        pages = (total + page_size - 1) // page_size
        return PaginatedResponse(
            items=[GenerationResponse(**g) for g in generations],
            total=total,
            page=page,
            page_size=page_size,
            pages=pages,
        )
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.get(
    "/{project_id}/export",
    summary="Export project",
    description="Export project as ZIP file with all audio and metadata.",
 )
 async def export_project(
    project_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> FileResponse:
    """Export project as ZIP."""
    if _project_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Project service not available",
        )
    try:
        zip_path = await _project_service.export_project_zip(project_id)
        return FileResponse(
            path=zip_path,
            filename=f"project_{project_id}.zip",
            media_type="application/zip",
        )
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.delete(
    "/{project_id}/generations/{generation_id}",
    status_code=status.HTTP_204_NO_CONTENT,
    summary="Delete generation",
    description="Delete a specific generation.",
 )
 async def delete_generation(
    project_id: str,
    generation_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> None:
    """Delete a generation."""
    if _project_service is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Project service not available",
        )
    try:
        await _project_service.delete_generation(generation_id)
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
--- a/src/api/routes/system.py
+++ b/src/api/routes/system.py
@@ -0,0 +1,263 @@
 """System API endpoints."""
 import time
 from typing import Any
 from pathlib import Path
 from fastapi import APIRouter, Depends, HTTPException, status
 from fastapi.responses import FileResponse
 from src.api.auth import verify_api_key_dependency, get_key_manager
 from src.api.models import GPUStatus, QueueStatus, SystemStatus
 router = APIRouter(prefix="/system", tags=["system"])
 # Service dependencies (injected at app startup)
 _gpu_manager = None
 _batch_processor = None
 _model_registry = None
 _start_time = time.time()
 def set_services(
    gpu_manager: Any,
    batch_processor: Any,
    model_registry: Any,
 ) -> None:
    """Set service dependencies."""
    global _gpu_manager, _batch_processor, _model_registry
    _gpu_manager = gpu_manager
    _batch_processor = batch_processor
    _model_registry = model_registry
@router.get(
    "/status",
    response_model=SystemStatus,
    summary="System status",
    description="Get overall system status including GPU, queue, and loaded models.",
 )
 async def get_status(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> SystemStatus:
    """Get system status."""
    # GPU status
    if _gpu_manager:
        gpu = GPUStatus(
            device_name=_gpu_manager.device_name,
            total_gb=_gpu_manager.total_memory / 1024**3,
            used_gb=_gpu_manager.get_used_memory() / 1024**3,
            available_gb=_gpu_manager.get_available_memory() / 1024**3,
            utilization_percent=_gpu_manager.get_utilization(),
            temperature_c=_gpu_manager.get_temperature(),
        )
    else:
        gpu = GPUStatus(
            device_name="Unknown",
            total_gb=0,
            used_gb=0,
            available_gb=0,
            utilization_percent=0,
            temperature_c=None,
        )
    # Queue status
    if _batch_processor:
        queue = QueueStatus(
            queue_size=len(_batch_processor.queue),
            active_jobs=_batch_processor.active_count,
            completed_today=_batch_processor.completed_count,
            failed_today=_batch_processor.failed_count,
        )
    else:
        queue = QueueStatus(
            queue_size=0,
            active_jobs=0,
            completed_today=0,
            failed_today=0,
        )
    # Loaded models
    loaded_models = []
    if _model_registry:
        loaded_models = _model_registry.get_loaded_models()
    return SystemStatus(
        gpu=gpu,
        queue=queue,
        loaded_models=loaded_models,
        uptime_seconds=time.time() - _start_time,
    )
@router.get(
    "/gpu",
    response_model=GPUStatus,
    summary="GPU status",
    description="Get detailed GPU memory and utilization status.",
 )
 async def get_gpu_status(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> GPUStatus:
    """Get GPU status."""
    if _gpu_manager is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="GPU manager not available",
        )
    return GPUStatus(
        device_name=_gpu_manager.device_name,
        total_gb=_gpu_manager.total_memory / 1024**3,
        used_gb=_gpu_manager.get_used_memory() / 1024**3,
        available_gb=_gpu_manager.get_available_memory() / 1024**3,
        utilization_percent=_gpu_manager.get_utilization(),
        temperature_c=_gpu_manager.get_temperature(),
    )
@router.post(
    "/clear-cache",
    summary="Clear cache",
    description="Clear model cache and free GPU memory.",
 )
 async def clear_cache(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> dict:
    """Clear model cache."""
    if _model_registry is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Model registry not available",
        )
    try:
        _model_registry.clear_cache()
        return {"message": "Cache cleared successfully"}
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.post(
    "/unload-all",
    summary="Unload all models",
    description="Unload all models from GPU memory.",
 )
 async def unload_all_models(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> dict:
    """Unload all models."""
    if _model_registry is None:
        raise HTTPException(
            status_code=status.HTTP_503_SERVICE_UNAVAILABLE,
            detail="Model registry not available",
        )
    try:
        await _model_registry.unload_all()
        return {"message": "All models unloaded successfully"}
    except Exception as e:
        raise HTTPException(
            status_code=status.HTTP_500_INTERNAL_SERVER_ERROR,
            detail=str(e),
        )
@router.get(
    "/health",
    summary="Health check",
    description="Simple health check endpoint.",
 )
 async def health_check() -> dict:
    """Health check endpoint (no auth required)."""
    return {
        "status": "healthy",
        "uptime_seconds": time.time() - _start_time,
    }
@router.post(
    "/api-key/regenerate",
    summary="Regenerate API key",
    description="Generate a new API key. The old key will be invalidated.",
 )
 async def regenerate_api_key(
    api_key: str = Depends(verify_api_key_dependency),
 ) -> dict:
    """Regenerate API key."""
    key_manager = get_key_manager()
    new_key = key_manager.generate_new_key()
    return {
        "api_key": new_key,
        "message": "New API key generated. Store it securely - it won't be shown again.",
    }
@router.get(
    "/audio/{generation_id}",
    summary="Download audio",
    description="Download generated audio file.",
 )
 async def download_audio(
    generation_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> FileResponse:
    """Download audio file for a generation."""
    # This would look up the actual file path from the database
    # For now, construct expected path
    from config.settings import get_settings
    settings = get_settings()
    # Find the audio file
    audio_dir = Path(settings.output_dir)
    possible_paths = [
        audio_dir / f"{generation_id}.wav",
        audio_dir / f"{generation_id}.mp3",
        audio_dir / f"{generation_id}.flac",
    ]
    for path in possible_paths:
        if path.exists():
            return FileResponse(
                path=path,
                filename=path.name,
                media_type="audio/wav" if path.suffix == ".wav" else f"audio/{path.suffix[1:]}",
            )
    raise HTTPException(
        status_code=status.HTTP_404_NOT_FOUND,
        detail=f"Audio file for generation {generation_id} not found",
    )
@router.get(
    "/audio/{generation_id}/waveform",
    summary="Download waveform",
    description="Download waveform visualization image.",
 )
 async def download_waveform(
    generation_id: str,
    api_key: str = Depends(verify_api_key_dependency),
 ) -> FileResponse:
    """Download waveform image for a generation."""
    from config.settings import get_settings
    settings = get_settings()
    waveform_path = Path(settings.output_dir) / f"{generation_id}_waveform.png"
    if not waveform_path.exists():
        raise HTTPException(
            status_code=status.HTTP_404_NOT_FOUND,
            detail=f"Waveform for generation {generation_id} not found",
        )
    return FileResponse(
        path=waveform_path,
        filename=waveform_path.name,
        media_type="image/png",
    )
--- a/src/core/init.py
+++ b/src/core/init.py
@@ -0,0 +1,24 @@
 """Core infrastructure for AudioCraft Studio."""
 from src.core.base_model import (
    BaseAudioModel,
    GenerationRequest,
    GenerationResult,
    ConditioningType,
 )
 from src.core.gpu_manager import GPUMemoryManager, VRAMBudget
 from src.core.model_registry import ModelRegistry
 from src.core.oom_handler import OOMHandler, OOMRecoveryError, oom_safe
 __all__ = [
    "BaseAudioModel",
    "GenerationRequest",
    "GenerationResult",
    "ConditioningType",
    "GPUMemoryManager",
    "VRAMBudget",
    "ModelRegistry",
    "OOMHandler",
    "OOMRecoveryError",
    "oom_safe",
 ]
--- a/src/core/audio_utils.py
+++ b/src/core/audio_utils.py
@@ -0,0 +1,535 @@
 """Audio utilities for processing, visualization, and export."""
 import io
 import logging
 from pathlib import Path
 from typing import Optional, Tuple, Union
 import numpy as np
 import torch
 logger = logging.getLogger(__name__)
 def normalize_audio(
    audio: Union[torch.Tensor, np.ndarray],
    target_db: float = -14.0,
    peak_normalize: bool = False,
 ) -> np.ndarray:
    """Normalize audio to target loudness.
    Args:
        audio: Audio tensor or array [channels, samples] or [samples]
        target_db: Target loudness in dB (LUFS-like)
        peak_normalize: If True, normalize to peak instead of RMS
    Returns:
        Normalized audio as numpy array
    """
    if isinstance(audio, torch.Tensor):
        audio = audio.numpy()
    # Ensure float32
    audio = audio.astype(np.float32)
    # Handle batch dimension
    if audio.ndim == 3:
        audio = audio[0]  # Take first sample if batched
    if peak_normalize:
        # Peak normalization
        peak = np.abs(audio).max()
        if peak > 0:
            target_linear = 10 ** (target_db / 20)
            audio = audio * (target_linear / peak)
    else:
        # RMS normalization (approximating LUFS)
        rms = np.sqrt(np.mean(audio ** 2))
        if rms > 0:
            target_rms = 10 ** (target_db / 20)
            audio = audio * (target_rms / rms)
    # Clip to prevent clipping
    audio = np.clip(audio, -1.0, 1.0)
    return audio
 def convert_sample_rate(
    audio: np.ndarray,
    orig_sr: int,
    target_sr: int,
 ) -> np.ndarray:
    """Convert audio sample rate.
    Args:
        audio: Audio array [channels, samples] or [samples]
        orig_sr: Original sample rate
        target_sr: Target sample rate
    Returns:
        Resampled audio
    """
    if orig_sr == target_sr:
        return audio
    try:
        import librosa
        # Handle multi-channel
        if audio.ndim == 2:
            resampled = np.array([
                librosa.resample(ch, orig_sr=orig_sr, target_sr=target_sr)
                for ch in audio
            ])
        else:
            resampled = librosa.resample(audio, orig_sr=orig_sr, target_sr=target_sr)
        return resampled
    except ImportError:
        logger.warning("librosa not available, using scipy for resampling")
        from scipy import signal
        ratio = target_sr / orig_sr
        new_length = int(audio.shape[-1] * ratio)
        if audio.ndim == 2:
            resampled = np.array([
                signal.resample(ch, new_length) for ch in audio
            ])
        else:
            resampled = signal.resample(audio, new_length)
        return resampled
 def generate_waveform(
    audio: Union[torch.Tensor, np.ndarray],
    sample_rate: int,
    width: int = 800,
    height: int = 200,
    color: str = "#3b82f6",
    background: str = "#1f2937",
 ) -> bytes:
    """Generate waveform image as PNG bytes.
    Args:
        audio: Audio data [channels, samples] or [samples]
        sample_rate: Sample rate in Hz
        width: Image width in pixels
        height: Image height in pixels
        color: Waveform color (hex)
        background: Background color (hex)
    Returns:
        PNG image as bytes
    """
    try:
        import matplotlib
        matplotlib.use('Agg')
        import matplotlib.pyplot as plt
    except ImportError:
        logger.warning("matplotlib not available for waveform generation")
        return b""
    if isinstance(audio, torch.Tensor):
        audio = audio.numpy()
    # Handle dimensions
    if audio.ndim == 3:
        audio = audio[0]
    if audio.ndim == 2:
        audio = audio.mean(axis=0)  # Mix to mono for visualization
    # Downsample for visualization
    samples_per_pixel = max(1, len(audio) // width)
    num_chunks = len(audio) // samples_per_pixel
    if num_chunks > 0:
        audio_chunks = audio[:num_chunks * samples_per_pixel].reshape(
            num_chunks, samples_per_pixel
        )
        # Get min/max for each chunk
        mins = audio_chunks.min(axis=1)
        maxs = audio_chunks.max(axis=1)
    else:
        mins = maxs = audio
    # Create figure
    fig, ax = plt.subplots(figsize=(width / 100, height / 100), dpi=100)
    fig.patch.set_facecolor(background)
    ax.set_facecolor(background)
    # Plot waveform
    x = np.arange(len(mins))
    ax.fill_between(x, mins, maxs, color=color, alpha=0.7)
    ax.axhline(y=0, color=color, alpha=0.3, linewidth=0.5)
    # Style
    ax.set_xlim(0, len(mins))
    ax.set_ylim(-1, 1)
    ax.axis('off')
    plt.tight_layout(pad=0)
    # Save to bytes
    buf = io.BytesIO()
    fig.savefig(buf, format='png', facecolor=background, edgecolor='none')
    plt.close(fig)
    buf.seek(0)
    return buf.read()
 def generate_spectrogram(
    audio: Union[torch.Tensor, np.ndarray],
    sample_rate: int,
    width: int = 800,
    height: int = 200,
    colormap: str = "magma",
 ) -> bytes:
    """Generate spectrogram image as PNG bytes.
    Args:
        audio: Audio data
        sample_rate: Sample rate in Hz
        width: Image width
        height: Image height
        colormap: Matplotlib colormap name
    Returns:
        PNG image as bytes
    """
    try:
        import matplotlib
        matplotlib.use('Agg')
        import matplotlib.pyplot as plt
        import librosa
        import librosa.display
    except ImportError:
        logger.warning("matplotlib/librosa not available for spectrogram")
        return b""
    if isinstance(audio, torch.Tensor):
        audio = audio.numpy()
    # Handle dimensions
    if audio.ndim == 3:
        audio = audio[0]
    if audio.ndim == 2:
        audio = audio.mean(axis=0)
    # Compute mel spectrogram
    S = librosa.feature.melspectrogram(
        y=audio,
        sr=sample_rate,
        n_mels=128,
        fmax=sample_rate // 2,
    )
    S_db = librosa.power_to_db(S, ref=np.max)
    # Create figure
    fig, ax = plt.subplots(figsize=(width / 100, height / 100), dpi=100)
    librosa.display.specshow(
        S_db,
        sr=sample_rate,
        x_axis='time',
        y_axis='mel',
        cmap=colormap,
        ax=ax,
    )
    ax.axis('off')
    plt.tight_layout(pad=0)
    # Save to bytes
    buf = io.BytesIO()
    fig.savefig(buf, format='png', bbox_inches='tight', pad_inches=0)
    plt.close(fig)
    buf.seek(0)
    return buf.read()
 def save_audio(
    audio: Union[torch.Tensor, np.ndarray],
    sample_rate: int,
    path: Path,
    format: str = "wav",
    normalize: bool = True,
    target_db: float = -14.0,
 ) -> Path:
    """Save audio to file with optional normalization.
    Args:
        audio: Audio data
        sample_rate: Sample rate
        path: Output path (extension will be added if needed)
        format: Output format (wav, mp3, flac, ogg)
        normalize: Whether to normalize audio
        target_db: Normalization target
    Returns:
        Path to saved file
    """
    import soundfile as sf
    if isinstance(audio, torch.Tensor):
        audio = audio.numpy()
    # Handle batch dimension
    if audio.ndim == 3:
        audio = audio[0]
    # Normalize if requested
    if normalize:
        audio = normalize_audio(audio, target_db=target_db)
    # Transpose for soundfile [samples, channels]
    if audio.ndim == 2:
        audio = audio.T
    # Ensure correct extension
    path = Path(path)
    if not path.suffix:
        path = path.with_suffix(f".{format}")
    # Save based on format
    if format in ("wav", "flac"):
        sf.write(path, audio, sample_rate)
    elif format == "mp3":
        # Use scipy.io.wavfile then convert with pydub if available
        try:
            from pydub import AudioSegment
            # Save as WAV first
            wav_path = path.with_suffix(".wav")
            sf.write(wav_path, audio, sample_rate)
            # Convert to MP3
            sound = AudioSegment.from_wav(wav_path)
            sound.export(path, format="mp3", bitrate="320k")
            # Remove temp WAV
            wav_path.unlink()
        except ImportError:
            logger.warning("pydub not available, saving as WAV instead")
            path = path.with_suffix(".wav")
            sf.write(path, audio, sample_rate)
    elif format == "ogg":
        sf.write(path, audio, sample_rate, format="ogg", subtype="vorbis")
    else:
        # Default to WAV
        path = path.with_suffix(".wav")
        sf.write(path, audio, sample_rate)
    return path
 def load_audio(
    path: Path,
    target_sr: Optional[int] = None,
    mono: bool = False,
 ) -> Tuple[np.ndarray, int]:
    """Load audio from file.
    Args:
        path: Path to audio file
        target_sr: Target sample rate (None to keep original)
        mono: Convert to mono
    Returns:
        Tuple of (audio_array, sample_rate)
    """
    import soundfile as sf
    audio, sr = sf.read(path)
    # Convert to [channels, samples] format
    if audio.ndim == 1:
        audio = audio[np.newaxis, :]
    else:
        audio = audio.T
    # Convert to mono
    if mono and audio.shape[0] > 1:
        audio = audio.mean(axis=0, keepdims=True)
    # Resample if needed
    if target_sr and target_sr != sr:
        audio = convert_sample_rate(audio, sr, target_sr)
        sr = target_sr
    return audio, sr
 def get_audio_info(path: Path) -> dict:
    """Get audio file information.
    Args:
        path: Path to audio file
    Returns:
        Dictionary with audio info
    """
    import soundfile as sf
    info = sf.info(path)
    return {
        "path": str(path),
        "duration": info.duration,
        "sample_rate": info.samplerate,
        "channels": info.channels,
        "format": info.format,
        "subtype": info.subtype,
        "frames": info.frames,
    }
 def trim_silence(
    audio: np.ndarray,
    sample_rate: int,
    threshold_db: float = -40.0,
    min_silence_ms: int = 100,
 ) -> np.ndarray:
    """Trim silence from start and end of audio.
    Args:
        audio: Audio array
        sample_rate: Sample rate
        threshold_db: Silence threshold in dB
        min_silence_ms: Minimum silence duration to trim
    Returns:
        Trimmed audio
    """
    try:
        import librosa
        if audio.ndim == 2:
            # Process mono for trimming
            mono = audio.mean(axis=0)
        else:
            mono = audio
        # Get non-silent intervals
        intervals = librosa.effects.split(
            mono,
            top_db=abs(threshold_db),
            frame_length=int(sample_rate * min_silence_ms / 1000),
        )
        if len(intervals) == 0:
            return audio
        start = intervals[0][0]
        end = intervals[-1][1]
        if audio.ndim == 2:
            return audio[:, start:end]
        return audio[start:end]
    except ImportError:
        logger.warning("librosa not available for silence trimming")
        return audio
 def apply_fade(
    audio: np.ndarray,
    sample_rate: int,
    fade_in_ms: float = 0,
    fade_out_ms: float = 0,
 ) -> np.ndarray:
    """Apply fade in/out to audio.
    Args:
        audio: Audio array [channels, samples] or [samples]
        sample_rate: Sample rate
        fade_in_ms: Fade in duration in milliseconds
        fade_out_ms: Fade out duration in milliseconds
    Returns:
        Audio with fades applied
    """
    audio = audio.copy()
    if fade_in_ms > 0:
        fade_in_samples = int(sample_rate * fade_in_ms / 1000)
        fade_in_samples = min(fade_in_samples, audio.shape[-1])
        fade_in_curve = np.linspace(0, 1, fade_in_samples)
        if audio.ndim == 2:
            audio[:, :fade_in_samples] *= fade_in_curve
        else:
            audio[:fade_in_samples] *= fade_in_curve
    if fade_out_ms > 0:
        fade_out_samples = int(sample_rate * fade_out_ms / 1000)
        fade_out_samples = min(fade_out_samples, audio.shape[-1])
        fade_out_curve = np.linspace(1, 0, fade_out_samples)
        if audio.ndim == 2:
            audio[:, -fade_out_samples:] *= fade_out_curve
        else:
            audio[-fade_out_samples:] *= fade_out_curve
    return audio
 def concatenate_audio(
    audio_list: list[np.ndarray],
    sample_rate: int,
    crossfade_ms: float = 0,
 ) -> np.ndarray:
    """Concatenate multiple audio segments.
    Args:
        audio_list: List of audio arrays
        sample_rate: Sample rate (must be same for all)
        crossfade_ms: Crossfade duration between segments
    Returns:
        Concatenated audio
    """
    if not audio_list:
        return np.array([])
    if len(audio_list) == 1:
        return audio_list[0]
    crossfade_samples = int(sample_rate * crossfade_ms / 1000)
    result = audio_list[0]
    for audio in audio_list[1:]:
        if crossfade_samples > 0 and crossfade_samples < min(
            result.shape[-1], audio.shape[-1]
        ):
            # Apply crossfade
            fade_out = np.linspace(1, 0, crossfade_samples)
            fade_in = np.linspace(0, 1, crossfade_samples)
            if result.ndim == 2:
                # Overlap region
                result[:, -crossfade_samples:] *= fade_out
                overlap = result[:, -crossfade_samples:] + audio[:, :crossfade_samples] * fade_in
                result = np.concatenate([
                    result[:, :-crossfade_samples],
                    overlap,
                    audio[:, crossfade_samples:]
                ], axis=1)
            else:
                result[-crossfade_samples:] *= fade_out
                overlap = result[-crossfade_samples:] + audio[:crossfade_samples] * fade_in
                result = np.concatenate([
                    result[:-crossfade_samples],
                    overlap,
                    audio[crossfade_samples:]
                ])
        else:
            # Simple concatenation
            result = np.concatenate([result, audio], axis=-1)
    return result
--- a/src/core/base_model.py
+++ b/src/core/base_model.py
@@ -0,0 +1,247 @@
 """Abstract base classes for AudioCraft model adapters."""
 from abc import ABC, abstractmethod
 from dataclasses import dataclass, field
 from enum import Enum
 from typing import Any, Optional
 import torch
 class ConditioningType(str, Enum):
    """Types of conditioning supported by models."""
    TEXT = "text"
    MELODY = "melody"
    STYLE = "style"
    CHORDS = "chords"
    DRUMS = "drums"
@dataclass
 class GenerationRequest:
    """Request parameters for audio generation.
    Attributes:
        prompts: List of text prompts for generation
        duration: Target duration in seconds
        temperature: Sampling temperature (higher = more random)
        top_k: Top-k sampling parameter
        top_p: Nucleus sampling parameter (0 = disabled)
        cfg_coef: Classifier-free guidance coefficient
        batch_size: Number of samples to generate per prompt
        seed: Random seed for reproducibility
        conditioning: Optional conditioning data
    """
    prompts: list[str]
    duration: float = 10.0
    temperature: float = 1.0
    top_k: int = 250
    top_p: float = 0.0
    cfg_coef: float = 3.0
    batch_size: int = 1
    seed: Optional[int] = None
    conditioning: dict[str, Any] = field(default_factory=dict)
    def __post_init__(self) -> None:
        """Validate request parameters."""
        if not self.prompts:
            raise ValueError("At least one prompt is required")
        if self.duration <= 0:
            raise ValueError("Duration must be positive")
        if self.temperature < 0:
            raise ValueError("Temperature must be non-negative")
        if self.top_k < 0:
            raise ValueError("top_k must be non-negative")
        if not 0 <= self.top_p <= 1:
            raise ValueError("top_p must be between 0 and 1")
        if self.cfg_coef < 1:
            raise ValueError("cfg_coef must be >= 1")
@dataclass
 class GenerationResult:
    """Result of audio generation.
    Attributes:
        audio: Generated audio tensor (shape: [batch, channels, samples])
        sample_rate: Audio sample rate in Hz
        duration: Actual duration in seconds
        model_id: ID of the model used
        variant: Model variant used
        parameters: Generation parameters used
        seed: Actual seed used (for reproducibility)
    """
    audio: torch.Tensor
    sample_rate: int
    duration: float
    model_id: str
    variant: str
    parameters: dict[str, Any]
    seed: int
    @property
    def num_samples(self) -> int:
        """Number of audio samples generated."""
        return self.audio.shape[0]
    @property
    def num_channels(self) -> int:
        """Number of audio channels."""
        return self.audio.shape[1]
    @property
    def num_frames(self) -> int:
        """Number of audio frames."""
        return self.audio.shape[2]
 class BaseAudioModel(ABC):
    """Abstract base class for AudioCraft model adapters.
    All model adapters must implement this interface to integrate with
    the model registry and generation service.
    """
    @property
    @abstractmethod
    def model_id(self) -> str:
        """Unique identifier for this model family (e.g., 'musicgen')."""
        ...
    @property
    @abstractmethod
    def variant(self) -> str:
        """Current model variant (e.g., 'medium', 'large')."""
        ...
    @property
    @abstractmethod
    def display_name(self) -> str:
        """Human-readable name for UI display."""
        ...
    @property
    @abstractmethod
    def description(self) -> str:
        """Brief description of the model's capabilities."""
        ...
    @property
    @abstractmethod
    def vram_estimate_mb(self) -> int:
        """Estimated VRAM usage when loaded (in megabytes)."""
        ...
    @property
    @abstractmethod
    def max_duration(self) -> float:
        """Maximum supported generation duration in seconds."""
        ...
    @property
    @abstractmethod
    def sample_rate(self) -> int:
        """Output audio sample rate in Hz."""
        ...
    @property
    @abstractmethod
    def supports_conditioning(self) -> list[ConditioningType]:
        """List of conditioning types supported by this model."""
        ...
    @property
    @abstractmethod
    def is_loaded(self) -> bool:
        """Whether the model is currently loaded in memory."""
        ...
    @property
    def device(self) -> Optional[torch.device]:
        """Device the model is loaded on, or None if not loaded."""
        return None
    @abstractmethod
    def load(self, device: str = "cuda") -> None:
        """Load the model into memory.
        Args:
            device: Target device ('cuda', 'cuda:0', 'cpu', etc.)
        Raises:
            RuntimeError: If loading fails
        """
        ...
    @abstractmethod
    def unload(self) -> None:
        """Unload the model and free memory.
        Should be idempotent - safe to call even if not loaded.
        """
        ...
    @abstractmethod
    def generate(self, request: GenerationRequest) -> GenerationResult:
        """Generate audio based on the request.
        Args:
            request: Generation parameters and prompts
        Returns:
            GenerationResult containing audio and metadata
        Raises:
            RuntimeError: If model is not loaded
            ValueError: If request parameters are invalid for this model
        """
        ...
    @abstractmethod
    def get_default_params(self) -> dict[str, Any]:
        """Get default generation parameters for this model.
        Returns:
            Dictionary of parameter names to default values
        """
        ...
    def validate_request(self, request: GenerationRequest) -> None:
        """Validate a generation request for this model.
        Args:
            request: Request to validate
        Raises:
            ValueError: If request is invalid for this model
        """
        if not self.is_loaded:
            raise RuntimeError(f"Model {self.model_id}/{self.variant} is not loaded")
        if request.duration > self.max_duration:
            raise ValueError(
                f"Duration {request.duration}s exceeds maximum {self.max_duration}s "
                f"for {self.model_id}/{self.variant}"
            )
        # Check conditioning requirements
        for cond_type, cond_data in request.conditioning.items():
            if cond_data is not None:
                try:
                    cond_enum = ConditioningType(cond_type)
                except ValueError:
                    raise ValueError(f"Unknown conditioning type: {cond_type}")
                if cond_enum not in self.supports_conditioning:
                    raise ValueError(
                        f"Model {self.model_id}/{self.variant} does not support "
                        f"{cond_type} conditioning"
                    )
    def __repr__(self) -> str:
        """String representation."""
        loaded = "loaded" if self.is_loaded else "not loaded"
        return f"<{self.__class__.__name__} {self.model_id}/{self.variant} ({loaded})>"
--- a/src/core/gpu_manager.py
+++ b/src/core/gpu_manager.py
@@ -0,0 +1,433 @@
 """GPU memory management for AudioCraft models."""
 import gc
 import json
 import logging
 import threading
 import time
 from dataclasses import dataclass
 from pathlib import Path
 from typing import Any, Callable, Optional
 import torch
 logger = logging.getLogger(__name__)
@dataclass
 class VRAMBudget:
    """VRAM budget allocation information.
    Attributes:
        total_mb: Total VRAM in megabytes
        used_mb: Currently used VRAM
        free_mb: Free VRAM
        reserved_comfyui_mb: VRAM reserved for ComfyUI
        safety_buffer_mb: Safety buffer to prevent OOM
        available_mb: VRAM available for AudioCraft models
    """
    total_mb: int
    used_mb: int
    free_mb: int
    reserved_comfyui_mb: int
    safety_buffer_mb: int
    available_mb: int
    @property
    def utilization(self) -> float:
        """Current VRAM utilization as a fraction (0-1)."""
        return self.used_mb / self.total_mb if self.total_mb > 0 else 0.0
@dataclass
 class GPUState:
    """State information for inter-service coordination."""
    timestamp: float
    service: str  # "audiocraft" or "comfyui"
    vram_used_mb: int
    vram_requested_mb: int
    status: str  # "idle", "working", "requesting_priority", "yielded"
 class GPUMemoryManager:
    """Manages GPU memory allocation and coordination with ComfyUI.
    Uses pynvml for accurate system-wide VRAM tracking and file-based
    IPC for coordination with ComfyUI running on the same system.
    """
    COORDINATION_FILE = Path("/tmp/audiocraft_comfyui_coord.json")
    LOCK_FILE = Path("/tmp/audiocraft_comfyui_coord.lock")
    STALE_THRESHOLD = 30.0  # seconds
    def __init__(
        self,
        device_id: int = 0,
        comfyui_reserve_gb: float = 10.0,
        safety_buffer_gb: float = 1.0,
    ):
        """Initialize GPU memory manager.
        Args:
            device_id: CUDA device index
            comfyui_reserve_gb: VRAM to reserve for ComfyUI (gigabytes)
            safety_buffer_gb: Safety buffer to prevent OOM (gigabytes)
        """
        self.device_id = device_id
        self.device = torch.device(f"cuda:{device_id}")
        self.comfyui_reserve_mb = int(comfyui_reserve_gb * 1024)
        self.safety_buffer_mb = int(safety_buffer_gb * 1024)
        # Initialize NVML for direct GPU monitoring
        self._nvml_initialized = False
        self._nvml_handle = None
        self._init_nvml()
        # Threading
        self._lock = threading.RLock()
        # Callbacks for memory events
        self._low_memory_callbacks: list[Callable[[VRAMBudget], None]] = []
        self._oom_callbacks: list[Callable[[], None]] = []
        # Initialize coordination file
        self._ensure_coordination_file()
    def _init_nvml(self) -> None:
        """Initialize NVML for GPU monitoring."""
        try:
            import pynvml
            pynvml.nvmlInit()
            self._nvml_handle = pynvml.nvmlDeviceGetHandleByIndex(self.device_id)
            self._nvml_initialized = True
            logger.info("NVML initialized successfully")
        except ImportError:
            logger.warning("pynvml not available, falling back to torch.cuda")
        except Exception as e:
            logger.warning(f"Failed to initialize NVML: {e}, falling back to torch.cuda")
    def _ensure_coordination_file(self) -> None:
        """Create coordination file if it doesn't exist."""
        if not self.COORDINATION_FILE.exists():
            initial_state = {
                "audiocraft": None,
                "comfyui": None,
                "priority": None,
                "last_update": time.time(),
            }
            self._write_coordination_state(initial_state)
    def get_memory_info(self) -> dict[str, int]:
        """Get current GPU memory status.
        Returns:
            Dictionary with memory values in megabytes:
            - total: Total VRAM
            - used: Used VRAM (system-wide)
            - free: Free VRAM
            - torch_allocated: PyTorch allocated memory
            - torch_reserved: PyTorch reserved memory
            - torch_cached: PyTorch cached memory
        """
        with self._lock:
            if self._nvml_initialized:
                return self._get_memory_info_nvml()
            return self._get_memory_info_torch()
    def _get_memory_info_nvml(self) -> dict[str, int]:
        """Get memory info using NVML (more accurate)."""
        import pynvml
        info = pynvml.nvmlDeviceGetMemoryInfo(self._nvml_handle)
        torch_allocated = torch.cuda.memory_allocated(self.device)
        torch_reserved = torch.cuda.memory_reserved(self.device)
        return {
            "total": info.total // (1024 * 1024),
            "used": info.used // (1024 * 1024),
            "free": info.free // (1024 * 1024),
            "torch_allocated": torch_allocated // (1024 * 1024),
            "torch_reserved": torch_reserved // (1024 * 1024),
            "torch_cached": (torch_reserved - torch_allocated) // (1024 * 1024),
        }
    def _get_memory_info_torch(self) -> dict[str, int]:
        """Get memory info using torch.cuda (fallback)."""
        props = torch.cuda.get_device_properties(self.device)
        allocated = torch.cuda.memory_allocated(self.device)
        reserved = torch.cuda.memory_reserved(self.device)
        # Note: This is less accurate for system-wide usage
        return {
            "total": props.total_memory // (1024 * 1024),
            "used": reserved // (1024 * 1024),
            "free": (props.total_memory - reserved) // (1024 * 1024),
            "torch_allocated": allocated // (1024 * 1024),
            "torch_reserved": reserved // (1024 * 1024),
            "torch_cached": (reserved - allocated) // (1024 * 1024),
        }
    def get_available_budget(self) -> VRAMBudget:
        """Calculate available VRAM budget considering ComfyUI.
        Returns:
            VRAMBudget with current allocation information
        """
        mem = self.get_memory_info()
        # Check ComfyUI's actual usage via coordination file
        comfyui_state = self.get_comfyui_status()
        if comfyui_state and comfyui_state.status != "yielded":
            # Use actual ComfyUI usage + buffer, or reserve, whichever is higher
            effective_comfyui_reserve = max(
                self.comfyui_reserve_mb,
                comfyui_state.vram_used_mb + 2048,  # 2GB headroom
            )
        else:
            effective_comfyui_reserve = self.comfyui_reserve_mb
        available = max(
            0,
            mem["total"]
            - mem["used"]
            + mem["torch_allocated"]  # Our own usage doesn't count against us
            - effective_comfyui_reserve
            - self.safety_buffer_mb,
        )
        return VRAMBudget(
            total_mb=mem["total"],
            used_mb=mem["used"],
            free_mb=mem["free"],
            reserved_comfyui_mb=effective_comfyui_reserve,
            safety_buffer_mb=self.safety_buffer_mb,
            available_mb=available,
        )
    def can_load_model(self, vram_required_mb: int) -> tuple[bool, str]:
        """Check if a model can fit in available VRAM.
        Args:
            vram_required_mb: VRAM needed by the model
        Returns:
            Tuple of (can_load, reason_message)
        """
        budget = self.get_available_budget()
        if vram_required_mb <= budget.available_mb:
            return True, "Sufficient VRAM available"
        deficit = vram_required_mb - budget.available_mb
        return False, (
            f"Insufficient VRAM: need {vram_required_mb}MB, "
            f"available {budget.available_mb}MB (deficit: {deficit}MB)"
        )
    def force_cleanup(self) -> int:
        """Force GPU memory cleanup.
        Returns:
            Freed memory in megabytes (approximate)
        """
        with self._lock:
            before = self.get_memory_info()
            gc.collect()
            torch.cuda.empty_cache()
            torch.cuda.synchronize(self.device)
            after = self.get_memory_info()
            freed = before["torch_reserved"] - after["torch_reserved"]
            if freed > 0:
                logger.info(f"Freed {freed}MB of GPU memory")
            return freed
    def get_status(self) -> dict[str, Any]:
        """Get detailed GPU status for UI display.
        Returns:
            Dictionary with status information
        """
        mem = self.get_memory_info()
        budget = self.get_available_budget()
        return {
            "device": str(self.device),
            "total_gb": round(mem["total"] / 1024, 2),
            "used_gb": round(mem["used"] / 1024, 2),
            "free_gb": round(mem["free"] / 1024, 2),
            "utilization_percent": round(budget.utilization * 100, 1),
            "available_for_models_gb": round(budget.available_mb / 1024, 2),
            "comfyui_reserve_gb": round(budget.reserved_comfyui_mb / 1024, 2),
            "torch_allocated_gb": round(mem["torch_allocated"] / 1024, 2),
            "torch_cached_gb": round(mem["torch_cached"] / 1024, 2),
        }
    # ComfyUI Coordination Methods
    def _read_coordination_state(self) -> dict[str, Any]:
        """Read coordination state from file."""
        try:
            if self.COORDINATION_FILE.exists():
                return json.loads(self.COORDINATION_FILE.read_text())
        except (json.JSONDecodeError, IOError) as e:
            logger.warning(f"Failed to read coordination file: {e}")
        return {}
    def _write_coordination_state(self, state: dict[str, Any]) -> None:
        """Write coordination state to file with locking."""
        import fcntl
        try:
            self.LOCK_FILE.parent.mkdir(parents=True, exist_ok=True)
            with open(self.LOCK_FILE, "w") as lock:
                fcntl.flock(lock, fcntl.LOCK_EX)
                try:
                    self.COORDINATION_FILE.write_text(json.dumps(state, indent=2))
                finally:
                    fcntl.flock(lock, fcntl.LOCK_UN)
        except IOError as e:
            logger.warning(f"Failed to write coordination file: {e}")
    def update_status(
        self,
        vram_used_mb: int,
        vram_requested_mb: int = 0,
        status: str = "idle",
    ) -> None:
        """Update AudioCraft's status in coordination file.
        Args:
            vram_used_mb: Current VRAM usage
            vram_requested_mb: VRAM needed for pending operation
            status: Current status ("idle", "working", "requesting_priority")
        """
        state = self._read_coordination_state()
        state["audiocraft"] = {
            "timestamp": time.time(),
            "service": "audiocraft",
            "vram_used_mb": vram_used_mb,
            "vram_requested_mb": vram_requested_mb,
            "status": status,
        }
        state["last_update"] = time.time()
        self._write_coordination_state(state)
    def get_comfyui_status(self) -> Optional[GPUState]:
        """Get ComfyUI's current status.
        Returns:
            GPUState if ComfyUI is active and status is fresh, None otherwise
        """
        state = self._read_coordination_state()
        comfyui_data = state.get("comfyui")
        if not comfyui_data:
            return None
        # Check if stale
        if time.time() - comfyui_data.get("timestamp", 0) > self.STALE_THRESHOLD:
            return None
        return GPUState(
            timestamp=comfyui_data["timestamp"],
            service="comfyui",
            vram_used_mb=comfyui_data.get("vram_used_mb", 0),
            vram_requested_mb=comfyui_data.get("vram_requested_mb", 0),
            status=comfyui_data.get("status", "unknown"),
        )
    def request_priority(self, vram_needed_mb: int, timeout: float = 30.0) -> bool:
        """Request VRAM priority from ComfyUI.
        Signals ComfyUI to release VRAM if possible.
        Args:
            vram_needed_mb: Amount of VRAM needed
            timeout: Seconds to wait for ComfyUI to yield
        Returns:
            True if ComfyUI acknowledged and yielded, False otherwise
        """
        state = self._read_coordination_state()
        state["priority"] = {
            "requester": "audiocraft",
            "vram_needed_mb": vram_needed_mb,
            "timestamp": time.time(),
        }
        self._write_coordination_state(state)
        logger.info(f"Requesting {vram_needed_mb}MB VRAM from ComfyUI...")
        # Wait for ComfyUI to respond
        start = time.time()
        while time.time() - start < timeout:
            comfyui = self.get_comfyui_status()
            if comfyui and comfyui.status == "yielded":
                logger.info("ComfyUI yielded VRAM")
                return True
            time.sleep(0.5)
        logger.warning("ComfyUI did not yield VRAM within timeout")
        return False
    def is_comfyui_busy(self) -> bool:
        """Check if ComfyUI is actively processing.
        Returns:
            True if ComfyUI is working, False otherwise
        """
        status = self.get_comfyui_status()
        return status is not None and status.status == "working"
    # Callback Registration
    def on_low_memory(self, callback: Callable[[VRAMBudget], None]) -> None:
        """Register callback for low memory warnings.
        Args:
            callback: Function to call with budget info when memory is low
        """
        self._low_memory_callbacks.append(callback)
    def on_oom(self, callback: Callable[[], None]) -> None:
        """Register callback for OOM events.
        Args:
            callback: Function to call when OOM occurs
        """
        self._oom_callbacks.append(callback)
    def check_memory_pressure(self, warning_threshold: float = 0.85) -> None:
        """Check memory pressure and trigger callbacks if needed.
        Args:
            warning_threshold: Utilization threshold for warnings (0-1)
        """
        budget = self.get_available_budget()
        if budget.utilization >= warning_threshold:
            logger.warning(
                f"High GPU memory pressure: {budget.utilization*100:.1f}% utilized"
            )
            for callback in self._low_memory_callbacks:
                try:
                    callback(budget)
                except Exception as e:
                    logger.error(f"Low memory callback failed: {e}")
    def __del__(self) -> None:
        """Cleanup NVML on destruction."""
        if self._nvml_initialized:
            try:
                import pynvml
                pynvml.nvmlShutdown()
            except Exception:
                pass
--- a/src/core/model_registry.py
+++ b/src/core/model_registry.py
@@ -0,0 +1,487 @@
 """Model registry for discovering and managing AudioCraft model adapters."""
 import asyncio
 import logging
 import threading
 import time
 from contextlib import contextmanager
 from dataclasses import dataclass, field
 from pathlib import Path
 from typing import Any, Generator, Optional, Type
 import yaml
 from src.core.base_model import BaseAudioModel, ConditioningType
 from src.core.gpu_manager import GPUMemoryManager
 logger = logging.getLogger(__name__)
@dataclass
 class ModelVariantConfig:
    """Configuration for a model variant."""
    hf_id: str
    vram_mb: int
    max_duration: float = 30.0
    channels: int = 1
    conditioning: list[str] = field(default_factory=list)
    description: str = ""
@dataclass
 class ModelFamilyConfig:
    """Configuration for a model family."""
    enabled: bool
    display_name: str
    description: str
    default_variant: str
    variants: dict[str, ModelVariantConfig]
@dataclass
 class ModelHandle:
    """Handle for a loaded model with reference counting."""
    model: BaseAudioModel
    model_id: str
    variant: str
    loaded_at: float
    last_accessed: float
    ref_count: int = 0
    def touch(self) -> None:
        """Update last accessed time."""
        self.last_accessed = time.time()
 class ModelRegistry:
    """Central registry for discovering and managing model adapters.
    Handles:
    - Loading model configurations from YAML
    - Lazy loading models on demand
    - LRU eviction when VRAM is constrained
    - Reference counting to prevent unloading during use
    - Automatic idle timeout for unused models
    """
    def __init__(
        self,
        config_path: Path,
        gpu_manager: GPUMemoryManager,
        max_cached_models: int = 2,
        idle_timeout_minutes: int = 15,
    ):
        """Initialize the model registry.
        Args:
            config_path: Path to models.yaml configuration
            gpu_manager: GPU memory manager instance
            max_cached_models: Maximum models to keep loaded
            idle_timeout_minutes: Unload models after this idle time
        """
        self.config_path = config_path
        self.gpu_manager = gpu_manager
        self.max_cached_models = max_cached_models
        self.idle_timeout_seconds = idle_timeout_minutes * 60
        # Model configurations
        self._model_configs: dict[str, ModelFamilyConfig] = {}
        self._default_params: dict[str, Any] = {}
        # Loaded model handles
        self._handles: dict[str, ModelHandle] = {}  # Key: "model_id/variant"
        self._access_order: list[str] = []  # LRU tracking
        # Registered adapter classes
        self._adapter_classes: dict[str, Type[BaseAudioModel]] = {}
        # Threading
        self._lock = threading.RLock()
        self._cleanup_thread: Optional[threading.Thread] = None
        self._stop_cleanup = threading.Event()
        # Load configuration
        self._load_config()
    def _load_config(self) -> None:
        """Load model configurations from YAML file."""
        if not self.config_path.exists():
            logger.warning(f"Model config not found: {self.config_path}")
            return
        with open(self.config_path) as f:
            config = yaml.safe_load(f)
        # Parse model families
        for model_id, model_config in config.get("models", {}).items():
            if not model_config.get("enabled", True):
                continue
            variants = {}
            for variant_name, variant_config in model_config.get("variants", {}).items():
                variants[variant_name] = ModelVariantConfig(
                    hf_id=variant_config["hf_id"],
                    vram_mb=variant_config["vram_mb"],
                    max_duration=variant_config.get("max_duration", 30.0),
                    channels=variant_config.get("channels", 1),
                    conditioning=variant_config.get("conditioning", []),
                    description=variant_config.get("description", ""),
                )
            self._model_configs[model_id] = ModelFamilyConfig(
                enabled=model_config.get("enabled", True),
                display_name=model_config.get("display_name", model_id),
                description=model_config.get("description", ""),
                default_variant=model_config.get("default_variant", "medium"),
                variants=variants,
            )
        # Parse default generation parameters
        self._default_params = config.get("defaults", {}).get("generation", {})
        logger.info(f"Loaded {len(self._model_configs)} model families from config")
    def register_adapter(
        self, model_id: str, adapter_class: Type[BaseAudioModel]
    ) -> None:
        """Register a model adapter class.
        Args:
            model_id: Model family ID (e.g., 'musicgen')
            adapter_class: Adapter class implementing BaseAudioModel
        """
        self._adapter_classes[model_id] = adapter_class
        logger.debug(f"Registered adapter for {model_id}: {adapter_class.__name__}")
    def list_models(self) -> list[dict[str, Any]]:
        """List all available models with their configurations.
        Returns:
            List of model information dictionaries
        """
        models = []
        for model_id, config in self._model_configs.items():
            for variant_name, variant in config.variants.items():
                key = f"{model_id}/{variant_name}"
                handle = self._handles.get(key)
                can_load, reason = self.gpu_manager.can_load_model(variant.vram_mb)
                models.append({
                    "model_id": model_id,
                    "variant": variant_name,
                    "display_name": config.display_name,
                    "description": variant.description or config.description,
                    "hf_id": variant.hf_id,
                    "vram_mb": variant.vram_mb,
                    "max_duration": variant.max_duration,
                    "channels": variant.channels,
                    "conditioning": variant.conditioning,
                    "is_default": variant_name == config.default_variant,
                    "is_loaded": handle is not None,
                    "can_load": can_load,
                    "load_reason": reason,
                    "has_adapter": model_id in self._adapter_classes,
                })
        return models
    def get_model_config(
        self, model_id: str, variant: Optional[str] = None
    ) -> tuple[ModelFamilyConfig, ModelVariantConfig]:
        """Get configuration for a model.
        Args:
            model_id: Model family ID
            variant: Specific variant, or None for default
        Returns:
            Tuple of (family_config, variant_config)
        Raises:
            ValueError: If model or variant not found
        """
        if model_id not in self._model_configs:
            raise ValueError(f"Unknown model: {model_id}")
        family = self._model_configs[model_id]
        variant = variant or family.default_variant
        if variant not in family.variants:
            raise ValueError(f"Unknown variant {variant} for {model_id}")
        return family, family.variants[variant]
    def get_loaded_models(self) -> list[dict[str, Any]]:
        """Get information about currently loaded models.
        Returns:
            List of loaded model information
        """
        with self._lock:
            return [
                {
                    "model_id": handle.model_id,
                    "variant": handle.variant,
                    "loaded_at": handle.loaded_at,
                    "last_accessed": handle.last_accessed,
                    "ref_count": handle.ref_count,
                    "idle_seconds": time.time() - handle.last_accessed,
                }
                for handle in self._handles.values()
            ]
    @contextmanager
    def get_model(
        self, model_id: str, variant: Optional[str] = None
    ) -> Generator[BaseAudioModel, None, None]:
        """Get a model, loading it if necessary.
        Context manager that handles reference counting to prevent
        unloading during use.
        Args:
            model_id: Model family ID
            variant: Specific variant, or None for default
        Yields:
            Loaded model instance
        Raises:
            ValueError: If model not found or cannot be loaded
            RuntimeError: If VRAM insufficient
        """
        family, variant_config = self.get_model_config(model_id, variant)
        variant = variant or family.default_variant
        key = f"{model_id}/{variant}"
        with self._lock:
            # Get or load model
            if key not in self._handles:
                self._load_model(model_id, variant)
            handle = self._handles[key]
            handle.ref_count += 1
            handle.touch()
            # Update LRU order
            if key in self._access_order:
                self._access_order.remove(key)
            self._access_order.append(key)
        try:
            yield handle.model
        finally:
            with self._lock:
                handle.ref_count -= 1
    def _load_model(self, model_id: str, variant: str) -> None:
        """Load a model into memory.
        Must be called with self._lock held.
        Args:
            model_id: Model family ID
            variant: Variant to load
        Raises:
            ValueError: If no adapter registered
            RuntimeError: If VRAM insufficient
        """
        key = f"{model_id}/{variant}"
        family, variant_config = self.get_model_config(model_id, variant)
        # Check for adapter
        if model_id not in self._adapter_classes:
            raise ValueError(f"No adapter registered for {model_id}")
        # Check VRAM
        can_load, reason = self.gpu_manager.can_load_model(variant_config.vram_mb)
        if not can_load:
            # Try to free memory by evicting models
            self._evict_for_space(variant_config.vram_mb)
            can_load, reason = self.gpu_manager.can_load_model(variant_config.vram_mb)
            if not can_load:
                raise RuntimeError(reason)
        # Create and load model
        logger.info(f"Loading model {key}...")
        adapter_class = self._adapter_classes[model_id]
        model = adapter_class(variant=variant)
        model.load()
        # Register handle
        self._handles[key] = ModelHandle(
            model=model,
            model_id=model_id,
            variant=variant,
            loaded_at=time.time(),
            last_accessed=time.time(),
        )
        self._access_order.append(key)
        # Update GPU status
        mem = self.gpu_manager.get_memory_info()
        self.gpu_manager.update_status(mem["torch_allocated"], status="working")
        logger.info(f"Model {key} loaded successfully")
    def _evict_for_space(self, needed_mb: int) -> bool:
        """Evict models to free up VRAM.
        Must be called with self._lock held.
        Args:
            needed_mb: VRAM needed
        Returns:
            True if enough space was freed
        """
        freed = 0
        budget = self.gpu_manager.get_available_budget()
        deficit = needed_mb - budget.available_mb
        if deficit <= 0:
            return True
        # Evict LRU models that have no active references
        for key in list(self._access_order):
            if deficit <= 0:
                break
            handle = self._handles.get(key)
            if handle and handle.ref_count == 0:
                _, variant_config = self.get_model_config(
                    handle.model_id, handle.variant
                )
                logger.info(f"Evicting {key} to free {variant_config.vram_mb}MB")
                self._unload_model(key)
                freed += variant_config.vram_mb
                deficit -= variant_config.vram_mb
        self.gpu_manager.force_cleanup()
        return deficit <= 0
    def _unload_model(self, key: str) -> None:
        """Unload a model from memory.
        Must be called with self._lock held.
        Args:
            key: Model key (model_id/variant)
        """
        if key not in self._handles:
            return
        handle = self._handles[key]
        if handle.ref_count > 0:
            logger.warning(f"Cannot unload {key}: {handle.ref_count} active references")
            return
        logger.info(f"Unloading model {key}")
        handle.model.unload()
        del self._handles[key]
        if key in self._access_order:
            self._access_order.remove(key)
        self.gpu_manager.force_cleanup()
    def unload_model(self, model_id: str, variant: Optional[str] = None) -> bool:
        """Manually unload a model.
        Args:
            model_id: Model family ID
            variant: Variant to unload, or None for all variants
        Returns:
            True if model was unloaded
        """
        with self._lock:
            if variant:
                key = f"{model_id}/{variant}"
                if key in self._handles:
                    self._unload_model(key)
                    return True
            else:
                # Unload all variants of this model
                keys = [k for k in self._handles if k.startswith(f"{model_id}/")]
                for key in keys:
                    self._unload_model(key)
                return bool(keys)
        return False
    def preload_model(self, model_id: str, variant: Optional[str] = None) -> bool:
        """Preload a model into memory.
        Args:
            model_id: Model family ID
            variant: Variant to load
        Returns:
            True if model was loaded successfully
        """
        family, _ = self.get_model_config(model_id, variant)
        variant = variant or family.default_variant
        key = f"{model_id}/{variant}"
        with self._lock:
            if key in self._handles:
                return True  # Already loaded
            try:
                self._load_model(model_id, variant)
                return True
            except Exception as e:
                logger.error(f"Failed to preload {key}: {e}")
                return False
    def start_cleanup_thread(self) -> None:
        """Start background thread for idle model cleanup."""
        if self._cleanup_thread is not None:
            return
        def cleanup_loop():
            while not self._stop_cleanup.is_set():
                self._cleanup_idle_models()
                self._stop_cleanup.wait(60)  # Check every minute
        self._cleanup_thread = threading.Thread(target=cleanup_loop, daemon=True)
        self._cleanup_thread.start()
        logger.info("Started model cleanup thread")
    def stop_cleanup_thread(self) -> None:
        """Stop the background cleanup thread."""
        if self._cleanup_thread is not None:
            self._stop_cleanup.set()
            self._cleanup_thread.join(timeout=5)
            self._cleanup_thread = None
            self._stop_cleanup.clear()
    def _cleanup_idle_models(self) -> None:
        """Unload models that have been idle too long."""
        with self._lock:
            now = time.time()
            for key, handle in list(self._handles.items()):
                idle_time = now - handle.last_accessed
                if idle_time > self.idle_timeout_seconds and handle.ref_count == 0:
                    logger.info(
                        f"Unloading idle model {key} (idle for {idle_time/60:.1f} min)"
                    )
                    self._unload_model(key)
    def get_default_params(self) -> dict[str, Any]:
        """Get default generation parameters.
        Returns:
            Dictionary of default parameter values
        """
        return self._default_params.copy()
    def __del__(self) -> None:
        """Cleanup on destruction."""
        self.stop_cleanup_thread()
--- a/src/core/oom_handler.py
+++ b/src/core/oom_handler.py
@@ -0,0 +1,297 @@
 """OOM (Out of Memory) handling and recovery strategies."""
 import functools
 import gc
 import logging
 import time
 from typing import Any, Callable, Optional, ParamSpec, TypeVar
 import torch
 from src.core.gpu_manager import GPUMemoryManager
 logger = logging.getLogger(__name__)
 P = ParamSpec("P")
 R = TypeVar("R")
 class OOMRecoveryError(Exception):
    """Raised when OOM recovery fails after all strategies exhausted."""
    pass
 class OOMHandler:
    """Handles CUDA Out of Memory errors with multi-level recovery strategies.
    Recovery levels:
    1. Clear PyTorch CUDA cache
    2. Evict unused models from registry
    3. Request ComfyUI to yield VRAM
    """
    def __init__(
        self,
        gpu_manager: GPUMemoryManager,
        model_registry: Optional[Any] = None,  # Avoid circular import
        max_retries: int = 3,
        retry_delay: float = 0.5,
    ):
        """Initialize OOM handler.
        Args:
            gpu_manager: GPU memory manager instance
            model_registry: Optional model registry for eviction
            max_retries: Maximum recovery attempts
            retry_delay: Delay between retries in seconds
        """
        self.gpu_manager = gpu_manager
        self.model_registry = model_registry
        self.max_retries = max_retries
        self.retry_delay = retry_delay
        # Track OOM events for monitoring
        self._oom_count = 0
        self._last_oom_time: Optional[float] = None
    @property
    def oom_count(self) -> int:
        """Number of OOM events handled."""
        return self._oom_count
    def set_model_registry(self, registry: Any) -> None:
        """Set model registry (to avoid circular import at init time)."""
        self.model_registry = registry
    def with_oom_recovery(self, func: Callable[P, R]) -> Callable[P, R]:
        """Decorator that wraps function with OOM recovery logic.
        Usage:
            @oom_handler.with_oom_recovery
            def generate_audio(...):
                ...
        Args:
            func: Function to wrap
        Returns:
            Wrapped function with OOM recovery
        """
        @functools.wraps(func)
        def wrapper(*args: P.args, **kwargs: P.kwargs) -> R:
            last_exception = None
            for attempt in range(self.max_retries + 1):
                try:
                    if attempt > 0:
                        logger.info(f"Retry attempt {attempt}/{self.max_retries}")
                        time.sleep(self.retry_delay)
                    return func(*args, **kwargs)
                except torch.cuda.OutOfMemoryError as e:
                    last_exception = e
                    self._oom_count += 1
                    self._last_oom_time = time.time()
                    logger.warning(f"CUDA OOM detected (attempt {attempt + 1}): {e}")
                    if attempt < self.max_retries:
                        self._execute_recovery_strategy(attempt)
                    else:
                        logger.error(
                            f"OOM recovery failed after {self.max_retries} attempts"
                        )
            raise OOMRecoveryError(
                f"OOM recovery failed after {self.max_retries} attempts"
            ) from last_exception
        return wrapper
    def _execute_recovery_strategy(self, level: int) -> None:
        """Execute recovery strategy based on severity level.
        Args:
            level: Recovery level (0-2)
        """
        strategies = [
            self._strategy_clear_cache,
            self._strategy_evict_models,
            self._strategy_request_comfyui_yield,
        ]
        # Execute all strategies up to and including current level
        for i in range(min(level + 1, len(strategies))):
            logger.info(f"Executing recovery strategy {i + 1}: {strategies[i].__name__}")
            strategies[i]()
    def _strategy_clear_cache(self) -> None:
        """Level 1: Clear PyTorch CUDA cache.
        This is the fastest and least disruptive recovery strategy.
        Clears cached memory that PyTorch holds for future allocations.
        """
        logger.info("Clearing CUDA cache...")
        gc.collect()
        torch.cuda.empty_cache()
        torch.cuda.synchronize()
        # Reset peak memory stats for monitoring
        torch.cuda.reset_peak_memory_stats()
        freed = self.gpu_manager.force_cleanup()
        logger.info(f"Cache cleared, freed approximately {freed}MB")
    def _strategy_evict_models(self) -> None:
        """Level 2: Evict non-essential models from registry.
        Unloads all models that don't have active references,
        freeing their VRAM for the current operation.
        """
        if self.model_registry is None:
            logger.warning("No model registry available for eviction")
            self._strategy_clear_cache()
            return
        logger.info("Evicting unused models...")
        # Get list of loaded models
        loaded = self.model_registry.get_loaded_models()
        evicted = []
        for model_info in loaded:
            # Only evict models with no active references
            if model_info["ref_count"] == 0:
                model_id = model_info["model_id"]
                variant = model_info["variant"]
                logger.info(f"Evicting {model_id}/{variant}")
                self.model_registry.unload_model(model_id, variant)
                evicted.append(f"{model_id}/{variant}")
        # Clear cache after eviction
        self._strategy_clear_cache()
        logger.info(f"Evicted {len(evicted)} model(s): {evicted}")
    def _strategy_request_comfyui_yield(self) -> None:
        """Level 3: Request ComfyUI to yield VRAM.
        Uses the coordination protocol to ask ComfyUI to
        temporarily release GPU memory.
        """
        logger.info("Requesting ComfyUI to yield VRAM...")
        # First, evict our own models
        self._strategy_evict_models()
        # Calculate how much VRAM we need
        budget = self.gpu_manager.get_available_budget()
        needed = max(4096, budget.total_mb // 4)  # Request at least 4GB or 25% of total
        # Request priority from ComfyUI
        success = self.gpu_manager.request_priority(needed, timeout=15.0)
        if success:
            logger.info("ComfyUI yielded VRAM successfully")
        else:
            logger.warning("ComfyUI did not yield VRAM within timeout")
        # Final cache clear
        self._strategy_clear_cache()
    def recover_from_oom(self, level: int = 0) -> bool:
        """Manually trigger OOM recovery.
        Args:
            level: Recovery level to execute (0-2)
        Returns:
            True if recovery was successful (memory was freed)
        """
        before = self.gpu_manager.get_memory_info()
        self._execute_recovery_strategy(level)
        after = self.gpu_manager.get_memory_info()
        freed = before["used"] - after["used"]
        logger.info(f"Manual recovery freed {freed}MB")
        return freed > 0
    def check_memory_for_operation(self, required_mb: int) -> bool:
        """Check if there's enough memory for an operation.
        If not enough, attempts recovery strategies.
        Args:
            required_mb: Memory required in megabytes
        Returns:
            True if enough memory is available (possibly after recovery)
        """
        budget = self.gpu_manager.get_available_budget()
        if budget.available_mb >= required_mb:
            return True
        logger.info(
            f"Need {required_mb}MB but only {budget.available_mb}MB available. "
            "Attempting recovery..."
        )
        # Try progressively more aggressive recovery
        for level in range(3):
            self._execute_recovery_strategy(level)
            budget = self.gpu_manager.get_available_budget()
            if budget.available_mb >= required_mb:
                logger.info(f"Recovery successful at level {level + 1}")
                return True
        logger.error(
            f"Could not free enough memory. Need {required_mb}MB, "
            f"have {budget.available_mb}MB"
        )
        return False
    def get_stats(self) -> dict[str, Any]:
        """Get OOM handling statistics.
        Returns:
            Dictionary with OOM stats
        """
        return {
            "oom_count": self._oom_count,
            "last_oom_time": self._last_oom_time,
            "max_retries": self.max_retries,
            "has_registry": self.model_registry is not None,
        }
 # Module-level convenience function
 def oom_safe(
    gpu_manager: GPUMemoryManager,
    model_registry: Optional[Any] = None,
    max_retries: int = 3,
 ) -> Callable[[Callable[P, R]], Callable[P, R]]:
    """Decorator factory for OOM-safe functions.
    Usage:
        @oom_safe(gpu_manager, model_registry)
        def generate_audio(...):
            ...
    Args:
        gpu_manager: GPU memory manager
        model_registry: Optional model registry for eviction
        max_retries: Maximum recovery attempts
    Returns:
        Decorator function
    """
    handler = OOMHandler(gpu_manager, model_registry, max_retries)
    return handler.with_oom_recovery
--- a/src/main.py
+++ b/src/main.py
@@ -0,0 +1,84 @@
 """AudioCraft Studio - Main Application Entry Point."""
 import asyncio
 import logging
 import sys
 from pathlib import Path
 # Add project root to path for imports
 sys.path.insert(0, str(Path(__file__).parent.parent))
 from config.settings import get_settings
 from src.core.gpu_manager import GPUMemoryManager
 from src.core.model_registry import ModelRegistry
 from src.storage.database import Database
 logger = logging.getLogger(__name__)
 async def init_app():
    """Initialize application components."""
    settings = get_settings()
    # Configure logging
    logging.basicConfig(
        level=getattr(logging, settings.log_level),
        format="%(asctime)s - %(name)s - %(levelname)s - %(message)s",
    )
    # Ensure directories exist
    settings.ensure_directories()
    # Initialize GPU manager
    gpu_manager = GPUMemoryManager(
        comfyui_reserve_gb=settings.comfyui_reserve_gb,
        safety_buffer_gb=settings.safety_buffer_gb,
    )
    # Initialize model registry
    registry = ModelRegistry(
        config_path=settings.models_config,
        gpu_manager=gpu_manager,
        max_cached_models=settings.max_cached_models,
        idle_timeout_minutes=settings.idle_unload_minutes,
    )
    # Initialize database
    db = Database(settings.database_path)
    await db.connect()
    logger.info("AudioCraft Studio initialized")
    logger.info(f"GPU Status: {gpu_manager.get_status()}")
    logger.info(f"Available models: {len(registry.list_models())}")
    return {
        "settings": settings,
        "gpu_manager": gpu_manager,
        "registry": registry,
        "database": db,
    }
 def main():
    """Main entry point."""
    print("AudioCraft Studio - Starting...")
    print("Phase 1 core infrastructure is complete.")
    print("\nTo continue implementation:")
    print("  - Phase 2: Model adapters (musicgen, audiogen, magnet, style, jasco)")
    print("  - Phase 3: Services layer (generation, batch, project)")
    print("  - Phase 4: Gradio UI")
    print("  - Phase 5: REST API")
    print("  - Phase 6: Deployment")
    # Quick initialization test
    async def test_init():
        components = await init_app()
        print(f"\nDatabase path: {components['settings'].database_path}")
        print(f"GPU status: {components['gpu_manager'].get_status()}")
        await components["database"].close()
    asyncio.run(test_init())
 if __name__ == "__main__":
    main()
--- a/src/models/init.py
+++ b/src/models/init.py
@@ -0,0 +1,32 @@
 """AudioCraft model adapters.
 This module contains adapters that wrap AudioCraft's models with a
 consistent interface for the application.
 """
 from src.models.musicgen.adapter import MusicGenAdapter
 from src.models.audiogen.adapter import AudioGenAdapter
 from src.models.magnet.adapter import MAGNeTAdapter
 from src.models.musicgen_style.adapter import MusicGenStyleAdapter
 from src.models.jasco.adapter import JASCOAdapter
 __all__ = [
    "MusicGenAdapter",
    "AudioGenAdapter",
    "MAGNeTAdapter",
    "MusicGenStyleAdapter",
    "JASCOAdapter",
 ]
 def register_all_adapters(registry) -> None:
    """Register all model adapters with the registry.
    Args:
        registry: ModelRegistry instance to register adapters with
    """
    registry.register_adapter("musicgen", MusicGenAdapter)
    registry.register_adapter("audiogen", AudioGenAdapter)
    registry.register_adapter("magnet", MAGNeTAdapter)
    registry.register_adapter("musicgen-style", MusicGenStyleAdapter)
    registry.register_adapter("jasco", JASCOAdapter)
--- a/src/models/audiogen/init.py
+++ b/src/models/audiogen/init.py
@@ -0,0 +1,5 @@
 """AudioGen model adapter."""
 from src.models.audiogen.adapter import AudioGenAdapter
 __all__ = ["AudioGenAdapter"]
--- a/src/models/audiogen/adapter.py
+++ b/src/models/audiogen/adapter.py
@@ -0,0 +1,203 @@
 """AudioGen model adapter for text-to-sound effects generation."""
 import gc
 import logging
 import random
 from typing import Any, Optional
 import torch
 from src.core.base_model import (
    BaseAudioModel,
    ConditioningType,
    GenerationRequest,
    GenerationResult,
 )
 logger = logging.getLogger(__name__)
 class AudioGenAdapter(BaseAudioModel):
    """Adapter for Facebook's AudioGen model.
    Generates sound effects and environmental audio from text descriptions.
    Optimized for non-musical audio like sound effects, ambiences, and foley.
    """
    VARIANTS = {
        "medium": {
            "hf_id": "facebook/audiogen-medium",
            "vram_mb": 5000,
            "max_duration": 10,
            "channels": 1,
        },
    }
    def __init__(self, variant: str = "medium"):
        """Initialize AudioGen adapter.
        Args:
            variant: Model variant (currently only 'medium' available)
        """
        if variant not in self.VARIANTS:
            raise ValueError(
                f"Unknown AudioGen variant: {variant}. "
                f"Available: {list(self.VARIANTS.keys())}"
            )
        self._variant = variant
        self._config = self.VARIANTS[variant]
        self._model = None
        self._device: Optional[torch.device] = None
    @property
    def model_id(self) -> str:
        return "audiogen"
    @property
    def variant(self) -> str:
        return self._variant
    @property
    def display_name(self) -> str:
        return f"AudioGen ({self._variant})"
    @property
    def description(self) -> str:
        return "Text-to-sound effects generation"
    @property
    def vram_estimate_mb(self) -> int:
        return self._config["vram_mb"]
    @property
    def max_duration(self) -> float:
        return self._config["max_duration"]
    @property
    def sample_rate(self) -> int:
        if self._model is not None:
            return self._model.sample_rate
        return 16000  # AudioGen default sample rate
    @property
    def supports_conditioning(self) -> list[ConditioningType]:
        return [ConditioningType.TEXT]
    @property
    def is_loaded(self) -> bool:
        return self._model is not None
    @property
    def device(self) -> Optional[torch.device]:
        return self._device
    def load(self, device: str = "cuda") -> None:
        """Load the AudioGen model."""
        if self._model is not None:
            logger.warning(f"AudioGen {self._variant} already loaded")
            return
        logger.info(f"Loading AudioGen {self._variant} from {self._config['hf_id']}...")
        try:
            from audiocraft.models import AudioGen
            self._device = torch.device(device)
            self._model = AudioGen.get_pretrained(self._config["hf_id"])
            self._model.to(self._device)
            logger.info(
                f"AudioGen {self._variant} loaded successfully "
                f"(sample_rate={self._model.sample_rate})"
            )
        except Exception as e:
            self._model = None
            self._device = None
            logger.error(f"Failed to load AudioGen {self._variant}: {e}")
            raise RuntimeError(f"Failed to load AudioGen: {e}") from e
    def unload(self) -> None:
        """Unload the model and free memory."""
        if self._model is None:
            return
        logger.info(f"Unloading AudioGen {self._variant}...")
        del self._model
        self._model = None
        self._device = None
        gc.collect()
        if torch.cuda.is_available():
            torch.cuda.empty_cache()
    def generate(self, request: GenerationRequest) -> GenerationResult:
        """Generate sound effects from text prompts.
        Args:
            request: Generation parameters including prompts
        Returns:
            GenerationResult with audio tensor and metadata
        """
        self.validate_request(request)
        # Set random seed
        seed = request.seed if request.seed is not None else random.randint(0, 2**32 - 1)
        torch.manual_seed(seed)
        if torch.cuda.is_available():
            torch.cuda.manual_seed(seed)
        # Configure generation
        self._model.set_generation_params(
            duration=request.duration,
            temperature=request.temperature,
            top_k=request.top_k,
            top_p=request.top_p,
            cfg_coef=request.cfg_coef,
        )
        logger.info(
            f"Generating {len(request.prompts)} sound effect(s) with AudioGen "
            f"(duration={request.duration}s)"
        )
        # Generate audio
        with torch.inference_mode():
            audio = self._model.generate(request.prompts)
        actual_duration = audio.shape[-1] / self.sample_rate
        logger.info(
            f"Generated {audio.shape[0]} sample(s), "
            f"duration={actual_duration:.2f}s"
        )
        return GenerationResult(
            audio=audio.cpu(),
            sample_rate=self.sample_rate,
            duration=actual_duration,
            model_id=self.model_id,
            variant=self._variant,
            parameters={
                "duration": request.duration,
                "temperature": request.temperature,
                "top_k": request.top_k,
                "top_p": request.top_p,
                "cfg_coef": request.cfg_coef,
                "prompts": request.prompts,
            },
            seed=seed,
        )
    def get_default_params(self) -> dict[str, Any]:
        """Get default generation parameters."""
        return {
            "duration": 5.0,
            "temperature": 1.0,
            "top_k": 250,
            "top_p": 0.0,
            "cfg_coef": 3.0,
        }
--- a/src/models/jasco/init.py
+++ b/src/models/jasco/init.py
@@ -0,0 +1,5 @@
 """JASCO model adapter."""
 from src.models.jasco.adapter import JASCOAdapter
 __all__ = ["JASCOAdapter"]
--- a/src/models/jasco/adapter.py
+++ b/src/models/jasco/adapter.py
@@ -0,0 +1,348 @@
 """JASCO model adapter for chord and drum-conditioned music generation."""
 import gc
 import logging
 import random
 from typing import Any, Optional
 import torch
 from src.core.base_model import (
    BaseAudioModel,
    ConditioningType,
    GenerationRequest,
    GenerationResult,
 )
 logger = logging.getLogger(__name__)
 class JASCOAdapter(BaseAudioModel):
    """Adapter for Facebook's JASCO model.
    JASCO (Joint Audio and Symbolic Conditioning) enables music generation
    with control over chord progressions and drum patterns alongside text.
    """
    VARIANTS = {
        "chords-drums-400M": {
            "hf_id": "facebook/jasco-chords-drums-400M",
            "vram_mb": 2000,
            "max_duration": 10,
            "channels": 1,
        },
        "chords-drums-1B": {
            "hf_id": "facebook/jasco-chords-drums-1B",
            "vram_mb": 4000,
            "max_duration": 10,
            "channels": 1,
        },
    }
    # Common chord types for validation
    VALID_CHORD_TYPES = [
        "maj", "min", "dim", "aug", "7", "maj7", "min7", "dim7",
        "sus2", "sus4", "add9", "6", "min6", "9", "min9", "maj9",
    ]
    def __init__(self, variant: str = "chords-drums-400M"):
        """Initialize JASCO adapter.
        Args:
            variant: Model variant to use
        """
        if variant not in self.VARIANTS:
            raise ValueError(
                f"Unknown JASCO variant: {variant}. "
                f"Available: {list(self.VARIANTS.keys())}"
            )
        self._variant = variant
        self._config = self.VARIANTS[variant]
        self._model = None
        self._device: Optional[torch.device] = None
    @property
    def model_id(self) -> str:
        return "jasco"
    @property
    def variant(self) -> str:
        return self._variant
    @property
    def display_name(self) -> str:
        return f"JASCO ({self._variant})"
    @property
    def description(self) -> str:
        return "Chord and drum-conditioned music generation"
    @property
    def vram_estimate_mb(self) -> int:
        return self._config["vram_mb"]
    @property
    def max_duration(self) -> float:
        return self._config["max_duration"]
    @property
    def sample_rate(self) -> int:
        if self._model is not None:
            return self._model.sample_rate
        return 32000
    @property
    def supports_conditioning(self) -> list[ConditioningType]:
        return [ConditioningType.TEXT, ConditioningType.CHORDS, ConditioningType.DRUMS]
    @property
    def is_loaded(self) -> bool:
        return self._model is not None
    @property
    def device(self) -> Optional[torch.device]:
        return self._device
    def load(self, device: str = "cuda") -> None:
        """Load the JASCO model."""
        if self._model is not None:
            logger.warning(f"JASCO {self._variant} already loaded")
            return
        logger.info(f"Loading JASCO {self._variant} from {self._config['hf_id']}...")
        try:
            from audiocraft.models import JASCO
            self._device = torch.device(device)
            self._model = JASCO.get_pretrained(self._config["hf_id"])
            self._model.to(self._device)
            logger.info(
                f"JASCO {self._variant} loaded successfully "
                f"(sample_rate={self._model.sample_rate})"
            )
        except Exception as e:
            self._model = None
            self._device = None
            logger.error(f"Failed to load JASCO {self._variant}: {e}")
            raise RuntimeError(f"Failed to load JASCO: {e}") from e
    def unload(self) -> None:
        """Unload the model and free memory."""
        if self._model is None:
            return
        logger.info(f"Unloading JASCO {self._variant}...")
        del self._model
        self._model = None
        self._device = None
        gc.collect()
        if torch.cuda.is_available():
            torch.cuda.empty_cache()
    @staticmethod
    def parse_chord_progression(
        chords: list[dict[str, Any]], duration: float
    ) -> list[tuple[float, float, str]]:
        """Parse chord progression from user input format.
        Args:
            chords: List of chord dictionaries with keys:
                - time: Start time in seconds
                - chord: Chord name (e.g., "C", "Am", "G7")
            duration: Total duration for calculating end times
        Returns:
            List of (start_time, end_time, chord_name) tuples
        Example input:
            [
                {"time": 0.0, "chord": "C"},
                {"time": 2.0, "chord": "Am"},
                {"time": 4.0, "chord": "F"},
                {"time": 6.0, "chord": "G"},
            ]
        """
        if not chords:
            return []
        # Sort by time
        sorted_chords = sorted(chords, key=lambda x: x["time"])
        # Build (start, end, chord) tuples
        result = []
        for i, chord_info in enumerate(sorted_chords):
            start = chord_info["time"]
            # End time is either next chord's start or total duration
            if i + 1 < len(sorted_chords):
                end = sorted_chords[i + 1]["time"]
            else:
                end = duration
            result.append((start, end, chord_info["chord"]))
        return result
    @staticmethod
    def create_drum_pattern(
        pattern: str, duration: float, bpm: float = 120.0
    ) -> list[tuple[float, str]]:
        """Create drum events from a pattern string.
        Args:
            pattern: Pattern string (e.g., "kick,snare,kick,snare")
                     or "4/4" for common time signature
            duration: Total duration in seconds
            bpm: Beats per minute
        Returns:
            List of (time, drum_type) tuples
        """
        beat_duration = 60.0 / bpm
        events = []
        if pattern in ["4/4", "common"]:
            # Standard 4/4 rock pattern
            time = 0.0
            beat = 0
            while time < duration:
                if beat % 4 == 0:
                    events.append((time, "kick"))
                elif beat % 4 == 2:
                    events.append((time, "snare"))
                if beat % 2 == 0:
                    events.append((time, "hihat"))
                time += beat_duration / 2
                beat += 1
        else:
            # Parse comma-separated pattern
            drum_types = pattern.split(",")
            time = 0.0
            idx = 0
            while time < duration:
                drum = drum_types[idx % len(drum_types)].strip()
                if drum:
                    events.append((time, drum))
                time += beat_duration
                idx += 1
        return events
    def generate(self, request: GenerationRequest) -> GenerationResult:
        """Generate music with chord and drum conditioning.
        Args:
            request: Generation parameters with optional conditioning:
                - chords: List of {"time": float, "chord": str} dicts
                - drums: Drum pattern string or list of (time, drum_type)
                - bpm: Beats per minute for drum pattern
        Returns:
            GenerationResult with audio tensor and metadata
        """
        self.validate_request(request)
        # Set random seed
        seed = request.seed if request.seed is not None else random.randint(0, 2**32 - 1)
        torch.manual_seed(seed)
        if torch.cuda.is_available():
            torch.cuda.manual_seed(seed)
        # Configure generation parameters
        self._model.set_generation_params(
            duration=request.duration,
            temperature=request.temperature,
            top_k=request.top_k,
            top_p=request.top_p,
            cfg_coef=request.cfg_coef,
        )
        # Process chord conditioning
        chords_input = request.conditioning.get("chords")
        chords_formatted = None
        if chords_input:
            if isinstance(chords_input, list) and len(chords_input) > 0:
                if isinstance(chords_input[0], dict):
                    chords_formatted = self.parse_chord_progression(
                        chords_input, request.duration
                    )
                else:
                    # Already in (start, end, chord) format
                    chords_formatted = chords_input
        # Process drum conditioning
        drums_input = request.conditioning.get("drums")
        bpm = request.conditioning.get("bpm", 120.0)
        drums_formatted = None
        if drums_input:
            if isinstance(drums_input, str):
                drums_formatted = self.create_drum_pattern(
                    drums_input, request.duration, bpm
                )
            else:
                drums_formatted = drums_input
        logger.info(
            f"Generating {len(request.prompts)} sample(s) with JASCO "
            f"(duration={request.duration}s, chords={chords_formatted is not None}, "
            f"drums={drums_formatted is not None})"
        )
        with torch.inference_mode():
            # Build conditioning dict for JASCO
            conditioning = {}
            if chords_formatted:
                conditioning["chords"] = chords_formatted
            if drums_formatted:
                conditioning["drums"] = drums_formatted
            if conditioning:
                audio = self._model.generate(
                    descriptions=request.prompts,
                    **conditioning,
                )
            else:
                # Generate without symbolic conditioning
                audio = self._model.generate(request.prompts)
        actual_duration = audio.shape[-1] / self.sample_rate
        logger.info(
            f"Generated {audio.shape[0]} sample(s), "
            f"duration={actual_duration:.2f}s"
        )
        return GenerationResult(
            audio=audio.cpu(),
            sample_rate=self.sample_rate,
            duration=actual_duration,
            model_id=self.model_id,
            variant=self._variant,
            parameters={
                "duration": request.duration,
                "temperature": request.temperature,
                "top_k": request.top_k,
                "top_p": request.top_p,
                "cfg_coef": request.cfg_coef,
                "prompts": request.prompts,
                "chords": chords_formatted,
                "drums": drums_formatted,
                "bpm": bpm,
            },
            seed=seed,
        )
    def get_default_params(self) -> dict[str, Any]:
        """Get default generation parameters for JASCO."""
        return {
            "duration": 10.0,
            "temperature": 1.0,
            "top_k": 250,
            "top_p": 0.0,
            "cfg_coef": 3.0,
            "bpm": 120.0,
        }
--- a/src/models/magnet/init.py
+++ b/src/models/magnet/init.py
@@ -0,0 +1,5 @@
 """MAGNeT model adapter."""
 from src.models.magnet.adapter import MAGNeTAdapter
 __all__ = ["MAGNeTAdapter"]
--- a/src/models/magnet/adapter.py
+++ b/src/models/magnet/adapter.py
@@ -0,0 +1,253 @@
 """MAGNeT model adapter for fast non-autoregressive audio generation."""
 import gc
 import logging
 import random
 from typing import Any, Optional
 import torch
 from src.core.base_model import (
    BaseAudioModel,
    ConditioningType,
    GenerationRequest,
    GenerationResult,
 )
 logger = logging.getLogger(__name__)
 class MAGNeTAdapter(BaseAudioModel):
    """Adapter for Facebook's MAGNeT model.
    MAGNeT (Masked Audio Generation using Non-autoregressive Transformers)
    provides faster generation than autoregressive models like MusicGen.
    Supports both music and sound effect generation.
    """
    VARIANTS = {
        "small-10secs": {
            "hf_id": "facebook/magnet-small-10secs",
            "vram_mb": 1500,
            "max_duration": 10,
            "channels": 1,
            "audio_type": "music",
        },
        "medium-10secs": {
            "hf_id": "facebook/magnet-medium-10secs",
            "vram_mb": 5000,
            "max_duration": 10,
            "channels": 1,
            "audio_type": "music",
        },
        "small-30secs": {
            "hf_id": "facebook/magnet-small-30secs",
            "vram_mb": 1800,
            "max_duration": 30,
            "channels": 1,
            "audio_type": "music",
        },
        "medium-30secs": {
            "hf_id": "facebook/magnet-medium-30secs",
            "vram_mb": 6000,
            "max_duration": 30,
            "channels": 1,
            "audio_type": "music",
        },
        "audio-small-10secs": {
            "hf_id": "facebook/audio-magnet-small",
            "vram_mb": 1500,
            "max_duration": 10,
            "channels": 1,
            "audio_type": "sound",
        },
        "audio-medium-10secs": {
            "hf_id": "facebook/audio-magnet-medium",
            "vram_mb": 5000,
            "max_duration": 10,
            "channels": 1,
            "audio_type": "sound",
        },
    }
    def __init__(self, variant: str = "medium-10secs"):
        """Initialize MAGNeT adapter.
        Args:
            variant: Model variant to use
        """
        if variant not in self.VARIANTS:
            raise ValueError(
                f"Unknown MAGNeT variant: {variant}. "
                f"Available: {list(self.VARIANTS.keys())}"
            )
        self._variant = variant
        self._config = self.VARIANTS[variant]
        self._model = None
        self._device: Optional[torch.device] = None
    @property
    def model_id(self) -> str:
        return "magnet"
    @property
    def variant(self) -> str:
        return self._variant
    @property
    def display_name(self) -> str:
        return f"MAGNeT ({self._variant})"
    @property
    def description(self) -> str:
        audio_type = self._config.get("audio_type", "music")
        return f"Fast non-autoregressive {audio_type} generation"
    @property
    def vram_estimate_mb(self) -> int:
        return self._config["vram_mb"]
    @property
    def max_duration(self) -> float:
        return self._config["max_duration"]
    @property
    def sample_rate(self) -> int:
        if self._model is not None:
            return self._model.sample_rate
        return 32000
    @property
    def supports_conditioning(self) -> list[ConditioningType]:
        return [ConditioningType.TEXT]
    @property
    def is_loaded(self) -> bool:
        return self._model is not None
    @property
    def device(self) -> Optional[torch.device]:
        return self._device
    def load(self, device: str = "cuda") -> None:
        """Load the MAGNeT model."""
        if self._model is not None:
            logger.warning(f"MAGNeT {self._variant} already loaded")
            return
        logger.info(f"Loading MAGNeT {self._variant} from {self._config['hf_id']}...")
        try:
            from audiocraft.models import MAGNeT
            self._device = torch.device(device)
            self._model = MAGNeT.get_pretrained(self._config["hf_id"])
            self._model.to(self._device)
            logger.info(
                f"MAGNeT {self._variant} loaded successfully "
                f"(sample_rate={self._model.sample_rate})"
            )
        except Exception as e:
            self._model = None
            self._device = None
            logger.error(f"Failed to load MAGNeT {self._variant}: {e}")
            raise RuntimeError(f"Failed to load MAGNeT: {e}") from e
    def unload(self) -> None:
        """Unload the model and free memory."""
        if self._model is None:
            return
        logger.info(f"Unloading MAGNeT {self._variant}...")
        del self._model
        self._model = None
        self._device = None
        gc.collect()
        if torch.cuda.is_available():
            torch.cuda.empty_cache()
    def generate(self, request: GenerationRequest) -> GenerationResult:
        """Generate audio from text prompts using MAGNeT.
        MAGNeT uses a non-autoregressive approach with iterative decoding,
        which is significantly faster than autoregressive models.
        Args:
            request: Generation parameters including prompts
        Returns:
            GenerationResult with audio tensor and metadata
        """
        self.validate_request(request)
        # Set random seed
        seed = request.seed if request.seed is not None else random.randint(0, 2**32 - 1)
        torch.manual_seed(seed)
        if torch.cuda.is_available():
            torch.cuda.manual_seed(seed)
        # Configure generation parameters
        # MAGNeT has different parameters than MusicGen
        self._model.set_generation_params(
            duration=request.duration,
            temperature=request.temperature,
            top_k=request.top_k,
            top_p=request.top_p,
            cfg_coef=request.cfg_coef,
            # MAGNeT-specific parameters
            decoding_steps=[
                int(request.conditioning.get("decoding_steps_1", 20)),
                int(request.conditioning.get("decoding_steps_2", 10)),
                int(request.conditioning.get("decoding_steps_3", 10)),
                int(request.conditioning.get("decoding_steps_4", 10)),
            ],
            span_arrangement=request.conditioning.get("span_arrangement", "nonoverlap"),
        )
        logger.info(
            f"Generating {len(request.prompts)} sample(s) with MAGNeT {self._variant} "
            f"(duration={request.duration}s)"
        )
        # Generate audio
        with torch.inference_mode():
            audio = self._model.generate(request.prompts)
        actual_duration = audio.shape[-1] / self.sample_rate
        logger.info(
            f"Generated {audio.shape[0]} sample(s), "
            f"duration={actual_duration:.2f}s"
        )
        return GenerationResult(
            audio=audio.cpu(),
            sample_rate=self.sample_rate,
            duration=actual_duration,
            model_id=self.model_id,
            variant=self._variant,
            parameters={
                "duration": request.duration,
                "temperature": request.temperature,
                "top_k": request.top_k,
                "top_p": request.top_p,
                "cfg_coef": request.cfg_coef,
                "prompts": request.prompts,
            },
            seed=seed,
        )
    def get_default_params(self) -> dict[str, Any]:
        """Get default generation parameters for MAGNeT."""
        return {
            "duration": 10.0,
            "temperature": 3.0,  # MAGNeT works better with higher temperature
            "top_k": 0,  # Use top_p instead for MAGNeT
            "top_p": 0.9,
            "cfg_coef": 3.0,
        }
--- a/src/models/musicgen/init.py
+++ b/src/models/musicgen/init.py
@@ -0,0 +1,5 @@
 """MusicGen model adapter."""
 from src.models.musicgen.adapter import MusicGenAdapter
 __all__ = ["MusicGenAdapter"]
--- a/src/models/musicgen/adapter.py
+++ b/src/models/musicgen/adapter.py
@@ -0,0 +1,290 @@
 """MusicGen model adapter for text-to-music generation."""
 import gc
 import logging
 import random
 from typing import Any, Optional
 import torch
 from src.core.base_model import (
    BaseAudioModel,
    ConditioningType,
    GenerationRequest,
    GenerationResult,
 )
 logger = logging.getLogger(__name__)
 class MusicGenAdapter(BaseAudioModel):
    """Adapter for Facebook's MusicGen model.
    Supports text-to-music generation with optional melody conditioning.
    Available variants: small, medium, large, melody, and stereo versions.
    """
    # Variant configurations
    VARIANTS = {
        "small": {
            "hf_id": "facebook/musicgen-small",
            "vram_mb": 1500,
            "max_duration": 30,
            "channels": 1,
            "conditioning": [],
        },
        "medium": {
            "hf_id": "facebook/musicgen-medium",
            "vram_mb": 5000,
            "max_duration": 30,
            "channels": 1,
            "conditioning": [],
        },
        "large": {
            "hf_id": "facebook/musicgen-large",
            "vram_mb": 10000,
            "max_duration": 30,
            "channels": 1,
            "conditioning": [],
        },
        "melody": {
            "hf_id": "facebook/musicgen-melody",
            "vram_mb": 5000,
            "max_duration": 30,
            "channels": 1,
            "conditioning": [ConditioningType.MELODY],
        },
        "stereo-small": {
            "hf_id": "facebook/musicgen-stereo-small",
            "vram_mb": 1800,
            "max_duration": 30,
            "channels": 2,
            "conditioning": [],
        },
        "stereo-medium": {
            "hf_id": "facebook/musicgen-stereo-medium",
            "vram_mb": 6000,
            "max_duration": 30,
            "channels": 2,
            "conditioning": [],
        },
        "stereo-large": {
            "hf_id": "facebook/musicgen-stereo-large",
            "vram_mb": 12000,
            "max_duration": 30,
            "channels": 2,
            "conditioning": [],
        },
        "stereo-melody": {
            "hf_id": "facebook/musicgen-stereo-melody",
            "vram_mb": 6000,
            "max_duration": 30,
            "channels": 2,
            "conditioning": [ConditioningType.MELODY],
        },
    }
    def __init__(self, variant: str = "medium"):
        """Initialize MusicGen adapter.
        Args:
            variant: Model variant to use (small, medium, large, melody, etc.)
        Raises:
            ValueError: If variant is not recognized
        """
        if variant not in self.VARIANTS:
            raise ValueError(
                f"Unknown MusicGen variant: {variant}. "
                f"Available: {list(self.VARIANTS.keys())}"
            )
        self._variant = variant
        self._config = self.VARIANTS[variant]
        self._model = None
        self._device: Optional[torch.device] = None
    @property
    def model_id(self) -> str:
        return "musicgen"
    @property
    def variant(self) -> str:
        return self._variant
    @property
    def display_name(self) -> str:
        return f"MusicGen ({self._variant})"
    @property
    def description(self) -> str:
        if "melody" in self._variant:
            return "Text-to-music with melody conditioning"
        elif "stereo" in self._variant:
            return "Stereo text-to-music generation"
        return "Text-to-music generation"
    @property
    def vram_estimate_mb(self) -> int:
        return self._config["vram_mb"]
    @property
    def max_duration(self) -> float:
        return self._config["max_duration"]
    @property
    def sample_rate(self) -> int:
        if self._model is not None:
            return self._model.sample_rate
        return 32000  # Default MusicGen sample rate
    @property
    def supports_conditioning(self) -> list[ConditioningType]:
        return [ConditioningType.TEXT] + self._config["conditioning"]
    @property
    def is_loaded(self) -> bool:
        return self._model is not None
    @property
    def device(self) -> Optional[torch.device]:
        return self._device
    def load(self, device: str = "cuda") -> None:
        """Load the MusicGen model.
        Args:
            device: Target device ('cuda', 'cuda:0', 'cpu', etc.)
        """
        if self._model is not None:
            logger.warning(f"MusicGen {self._variant} already loaded")
            return
        logger.info(f"Loading MusicGen {self._variant} from {self._config['hf_id']}...")
        try:
            from audiocraft.models import MusicGen
            self._device = torch.device(device)
            self._model = MusicGen.get_pretrained(self._config["hf_id"])
            self._model.to(self._device)
            logger.info(
                f"MusicGen {self._variant} loaded successfully "
                f"(sample_rate={self._model.sample_rate})"
            )
        except Exception as e:
            self._model = None
            self._device = None
            logger.error(f"Failed to load MusicGen {self._variant}: {e}")
            raise RuntimeError(f"Failed to load MusicGen: {e}") from e
    def unload(self) -> None:
        """Unload the model and free memory."""
        if self._model is None:
            return
        logger.info(f"Unloading MusicGen {self._variant}...")
        del self._model
        self._model = None
        self._device = None
        gc.collect()
        if torch.cuda.is_available():
            torch.cuda.empty_cache()
    def generate(self, request: GenerationRequest) -> GenerationResult:
        """Generate music from text prompts.
        Args:
            request: Generation parameters including prompts
        Returns:
            GenerationResult with audio tensor and metadata
        Raises:
            RuntimeError: If model not loaded
            ValueError: If request is invalid
        """
        self.validate_request(request)
        # Set random seed for reproducibility
        seed = request.seed if request.seed is not None else random.randint(0, 2**32 - 1)
        torch.manual_seed(seed)
        if torch.cuda.is_available():
            torch.cuda.manual_seed(seed)
        # Configure generation parameters
        self._model.set_generation_params(
            duration=request.duration,
            temperature=request.temperature,
            top_k=request.top_k,
            top_p=request.top_p,
            cfg_coef=request.cfg_coef,
        )
        logger.info(
            f"Generating {len(request.prompts)} sample(s) with MusicGen {self._variant} "
            f"(duration={request.duration}s, temp={request.temperature})"
        )
        # Generate audio
        with torch.inference_mode():
            melody_audio = request.conditioning.get("melody")
            melody_sr = request.conditioning.get("melody_sr", self.sample_rate)
            if melody_audio is not None and ConditioningType.MELODY in self.supports_conditioning:
                # Melody-conditioned generation
                if isinstance(melody_audio, str):
                    # Load from file path
                    import torchaudio
                    melody_tensor, melody_sr = torchaudio.load(melody_audio)
                    melody_tensor = melody_tensor.to(self._device)
                else:
                    melody_tensor = torch.tensor(melody_audio).to(self._device)
                audio = self._model.generate_with_chroma(
                    descriptions=request.prompts,
                    melody_wavs=melody_tensor.unsqueeze(0) if melody_tensor.dim() == 1 else melody_tensor,
                    melody_sample_rate=melody_sr,
                )
            else:
                # Standard text-to-music generation
                audio = self._model.generate(request.prompts)
        # audio shape: [batch, channels, samples]
        actual_duration = audio.shape[-1] / self.sample_rate
        logger.info(
            f"Generated {audio.shape[0]} sample(s), "
            f"duration={actual_duration:.2f}s, shape={audio.shape}"
        )
        return GenerationResult(
            audio=audio.cpu(),
            sample_rate=self.sample_rate,
            duration=actual_duration,
            model_id=self.model_id,
            variant=self._variant,
            parameters={
                "duration": request.duration,
                "temperature": request.temperature,
                "top_k": request.top_k,
                "top_p": request.top_p,
                "cfg_coef": request.cfg_coef,
                "prompts": request.prompts,
            },
            seed=seed,
        )
    def get_default_params(self) -> dict[str, Any]:
        """Get default generation parameters."""
        return {
            "duration": 10.0,
            "temperature": 1.0,
            "top_k": 250,
            "top_p": 0.0,
            "cfg_coef": 3.0,
        }
--- a/src/models/musicgen_style/init.py
+++ b/src/models/musicgen_style/init.py
@@ -0,0 +1,5 @@
 """MusicGen Style model adapter."""
 from src.models.musicgen_style.adapter import MusicGenStyleAdapter
 __all__ = ["MusicGenStyleAdapter"]
--- a/src/models/musicgen_style/adapter.py
+++ b/src/models/musicgen_style/adapter.py
@@ -0,0 +1,277 @@
 """MusicGen Style model adapter for style-conditioned music generation."""
 import gc
 import logging
 import random
 from typing import Any, Optional
 import torch
 import torchaudio
 from src.core.base_model import (
    BaseAudioModel,
    ConditioningType,
    GenerationRequest,
    GenerationResult,
 )
 logger = logging.getLogger(__name__)
 class MusicGenStyleAdapter(BaseAudioModel):
    """Adapter for Facebook's MusicGen Style model.
    Generates music conditioned on both text and a style reference audio.
    Extracts style features from the reference and applies them to new generations.
    """
    VARIANTS = {
        "medium": {
            "hf_id": "facebook/musicgen-style",
            "vram_mb": 5000,
            "max_duration": 30,
            "channels": 1,
        },
    }
    def __init__(self, variant: str = "medium"):
        """Initialize MusicGen Style adapter.
        Args:
            variant: Model variant (currently only 'medium' available)
        """
        if variant not in self.VARIANTS:
            raise ValueError(
                f"Unknown MusicGen Style variant: {variant}. "
                f"Available: {list(self.VARIANTS.keys())}"
            )
        self._variant = variant
        self._config = self.VARIANTS[variant]
        self._model = None
        self._device: Optional[torch.device] = None
    @property
    def model_id(self) -> str:
        return "musicgen-style"
    @property
    def variant(self) -> str:
        return self._variant
    @property
    def display_name(self) -> str:
        return f"MusicGen Style ({self._variant})"
    @property
    def description(self) -> str:
        return "Style-conditioned music generation from reference audio"
    @property
    def vram_estimate_mb(self) -> int:
        return self._config["vram_mb"]
    @property
    def max_duration(self) -> float:
        return self._config["max_duration"]
    @property
    def sample_rate(self) -> int:
        if self._model is not None:
            return self._model.sample_rate
        return 32000
    @property
    def supports_conditioning(self) -> list[ConditioningType]:
        return [ConditioningType.TEXT, ConditioningType.STYLE]
    @property
    def is_loaded(self) -> bool:
        return self._model is not None
    @property
    def device(self) -> Optional[torch.device]:
        return self._device
    def load(self, device: str = "cuda") -> None:
        """Load the MusicGen Style model."""
        if self._model is not None:
            logger.warning(f"MusicGen Style {self._variant} already loaded")
            return
        logger.info(f"Loading MusicGen Style {self._variant}...")
        try:
            from audiocraft.models import MusicGen
            self._device = torch.device(device)
            self._model = MusicGen.get_pretrained(self._config["hf_id"])
            self._model.to(self._device)
            logger.info(
                f"MusicGen Style {self._variant} loaded successfully "
                f"(sample_rate={self._model.sample_rate})"
            )
        except Exception as e:
            self._model = None
            self._device = None
            logger.error(f"Failed to load MusicGen Style {self._variant}: {e}")
            raise RuntimeError(f"Failed to load MusicGen Style: {e}") from e
    def unload(self) -> None:
        """Unload the model and free memory."""
        if self._model is None:
            return
        logger.info(f"Unloading MusicGen Style {self._variant}...")
        del self._model
        self._model = None
        self._device = None
        gc.collect()
        if torch.cuda.is_available():
            torch.cuda.empty_cache()
    def _load_style_audio(
        self, style_input: Any, target_sr: int
    ) -> tuple[torch.Tensor, int]:
        """Load and prepare style reference audio.
        Args:
            style_input: File path, tensor, or numpy array
            target_sr: Target sample rate
        Returns:
            Tuple of (audio_tensor, sample_rate)
        """
        if isinstance(style_input, str):
            # Load from file
            audio, sr = torchaudio.load(style_input)
            if sr != target_sr:
                audio = torchaudio.functional.resample(audio, sr, target_sr)
            return audio.to(self._device), target_sr
        elif isinstance(style_input, torch.Tensor):
            return style_input.to(self._device), target_sr
        else:
            # Assume numpy array
            return torch.tensor(style_input).to(self._device), target_sr
    def generate(self, request: GenerationRequest) -> GenerationResult:
        """Generate music conditioned on text and style reference.
        Args:
            request: Generation parameters including prompts and style conditioning
        Returns:
            GenerationResult with audio tensor and metadata
        Note:
            Style conditioning requires 'style' in request.conditioning with either:
            - File path to audio
            - Audio tensor
            - Numpy array
        """
        self.validate_request(request)
        # Set random seed
        seed = request.seed if request.seed is not None else random.randint(0, 2**32 - 1)
        torch.manual_seed(seed)
        if torch.cuda.is_available():
            torch.cuda.manual_seed(seed)
        # Get style conditioning parameters
        style_audio = request.conditioning.get("style")
        eval_q = request.conditioning.get("eval_q", 3)
        excerpt_length = request.conditioning.get("excerpt_length", 3.0)
        # Configure generation parameters
        self._model.set_generation_params(
            duration=request.duration,
            temperature=request.temperature,
            top_k=request.top_k,
            top_p=request.top_p,
            cfg_coef=request.cfg_coef,
        )
        logger.info(
            f"Generating {len(request.prompts)} sample(s) with MusicGen Style "
            f"(duration={request.duration}s, style_conditioned={style_audio is not None})"
        )
        with torch.inference_mode():
            if style_audio is not None:
                # Load style reference
                style_tensor, style_sr = self._load_style_audio(
                    style_audio, self.sample_rate
                )
                # Ensure proper shape [batch, channels, samples]
                if style_tensor.dim() == 1:
                    style_tensor = style_tensor.unsqueeze(0).unsqueeze(0)
                elif style_tensor.dim() == 2:
                    style_tensor = style_tensor.unsqueeze(0)
                # Set style conditioner parameters
                if hasattr(self._model, 'set_style_conditioner_params'):
                    self._model.set_style_conditioner_params(
                        eval_q=eval_q,
                        excerpt_length=excerpt_length,
                    )
                # Generate with style conditioning
                # Expand style to match number of prompts if needed
                if style_tensor.shape[0] == 1 and len(request.prompts) > 1:
                    style_tensor = style_tensor.expand(len(request.prompts), -1, -1)
                audio = self._model.generate_with_chroma(
                    descriptions=request.prompts,
                    melody_wavs=style_tensor,
                    melody_sample_rate=style_sr,
                )
            else:
                # Generate without style (falls back to standard MusicGen behavior)
                logger.warning(
                    "No style reference provided, generating without style conditioning"
                )
                audio = self._model.generate(request.prompts)
        actual_duration = audio.shape[-1] / self.sample_rate
        logger.info(
            f"Generated {audio.shape[0]} sample(s), "
            f"duration={actual_duration:.2f}s"
        )
        return GenerationResult(
            audio=audio.cpu(),
            sample_rate=self.sample_rate,
            duration=actual_duration,
            model_id=self.model_id,
            variant=self._variant,
            parameters={
                "duration": request.duration,
                "temperature": request.temperature,
                "top_k": request.top_k,
                "top_p": request.top_p,
                "cfg_coef": request.cfg_coef,
                "prompts": request.prompts,
                "style_conditioned": style_audio is not None,
                "eval_q": eval_q,
                "excerpt_length": excerpt_length,
            },
            seed=seed,
        )
    def get_default_params(self) -> dict[str, Any]:
        """Get default generation parameters for MusicGen Style."""
        return {
            "duration": 10.0,
            "temperature": 1.0,
            "top_k": 250,
            "top_p": 0.0,
            "cfg_coef": 3.0,
            "eval_q": 3,
            "excerpt_length": 3.0,
        }
--- a/src/services/init.py
+++ b/src/services/init.py
@@ -0,0 +1,13 @@
 """Services layer for AudioCraft Studio."""
 from src.services.generation_service import GenerationService
 from src.services.batch_processor import BatchProcessor, GenerationJob, JobStatus
 from src.services.project_service import ProjectService
 __all__ = [
    "GenerationService",
    "BatchProcessor",
    "GenerationJob",
    "JobStatus",
    "ProjectService",
 ]
--- a/src/services/batch_processor.py
+++ b/src/services/batch_processor.py
@@ -0,0 +1,397 @@
 """Batch processor for queued audio generation jobs."""
 import asyncio
 import logging
 import time
 import uuid
 from dataclasses import dataclass, field
 from datetime import datetime
 from enum import Enum
 from typing import Any, Callable, Optional
 logger = logging.getLogger(__name__)
 class JobStatus(str, Enum):
    """Status of a generation job."""
    PENDING = "pending"
    PROCESSING = "processing"
    COMPLETED = "completed"
    FAILED = "failed"
    CANCELLED = "cancelled"
@dataclass
 class GenerationJob:
    """A queued generation job."""
    id: str
    model_id: str
    variant: Optional[str]
    prompts: list[str]
    parameters: dict[str, Any]
    conditioning: dict[str, Any]
    project_id: Optional[str]
    preset_used: Optional[str]
    tags: list[str]
    # Status tracking
    status: JobStatus = JobStatus.PENDING
    progress: float = 0.0
    progress_message: str = ""
    created_at: datetime = field(default_factory=datetime.utcnow)
    started_at: Optional[datetime] = None
    completed_at: Optional[datetime] = None
    # Results
    result_id: Optional[str] = None  # Generation ID if completed
    audio_path: Optional[str] = None
    error: Optional[str] = None
    @classmethod
    def create(
        cls,
        model_id: str,
        variant: Optional[str],
        prompts: list[str],
        duration: float = 10.0,
        temperature: float = 1.0,
        top_k: int = 250,
        top_p: float = 0.0,
        cfg_coef: float = 3.0,
        seed: Optional[int] = None,
        conditioning: Optional[dict[str, Any]] = None,
        project_id: Optional[str] = None,
        preset_used: Optional[str] = None,
        tags: Optional[list[str]] = None,
    ) -> "GenerationJob":
        """Create a new generation job."""
        return cls(
            id=f"job_{uuid.uuid4().hex[:12]}",
            model_id=model_id,
            variant=variant,
            prompts=prompts,
            parameters={
                "duration": duration,
                "temperature": temperature,
                "top_k": top_k,
                "top_p": top_p,
                "cfg_coef": cfg_coef,
                "seed": seed,
            },
            conditioning=conditioning or {},
            project_id=project_id,
            preset_used=preset_used,
            tags=tags or [],
        )
    def to_dict(self) -> dict[str, Any]:
        """Convert job to dictionary for API responses."""
        return {
            "id": self.id,
            "model_id": self.model_id,
            "variant": self.variant,
            "prompts": self.prompts,
            "parameters": self.parameters,
            "status": self.status.value,
            "progress": self.progress,
            "progress_message": self.progress_message,
            "created_at": self.created_at.isoformat(),
            "started_at": self.started_at.isoformat() if self.started_at else None,
            "completed_at": self.completed_at.isoformat() if self.completed_at else None,
            "result_id": self.result_id,
            "audio_path": self.audio_path,
            "error": self.error,
        }
 class BatchProcessor:
    """Manages a queue of generation jobs.
    Features:
    - Async job queue with configurable concurrency
    - Progress tracking and callbacks
    - Job cancellation
    - Priority support (future enhancement)
    """
    def __init__(
        self,
        generation_service: Any,  # Avoid circular import
        max_queue_size: int = 100,
        max_concurrent: int = 1,  # GPU operations should be serialized
    ):
        """Initialize batch processor.
        Args:
            generation_service: GenerationService instance
            max_queue_size: Maximum jobs in queue
            max_concurrent: Maximum concurrent generations (usually 1 for GPU)
        """
        self.generation_service = generation_service
        self.max_queue_size = max_queue_size
        self.max_concurrent = max_concurrent
        # Job tracking
        self._jobs: dict[str, GenerationJob] = {}
        self._queue: asyncio.Queue[str] = asyncio.Queue(maxsize=max_queue_size)
        # Processing control
        self._workers: list[asyncio.Task] = []
        self._running = False
        self._lock = asyncio.Lock()
        # Callbacks
        self._on_job_complete: list[Callable[[GenerationJob], None]] = []
        self._on_job_failed: list[Callable[[GenerationJob], None]] = []
        self._on_progress: list[Callable[[GenerationJob], None]] = []
    async def start(self) -> None:
        """Start the batch processor workers."""
        if self._running:
            return
        self._running = True
        # Start worker tasks
        for i in range(self.max_concurrent):
            worker = asyncio.create_task(self._worker_loop(i))
            self._workers.append(worker)
        logger.info(f"Batch processor started with {self.max_concurrent} worker(s)")
    async def stop(self) -> None:
        """Stop the batch processor and wait for pending jobs."""
        if not self._running:
            return
        self._running = False
        # Cancel workers
        for worker in self._workers:
            worker.cancel()
        # Wait for workers to finish
        await asyncio.gather(*self._workers, return_exceptions=True)
        self._workers.clear()
        logger.info("Batch processor stopped")
    async def submit(self, job: GenerationJob) -> GenerationJob:
        """Submit a job to the queue.
        Args:
            job: Job to submit
        Returns:
            The submitted job with ID
        Raises:
            RuntimeError: If queue is full
        """
        async with self._lock:
            if len(self._jobs) >= self.max_queue_size:
                raise RuntimeError(
                    f"Queue full (max {self.max_queue_size} jobs). "
                    "Please wait for jobs to complete."
                )
            self._jobs[job.id] = job
            await self._queue.put(job.id)
        logger.info(f"Job {job.id} submitted to queue (position: {self._queue.qsize()})")
        return job
    async def cancel(self, job_id: str) -> bool:
        """Cancel a pending job.
        Args:
            job_id: ID of job to cancel
        Returns:
            True if job was cancelled, False if not found or already processing
        """
        async with self._lock:
            job = self._jobs.get(job_id)
            if job is None:
                return False
            if job.status != JobStatus.PENDING:
                logger.warning(f"Cannot cancel job {job_id} with status {job.status}")
                return False
            job.status = JobStatus.CANCELLED
            job.completed_at = datetime.utcnow()
        logger.info(f"Job {job_id} cancelled")
        return True
    def get_job(self, job_id: str) -> Optional[GenerationJob]:
        """Get a job by ID."""
        return self._jobs.get(job_id)
    def get_queue_status(self) -> dict[str, Any]:
        """Get current queue status."""
        jobs_by_status = {}
        for job in self._jobs.values():
            status = job.status.value
            jobs_by_status[status] = jobs_by_status.get(status, 0) + 1
        return {
            "queue_size": self._queue.qsize(),
            "total_jobs": len(self._jobs),
            "jobs_by_status": jobs_by_status,
            "running": self._running,
            "max_queue_size": self.max_queue_size,
        }
    def list_jobs(
        self,
        status: Optional[JobStatus] = None,
        limit: int = 50,
    ) -> list[GenerationJob]:
        """List jobs with optional status filter.
        Args:
            status: Filter by status
            limit: Maximum jobs to return
        Returns:
            List of jobs ordered by creation time (newest first)
        """
        jobs = list(self._jobs.values())
        if status:
            jobs = [j for j in jobs if j.status == status]
        # Sort by created_at descending
        jobs.sort(key=lambda j: j.created_at, reverse=True)
        return jobs[:limit]
    def cleanup_completed(self, max_age_hours: float = 24.0) -> int:
        """Remove old completed/failed jobs from memory.
        Args:
            max_age_hours: Remove jobs older than this
        Returns:
            Number of jobs removed
        """
        cutoff = datetime.utcnow().timestamp() - (max_age_hours * 3600)
        removed = 0
        for job_id, job in list(self._jobs.items()):
            if job.status in (JobStatus.COMPLETED, JobStatus.FAILED, JobStatus.CANCELLED):
                if job.completed_at and job.completed_at.timestamp() < cutoff:
                    del self._jobs[job_id]
                    removed += 1
        if removed:
            logger.info(f"Cleaned up {removed} old jobs")
        return removed
    async def _worker_loop(self, worker_id: int) -> None:
        """Worker loop that processes jobs from queue."""
        logger.debug(f"Worker {worker_id} started")
        while self._running:
            try:
                # Wait for job with timeout
                try:
                    job_id = await asyncio.wait_for(
                        self._queue.get(), timeout=1.0
                    )
                except asyncio.TimeoutError:
                    continue
                job = self._jobs.get(job_id)
                if job is None or job.status == JobStatus.CANCELLED:
                    continue
                await self._process_job(job)
            except asyncio.CancelledError:
                break
            except Exception as e:
                logger.error(f"Worker {worker_id} error: {e}")
        logger.debug(f"Worker {worker_id} stopped")
    async def _process_job(self, job: GenerationJob) -> None:
        """Process a single generation job."""
        logger.info(f"Processing job {job.id}: {job.model_id}/{job.variant}")
        job.status = JobStatus.PROCESSING
        job.started_at = datetime.utcnow()
        def progress_callback(progress: float, message: str) -> None:
            job.progress = progress
            job.progress_message = message
            for callback in self._on_progress:
                try:
                    callback(job)
                except Exception as e:
                    logger.error(f"Progress callback error: {e}")
        try:
            result, generation = await self.generation_service.generate(
                model_id=job.model_id,
                variant=job.variant,
                prompts=job.prompts,
                duration=job.parameters.get("duration", 10.0),
                temperature=job.parameters.get("temperature", 1.0),
                top_k=job.parameters.get("top_k", 250),
                top_p=job.parameters.get("top_p", 0.0),
                cfg_coef=job.parameters.get("cfg_coef", 3.0),
                seed=job.parameters.get("seed"),
                conditioning=job.conditioning,
                project_id=job.project_id,
                preset_used=job.preset_used,
                tags=job.tags,
                progress_callback=progress_callback,
            )
            job.status = JobStatus.COMPLETED
            job.result_id = generation.id
            job.audio_path = generation.audio_path
            job.completed_at = datetime.utcnow()
            job.progress = 1.0
            job.progress_message = "Complete"
            logger.info(f"Job {job.id} completed: {generation.id}")
            for callback in self._on_job_complete:
                try:
                    callback(job)
                except Exception as e:
                    logger.error(f"Completion callback error: {e}")
        except Exception as e:
            job.status = JobStatus.FAILED
            job.error = str(e)
            job.completed_at = datetime.utcnow()
            logger.error(f"Job {job.id} failed: {e}")
            for callback in self._on_job_failed:
                try:
                    callback(job)
                except Exception as e2:
                    logger.error(f"Failure callback error: {e2}")
    # Callback registration
    def on_job_complete(self, callback: Callable[[GenerationJob], None]) -> None:
        """Register callback for job completion."""
        self._on_job_complete.append(callback)
    def on_job_failed(self, callback: Callable[[GenerationJob], None]) -> None:
        """Register callback for job failure."""
        self._on_job_failed.append(callback)
    def on_progress(self, callback: Callable[[GenerationJob], None]) -> None:
        """Register callback for progress updates."""
        self._on_progress.append(callback)
--- a/src/services/generation_service.py
+++ b/src/services/generation_service.py
@@ -0,0 +1,322 @@
 """Generation service for orchestrating audio generation."""
 import logging
 import time
 from pathlib import Path
 from typing import Any, Callable, Optional
 import soundfile as sf
 import torch
 from src.core.base_model import GenerationRequest, GenerationResult
 from src.core.gpu_manager import GPUMemoryManager
 from src.core.model_registry import ModelRegistry
 from src.core.oom_handler import OOMHandler
 from src.storage.database import Database, Generation
 logger = logging.getLogger(__name__)
 class GenerationService:
    """Orchestrates audio generation across all models.
    Handles:
    - Model selection and loading
    - Generation execution with OOM recovery
    - Result saving and database recording
    - Progress callbacks for UI updates
    """
    def __init__(
        self,
        registry: ModelRegistry,
        gpu_manager: GPUMemoryManager,
        database: Database,
        output_dir: Path,
    ):
        """Initialize generation service.
        Args:
            registry: Model registry for model access
            gpu_manager: GPU memory manager
            database: Database for storing generation records
            output_dir: Directory for saving generated audio
        """
        self.registry = registry
        self.gpu_manager = gpu_manager
        self.database = database
        self.output_dir = Path(output_dir)
        self.output_dir.mkdir(parents=True, exist_ok=True)
        # OOM handler
        self.oom_handler = OOMHandler(gpu_manager, registry)
        # Statistics
        self._generation_count = 0
        self._total_duration_generated = 0.0
    async def generate(
        self,
        model_id: str,
        variant: Optional[str],
        prompts: list[str],
        duration: float = 10.0,
        temperature: float = 1.0,
        top_k: int = 250,
        top_p: float = 0.0,
        cfg_coef: float = 3.0,
        seed: Optional[int] = None,
        conditioning: Optional[dict[str, Any]] = None,
        project_id: Optional[str] = None,
        preset_used: Optional[str] = None,
        tags: Optional[list[str]] = None,
        progress_callback: Optional[Callable[[float, str], None]] = None,
    ) -> tuple[GenerationResult, Generation]:
        """Generate audio and save to database.
        Args:
            model_id: Model family to use
            variant: Model variant (None for default)
            prompts: Text prompts for generation
            duration: Target duration in seconds
            temperature: Sampling temperature
            top_k: Top-k sampling parameter
            top_p: Nucleus sampling parameter
            cfg_coef: Classifier-free guidance coefficient
            seed: Random seed for reproducibility
            conditioning: Optional conditioning data (melody, style, chords, etc.)
            project_id: Optional project to associate with
            preset_used: Name of preset used (for metadata)
            tags: Optional tags for organization
            progress_callback: Optional callback for progress updates
        Returns:
            Tuple of (GenerationResult, Generation database record)
        Raises:
            ValueError: If model not found or parameters invalid
            RuntimeError: If generation fails
        """
        start_time = time.time()
        # Report progress
        if progress_callback:
            progress_callback(0.0, "Preparing generation...")
        # Build generation request
        request = GenerationRequest(
            prompts=prompts,
            duration=duration,
            temperature=temperature,
            top_k=top_k,
            top_p=top_p,
            cfg_coef=cfg_coef,
            seed=seed,
            conditioning=conditioning or {},
        )
        # Get model configuration
        family_config, variant_config = self.registry.get_model_config(model_id, variant)
        actual_variant = variant or family_config.default_variant
        # Check VRAM availability
        if progress_callback:
            progress_callback(0.1, "Checking GPU memory...")
        can_load, reason = self.gpu_manager.can_load_model(variant_config.vram_mb)
        if not can_load:
            # Try OOM recovery
            if not self.oom_handler.check_memory_for_operation(variant_config.vram_mb):
                raise RuntimeError(f"Insufficient GPU memory: {reason}")
        # Generate with OOM recovery wrapper
        if progress_callback:
            progress_callback(0.2, f"Loading {model_id}/{actual_variant}...")
        @self.oom_handler.with_oom_recovery
        def do_generation() -> GenerationResult:
            with self.registry.get_model(model_id, actual_variant) as model:
                if progress_callback:
                    progress_callback(0.4, "Generating audio...")
                return model.generate(request)
        result = do_generation()
        if progress_callback:
            progress_callback(0.8, "Saving audio...")
        # Save audio file
        audio_path = self._save_audio(result)
        # Create database record
        generation = Generation.create(
            model=model_id,
            variant=actual_variant,
            prompt=prompts[0] if len(prompts) == 1 else "\n".join(prompts),
            parameters={
                "duration": duration,
                "temperature": temperature,
                "top_k": top_k,
                "top_p": top_p,
                "cfg_coef": cfg_coef,
            },
            project_id=project_id,
            preset_used=preset_used,
            conditioning=conditioning,
            audio_path=str(audio_path),
            duration_seconds=result.duration,
            sample_rate=result.sample_rate,
            tags=tags or [],
            seed=result.seed,
        )
        # Save to database
        await self.database.create_generation(generation)
        # Update statistics
        self._generation_count += 1
        self._total_duration_generated += result.duration
        elapsed = time.time() - start_time
        logger.info(
            f"Generation complete: {model_id}/{actual_variant}, "
            f"duration={result.duration:.1f}s, elapsed={elapsed:.1f}s"
        )
        if progress_callback:
            progress_callback(1.0, "Complete!")
        return result, generation
    def _save_audio(self, result: GenerationResult) -> Path:
        """Save generated audio to file.
        Args:
            result: Generation result with audio tensor
        Returns:
            Path to saved audio file
        """
        # Generate unique filename
        timestamp = int(time.time() * 1000)
        filename = f"{result.model_id}_{result.variant}_{timestamp}.wav"
        filepath = self.output_dir / filename
        # Convert tensor to numpy and save
        audio = result.audio.numpy()
        # Handle batch dimension - save first sample if batched
        if audio.ndim == 3:
            audio = audio[0]  # [channels, samples]
        # Transpose to [samples, channels] for soundfile
        if audio.ndim == 2:
            audio = audio.T
        sf.write(filepath, audio, result.sample_rate)
        logger.debug(f"Saved audio to {filepath}")
        return filepath
    async def regenerate(
        self,
        generation_id: str,
        new_seed: Optional[int] = None,
        progress_callback: Optional[Callable[[float, str], None]] = None,
    ) -> tuple[GenerationResult, Generation]:
        """Regenerate audio using parameters from existing generation.
        Args:
            generation_id: ID of generation to regenerate
            new_seed: Optional new seed (uses original if None)
            progress_callback: Optional progress callback
        Returns:
            Tuple of (GenerationResult, new Generation record)
        Raises:
            ValueError: If generation not found
        """
        # Load original generation
        original = await self.database.get_generation(generation_id)
        if original is None:
            raise ValueError(f"Generation not found: {generation_id}")
        # Parse prompts
        prompts = original.prompt.split("\n") if "\n" in original.prompt else [original.prompt]
        # Regenerate with same or new seed
        return await self.generate(
            model_id=original.model,
            variant=original.variant,
            prompts=prompts,
            duration=original.parameters.get("duration", 10.0),
            temperature=original.parameters.get("temperature", 1.0),
            top_k=original.parameters.get("top_k", 250),
            top_p=original.parameters.get("top_p", 0.0),
            cfg_coef=original.parameters.get("cfg_coef", 3.0),
            seed=new_seed if new_seed is not None else original.seed,
            conditioning=original.conditioning,
            project_id=original.project_id,
            preset_used=original.preset_used,
            tags=original.tags,
            progress_callback=progress_callback,
        )
    def get_stats(self) -> dict[str, Any]:
        """Get generation statistics.
        Returns:
            Dictionary with generation stats
        """
        return {
            "generation_count": self._generation_count,
            "total_duration_generated": self._total_duration_generated,
            "oom_stats": self.oom_handler.get_stats(),
        }
    def estimate_generation_time(
        self, model_id: str, variant: Optional[str], duration: float
    ) -> float:
        """Estimate generation time for given parameters.
        Args:
            model_id: Model family
            variant: Model variant
            duration: Target audio duration
        Returns:
            Estimated generation time in seconds
        """
        # Rough estimates based on model type and RTX 4090
        # These are approximations and vary based on many factors
        estimates = {
            "musicgen": {
                "small": 0.8,  # seconds per second of audio
                "medium": 1.5,
                "large": 3.0,
                "melody": 1.8,
            },
            "audiogen": {
                "medium": 1.5,
            },
            "magnet": {
                "small-10secs": 0.3,  # Non-autoregressive is faster
                "medium-10secs": 0.5,
                "small-30secs": 0.3,
                "medium-30secs": 0.5,
            },
            "musicgen-style": {
                "medium": 1.8,
            },
            "jasco": {
                "chords-drums-400M": 1.0,
                "chords-drums-1B": 1.5,
            },
        }
        family_config, _ = self.registry.get_model_config(model_id, variant)
        actual_variant = variant or family_config.default_variant
        ratio = estimates.get(model_id, {}).get(actual_variant, 2.0)
        return duration * ratio + 5.0  # Add 5s for model loading overhead
--- a/src/services/project_service.py
+++ b/src/services/project_service.py
@@ -0,0 +1,395 @@
 """Project service for managing projects and generations."""
 import logging
 import shutil
 from pathlib import Path
 from typing import Any, Optional
 from src.storage.database import Database, Generation, Project, Preset
 logger = logging.getLogger(__name__)
 class ProjectService:
    """Service for managing projects, generations, and presets.
    Provides a high-level API for project organization and
    generation history management.
    """
    def __init__(self, database: Database, output_dir: Path):
        """Initialize project service.
        Args:
            database: Database instance
            output_dir: Directory where audio files are stored
        """
        self.database = database
        self.output_dir = Path(output_dir)
    # Project Operations
    async def create_project(
        self, name: str, description: str = ""
    ) -> Project:
        """Create a new project.
        Args:
            name: Project name
            description: Optional description
        Returns:
            Created project
        """
        project = Project.create(name, description)
        await self.database.create_project(project)
        logger.info(f"Created project: {project.id} ({name})")
        return project
    async def get_project(self, project_id: str) -> Optional[Project]:
        """Get a project by ID."""
        return await self.database.get_project(project_id)
    async def list_projects(
        self, limit: int = 100, offset: int = 0
    ) -> list[Project]:
        """List all projects."""
        return await self.database.list_projects(limit, offset)
    async def update_project(
        self,
        project_id: str,
        name: Optional[str] = None,
        description: Optional[str] = None,
    ) -> Optional[Project]:
        """Update a project.
        Args:
            project_id: Project ID
            name: New name (None to keep current)
            description: New description (None to keep current)
        Returns:
            Updated project, or None if not found
        """
        project = await self.database.get_project(project_id)
        if project is None:
            return None
        if name is not None:
            project.name = name
        if description is not None:
            project.description = description
        await self.database.update_project(project)
        logger.info(f"Updated project: {project_id}")
        return project
    async def delete_project(
        self, project_id: str, delete_files: bool = False
    ) -> bool:
        """Delete a project.
        Args:
            project_id: Project ID
            delete_files: If True, also delete associated audio files
        Returns:
            True if deleted
        """
        if delete_files:
            # Get all generations and delete their files
            generations = await self.database.list_generations(project_id=project_id)
            for gen in generations:
                if gen.audio_path:
                    try:
                        Path(gen.audio_path).unlink(missing_ok=True)
                    except Exception as e:
                        logger.warning(f"Failed to delete {gen.audio_path}: {e}")
        result = await self.database.delete_project(project_id)
        if result:
            logger.info(f"Deleted project: {project_id}")
        return result
    async def get_project_stats(self, project_id: str) -> dict[str, Any]:
        """Get statistics for a project.
        Args:
            project_id: Project ID
        Returns:
            Dictionary with project statistics
        """
        generations = await self.database.list_generations(
            project_id=project_id, limit=10000
        )
        total_duration = sum(g.duration_seconds or 0 for g in generations)
        models_used = {}
        for gen in generations:
            key = f"{gen.model}/{gen.variant}"
            models_used[key] = models_used.get(key, 0) + 1
        return {
            "generation_count": len(generations),
            "total_duration_seconds": total_duration,
            "models_used": models_used,
        }
    # Generation Operations
    async def get_generation(self, generation_id: str) -> Optional[Generation]:
        """Get a generation by ID."""
        return await self.database.get_generation(generation_id)
    async def list_generations(
        self,
        project_id: Optional[str] = None,
        model: Optional[str] = None,
        search: Optional[str] = None,
        limit: int = 100,
        offset: int = 0,
    ) -> list[Generation]:
        """List generations with optional filters.
        Args:
            project_id: Filter by project
            model: Filter by model family
            search: Search in prompts, names, and tags
            limit: Maximum results
            offset: Pagination offset
        Returns:
            List of generations
        """
        return await self.database.list_generations(
            project_id=project_id,
            model=model,
            search=search,
            limit=limit,
            offset=offset,
        )
    async def update_generation(
        self,
        generation_id: str,
        name: Optional[str] = None,
        tags: Optional[list[str]] = None,
        notes: Optional[str] = None,
        project_id: Optional[str] = None,
    ) -> Optional[Generation]:
        """Update a generation's metadata.
        Args:
            generation_id: Generation ID
            name: New name
            tags: New tags (replaces existing)
            notes: New notes
            project_id: Move to different project
        Returns:
            Updated generation, or None if not found
        """
        generation = await self.database.get_generation(generation_id)
        if generation is None:
            return None
        if name is not None:
            generation.name = name
        if tags is not None:
            generation.tags = tags
        if notes is not None:
            generation.notes = notes
        if project_id is not None:
            generation.project_id = project_id
        await self.database.update_generation(generation)
        logger.info(f"Updated generation: {generation_id}")
        return generation
    async def delete_generation(
        self, generation_id: str, delete_file: bool = True
    ) -> bool:
        """Delete a generation.
        Args:
            generation_id: Generation ID
            delete_file: If True, also delete audio file
        Returns:
            True if deleted
        """
        if delete_file:
            generation = await self.database.get_generation(generation_id)
            if generation and generation.audio_path:
                try:
                    Path(generation.audio_path).unlink(missing_ok=True)
                except Exception as e:
                    logger.warning(f"Failed to delete audio file: {e}")
        result = await self.database.delete_generation(generation_id)
        if result:
            logger.info(f"Deleted generation: {generation_id}")
        return result
    async def move_generations_to_project(
        self, generation_ids: list[str], project_id: Optional[str]
    ) -> int:
        """Move multiple generations to a project.
        Args:
            generation_ids: List of generation IDs
            project_id: Target project ID (None to unlink)
        Returns:
            Number of generations moved
        """
        moved = 0
        for gen_id in generation_ids:
            result = await self.update_generation(gen_id, project_id=project_id)
            if result:
                moved += 1
        logger.info(f"Moved {moved} generations to project {project_id}")
        return moved
    # Preset Operations
    async def create_preset(
        self,
        model: str,
        name: str,
        parameters: dict[str, Any],
        description: str = "",
    ) -> Preset:
        """Create a custom preset.
        Args:
            model: Model family this preset is for
            name: Preset name
            parameters: Generation parameters
            description: Optional description
        Returns:
            Created preset
        """
        preset = Preset.create(model, name, parameters, description)
        await self.database.create_preset(preset)
        logger.info(f"Created preset: {preset.id} ({name}) for {model}")
        return preset
    async def list_presets(
        self, model: Optional[str] = None, include_builtin: bool = True
    ) -> list[Preset]:
        """List presets with optional model filter.
        Args:
            model: Filter by model family
            include_builtin: Include built-in presets
        Returns:
            List of presets
        """
        return await self.database.list_presets(model, include_builtin)
    async def get_preset(self, preset_id: str) -> Optional[Preset]:
        """Get a preset by ID."""
        return await self.database.get_preset(preset_id)
    async def delete_preset(self, preset_id: str) -> bool:
        """Delete a custom preset.
        Note: Built-in presets cannot be deleted.
        Args:
            preset_id: Preset ID
        Returns:
            True if deleted
        """
        result = await self.database.delete_preset(preset_id)
        if result:
            logger.info(f"Deleted preset: {preset_id}")
        return result
    # Export Operations
    async def export_project(
        self, project_id: str, output_path: Path, include_metadata: bool = True
    ) -> Path:
        """Export a project as a ZIP archive.
        Args:
            project_id: Project ID
            output_path: Output ZIP file path
            include_metadata: Include JSON metadata file
        Returns:
            Path to created ZIP file
        """
        import json
        import tempfile
        import zipfile
        project = await self.database.get_project(project_id)
        if project is None:
            raise ValueError(f"Project not found: {project_id}")
        generations = await self.database.list_generations(
            project_id=project_id, limit=10000
        )
        with tempfile.TemporaryDirectory() as tmpdir:
            tmppath = Path(tmpdir)
            # Copy audio files
            for gen in generations:
                if gen.audio_path and Path(gen.audio_path).exists():
                    src = Path(gen.audio_path)
                    dst = tmppath / src.name
                    shutil.copy2(src, dst)
            # Create metadata file
            if include_metadata:
                metadata = {
                    "project": {
                        "id": project.id,
                        "name": project.name,
                        "description": project.description,
                        "created_at": project.created_at.isoformat(),
                    },
                    "generations": [
                        {
                            "id": g.id,
                            "model": g.model,
                            "variant": g.variant,
                            "prompt": g.prompt,
                            "parameters": g.parameters,
                            "duration": g.duration_seconds,
                            "audio_file": Path(g.audio_path).name if g.audio_path else None,
                            "created_at": g.created_at.isoformat(),
                            "tags": g.tags,
                            "seed": g.seed,
                        }
                        for g in generations
                    ],
                }
                metadata_path = tmppath / "metadata.json"
                metadata_path.write_text(json.dumps(metadata, indent=2))
            # Create ZIP
            output_path = Path(output_path)
            with zipfile.ZipFile(output_path, "w", zipfile.ZIP_DEFLATED) as zf:
                for file in tmppath.iterdir():
                    zf.write(file, file.name)
        logger.info(f"Exported project {project_id} to {output_path}")
        return output_path
    # Statistics
    async def get_stats(self) -> dict[str, Any]:
        """Get overall statistics."""
        return await self.database.get_stats()
--- a/src/storage/init.py
+++ b/src/storage/init.py
@@ -0,0 +1,5 @@
 """Storage module for AudioCraft Studio."""
 from src.storage.database import Database, Generation, Project, Preset
 __all__ = ["Database", "Generation", "Project", "Preset"]
--- a/src/storage/database.py
+++ b/src/storage/database.py
@@ -0,0 +1,550 @@
 """SQLite database for projects, generations, and presets."""
 import json
 import logging
 import uuid
 from dataclasses import dataclass, field
 from datetime import datetime
 from pathlib import Path
 from typing import Any, Optional
 import aiosqlite
 logger = logging.getLogger(__name__)
@dataclass
 class Project:
    """Project entity for organizing generations."""
    id: str
    name: str
    created_at: datetime
    updated_at: datetime
    description: str = ""
    @classmethod
    def create(cls, name: str, description: str = "") -> "Project":
        """Create a new project with generated ID."""
        now = datetime.utcnow()
        return cls(
            id=f"proj_{uuid.uuid4().hex[:12]}",
            name=name,
            created_at=now,
            updated_at=now,
            description=description,
        )
@dataclass
 class Generation:
    """Audio generation record."""
    id: str
    project_id: Optional[str]
    model: str
    variant: str
    prompt: str
    parameters: dict[str, Any]
    created_at: datetime
    audio_path: Optional[str] = None
    duration_seconds: Optional[float] = None
    sample_rate: Optional[int] = None
    preset_used: Optional[str] = None
    conditioning: dict[str, Any] = field(default_factory=dict)
    name: Optional[str] = None
    tags: list[str] = field(default_factory=list)
    notes: Optional[str] = None
    seed: Optional[int] = None
    @classmethod
    def create(
        cls,
        model: str,
        variant: str,
        prompt: str,
        parameters: dict[str, Any],
        project_id: Optional[str] = None,
        **kwargs,
    ) -> "Generation":
        """Create a new generation record."""
        return cls(
            id=f"gen_{uuid.uuid4().hex[:12]}",
            project_id=project_id,
            model=model,
            variant=variant,
            prompt=prompt,
            parameters=parameters,
            created_at=datetime.utcnow(),
            **kwargs,
        )
@dataclass
 class Preset:
    """Generation parameter preset."""
    id: str
    model: str
    name: str
    parameters: dict[str, Any]
    created_at: datetime
    description: str = ""
    is_builtin: bool = False
    @classmethod
    def create(
        cls,
        model: str,
        name: str,
        parameters: dict[str, Any],
        description: str = "",
    ) -> "Preset":
        """Create a new custom preset."""
        return cls(
            id=f"preset_{uuid.uuid4().hex[:12]}",
            model=model,
            name=name,
            parameters=parameters,
            created_at=datetime.utcnow(),
            description=description,
            is_builtin=False,
        )
 class Database:
    """Async SQLite database for AudioCraft Studio.
    Handles storage of projects, generations, and presets.
    """
    SCHEMA = """
    CREATE TABLE IF NOT EXISTS projects (
        id TEXT PRIMARY KEY,
        name TEXT NOT NULL,
        description TEXT DEFAULT '',
        created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
        updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
    );
    CREATE TABLE IF NOT EXISTS generations (
        id TEXT PRIMARY KEY,
        project_id TEXT REFERENCES projects(id) ON DELETE SET NULL,
        model TEXT NOT NULL,
        variant TEXT NOT NULL,
        prompt TEXT NOT NULL,
        parameters JSON NOT NULL,
        preset_used TEXT,
        conditioning JSON,
        audio_path TEXT,
        duration_seconds REAL,
        sample_rate INTEGER,
        created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
        name TEXT,
        tags JSON,
        notes TEXT,
        seed INTEGER
    );
    CREATE TABLE IF NOT EXISTS presets (
        id TEXT PRIMARY KEY,
        model TEXT NOT NULL,
        name TEXT NOT NULL,
        description TEXT DEFAULT '',
        parameters JSON NOT NULL,
        is_builtin BOOLEAN DEFAULT FALSE,
        created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
    );
    CREATE INDEX IF NOT EXISTS idx_generations_project ON generations(project_id);
    CREATE INDEX IF NOT EXISTS idx_generations_created ON generations(created_at DESC);
    CREATE INDEX IF NOT EXISTS idx_generations_model ON generations(model);
    CREATE INDEX IF NOT EXISTS idx_presets_model ON presets(model);
    """
    def __init__(self, db_path: Path):
        """Initialize database.
        Args:
            db_path: Path to SQLite database file
        """
        self.db_path = db_path
        self._connection: Optional[aiosqlite.Connection] = None
    async def connect(self) -> None:
        """Open database connection and initialize schema."""
        self.db_path.parent.mkdir(parents=True, exist_ok=True)
        self._connection = await aiosqlite.connect(self.db_path)
        self._connection.row_factory = aiosqlite.Row
        # Initialize schema
        await self._connection.executescript(self.SCHEMA)
        await self._connection.commit()
        logger.info(f"Database connected: {self.db_path}")
    async def close(self) -> None:
        """Close database connection."""
        if self._connection:
            await self._connection.close()
            self._connection = None
    @property
    def conn(self) -> aiosqlite.Connection:
        """Get active connection."""
        if not self._connection:
            raise RuntimeError("Database not connected")
        return self._connection
    # Project Methods
    async def create_project(self, project: Project) -> Project:
        """Create a new project."""
        await self.conn.execute(
            """
            INSERT INTO projects (id, name, description, created_at, updated_at)
            VALUES (?, ?, ?, ?, ?)
            """,
            (
                project.id,
                project.name,
                project.description,
                project.created_at.isoformat(),
                project.updated_at.isoformat(),
            ),
        )
        await self.conn.commit()
        return project
    async def get_project(self, project_id: str) -> Optional[Project]:
        """Get a project by ID."""
        async with self.conn.execute(
            "SELECT * FROM projects WHERE id = ?", (project_id,)
        ) as cursor:
            row = await cursor.fetchone()
            if row:
                return Project(
                    id=row["id"],
                    name=row["name"],
                    description=row["description"] or "",
                    created_at=datetime.fromisoformat(row["created_at"]),
                    updated_at=datetime.fromisoformat(row["updated_at"]),
                )
        return None
    async def list_projects(
        self, limit: int = 100, offset: int = 0
    ) -> list[Project]:
        """List all projects, ordered by last update."""
        async with self.conn.execute(
            """
            SELECT * FROM projects
            ORDER BY updated_at DESC
            LIMIT ? OFFSET ?
            """,
            (limit, offset),
        ) as cursor:
            rows = await cursor.fetchall()
            return [
                Project(
                    id=row["id"],
                    name=row["name"],
                    description=row["description"] or "",
                    created_at=datetime.fromisoformat(row["created_at"]),
                    updated_at=datetime.fromisoformat(row["updated_at"]),
                )
                for row in rows
            ]
    async def update_project(self, project: Project) -> None:
        """Update a project."""
        project.updated_at = datetime.utcnow()
        await self.conn.execute(
            """
            UPDATE projects SET name = ?, description = ?, updated_at = ?
            WHERE id = ?
            """,
            (project.name, project.description, project.updated_at.isoformat(), project.id),
        )
        await self.conn.commit()
    async def delete_project(self, project_id: str) -> bool:
        """Delete a project (generations are kept but unlinked)."""
        result = await self.conn.execute(
            "DELETE FROM projects WHERE id = ?", (project_id,)
        )
        await self.conn.commit()
        return result.rowcount > 0
    # Generation Methods
    async def create_generation(self, generation: Generation) -> Generation:
        """Create a new generation record."""
        await self.conn.execute(
            """
            INSERT INTO generations (
                id, project_id, model, variant, prompt, parameters,
                preset_used, conditioning, audio_path, duration_seconds,
                sample_rate, created_at, name, tags, notes, seed
            ) VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)
            """,
            (
                generation.id,
                generation.project_id,
                generation.model,
                generation.variant,
                generation.prompt,
                json.dumps(generation.parameters),
                generation.preset_used,
                json.dumps(generation.conditioning),
                generation.audio_path,
                generation.duration_seconds,
                generation.sample_rate,
                generation.created_at.isoformat(),
                generation.name,
                json.dumps(generation.tags),
                generation.notes,
                generation.seed,
            ),
        )
        await self.conn.commit()
        # Update project's updated_at if linked
        if generation.project_id:
            await self.conn.execute(
                "UPDATE projects SET updated_at = ? WHERE id = ?",
                (datetime.utcnow().isoformat(), generation.project_id),
            )
            await self.conn.commit()
        return generation
    async def get_generation(self, generation_id: str) -> Optional[Generation]:
        """Get a generation by ID."""
        async with self.conn.execute(
            "SELECT * FROM generations WHERE id = ?", (generation_id,)
        ) as cursor:
            row = await cursor.fetchone()
            if row:
                return self._row_to_generation(row)
        return None
    async def list_generations(
        self,
        project_id: Optional[str] = None,
        model: Optional[str] = None,
        limit: int = 100,
        offset: int = 0,
        search: Optional[str] = None,
    ) -> list[Generation]:
        """List generations with optional filters."""
        conditions = []
        params = []
        if project_id:
            conditions.append("project_id = ?")
            params.append(project_id)
        if model:
            conditions.append("model = ?")
            params.append(model)
        if search:
            conditions.append("(prompt LIKE ? OR name LIKE ? OR tags LIKE ?)")
            search_pattern = f"%{search}%"
            params.extend([search_pattern, search_pattern, search_pattern])
        where_clause = " AND ".join(conditions) if conditions else "1=1"
        async with self.conn.execute(
            f"""
            SELECT * FROM generations
            WHERE {where_clause}
            ORDER BY created_at DESC
            LIMIT ? OFFSET ?
            """,
            (*params, limit, offset),
        ) as cursor:
            rows = await cursor.fetchall()
            return [self._row_to_generation(row) for row in rows]
    async def update_generation(self, generation: Generation) -> None:
        """Update a generation record."""
        await self.conn.execute(
            """
            UPDATE generations SET
                project_id = ?, name = ?, tags = ?, notes = ?,
                audio_path = ?, duration_seconds = ?, sample_rate = ?
            WHERE id = ?
            """,
            (
                generation.project_id,
                generation.name,
                json.dumps(generation.tags),
                generation.notes,
                generation.audio_path,
                generation.duration_seconds,
                generation.sample_rate,
                generation.id,
            ),
        )
        await self.conn.commit()
    async def delete_generation(self, generation_id: str) -> bool:
        """Delete a generation record."""
        result = await self.conn.execute(
            "DELETE FROM generations WHERE id = ?", (generation_id,)
        )
        await self.conn.commit()
        return result.rowcount > 0
    async def count_generations(
        self, project_id: Optional[str] = None, model: Optional[str] = None
    ) -> int:
        """Count generations with optional filters."""
        conditions = []
        params = []
        if project_id:
            conditions.append("project_id = ?")
            params.append(project_id)
        if model:
            conditions.append("model = ?")
            params.append(model)
        where_clause = " AND ".join(conditions) if conditions else "1=1"
        async with self.conn.execute(
            f"SELECT COUNT(*) FROM generations WHERE {where_clause}",
            params,
        ) as cursor:
            row = await cursor.fetchone()
            return row[0] if row else 0
    def _row_to_generation(self, row: aiosqlite.Row) -> Generation:
        """Convert database row to Generation object."""
        return Generation(
            id=row["id"],
            project_id=row["project_id"],
            model=row["model"],
            variant=row["variant"],
            prompt=row["prompt"],
            parameters=json.loads(row["parameters"]),
            preset_used=row["preset_used"],
            conditioning=json.loads(row["conditioning"]) if row["conditioning"] else {},
            audio_path=row["audio_path"],
            duration_seconds=row["duration_seconds"],
            sample_rate=row["sample_rate"],
            created_at=datetime.fromisoformat(row["created_at"]),
            name=row["name"],
            tags=json.loads(row["tags"]) if row["tags"] else [],
            notes=row["notes"],
            seed=row["seed"],
        )
    # Preset Methods
    async def create_preset(self, preset: Preset) -> Preset:
        """Create a new preset."""
        await self.conn.execute(
            """
            INSERT INTO presets (id, model, name, description, parameters, is_builtin, created_at)
            VALUES (?, ?, ?, ?, ?, ?, ?)
            """,
            (
                preset.id,
                preset.model,
                preset.name,
                preset.description,
                json.dumps(preset.parameters),
                preset.is_builtin,
                preset.created_at.isoformat(),
            ),
        )
        await self.conn.commit()
        return preset
    async def get_preset(self, preset_id: str) -> Optional[Preset]:
        """Get a preset by ID."""
        async with self.conn.execute(
            "SELECT * FROM presets WHERE id = ?", (preset_id,)
        ) as cursor:
            row = await cursor.fetchone()
            if row:
                return self._row_to_preset(row)
        return None
    async def list_presets(
        self, model: Optional[str] = None, include_builtin: bool = True
    ) -> list[Preset]:
        """List presets with optional model filter."""
        conditions = []
        params = []
        if model:
            conditions.append("model = ?")
            params.append(model)
        if not include_builtin:
            conditions.append("is_builtin = FALSE")
        where_clause = " AND ".join(conditions) if conditions else "1=1"
        async with self.conn.execute(
            f"""
            SELECT * FROM presets
            WHERE {where_clause}
            ORDER BY is_builtin DESC, name ASC
            """,
            params,
        ) as cursor:
            rows = await cursor.fetchall()
            return [self._row_to_preset(row) for row in rows]
    async def delete_preset(self, preset_id: str) -> bool:
        """Delete a preset (only custom presets can be deleted)."""
        result = await self.conn.execute(
            "DELETE FROM presets WHERE id = ? AND is_builtin = FALSE",
            (preset_id,),
        )
        await self.conn.commit()
        return result.rowcount > 0
    def _row_to_preset(self, row: aiosqlite.Row) -> Preset:
        """Convert database row to Preset object."""
        return Preset(
            id=row["id"],
            model=row["model"],
            name=row["name"],
            description=row["description"] or "",
            parameters=json.loads(row["parameters"]),
            is_builtin=bool(row["is_builtin"]),
            created_at=datetime.fromisoformat(row["created_at"]),
        )
    # Utility Methods
    async def get_stats(self) -> dict[str, Any]:
        """Get database statistics."""
        stats = {}
        async with self.conn.execute("SELECT COUNT(*) FROM projects") as cursor:
            row = await cursor.fetchone()
            stats["projects"] = row[0] if row else 0
        async with self.conn.execute("SELECT COUNT(*) FROM generations") as cursor:
            row = await cursor.fetchone()
            stats["generations"] = row[0] if row else 0
        async with self.conn.execute("SELECT COUNT(*) FROM presets") as cursor:
            row = await cursor.fetchone()
            stats["presets"] = row[0] if row else 0
        async with self.conn.execute(
            "SELECT model, COUNT(*) as count FROM generations GROUP BY model"
        ) as cursor:
            rows = await cursor.fetchall()
            stats["generations_by_model"] = {row["model"]: row["count"] for row in rows}
        return stats
--- a/src/ui/init.py
+++ b/src/ui/init.py
@@ -0,0 +1,5 @@
 """Gradio UI for AudioCraft Studio."""
 from src.ui.app import create_app
 __all__ = ["create_app"]
--- a/src/ui/app.py
+++ b/src/ui/app.py
@@ -0,0 +1,355 @@
 """Main Gradio application for AudioCraft Studio."""
 import asyncio
 import gradio as gr
 from typing import Any, Optional
 from pathlib import Path
 from src.ui.theme import create_audiocraft_theme, get_custom_css
 from src.ui.state import UIState, DEFAULT_PRESETS, PROMPT_SUGGESTIONS
 from src.ui.components.vram_monitor import create_vram_monitor
 from src.ui.tabs import (
    create_dashboard_tab,
    create_musicgen_tab,
    create_audiogen_tab,
    create_magnet_tab,
    create_style_tab,
    create_jasco_tab,
 )
 from src.ui.pages import create_projects_page, create_settings_page
 from config.settings import get_settings
 class AudioCraftApp:
    """Main AudioCraft Studio Gradio application."""
    def __init__(
        self,
        generation_service: Any = None,
        batch_processor: Any = None,
        project_service: Any = None,
        gpu_manager: Any = None,
        model_registry: Any = None,
    ):
        """Initialize the application.
        Args:
            generation_service: Service for handling generations
            batch_processor: Service for batch/queue processing
            project_service: Service for project management
            gpu_manager: GPU memory manager
            model_registry: Model registry for loading/unloading
        """
        self.settings = get_settings()
        self.generation_service = generation_service
        self.batch_processor = batch_processor
        self.project_service = project_service
        self.gpu_manager = gpu_manager
        self.model_registry = model_registry
        self.ui_state = UIState()
        self.app: Optional[gr.Blocks] = None
    def _get_queue_status(self) -> dict[str, Any]:
        """Get current queue status."""
        if self.batch_processor:
            return {
                "queue_size": len(self.batch_processor.queue),
                "active_jobs": self.batch_processor.active_count,
                "completed_today": self.batch_processor.completed_count,
            }
        return {"queue_size": 0, "active_jobs": 0, "completed_today": 0}
    def _get_recent_generations(self, limit: int = 5) -> list[dict[str, Any]]:
        """Get recent generations."""
        if self.project_service:
            try:
                return asyncio.run(self.project_service.get_recent_generations(limit))
            except Exception:
                pass
        return []
    def _get_gpu_status(self) -> dict[str, Any]:
        """Get GPU memory status."""
        if self.gpu_manager:
            return {
                "used_gb": self.gpu_manager.get_used_memory() / 1024**3,
                "total_gb": self.gpu_manager.total_memory / 1024**3,
                "utilization_percent": self.gpu_manager.get_utilization(),
                "available_gb": self.gpu_manager.get_available_memory() / 1024**3,
            }
        return {"used_gb": 0, "total_gb": 24, "utilization_percent": 0, "available_gb": 24}
    async def _generate(self, **kwargs) -> tuple[Any, Any]:
        """Generate audio using the generation service."""
        if self.generation_service:
            return await self.generation_service.generate(**kwargs)
        raise RuntimeError("Generation service not configured")
    def _add_to_queue(self, **kwargs) -> Any:
        """Add generation job to queue."""
        if self.batch_processor:
            return self.batch_processor.add_job(**kwargs)
        raise RuntimeError("Batch processor not configured")
    def _get_projects(self) -> list[dict]:
        """Get all projects."""
        if self.project_service:
            try:
                return asyncio.run(self.project_service.list_projects())
            except Exception:
                pass
        return []
    def _get_generations(self, project_id: str, limit: int, offset: int) -> list[dict]:
        """Get generations for a project."""
        if self.project_service:
            try:
                return asyncio.run(
                    self.project_service.list_generations(project_id, limit, offset)
                )
            except Exception:
                pass
        return []
    def _delete_generation(self, generation_id: str) -> bool:
        """Delete a generation."""
        if self.project_service:
            try:
                asyncio.run(self.project_service.delete_generation(generation_id))
                return True
            except Exception:
                pass
        return False
    def _export_project(self, project_id: str) -> str:
        """Export project as ZIP."""
        if self.project_service:
            return asyncio.run(self.project_service.export_project_zip(project_id))
        raise RuntimeError("Project service not configured")
    def _create_project(self, name: str, description: str) -> dict:
        """Create a new project."""
        if self.project_service:
            return asyncio.run(self.project_service.create_project(name, description))
        raise RuntimeError("Project service not configured")
    def _get_app_settings(self) -> dict:
        """Get application settings."""
        return {
            "output_dir": str(self.settings.output_dir),
            "default_format": self.settings.default_format,
            "sample_rate": self.settings.sample_rate,
            "normalize_audio": self.settings.normalize_audio,
            "theme_mode": "Dark",
            "show_advanced": False,
            "auto_play": True,
            "comfyui_reserve_gb": self.settings.comfyui_reserve_gb,
            "idle_timeout_minutes": self.settings.idle_unload_minutes,
            "max_loaded_models": self.settings.max_loaded_models,
            "musicgen_variant": "medium",
            "musicgen_duration": 10,
            "audiogen_duration": 5,
            "magnet_variant": "medium",
            "magnet_decoding_steps": 20,
            "api_enabled": self.settings.api_enabled,
            "api_port": self.settings.api_port,
            "rate_limit": self.settings.api_rate_limit,
            "max_batch_size": self.settings.max_batch_size,
            "max_queue_size": self.settings.max_queue_size,
            "max_workers": 1,
            "priority_queue": False,
        }
    def _update_app_settings(self, settings: dict) -> bool:
        """Update application settings."""
        # In a real implementation, this would persist settings
        # For now, just return success
        return True
    def _clear_cache(self) -> bool:
        """Clear model cache."""
        if self.model_registry:
            try:
                self.model_registry.clear_cache()
                return True
            except Exception:
                pass
        return False
    def _unload_all_models(self) -> bool:
        """Unload all models from memory."""
        if self.model_registry:
            try:
                asyncio.run(self.model_registry.unload_all())
                return True
            except Exception:
                pass
        return False
    def build(self) -> gr.Blocks:
        """Build the Gradio application."""
        theme = create_audiocraft_theme()
        css = get_custom_css()
        with gr.Blocks(
            theme=theme,
            css=css,
            title="AudioCraft Studio",
            analytics_enabled=False,
        ) as app:
            # Header with VRAM monitor
            with gr.Row():
                with gr.Column(scale=4):
                    gr.Markdown("# AudioCraft Studio")
                with gr.Column(scale=1):
                    vram_monitor = create_vram_monitor(
                        get_status_fn=self._get_gpu_status,
                        update_interval=5,
                    )
            # Main tabs
            with gr.Tabs() as main_tabs:
                # Dashboard
                with gr.TabItem("Dashboard", id="dashboard"):
                    dashboard = create_dashboard_tab(
                        get_queue_status=self._get_queue_status,
                        get_recent_generations=self._get_recent_generations,
                        get_gpu_status=self._get_gpu_status,
                    )
                # Model tabs
                with gr.TabItem("MusicGen", id="musicgen"):
                    musicgen = create_musicgen_tab(
                        generate_fn=self._generate,
                        add_to_queue_fn=self._add_to_queue,
                    )
                with gr.TabItem("AudioGen", id="audiogen"):
                    audiogen = create_audiogen_tab(
                        generate_fn=self._generate,
                        add_to_queue_fn=self._add_to_queue,
                    )
                with gr.TabItem("MAGNeT", id="magnet"):
                    magnet = create_magnet_tab(
                        generate_fn=self._generate,
                        add_to_queue_fn=self._add_to_queue,
                    )
                with gr.TabItem("Style", id="style"):
                    style = create_style_tab(
                        generate_fn=self._generate,
                        add_to_queue_fn=self._add_to_queue,
                    )
                with gr.TabItem("JASCO", id="jasco"):
                    jasco = create_jasco_tab(
                        generate_fn=self._generate,
                        add_to_queue_fn=self._add_to_queue,
                    )
                # Projects
                with gr.TabItem("Projects", id="projects"):
                    projects = create_projects_page(
                        get_projects=self._get_projects,
                        get_generations=self._get_generations,
                        delete_generation=self._delete_generation,
                        export_project=self._export_project,
                        create_project=self._create_project,
                    )
                # Settings
                with gr.TabItem("Settings", id="settings"):
                    settings = create_settings_page(
                        get_settings=self._get_app_settings,
                        update_settings=self._update_app_settings,
                        get_gpu_info=self._get_gpu_status,
                        clear_cache=self._clear_cache,
                        unload_all_models=self._unload_all_models,
                    )
            # Footer
            gr.Markdown("---")
            gr.Markdown(
                "AudioCraft Studio | "
                "[Documentation](https://github.com/facebookresearch/audiocraft) | "
                "Powered by Meta AudioCraft"
            )
            # Store component references
            self.components = {
                "vram_monitor": vram_monitor,
                "dashboard": dashboard,
                "musicgen": musicgen,
                "audiogen": audiogen,
                "magnet": magnet,
                "style": style,
                "jasco": jasco,
                "projects": projects,
                "settings": settings,
            }
        self.app = app
        return app
    def launch(
        self,
        server_name: Optional[str] = None,
        server_port: Optional[int] = None,
        share: bool = False,
        **kwargs,
    ) -> None:
        """Launch the Gradio application.
        Args:
            server_name: Server hostname
            server_port: Server port
            share: Whether to create a public share link
            **kwargs: Additional arguments for gr.Blocks.launch()
        """
        if self.app is None:
            self.build()
        self.app.launch(
            server_name=server_name or self.settings.host,
            server_port=server_port or self.settings.gradio_port,
            share=share,
            show_error=True,
            **kwargs,
        )
 def create_app(
    generation_service: Any = None,
    batch_processor: Any = None,
    project_service: Any = None,
    gpu_manager: Any = None,
    model_registry: Any = None,
 ) -> AudioCraftApp:
    """Create and return the AudioCraft application.
    Args:
        generation_service: Service for handling generations
        batch_processor: Service for batch/queue processing
        project_service: Service for project management
        gpu_manager: GPU memory manager
        model_registry: Model registry for loading/unloading
    Returns:
        Configured AudioCraftApp instance
    """
    return AudioCraftApp(
        generation_service=generation_service,
        batch_processor=batch_processor,
        project_service=project_service,
        gpu_manager=gpu_manager,
        model_registry=model_registry,
    )
 # Standalone launch for development/testing
 if __name__ == "__main__":
    app = create_app()
    app.launch()
--- a/src/ui/components/init.py
+++ b/src/ui/components/init.py
@@ -0,0 +1,13 @@
 """Reusable UI components for AudioCraft Studio."""
 from src.ui.components.vram_monitor import create_vram_monitor
 from src.ui.components.audio_player import create_audio_player
 from src.ui.components.preset_selector import create_preset_selector
 from src.ui.components.generation_params import create_generation_params
 __all__ = [
    "create_vram_monitor",
    "create_audio_player",
    "create_preset_selector",
    "create_generation_params",
 ]
--- a/src/ui/components/audio_player.py
+++ b/src/ui/components/audio_player.py
@@ -0,0 +1,178 @@
 """Audio player component with waveform visualization."""
 import gradio as gr
 from pathlib import Path
 from typing import Any, Optional, Callable
 def create_audio_player(
    label: str = "Generated Audio",
    show_waveform: bool = True,
    show_download: bool = True,
    show_info: bool = True,
 ) -> dict[str, Any]:
    """Create audio player component with optional waveform.
    Args:
        label: Label for the audio component
        show_waveform: Show waveform image
        show_download: Show download buttons
        show_info: Show audio info (duration, sample rate)
    Returns:
        Dictionary with component references
    """
    with gr.Group():
        # Audio player
        audio_output = gr.Audio(
            label=label,
            type="filepath",
            interactive=False,
            show_download_button=show_download,
        )
        # Waveform visualization
        if show_waveform:
            waveform_image = gr.Image(
                label="Waveform",
                type="filepath",
                interactive=False,
                height=100,
                visible=False,
            )
        else:
            waveform_image = None
        # Audio info
        if show_info:
            with gr.Row():
                duration_text = gr.Textbox(
                    label="Duration",
                    value="",
                    interactive=False,
                    scale=1,
                )
                sample_rate_text = gr.Textbox(
                    label="Sample Rate",
                    value="",
                    interactive=False,
                    scale=1,
                )
                seed_text = gr.Textbox(
                    label="Seed",
                    value="",
                    interactive=False,
                    scale=1,
                )
        else:
            duration_text = None
            sample_rate_text = None
            seed_text = None
        # Download buttons
        if show_download:
            with gr.Row():
                download_wav = gr.Button("Download WAV", size="sm")
                download_mp3 = gr.Button("Download MP3", size="sm")
                download_flac = gr.Button("Download FLAC", size="sm")
        else:
            download_wav = download_mp3 = download_flac = None
    return {
        "audio": audio_output,
        "waveform": waveform_image,
        "duration": duration_text,
        "sample_rate": sample_rate_text,
        "seed": seed_text,
        "download_wav": download_wav,
        "download_mp3": download_mp3,
        "download_flac": download_flac,
    }
 def update_audio_player(
    audio_path: Optional[str],
    duration: Optional[float] = None,
    sample_rate: Optional[int] = None,
    seed: Optional[int] = None,
    waveform_path: Optional[str] = None,
 ) -> tuple:
    """Update audio player with new audio.
    Args:
        audio_path: Path to audio file
        duration: Audio duration in seconds
        sample_rate: Audio sample rate
        seed: Generation seed
        waveform_path: Path to waveform image
    Returns:
        Tuple of update values for components
    """
    duration_str = f"{duration:.2f}s" if duration else ""
    sample_rate_str = f"{sample_rate} Hz" if sample_rate else ""
    seed_str = str(seed) if seed is not None else ""
    waveform_update = gr.update(value=waveform_path, visible=waveform_path is not None)
    return (
        audio_path,
        waveform_update,
        duration_str,
        sample_rate_str,
        seed_str,
    )
 def create_generation_output() -> dict[str, Any]:
    """Create generation output section with audio player and metadata.
    Returns:
        Dictionary with component references
    """
    with gr.Group():
        gr.Markdown("### Output")
        # Status/progress
        with gr.Row():
            status_text = gr.Markdown("Ready to generate")
            progress_bar = gr.Slider(
                minimum=0,
                maximum=100,
                value=0,
                label="Progress",
                interactive=False,
                visible=False,
            )
        # Audio player
        player = create_audio_player(
            label="Generated Audio",
            show_waveform=True,
            show_download=True,
            show_info=True,
        )
        # Generation metadata
        with gr.Accordion("Generation Details", open=False):
            generation_info = gr.JSON(
                label="Parameters",
                value={},
            )
        # Actions
        with gr.Row():
            save_btn = gr.Button("Save to Project", variant="secondary")
            regenerate_btn = gr.Button("Regenerate", variant="secondary")
            add_queue_btn = gr.Button("Add to Queue", variant="secondary")
    return {
        "status": status_text,
        "progress": progress_bar,
        "player": player,
        "info": generation_info,
        "save_btn": save_btn,
        "regenerate_btn": regenerate_btn,
        "add_queue_btn": add_queue_btn,
    }
--- a/src/ui/components/generation_params.py
+++ b/src/ui/components/generation_params.py
@@ -0,0 +1,199 @@
 """Generation parameters component."""
 import gradio as gr
 from typing import Any, Optional
 def create_generation_params(
    model_id: str,
    show_advanced: bool = False,
    max_duration: float = 30.0,
 ) -> dict[str, Any]:
    """Create generation parameters panel.
    Args:
        model_id: Model family for customizing available options
        show_advanced: Whether to show advanced parameters by default
        max_duration: Maximum allowed duration
    Returns:
        Dictionary with component references
    """
    # Model-specific defaults
    defaults = {
        "musicgen": {"duration": 10, "temperature": 1.0, "top_k": 250, "top_p": 0.0, "cfg_coef": 3.0},
        "audiogen": {"duration": 5, "temperature": 1.0, "top_k": 250, "top_p": 0.0, "cfg_coef": 3.0},
        "magnet": {"duration": 10, "temperature": 3.0, "top_k": 0, "top_p": 0.9, "cfg_coef": 3.0},
        "musicgen-style": {"duration": 10, "temperature": 1.0, "top_k": 250, "top_p": 0.0, "cfg_coef": 3.0},
        "jasco": {"duration": 10, "temperature": 1.0, "top_k": 250, "top_p": 0.0, "cfg_coef": 3.0},
    }
    d = defaults.get(model_id, defaults["musicgen"])
    with gr.Group():
        # Basic parameters (always visible)
        duration_slider = gr.Slider(
            minimum=1,
            maximum=max_duration,
            value=d["duration"],
            step=1,
            label="Duration (seconds)",
            info="Length of audio to generate",
        )
        # Advanced parameters (expandable)
        with gr.Accordion("Advanced Parameters", open=show_advanced):
            with gr.Row():
                temperature_slider = gr.Slider(
                    minimum=0.0,
                    maximum=2.0,
                    value=d["temperature"],
                    step=0.05,
                    label="Temperature",
                    info="Higher = more random, lower = more deterministic",
                )
                cfg_slider = gr.Slider(
                    minimum=1.0,
                    maximum=10.0,
                    value=d["cfg_coef"],
                    step=0.5,
                    label="CFG Coefficient",
                    info="Classifier-free guidance strength",
                )
            with gr.Row():
                top_k_slider = gr.Slider(
                    minimum=0,
                    maximum=500,
                    value=d["top_k"],
                    step=10,
                    label="Top-K",
                    info="Token selection limit (0 = disabled)",
                )
                top_p_slider = gr.Slider(
                    minimum=0.0,
                    maximum=1.0,
                    value=d["top_p"],
                    step=0.05,
                    label="Top-P (Nucleus)",
                    info="Cumulative probability threshold (0 = disabled)",
                )
            with gr.Row():
                seed_input = gr.Number(
                    value=None,
                    label="Seed",
                    info="Random seed for reproducibility (leave empty for random)",
                    precision=0,
                )
                use_random_seed = gr.Checkbox(
                    value=True,
                    label="Random Seed",
                )
            # Reset button
            reset_btn = gr.Button("Reset to Defaults", size="sm", variant="secondary")
    def reset_params():
        """Reset all parameters to defaults."""
        return (
            d["duration"],
            d["temperature"],
            d["cfg_coef"],
            d["top_k"],
            d["top_p"],
            None,
            True,
        )
    reset_btn.click(
        fn=reset_params,
        outputs=[
            duration_slider,
            temperature_slider,
            cfg_slider,
            top_k_slider,
            top_p_slider,
            seed_input,
            use_random_seed,
        ],
    )
    # Link random seed checkbox to seed input
    def toggle_seed(use_random: bool, current_seed: Optional[int]):
        if use_random:
            return gr.update(value=None, interactive=False)
        return gr.update(interactive=True)
    use_random_seed.change(
        fn=toggle_seed,
        inputs=[use_random_seed, seed_input],
        outputs=[seed_input],
    )
    return {
        "duration": duration_slider,
        "temperature": temperature_slider,
        "cfg_coef": cfg_slider,
        "top_k": top_k_slider,
        "top_p": top_p_slider,
        "seed": seed_input,
        "use_random_seed": use_random_seed,
        "reset_btn": reset_btn,
    }
 def create_model_variant_selector(
    model_id: str,
    variants: list[dict[str, Any]],
    default_variant: str = "medium",
 ) -> dict[str, Any]:
    """Create model variant selector.
    Args:
        model_id: Model family ID
        variants: List of variant configurations
        default_variant: Default variant to select
    Returns:
        Dictionary with component references
    """
    # Build choices with descriptions
    choices = []
    for v in variants:
        name = v.get("name", v.get("id", "unknown"))
        vram = v.get("vram_mb", 0)
        desc = v.get("description", "")
        label = f"{name} ({vram/1024:.1f}GB)"
        choices.append((label, name))
    with gr.Group():
        variant_dropdown = gr.Dropdown(
            label="Model Variant",
            choices=choices,
            value=default_variant,
            interactive=True,
        )
        variant_info = gr.Markdown(
            value="",
            visible=True,
        )
    def update_info(variant_name: str):
        for v in variants:
            if v.get("name", v.get("id")) == variant_name:
                return v.get("description", "")
        return ""
    variant_dropdown.change(
        fn=update_info,
        inputs=[variant_dropdown],
        outputs=[variant_info],
    )
    return {
        "dropdown": variant_dropdown,
        "info": variant_info,
        "variants": variants,
    }
--- a/src/ui/components/preset_selector.py
+++ b/src/ui/components/preset_selector.py
@@ -0,0 +1,103 @@
 """Preset selector component."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from src.ui.state import DEFAULT_PRESETS
 def create_preset_selector(
    model_id: str,
    on_preset_select: Optional[Callable[[dict], None]] = None,
 ) -> dict[str, Any]:
    """Create preset selector component for a model.
    Args:
        model_id: Model family ID
        on_preset_select: Callback when preset is selected
    Returns:
        Dictionary with component references
    """
    presets = DEFAULT_PRESETS.get(model_id, [])
    # Create preset choices
    choices = [(p["name"], p["id"]) for p in presets]
    choices.append(("Custom", "custom"))
    def get_preset_by_id(preset_id: str) -> Optional[dict]:
        """Get preset data by ID."""
        for p in presets:
            if p["id"] == preset_id:
                return p
        return None
    def on_change(preset_id: str):
        """Handle preset selection change."""
        if preset_id == "custom":
            return gr.update(visible=True), {}
        preset = get_preset_by_id(preset_id)
        if preset:
            return gr.update(visible=False), preset.get("parameters", {})
        return gr.update(visible=True), {}
    with gr.Group():
        preset_dropdown = gr.Dropdown(
            label="Preset",
            choices=choices,
            value=presets[0]["id"] if presets else "custom",
            interactive=True,
        )
        preset_description = gr.Markdown(
            value=presets[0]["description"] if presets else "",
            visible=True,
        )
    return {
        "dropdown": preset_dropdown,
        "description": preset_description,
        "presets": presets,
        "get_preset": get_preset_by_id,
        "on_change": on_change,
    }
 def create_preset_chips(
    model_id: str,
    on_select: Callable[[str], None],
 ) -> dict[str, Any]:
    """Create preset selector as clickable chips/buttons.
    Args:
        model_id: Model family ID
        on_select: Callback when preset is clicked
    Returns:
        Dictionary with component references
    """
    presets = DEFAULT_PRESETS.get(model_id, [])
    with gr.Row():
        buttons = []
        for preset in presets:
            btn = gr.Button(
                preset["name"],
                size="sm",
                variant="secondary",
            )
            buttons.append((btn, preset))
        custom_btn = gr.Button(
            "Custom",
            size="sm",
            variant="secondary",
        )
    return {
        "buttons": buttons,
        "custom_btn": custom_btn,
        "presets": presets,
    }
--- a/src/ui/components/vram_monitor.py
+++ b/src/ui/components/vram_monitor.py
@@ -0,0 +1,151 @@
 """VRAM monitor component for GPU memory tracking."""
 import gradio as gr
 from typing import Any, Callable, Optional
 def create_vram_monitor(
    get_gpu_status: Callable[[], dict[str, Any]],
    get_loaded_models: Callable[[], list[dict[str, Any]]],
    unload_model: Callable[[str, str], bool],
    load_model: Callable[[str, str], bool],
 ) -> dict[str, Any]:
    """Create VRAM monitor component.
    Args:
        get_gpu_status: Function to get GPU status dict
        get_loaded_models: Function to get list of loaded models
        unload_model: Function to unload a model (model_id, variant)
        load_model: Function to load a model (model_id, variant)
    Returns:
        Dictionary with component references
    """
    def refresh_status():
        """Refresh GPU status display."""
        status = get_gpu_status()
        loaded = get_loaded_models()
        # Format VRAM bar
        used_gb = status.get("used_gb", 0)
        total_gb = status.get("total_gb", 24)
        util_pct = status.get("utilization_percent", 0)
        vram_text = f"{used_gb:.1f} / {total_gb:.1f} GB ({util_pct:.0f}%)"
        # Format loaded models list
        if loaded:
            models_text = "\n".join([
                f"• {m['model_id']}/{m['variant']} "
                f"(idle: {m['idle_seconds']:.0f}s)"
                for m in loaded
            ])
        else:
            models_text = "No models loaded"
        # Determine status color
        if util_pct > 90:
            status_color = "🔴"
        elif util_pct > 75:
            status_color = "🟡"
        else:
            status_color = "🟢"
        status_text = f"{status_color} GPU: {status.get('device', 'N/A')}"
        return vram_text, util_pct, models_text, status_text
    def handle_unload(model_selection: str):
        """Handle model unload."""
        if not model_selection or "/" not in model_selection:
            return "Select a model to unload", *refresh_status()
        parts = model_selection.split("/")
        model_id, variant = parts[0], parts[1]
        success = unload_model(model_id, variant)
        if success:
            msg = f"Unloaded {model_id}/{variant}"
        else:
            msg = f"Failed to unload {model_id}/{variant}"
        return msg, *refresh_status()
    with gr.Group():
        gr.Markdown("### GPU Memory")
        status_text = gr.Markdown("🟢 GPU: Checking...")
        with gr.Row():
            vram_display = gr.Textbox(
                label="VRAM Usage",
                value="Loading...",
                interactive=False,
                scale=3,
            )
            refresh_btn = gr.Button("🔄", scale=1, min_width=50)
        vram_slider = gr.Slider(
            minimum=0,
            maximum=100,
            value=0,
            label="",
            interactive=False,
            visible=True,
        )
        gr.Markdown("### Loaded Models")
        models_display = gr.Textbox(
            label="",
            value="No models loaded",
            interactive=False,
            lines=4,
            max_lines=6,
        )
        with gr.Row():
            model_selector = gr.Dropdown(
                label="Select Model",
                choices=[],
                interactive=True,
                scale=3,
            )
            unload_btn = gr.Button("Unload", variant="secondary", scale=1)
        unload_status = gr.Markdown("")
        # Event handlers
        def update_model_choices():
            loaded = get_loaded_models()
            choices = [f"{m['model_id']}/{m['variant']}" for m in loaded]
            return gr.update(choices=choices, value=None)
        refresh_btn.click(
            fn=refresh_status,
            outputs=[vram_display, vram_slider, models_display, status_text],
        ).then(
            fn=update_model_choices,
            outputs=[model_selector],
        )
        unload_btn.click(
            fn=handle_unload,
            inputs=[model_selector],
            outputs=[unload_status, vram_display, vram_slider, models_display, status_text],
        ).then(
            fn=update_model_choices,
            outputs=[model_selector],
        )
    return {
        "vram_display": vram_display,
        "vram_slider": vram_slider,
        "models_display": models_display,
        "status_text": status_text,
        "model_selector": model_selector,
        "refresh_btn": refresh_btn,
        "unload_btn": unload_btn,
        "refresh_fn": refresh_status,
    }
--- a/src/ui/pages/init.py
+++ b/src/ui/pages/init.py
@@ -0,0 +1,9 @@
 """UI pages for AudioCraft Studio."""
 from src.ui.pages.projects_page import create_projects_page
 from src.ui.pages.settings_page import create_settings_page
 __all__ = [
    "create_projects_page",
    "create_settings_page",
 ]
--- a/src/ui/pages/projects_page.py
+++ b/src/ui/pages/projects_page.py
@@ -0,0 +1,374 @@
 """Projects page for managing generations and history."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from datetime import datetime
 def create_projects_page(
    get_projects: Callable[[], list[dict]],
    get_generations: Callable[[str, int, int], list[dict]],
    delete_generation: Callable[[str], bool],
    export_project: Callable[[str], str],
    create_project: Callable[[str, str], dict],
 ) -> dict[str, Any]:
    """Create projects management page.
    Args:
        get_projects: Function to get all projects
        get_generations: Function to get generations (project_id, limit, offset)
        delete_generation: Function to delete a generation
        export_project: Function to export project as ZIP
        create_project: Function to create new project
    Returns:
        Dictionary with component references
    """
    with gr.Column():
        gr.Markdown("# Projects")
        gr.Markdown("Browse and manage your generations")
        with gr.Row():
            # Left sidebar - project list
            with gr.Column(scale=1):
                gr.Markdown("### Projects")
                with gr.Row():
                    new_project_name = gr.Textbox(
                        placeholder="New project name...",
                        show_label=False,
                        scale=3,
                    )
                    new_project_btn = gr.Button("+", size="sm", scale=1)
                project_list = gr.Dataframe(
                    headers=["ID", "Name", "Count"],
                    datatype=["str", "str", "number"],
                    col_count=(3, "fixed"),
                    interactive=False,
                    height=400,
                )
                refresh_projects_btn = gr.Button("Refresh Projects", size="sm")
            # Main content - generations
            with gr.Column(scale=3):
                # Selected project info
                selected_project_id = gr.State(value=None)
                selected_project_name = gr.Markdown("### Select a project")
                # Filters
                with gr.Row():
                    model_filter = gr.Dropdown(
                        label="Model",
                        choices=[
                            ("All", "all"),
                            ("MusicGen", "musicgen"),
                            ("AudioGen", "audiogen"),
                            ("MAGNeT", "magnet"),
                            ("Style", "musicgen-style"),
                            ("JASCO", "jasco"),
                        ],
                        value="all",
                        scale=1,
                    )
                    sort_by = gr.Dropdown(
                        label="Sort By",
                        choices=[
                            ("Newest First", "newest"),
                            ("Oldest First", "oldest"),
                            ("Duration (Long)", "duration_desc"),
                            ("Duration (Short)", "duration_asc"),
                        ],
                        value="newest",
                        scale=1,
                    )
                    search_input = gr.Textbox(
                        label="Search Prompts",
                        placeholder="Search...",
                        scale=2,
                    )
                # Generations grid
                generations_gallery = gr.Gallery(
                    label="Generations",
                    columns=3,
                    rows=3,
                    height=400,
                    object_fit="contain",
                    show_label=False,
                )
                # Pagination
                with gr.Row():
                    prev_page_btn = gr.Button("← Previous", size="sm")
                    page_info = gr.Markdown("Page 1 of 1")
                    next_page_btn = gr.Button("Next →", size="sm")
                current_page = gr.State(value=1)
                total_pages = gr.State(value=1)
                # Selected generation details
                gr.Markdown("---")
                gr.Markdown("### Generation Details")
                with gr.Row():
                    with gr.Column(scale=2):
                        selected_audio = gr.Audio(
                            label="Audio",
                            interactive=False,
                        )
                    with gr.Column(scale=2):
                        selected_prompt = gr.Textbox(
                            label="Prompt",
                            interactive=False,
                            lines=2,
                        )
                        with gr.Row():
                            selected_model = gr.Textbox(
                                label="Model",
                                interactive=False,
                            )
                            selected_duration = gr.Textbox(
                                label="Duration",
                                interactive=False,
                            )
                        with gr.Row():
                            selected_seed = gr.Textbox(
                                label="Seed",
                                interactive=False,
                            )
                            selected_date = gr.Textbox(
                                label="Created",
                                interactive=False,
                            )
                # Action buttons
                with gr.Row():
                    regenerate_btn = gr.Button("Regenerate", variant="secondary")
                    download_btn = gr.Button("Download", variant="secondary")
                    delete_btn = gr.Button("Delete", variant="stop")
                    export_project_btn = gr.Button("Export Project", variant="secondary")
        # Event handlers
        def load_projects():
            """Load all projects into the list."""
            projects = get_projects()
            data = []
            for p in projects:
                data.append([
                    p.get("id", ""),
                    p.get("name", "Untitled"),
                    p.get("generation_count", 0),
                ])
            return data
        def on_project_select(evt: gr.SelectData, df):
            """Handle project selection from dataframe."""
            if evt.index is None:
                return None, "### Select a project"
            row = evt.index[0]
            if row < len(df):
                project_id = df[row][0]
                project_name = df[row][1]
                return project_id, f"### {project_name}"
            return None, "### Select a project"
        def load_generations(project_id, page, model, sort, search):
            """Load generations for selected project."""
            if not project_id:
                return [], "Page 0 of 0", 1, 1
            limit = 9  # 3x3 grid
            offset = (page - 1) * limit
            gens = get_generations(project_id, limit + 1, offset)
            # Check if there are more pages
            has_more = len(gens) > limit
            gens = gens[:limit]
            # Filter by model if needed
            if model != "all":
                gens = [g for g in gens if g.get("model") == model]
            # Filter by search
            if search:
                search_lower = search.lower()
                gens = [g for g in gens if search_lower in g.get("prompt", "").lower()]
            # Sort
            if sort == "oldest":
                gens = sorted(gens, key=lambda x: x.get("created_at", ""))
            elif sort == "duration_desc":
                gens = sorted(gens, key=lambda x: x.get("duration_seconds", 0), reverse=True)
            elif sort == "duration_asc":
                gens = sorted(gens, key=lambda x: x.get("duration_seconds", 0))
            # Default is newest first (already sorted from DB)
            # Build gallery items (using waveform images if available)
            gallery_items = []
            for g in gens:
                waveform = g.get("waveform_path")
                if waveform:
                    gallery_items.append((waveform, g.get("prompt", "")[:50]))
                else:
                    # Placeholder
                    gallery_items.append((None, g.get("prompt", "")[:50]))
            # Calculate total pages (estimate)
            total = offset + len(gens) + (1 if has_more else 0)
            total_p = max(1, (total + limit - 1) // limit)
            return gallery_items, f"Page {page} of {total_p}", page, total_p
        def on_generation_select(evt: gr.SelectData, project_id):
            """Handle generation selection from gallery."""
            if evt.index is None or not project_id:
                return None, "", "", "", "", ""
            # Get generations again to find the selected one
            gens = get_generations(project_id, 100, 0)
            if evt.index < len(gens):
                gen = gens[evt.index]
                return (
                    gen.get("audio_path"),
                    gen.get("prompt", ""),
                    gen.get("model", ""),
                    f"{gen.get('duration_seconds', 0):.1f}s",
                    str(gen.get("seed", "")),
                    gen.get("created_at", "")[:19] if gen.get("created_at") else "",
                )
            return None, "", "", "", "", ""
        def do_create_project(name):
            """Create a new project."""
            if not name.strip():
                return gr.update(), "Please enter a project name"
            project = create_project(name.strip(), "")
            projects_data = load_projects()
            return projects_data, f"Created project: {name}"
        def do_delete_generation(project_id, audio_path):
            """Delete selected generation."""
            if not audio_path:
                return "No generation selected"
            # Find generation by audio path
            gens = get_generations(project_id, 100, 0)
            for g in gens:
                if g.get("audio_path") == audio_path:
                    if delete_generation(g.get("id")):
                        return "Generation deleted"
                    else:
                        return "Failed to delete"
            return "Generation not found"
        def do_export_project(project_id):
            """Export project as ZIP."""
            if not project_id:
                return "No project selected"
            try:
                zip_path = export_project(project_id)
                return f"Exported to: {zip_path}"
            except Exception as e:
                return f"Export failed: {str(e)}"
        # Wire up events
        refresh_projects_btn.click(
            fn=load_projects,
            outputs=[project_list],
        )
        project_list.select(
            fn=on_project_select,
            inputs=[project_list],
            outputs=[selected_project_id, selected_project_name],
        ).then(
            fn=load_generations,
            inputs=[selected_project_id, current_page, model_filter, sort_by, search_input],
            outputs=[generations_gallery, page_info, current_page, total_pages],
        )
        # Filter changes reload generations
        for component in [model_filter, sort_by, search_input]:
            component.change(
                fn=load_generations,
                inputs=[selected_project_id, current_page, model_filter, sort_by, search_input],
                outputs=[generations_gallery, page_info, current_page, total_pages],
            )
        # Pagination
        def go_prev(page, total):
            return max(1, page - 1)
        def go_next(page, total):
            return min(total, page + 1)
        prev_page_btn.click(
            fn=go_prev,
            inputs=[current_page, total_pages],
            outputs=[current_page],
        ).then(
            fn=load_generations,
            inputs=[selected_project_id, current_page, model_filter, sort_by, search_input],
            outputs=[generations_gallery, page_info, current_page, total_pages],
        )
        next_page_btn.click(
            fn=go_next,
            inputs=[current_page, total_pages],
            outputs=[current_page],
        ).then(
            fn=load_generations,
            inputs=[selected_project_id, current_page, model_filter, sort_by, search_input],
            outputs=[generations_gallery, page_info, current_page, total_pages],
        )
        # Generation selection
        generations_gallery.select(
            fn=on_generation_select,
            inputs=[selected_project_id],
            outputs=[selected_audio, selected_prompt, selected_model, selected_duration, selected_seed, selected_date],
        )
        # Actions
        new_project_btn.click(
            fn=do_create_project,
            inputs=[new_project_name],
            outputs=[project_list, selected_project_name],
        )
        delete_btn.click(
            fn=do_delete_generation,
            inputs=[selected_project_id, selected_audio],
            outputs=[selected_project_name],
        ).then(
            fn=load_generations,
            inputs=[selected_project_id, current_page, model_filter, sort_by, search_input],
            outputs=[generations_gallery, page_info, current_page, total_pages],
        )
        export_project_btn.click(
            fn=do_export_project,
            inputs=[selected_project_id],
            outputs=[selected_project_name],
        )
    return {
        "project_list": project_list,
        "generations_gallery": generations_gallery,
        "selected_audio": selected_audio,
        "selected_project_id": selected_project_id,
        "refresh_fn": load_projects,
    }
--- a/src/ui/pages/settings_page.py
+++ b/src/ui/pages/settings_page.py
@@ -0,0 +1,397 @@
 """Settings page for application configuration."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from pathlib import Path
 def create_settings_page(
    get_settings: Callable[[], dict],
    update_settings: Callable[[dict], bool],
    get_gpu_info: Callable[[], dict],
    clear_cache: Callable[[], bool],
    unload_all_models: Callable[[], bool],
 ) -> dict[str, Any]:
    """Create settings management page.
    Args:
        get_settings: Function to get current settings
        update_settings: Function to update settings
        get_gpu_info: Function to get GPU information
        clear_cache: Function to clear model cache
        unload_all_models: Function to unload all models
    Returns:
        Dictionary with component references
    """
    with gr.Column():
        gr.Markdown("# Settings")
        gr.Markdown("Configure AudioCraft Studio")
        with gr.Tabs():
            # General Settings Tab
            with gr.TabItem("General"):
                with gr.Group():
                    gr.Markdown("### Output Settings")
                    output_dir = gr.Textbox(
                        label="Output Directory",
                        placeholder="/path/to/output",
                        info="Where generated audio files are saved",
                    )
                    with gr.Row():
                        default_format = gr.Dropdown(
                            label="Default Audio Format",
                            choices=[("WAV", "wav"), ("MP3", "mp3"), ("FLAC", "flac"), ("OGG", "ogg")],
                            value="wav",
                        )
                        sample_rate = gr.Dropdown(
                            label="Sample Rate",
                            choices=[
                                ("32000 Hz (AudioCraft default)", 32000),
                                ("44100 Hz (CD quality)", 44100),
                                ("48000 Hz (Video standard)", 48000),
                            ],
                            value=32000,
                        )
                    normalize_audio = gr.Checkbox(
                        label="Normalize audio output",
                        value=True,
                        info="Normalize audio levels to prevent clipping",
                    )
                with gr.Group():
                    gr.Markdown("### Interface Settings")
                    theme_mode = gr.Radio(
                        label="Theme",
                        choices=["Dark", "Light", "System"],
                        value="Dark",
                    )
                    show_advanced = gr.Checkbox(
                        label="Show advanced parameters by default",
                        value=False,
                    )
                    auto_play = gr.Checkbox(
                        label="Auto-play generated audio",
                        value=True,
                    )
            # GPU & Memory Tab
            with gr.TabItem("GPU & Memory"):
                with gr.Group():
                    gr.Markdown("### GPU Information")
                    gpu_info_display = gr.JSON(
                        label="GPU Status",
                        value={},
                    )
                    refresh_gpu_btn = gr.Button("Refresh GPU Info", size="sm")
                with gr.Group():
                    gr.Markdown("### Memory Management")
                    comfyui_reserve = gr.Slider(
                        minimum=0,
                        maximum=16,
                        value=10,
                        step=0.5,
                        label="ComfyUI VRAM Reserve (GB)",
                        info="VRAM to reserve for ComfyUI when running alongside",
                    )
                    idle_timeout = gr.Slider(
                        minimum=1,
                        maximum=60,
                        value=15,
                        step=1,
                        label="Idle Model Timeout (minutes)",
                        info="Unload models after this period of inactivity",
                    )
                    max_loaded = gr.Slider(
                        minimum=1,
                        maximum=5,
                        value=2,
                        step=1,
                        label="Maximum Loaded Models",
                        info="Maximum number of models to keep in memory",
                    )
                with gr.Group():
                    gr.Markdown("### Cache Management")
                    with gr.Row():
                        clear_cache_btn = gr.Button("Clear Model Cache", variant="secondary")
                        unload_models_btn = gr.Button("Unload All Models", variant="stop")
                    cache_status = gr.Markdown("Cache status: Ready")
            # Model Defaults Tab
            with gr.TabItem("Model Defaults"):
                with gr.Group():
                    gr.Markdown("### MusicGen Defaults")
                    with gr.Row():
                        musicgen_variant = gr.Dropdown(
                            label="Default Variant",
                            choices=[
                                ("Small", "small"),
                                ("Medium", "medium"),
                                ("Large", "large"),
                                ("Melody", "melody"),
                            ],
                            value="medium",
                        )
                        musicgen_duration = gr.Slider(
                            minimum=1,
                            maximum=30,
                            value=10,
                            step=1,
                            label="Default Duration (s)",
                        )
                with gr.Group():
                    gr.Markdown("### AudioGen Defaults")
                    audiogen_duration = gr.Slider(
                        minimum=1,
                        maximum=10,
                        value=5,
                        step=1,
                        label="Default Duration (s)",
                    )
                with gr.Group():
                    gr.Markdown("### MAGNeT Defaults")
                    with gr.Row():
                        magnet_variant = gr.Dropdown(
                            label="Default Variant",
                            choices=[
                                ("Small Music", "small"),
                                ("Medium Music", "medium"),
                                ("Small Audio", "audio-small"),
                                ("Medium Audio", "audio-medium"),
                            ],
                            value="medium",
                        )
                        magnet_decoding_steps = gr.Slider(
                            minimum=10,
                            maximum=100,
                            value=20,
                            step=5,
                            label="Decoding Steps",
                        )
            # API Settings Tab
            with gr.TabItem("API"):
                with gr.Group():
                    gr.Markdown("### REST API Configuration")
                    api_enabled = gr.Checkbox(
                        label="Enable REST API",
                        value=True,
                        info="Enable FastAPI endpoints for programmatic access",
                    )
                    api_port = gr.Number(
                        value=8000,
                        label="API Port",
                        precision=0,
                    )
                    with gr.Row():
                        api_key_display = gr.Textbox(
                            label="API Key",
                            value="••••••••",
                            interactive=False,
                        )
                        regenerate_key_btn = gr.Button("Regenerate", size="sm")
                with gr.Group():
                    gr.Markdown("### Rate Limiting")
                    rate_limit = gr.Slider(
                        minimum=1,
                        maximum=100,
                        value=10,
                        step=1,
                        label="Requests per minute",
                    )
                    max_batch_size = gr.Slider(
                        minimum=1,
                        maximum=10,
                        value=4,
                        step=1,
                        label="Maximum batch size",
                    )
            # Queue Settings Tab
            with gr.TabItem("Queue"):
                with gr.Group():
                    gr.Markdown("### Batch Processing")
                    max_queue_size = gr.Slider(
                        minimum=10,
                        maximum=500,
                        value=100,
                        step=10,
                        label="Maximum Queue Size",
                    )
                    max_workers = gr.Slider(
                        minimum=1,
                        maximum=4,
                        value=1,
                        step=1,
                        label="Concurrent Workers",
                        info="Number of parallel generation workers",
                    )
                    priority_queue = gr.Checkbox(
                        label="Enable priority queue",
                        value=False,
                        info="Allow high-priority jobs to skip the queue",
                    )
        # Save button
        gr.Markdown("---")
        with gr.Row():
            save_btn = gr.Button("Save Settings", variant="primary", scale=2)
            reset_btn = gr.Button("Reset to Defaults", variant="secondary", scale=1)
        settings_status = gr.Markdown("")
        # Event handlers
        def load_settings():
            """Load current settings into form."""
            settings = get_settings()
            return (
                settings.get("output_dir", ""),
                settings.get("default_format", "wav"),
                settings.get("sample_rate", 32000),
                settings.get("normalize_audio", True),
                settings.get("theme_mode", "Dark"),
                settings.get("show_advanced", False),
                settings.get("auto_play", True),
                settings.get("comfyui_reserve_gb", 10),
                settings.get("idle_timeout_minutes", 15),
                settings.get("max_loaded_models", 2),
                settings.get("musicgen_variant", "medium"),
                settings.get("musicgen_duration", 10),
                settings.get("audiogen_duration", 5),
                settings.get("magnet_variant", "medium"),
                settings.get("magnet_decoding_steps", 20),
                settings.get("api_enabled", True),
                settings.get("api_port", 8000),
                settings.get("rate_limit", 10),
                settings.get("max_batch_size", 4),
                settings.get("max_queue_size", 100),
                settings.get("max_workers", 1),
                settings.get("priority_queue", False),
            )
        def save_settings(
            out_dir, fmt, sr, norm, theme, adv, play,
            comfyui_res, idle_to, max_load,
            mg_var, mg_dur, ag_dur, mn_var, mn_steps,
            api_en, api_p, rate, batch, queue_sz, workers, priority
        ):
            """Save settings from form."""
            settings = {
                "output_dir": out_dir,
                "default_format": fmt,
                "sample_rate": sr,
                "normalize_audio": norm,
                "theme_mode": theme,
                "show_advanced": adv,
                "auto_play": play,
                "comfyui_reserve_gb": comfyui_res,
                "idle_timeout_minutes": idle_to,
                "max_loaded_models": max_load,
                "musicgen_variant": mg_var,
                "musicgen_duration": mg_dur,
                "audiogen_duration": ag_dur,
                "magnet_variant": mn_var,
                "magnet_decoding_steps": mn_steps,
                "api_enabled": api_en,
                "api_port": int(api_p),
                "rate_limit": rate,
                "max_batch_size": batch,
                "max_queue_size": queue_sz,
                "max_workers": workers,
                "priority_queue": priority,
            }
            if update_settings(settings):
                return "✅ Settings saved successfully"
            else:
                return "❌ Failed to save settings"
        def do_refresh_gpu():
            """Refresh GPU info display."""
            return get_gpu_info()
        def do_clear_cache():
            """Clear model cache."""
            if clear_cache():
                return "✅ Cache cleared"
            return "❌ Failed to clear cache"
        def do_unload_models():
            """Unload all models."""
            if unload_all_models():
                return "✅ All models unloaded"
            return "❌ Failed to unload models"
        # Wire up events
        refresh_gpu_btn.click(
            fn=do_refresh_gpu,
            outputs=[gpu_info_display],
        )
        clear_cache_btn.click(
            fn=do_clear_cache,
            outputs=[cache_status],
        )
        unload_models_btn.click(
            fn=do_unload_models,
            outputs=[cache_status],
        )
        save_btn.click(
            fn=save_settings,
            inputs=[
                output_dir, default_format, sample_rate, normalize_audio,
                theme_mode, show_advanced, auto_play,
                comfyui_reserve, idle_timeout, max_loaded,
                musicgen_variant, musicgen_duration, audiogen_duration,
                magnet_variant, magnet_decoding_steps,
                api_enabled, api_port, rate_limit, max_batch_size,
                max_queue_size, max_workers, priority_queue,
            ],
            outputs=[settings_status],
        )
    return {
        "output_dir": output_dir,
        "default_format": default_format,
        "sample_rate": sample_rate,
        "comfyui_reserve": comfyui_reserve,
        "idle_timeout": idle_timeout,
        "api_enabled": api_enabled,
        "save_btn": save_btn,
        "settings_status": settings_status,
        "load_fn": load_settings,
    }
--- a/src/ui/state.py
+++ b/src/ui/state.py
@@ -0,0 +1,294 @@
 """State management for Gradio UI."""
 from dataclasses import dataclass, field
 from typing import Any, Optional
@dataclass
 class UIState:
    """Global UI state container."""
    # Current view
    current_tab: str = "dashboard"
    # Generation state
    is_generating: bool = False
    current_job_id: Optional[str] = None
    # Selected items
    selected_project_id: Optional[str] = None
    selected_generation_id: Optional[str] = None
    selected_preset_id: Optional[str] = None
    # Model state
    selected_model: str = "musicgen"
    selected_variant: str = "medium"
    # Generation parameters (current values)
    prompt: str = ""
    duration: float = 10.0
    temperature: float = 1.0
    top_k: int = 250
    top_p: float = 0.0
    cfg_coef: float = 3.0
    seed: Optional[int] = None
    # Conditioning
    melody_audio: Optional[str] = None
    style_audio: Optional[str] = None
    chords: list[dict[str, Any]] = field(default_factory=list)
    drums_pattern: str = ""
    bpm: float = 120.0
    # UI preferences
    show_advanced: bool = False
    auto_play: bool = True
    def reset_generation_params(self) -> None:
        """Reset generation parameters to defaults."""
        self.prompt = ""
        self.duration = 10.0
        self.temperature = 1.0
        self.top_k = 250
        self.top_p = 0.0
        self.cfg_coef = 3.0
        self.seed = None
        self.melody_audio = None
        self.style_audio = None
        self.chords = []
        self.drums_pattern = ""
    def apply_preset(self, preset: dict[str, Any]) -> None:
        """Apply preset parameters."""
        params = preset.get("parameters", {})
        self.duration = params.get("duration", self.duration)
        self.temperature = params.get("temperature", self.temperature)
        self.top_k = params.get("top_k", self.top_k)
        self.top_p = params.get("top_p", self.top_p)
        self.cfg_coef = params.get("cfg_coef", self.cfg_coef)
    def to_generation_params(self) -> dict[str, Any]:
        """Convert current state to generation parameters."""
        return {
            "duration": self.duration,
            "temperature": self.temperature,
            "top_k": self.top_k,
            "top_p": self.top_p,
            "cfg_coef": self.cfg_coef,
            "seed": self.seed,
        }
 # Default presets for each model
 DEFAULT_PRESETS = {
    "musicgen": [
        {
            "id": "cinematic",
            "name": "Cinematic",
            "description": "Epic orchestral soundscapes",
            "parameters": {
                "duration": 30,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
            },
        },
        {
            "id": "electronic",
            "name": "Electronic",
            "description": "Synthesizers and beats",
            "parameters": {
                "duration": 15,
                "temperature": 1.1,
                "top_k": 200,
                "cfg_coef": 3.5,
            },
        },
        {
            "id": "ambient",
            "name": "Ambient",
            "description": "Atmospheric and calm",
            "parameters": {
                "duration": 30,
                "temperature": 0.9,
                "top_k": 300,
                "cfg_coef": 2.5,
            },
        },
        {
            "id": "rock",
            "name": "Rock",
            "description": "Guitar-driven energy",
            "parameters": {
                "duration": 20,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
            },
        },
        {
            "id": "jazz",
            "name": "Jazz",
            "description": "Smooth and improvisational",
            "parameters": {
                "duration": 20,
                "temperature": 1.2,
                "top_k": 200,
                "cfg_coef": 2.5,
            },
        },
    ],
    "audiogen": [
        {
            "id": "nature",
            "name": "Nature",
            "description": "Natural environments",
            "parameters": {
                "duration": 10,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
            },
        },
        {
            "id": "urban",
            "name": "Urban",
            "description": "City sounds",
            "parameters": {
                "duration": 10,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
            },
        },
        {
            "id": "mechanical",
            "name": "Mechanical",
            "description": "Machines and tools",
            "parameters": {
                "duration": 5,
                "temperature": 0.9,
                "top_k": 200,
                "cfg_coef": 3.5,
            },
        },
        {
            "id": "weather",
            "name": "Weather",
            "description": "Rain, thunder, wind",
            "parameters": {
                "duration": 10,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
            },
        },
    ],
    "magnet": [
        {
            "id": "fast",
            "name": "Fast",
            "description": "Quick generation",
            "parameters": {
                "duration": 10,
                "temperature": 3.0,
                "top_p": 0.9,
                "cfg_coef": 3.0,
            },
        },
        {
            "id": "quality",
            "name": "Quality",
            "description": "Higher quality output",
            "parameters": {
                "duration": 10,
                "temperature": 2.5,
                "top_p": 0.85,
                "cfg_coef": 4.0,
            },
        },
    ],
    "musicgen-style": [
        {
            "id": "style_transfer",
            "name": "Style Transfer",
            "description": "Copy style from reference",
            "parameters": {
                "duration": 15,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
                "eval_q": 3,
                "excerpt_length": 3.0,
            },
        },
    ],
    "jasco": [
        {
            "id": "pop",
            "name": "Pop",
            "description": "Pop chord progressions",
            "parameters": {
                "duration": 10,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
                "bpm": 120,
            },
        },
        {
            "id": "blues",
            "name": "Blues",
            "description": "12-bar blues",
            "parameters": {
                "duration": 10,
                "temperature": 1.0,
                "top_k": 250,
                "cfg_coef": 3.0,
                "bpm": 100,
            },
        },
    ],
 }
 # Prompt suggestions for each model
 PROMPT_SUGGESTIONS = {
    "musicgen": [
        "Epic orchestral music with dramatic strings and powerful brass",
        "Upbeat electronic dance music with synthesizers and heavy bass",
        "Calm acoustic guitar melody with soft piano accompaniment",
        "Energetic rock song with electric guitars and driving drums",
        "Smooth jazz with saxophone solo and walking bass",
        "Ambient soundscape with ethereal pads and gentle textures",
        "Cinematic trailer music building to an epic climax",
        "Lo-fi hip hop beats with vinyl crackle and mellow keys",
    ],
    "audiogen": [
        "Thunder and heavy rain with occasional lightning strikes",
        "Busy city street with traffic, horns, and distant sirens",
        "Forest ambience with birds singing and wind in trees",
        "Ocean waves crashing on a rocky shore",
        "Crackling fireplace with wood popping",
        "Coffee shop atmosphere with murmuring voices and clinking cups",
        "Construction site with hammering and machinery",
        "Spaceship engine humming with occasional beeps",
    ],
    "magnet": [
        "Energetic pop music with catchy melody",
        "Dark electronic music with deep bass",
        "Cheerful ukulele tune with whistling",
        "Dramatic piano piece with building intensity",
    ],
    "musicgen-style": [
        "Generate music in the style of the uploaded reference",
        "Create a variation with similar instrumentation",
        "Compose a piece matching the mood of the reference",
    ],
    "jasco": [
        "Upbeat pop song with the specified chord progression",
        "Mellow jazz piece following the chord changes",
        "Rock anthem with powerful drum pattern",
        "Electronic track with syncopated rhythms",
    ],
 }
--- a/src/ui/tabs/init.py
+++ b/src/ui/tabs/init.py
@@ -0,0 +1,17 @@
 """Model tabs for AudioCraft Studio."""
 from src.ui.tabs.dashboard_tab import create_dashboard_tab
 from src.ui.tabs.musicgen_tab import create_musicgen_tab
 from src.ui.tabs.audiogen_tab import create_audiogen_tab
 from src.ui.tabs.magnet_tab import create_magnet_tab
 from src.ui.tabs.style_tab import create_style_tab
 from src.ui.tabs.jasco_tab import create_jasco_tab
 __all__ = [
    "create_dashboard_tab",
    "create_musicgen_tab",
    "create_audiogen_tab",
    "create_magnet_tab",
    "create_style_tab",
    "create_jasco_tab",
 ]
--- a/src/ui/tabs/audiogen_tab.py
+++ b/src/ui/tabs/audiogen_tab.py
@@ -0,0 +1,283 @@
 """AudioGen tab for text-to-sound generation."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from src.ui.state import DEFAULT_PRESETS, PROMPT_SUGGESTIONS
 from src.ui.components.audio_player import create_generation_output
 AUDIOGEN_VARIANTS = [
    {"id": "medium", "name": "Medium", "vram_mb": 5000, "description": "1.5B params, balanced quality/speed"},
 ]
 def create_audiogen_tab(
    generate_fn: Callable[..., Any],
    add_to_queue_fn: Callable[..., Any],
 ) -> dict[str, Any]:
    """Create AudioGen generation tab.
    Args:
        generate_fn: Function to call for generation
        add_to_queue_fn: Function to add to queue
    Returns:
        Dictionary with component references
    """
    presets = DEFAULT_PRESETS.get("audiogen", [])
    suggestions = PROMPT_SUGGESTIONS.get("audiogen", [])
    with gr.Column():
        gr.Markdown("## 🔊 AudioGen")
        gr.Markdown("Generate sound effects and environmental audio from text")
        with gr.Row():
            # Left column - inputs
            with gr.Column(scale=2):
                # Preset selector
                preset_choices = [(p["name"], p["id"]) for p in presets] + [("Custom", "custom")]
                preset_dropdown = gr.Dropdown(
                    label="Preset",
                    choices=preset_choices,
                    value=presets[0]["id"] if presets else "custom",
                )
                # Model variant (AudioGen only has medium)
                variant_choices = [(f"{v['name']} ({v['vram_mb']/1024:.1f}GB)", v["id"]) for v in AUDIOGEN_VARIANTS]
                variant_dropdown = gr.Dropdown(
                    label="Model Variant",
                    choices=variant_choices,
                    value="medium",
                )
                # Prompt input
                prompt_input = gr.Textbox(
                    label="Prompt",
                    placeholder="Describe the sound you want to generate...",
                    lines=3,
                    max_lines=5,
                )
                # Prompt suggestions
                with gr.Accordion("Prompt Suggestions", open=False):
                    suggestion_btns = []
                    for i, suggestion in enumerate(suggestions[:6]):
                        btn = gr.Button(suggestion[:50] + "...", size="sm", variant="secondary")
                        suggestion_btns.append((btn, suggestion))
                # Parameters
                gr.Markdown("### Parameters")
                duration_slider = gr.Slider(
                    minimum=1,
                    maximum=10,
                    value=5,
                    step=1,
                    label="Duration (seconds)",
                    info="AudioGen works best with shorter clips",
                )
                with gr.Accordion("Advanced Parameters", open=False):
                    with gr.Row():
                        temperature_slider = gr.Slider(
                            minimum=0.0,
                            maximum=2.0,
                            value=1.0,
                            step=0.05,
                            label="Temperature",
                        )
                        cfg_slider = gr.Slider(
                            minimum=1.0,
                            maximum=10.0,
                            value=3.0,
                            step=0.5,
                            label="CFG Coefficient",
                        )
                    with gr.Row():
                        top_k_slider = gr.Slider(
                            minimum=0,
                            maximum=500,
                            value=250,
                            step=10,
                            label="Top-K",
                        )
                        top_p_slider = gr.Slider(
                            minimum=0.0,
                            maximum=1.0,
                            value=0.0,
                            step=0.05,
                            label="Top-P",
                        )
                    with gr.Row():
                        seed_input = gr.Number(
                            value=None,
                            label="Seed (empty = random)",
                            precision=0,
                        )
                # Generate buttons
                with gr.Row():
                    generate_btn = gr.Button("🔊 Generate", variant="primary", scale=2)
                    queue_btn = gr.Button("Add to Queue", variant="secondary", scale=1)
            # Right column - output
            with gr.Column(scale=3):
                output = create_generation_output()
        # Event handlers
        # Preset change
        def apply_preset(preset_id: str):
            for p in presets:
                if p["id"] == preset_id:
                    params = p["parameters"]
                    return (
                        params.get("duration", 5),
                        params.get("temperature", 1.0),
                        params.get("cfg_coef", 3.0),
                        params.get("top_k", 250),
                        params.get("top_p", 0.0),
                    )
            return gr.update(), gr.update(), gr.update(), gr.update(), gr.update()
        preset_dropdown.change(
            fn=apply_preset,
            inputs=[preset_dropdown],
            outputs=[duration_slider, temperature_slider, cfg_slider, top_k_slider, top_p_slider],
        )
        # Prompt suggestions
        for btn, suggestion in suggestion_btns:
            btn.click(
                fn=lambda s=suggestion: s,
                outputs=[prompt_input],
            )
        # Generate
        async def do_generate(
            prompt, variant, duration, temperature, cfg_coef, top_k, top_p, seed
        ):
            if not prompt:
                return (
                    gr.update(value="Please enter a prompt"),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
            yield (
                gr.update(value="🔄 Generating..."),
                gr.update(visible=True, value=0),
                gr.update(),
                gr.update(),
                gr.update(),
                gr.update(),
            )
            try:
                result, generation = await generate_fn(
                    model_id="audiogen",
                    variant=variant,
                    prompts=[prompt],
                    duration=duration,
                    temperature=temperature,
                    top_k=int(top_k),
                    top_p=top_p,
                    cfg_coef=cfg_coef,
                    seed=int(seed) if seed else None,
                )
                yield (
                    gr.update(value="✅ Generation complete!"),
                    gr.update(visible=False),
                    gr.update(value=generation.audio_path),
                    gr.update(),
                    gr.update(value=f"{result.duration:.2f}s"),
                    gr.update(value=str(result.seed)),
                )
            except Exception as e:
                yield (
                    gr.update(value=f"❌ Error: {str(e)}"),
                    gr.update(visible=False),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
        generate_btn.click(
            fn=do_generate,
            inputs=[
                prompt_input,
                variant_dropdown,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
            ],
            outputs=[
                output["status"],
                output["progress"],
                output["player"]["audio"],
                output["player"]["waveform"],
                output["player"]["duration"],
                output["player"]["seed"],
            ],
        )
        # Add to queue
        def do_add_queue(prompt, variant, duration, temperature, cfg_coef, top_k, top_p, seed):
            if not prompt:
                return "Please enter a prompt"
            job = add_to_queue_fn(
                model_id="audiogen",
                variant=variant,
                prompts=[prompt],
                duration=duration,
                temperature=temperature,
                top_k=int(top_k),
                top_p=top_p,
                cfg_coef=cfg_coef,
                seed=int(seed) if seed else None,
            )
            return f"✅ Added to queue: {job.id}"
        queue_btn.click(
            fn=do_add_queue,
            inputs=[
                prompt_input,
                variant_dropdown,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
            ],
            outputs=[output["status"]],
        )
    return {
        "preset": preset_dropdown,
        "variant": variant_dropdown,
        "prompt": prompt_input,
        "duration": duration_slider,
        "temperature": temperature_slider,
        "cfg_coef": cfg_slider,
        "top_k": top_k_slider,
        "top_p": top_p_slider,
        "seed": seed_input,
        "generate_btn": generate_btn,
        "queue_btn": queue_btn,
        "output": output,
    }
--- a/src/ui/tabs/dashboard_tab.py
+++ b/src/ui/tabs/dashboard_tab.py
@@ -0,0 +1,166 @@
 """Dashboard tab - home page with model overview and quick actions."""
 import gradio as gr
 from typing import Any, Callable, Optional
 MODEL_INFO = {
    "musicgen": {
        "name": "MusicGen",
        "icon": "🎵",
        "description": "Text-to-music generation with optional melody conditioning",
        "capabilities": ["Text prompts", "Melody conditioning", "Stereo output"],
    },
    "audiogen": {
        "name": "AudioGen",
        "icon": "🔊",
        "description": "Text-to-sound effects and environmental audio",
        "capabilities": ["Sound effects", "Ambiences", "Foley"],
    },
    "magnet": {
        "name": "MAGNeT",
        "icon": "⚡",
        "description": "Fast non-autoregressive music generation",
        "capabilities": ["Fast generation", "Music", "Sound effects"],
    },
    "musicgen-style": {
        "name": "MusicGen Style",
        "icon": "🎨",
        "description": "Style-conditioned music from reference audio",
        "capabilities": ["Style transfer", "Reference audio", "Text prompts"],
    },
    "jasco": {
        "name": "JASCO",
        "icon": "🎹",
        "description": "Chord and drum-conditioned music generation",
        "capabilities": ["Chord control", "Drum patterns", "Symbolic conditioning"],
    },
 }
 def create_dashboard_tab(
    get_queue_status: Callable[[], dict[str, Any]],
    get_recent_generations: Callable[[int], list[dict[str, Any]]],
    get_gpu_status: Callable[[], dict[str, Any]],
 ) -> dict[str, Any]:
    """Create dashboard tab with model overview and status.
    Args:
        get_queue_status: Function to get generation queue status
        get_recent_generations: Function to get recent generations
        get_gpu_status: Function to get GPU status
    Returns:
        Dictionary with component references
    """
    def refresh_dashboard():
        """Refresh all dashboard data."""
        queue = get_queue_status()
        recent = get_recent_generations(5)
        gpu = get_gpu_status()
        # Format queue status
        queue_size = queue.get("queue_size", 0)
        queue_text = f"**Queue:** {queue_size} job(s) pending"
        # Format recent generations
        if recent:
            recent_items = []
            for gen in recent[:5]:
                model = gen.get("model", "unknown")
                prompt = gen.get("prompt", "")[:50]
                duration = gen.get("duration_seconds", 0)
                recent_items.append(f"• **{model}** ({duration:.0f}s): {prompt}...")
            recent_text = "\n".join(recent_items)
        else:
            recent_text = "No recent generations"
        # Format GPU status
        used_gb = gpu.get("used_gb", 0)
        total_gb = gpu.get("total_gb", 24)
        util = gpu.get("utilization_percent", 0)
        gpu_text = f"**GPU:** {used_gb:.1f}/{total_gb:.1f} GB ({util:.0f}%)"
        return queue_text, recent_text, gpu_text
    with gr.Column():
        # Header
        gr.Markdown("# AudioCraft Studio")
        gr.Markdown("AI-powered music and sound generation")
        # Status bar
        with gr.Row():
            queue_status = gr.Markdown("**Queue:** Loading...")
            gpu_status = gr.Markdown("**GPU:** Loading...")
            refresh_btn = gr.Button("🔄 Refresh", size="sm")
        gr.Markdown("---")
        # Model cards
        gr.Markdown("## Models")
        with gr.Row():
            # First row of cards
            for model_id in ["musicgen", "audiogen", "magnet"]:
                info = MODEL_INFO[model_id]
                with gr.Column(scale=1):
                    with gr.Group():
                        gr.Markdown(f"### {info['icon']} {info['name']}")
                        gr.Markdown(info["description"])
                        gr.Markdown("**Features:** " + ", ".join(info["capabilities"]))
                        gr.Button(
                            f"Open {info['name']}",
                            variant="primary",
                            size="sm",
                            elem_id=f"btn_{model_id}",
                        )
        with gr.Row():
            # Second row of cards
            for model_id in ["musicgen-style", "jasco"]:
                info = MODEL_INFO[model_id]
                with gr.Column(scale=1):
                    with gr.Group():
                        gr.Markdown(f"### {info['icon']} {info['name']}")
                        gr.Markdown(info["description"])
                        gr.Markdown("**Features:** " + ", ".join(info["capabilities"]))
                        gr.Button(
                            f"Open {info['name']}",
                            variant="primary",
                            size="sm",
                            elem_id=f"btn_{model_id}",
                        )
            # Empty column for balance
            with gr.Column(scale=1):
                pass
        gr.Markdown("---")
        # Recent generations and queue
        with gr.Row():
            with gr.Column(scale=1):
                gr.Markdown("## Recent Generations")
                recent_list = gr.Markdown("Loading...")
            with gr.Column(scale=1):
                gr.Markdown("## Quick Actions")
                with gr.Group():
                    gr.Button("📁 Browse Projects", variant="secondary")
                    gr.Button("⚙️ Settings", variant="secondary")
                    gr.Button("📖 API Documentation", variant="secondary")
        # Refresh handler
        refresh_btn.click(
            fn=refresh_dashboard,
            outputs=[queue_status, recent_list, gpu_status],
        )
    return {
        "queue_status": queue_status,
        "gpu_status": gpu_status,
        "recent_list": recent_list,
        "refresh_btn": refresh_btn,
        "refresh_fn": refresh_dashboard,
    }
--- a/src/ui/tabs/jasco_tab.py
+++ b/src/ui/tabs/jasco_tab.py
@@ -0,0 +1,364 @@
 """JASCO tab for chord and drum-conditioned generation."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from src.ui.state import DEFAULT_PRESETS, PROMPT_SUGGESTIONS
 from src.ui.components.audio_player import create_generation_output
 JASCO_VARIANTS = [
    {"id": "chords", "name": "Chords", "vram_mb": 5000, "description": "Chord-conditioned generation"},
    {"id": "chords-drums", "name": "Chords + Drums", "vram_mb": 5500, "description": "Full symbolic conditioning"},
 ]
 # Common chord progressions
 CHORD_PRESETS = [
    {"name": "Pop I-V-vi-IV", "chords": "C G Am F"},
    {"name": "Jazz ii-V-I", "chords": "Dm7 G7 Cmaj7"},
    {"name": "Blues I-IV-V", "chords": "A7 D7 E7"},
    {"name": "Rock I-bVII-IV", "chords": "E D A"},
    {"name": "Minor i-VI-III-VII", "chords": "Am F C G"},
 ]
 def create_jasco_tab(
    generate_fn: Callable[..., Any],
    add_to_queue_fn: Callable[..., Any],
 ) -> dict[str, Any]:
    """Create JASCO generation tab.
    Args:
        generate_fn: Function to call for generation
        add_to_queue_fn: Function to add to queue
    Returns:
        Dictionary with component references
    """
    presets = DEFAULT_PRESETS.get("jasco", [])
    suggestions = PROMPT_SUGGESTIONS.get("musicgen", [])
    with gr.Column():
        gr.Markdown("## 🎹 JASCO")
        gr.Markdown("Generate music conditioned on chords and drum patterns")
        with gr.Row():
            # Left column - inputs
            with gr.Column(scale=2):
                # Preset selector
                preset_choices = [(p["name"], p["id"]) for p in presets] + [("Custom", "custom")]
                preset_dropdown = gr.Dropdown(
                    label="Preset",
                    choices=preset_choices,
                    value=presets[0]["id"] if presets else "custom",
                )
                # Model variant
                variant_choices = [(f"{v['name']} ({v['vram_mb']/1024:.1f}GB)", v["id"]) for v in JASCO_VARIANTS]
                variant_dropdown = gr.Dropdown(
                    label="Model Variant",
                    choices=variant_choices,
                    value="chords-drums",
                )
                # Prompt input
                prompt_input = gr.Textbox(
                    label="Text Prompt",
                    placeholder="Describe the music style, mood, instruments...",
                    lines=2,
                    max_lines=4,
                )
                # Chord conditioning
                gr.Markdown("### Chord Progression")
                chord_input = gr.Textbox(
                    label="Chords",
                    placeholder="C G Am F or Cmaj7 Dm7 G7 Cmaj7",
                    lines=1,
                    info="Space-separated chord symbols",
                )
                # Chord presets
                with gr.Accordion("Chord Presets", open=False):
                    chord_preset_btns = []
                    with gr.Row():
                        for cp in CHORD_PRESETS[:3]:
                            btn = gr.Button(cp["name"], size="sm", variant="secondary")
                            chord_preset_btns.append((btn, cp["chords"]))
                    with gr.Row():
                        for cp in CHORD_PRESETS[3:]:
                            btn = gr.Button(cp["name"], size="sm", variant="secondary")
                            chord_preset_btns.append((btn, cp["chords"]))
                # Drum conditioning (for chords-drums variant)
                with gr.Group(visible=True) as drum_section:
                    gr.Markdown("### Drum Pattern")
                    drum_input = gr.Audio(
                        label="Drum Reference",
                        type="filepath",
                        sources=["upload"],
                    )
                    gr.Markdown("*Upload a drum loop to condition the rhythm*")
                # Parameters
                gr.Markdown("### Parameters")
                duration_slider = gr.Slider(
                    minimum=1,
                    maximum=30,
                    value=10,
                    step=1,
                    label="Duration (seconds)",
                )
                bpm_slider = gr.Slider(
                    minimum=60,
                    maximum=180,
                    value=120,
                    step=1,
                    label="BPM",
                    info="Tempo for chord timing",
                )
                with gr.Accordion("Advanced Parameters", open=False):
                    with gr.Row():
                        temperature_slider = gr.Slider(
                            minimum=0.0,
                            maximum=2.0,
                            value=1.0,
                            step=0.05,
                            label="Temperature",
                        )
                        cfg_slider = gr.Slider(
                            minimum=1.0,
                            maximum=10.0,
                            value=3.0,
                            step=0.5,
                            label="CFG Coefficient",
                        )
                    with gr.Row():
                        top_k_slider = gr.Slider(
                            minimum=0,
                            maximum=500,
                            value=250,
                            step=10,
                            label="Top-K",
                        )
                        top_p_slider = gr.Slider(
                            minimum=0.0,
                            maximum=1.0,
                            value=0.0,
                            step=0.05,
                            label="Top-P",
                        )
                    with gr.Row():
                        seed_input = gr.Number(
                            value=None,
                            label="Seed (empty = random)",
                            precision=0,
                        )
                # Generate buttons
                with gr.Row():
                    generate_btn = gr.Button("🎹 Generate", variant="primary", scale=2)
                    queue_btn = gr.Button("Add to Queue", variant="secondary", scale=1)
            # Right column - output
            with gr.Column(scale=3):
                output = create_generation_output()
        # Event handlers
        # Preset change
        def apply_preset(preset_id: str):
            for p in presets:
                if p["id"] == preset_id:
                    params = p["parameters"]
                    return (
                        params.get("duration", 10),
                        params.get("bpm", 120),
                        params.get("temperature", 1.0),
                        params.get("cfg_coef", 3.0),
                        params.get("top_k", 250),
                        params.get("top_p", 0.0),
                    )
            return gr.update(), gr.update(), gr.update(), gr.update(), gr.update(), gr.update()
        preset_dropdown.change(
            fn=apply_preset,
            inputs=[preset_dropdown],
            outputs=[duration_slider, bpm_slider, temperature_slider, cfg_slider, top_k_slider, top_p_slider],
        )
        # Variant change - show/hide drum section
        def on_variant_change(variant: str):
            show_drums = "drums" in variant.lower()
            return gr.update(visible=show_drums)
        variant_dropdown.change(
            fn=on_variant_change,
            inputs=[variant_dropdown],
            outputs=[drum_section],
        )
        # Chord presets
        for btn, chords in chord_preset_btns:
            btn.click(
                fn=lambda c=chords: c,
                outputs=[chord_input],
            )
        # Generate
        async def do_generate(
            prompt, variant, chords, drums, duration, bpm, temperature, cfg_coef, top_k, top_p, seed
        ):
            if not chords:
                return (
                    gr.update(value="Please enter a chord progression"),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
            yield (
                gr.update(value="🔄 Generating..."),
                gr.update(visible=True, value=0),
                gr.update(),
                gr.update(),
                gr.update(),
                gr.update(),
            )
            try:
                conditioning = {
                    "chords": chords,
                    "bpm": bpm,
                }
                if drums and "drums" in variant.lower():
                    conditioning["drums"] = drums
                result, generation = await generate_fn(
                    model_id="jasco",
                    variant=variant,
                    prompts=[prompt] if prompt else [""],
                    duration=duration,
                    temperature=temperature,
                    top_k=int(top_k),
                    top_p=top_p,
                    cfg_coef=cfg_coef,
                    seed=int(seed) if seed else None,
                    conditioning=conditioning,
                )
                yield (
                    gr.update(value="✅ Generation complete!"),
                    gr.update(visible=False),
                    gr.update(value=generation.audio_path),
                    gr.update(),
                    gr.update(value=f"{result.duration:.2f}s"),
                    gr.update(value=str(result.seed)),
                )
            except Exception as e:
                yield (
                    gr.update(value=f"❌ Error: {str(e)}"),
                    gr.update(visible=False),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
        generate_btn.click(
            fn=do_generate,
            inputs=[
                prompt_input,
                variant_dropdown,
                chord_input,
                drum_input,
                duration_slider,
                bpm_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
            ],
            outputs=[
                output["status"],
                output["progress"],
                output["player"]["audio"],
                output["player"]["waveform"],
                output["player"]["duration"],
                output["player"]["seed"],
            ],
        )
        # Add to queue
        def do_add_queue(prompt, variant, chords, drums, duration, bpm, temperature, cfg_coef, top_k, top_p, seed):
            if not chords:
                return "Please enter a chord progression"
            conditioning = {
                "chords": chords,
                "bpm": bpm,
            }
            if drums and "drums" in variant.lower():
                conditioning["drums"] = drums
            job = add_to_queue_fn(
                model_id="jasco",
                variant=variant,
                prompts=[prompt] if prompt else [""],
                duration=duration,
                temperature=temperature,
                top_k=int(top_k),
                top_p=top_p,
                cfg_coef=cfg_coef,
                seed=int(seed) if seed else None,
                conditioning=conditioning,
            )
            return f"✅ Added to queue: {job.id}"
        queue_btn.click(
            fn=do_add_queue,
            inputs=[
                prompt_input,
                variant_dropdown,
                chord_input,
                drum_input,
                duration_slider,
                bpm_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
            ],
            outputs=[output["status"]],
        )
    return {
        "preset": preset_dropdown,
        "variant": variant_dropdown,
        "prompt": prompt_input,
        "chords": chord_input,
        "drums": drum_input,
        "duration": duration_slider,
        "bpm": bpm_slider,
        "temperature": temperature_slider,
        "cfg_coef": cfg_slider,
        "top_k": top_k_slider,
        "top_p": top_p_slider,
        "seed": seed_input,
        "generate_btn": generate_btn,
        "queue_btn": queue_btn,
        "output": output,
    }
--- a/src/ui/tabs/magnet_tab.py
+++ b/src/ui/tabs/magnet_tab.py
@@ -0,0 +1,316 @@
 """MAGNeT tab for fast non-autoregressive generation."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from src.ui.state import DEFAULT_PRESETS, PROMPT_SUGGESTIONS
 from src.ui.components.audio_player import create_generation_output
 MAGNET_VARIANTS = [
    {"id": "small", "name": "Small Music", "vram_mb": 2000, "description": "Fast music, 300M params"},
    {"id": "medium", "name": "Medium Music", "vram_mb": 5000, "description": "Balanced music, 1.5B params"},
    {"id": "audio-small", "name": "Small Audio", "vram_mb": 2000, "description": "Fast sound effects"},
    {"id": "audio-medium", "name": "Medium Audio", "vram_mb": 5000, "description": "Balanced sound effects"},
 ]
 def create_magnet_tab(
    generate_fn: Callable[..., Any],
    add_to_queue_fn: Callable[..., Any],
 ) -> dict[str, Any]:
    """Create MAGNeT generation tab.
    Args:
        generate_fn: Function to call for generation
        add_to_queue_fn: Function to add to queue
    Returns:
        Dictionary with component references
    """
    presets = DEFAULT_PRESETS.get("magnet", [])
    suggestions = PROMPT_SUGGESTIONS.get("musicgen", [])  # Reuse music suggestions
    with gr.Column():
        gr.Markdown("## ⚡ MAGNeT")
        gr.Markdown("Fast non-autoregressive music and sound generation")
        with gr.Row():
            # Left column - inputs
            with gr.Column(scale=2):
                # Preset selector
                preset_choices = [(p["name"], p["id"]) for p in presets] + [("Custom", "custom")]
                preset_dropdown = gr.Dropdown(
                    label="Preset",
                    choices=preset_choices,
                    value=presets[0]["id"] if presets else "custom",
                )
                # Model variant
                variant_choices = [(f"{v['name']} ({v['vram_mb']/1024:.1f}GB)", v["id"]) for v in MAGNET_VARIANTS]
                variant_dropdown = gr.Dropdown(
                    label="Model Variant",
                    choices=variant_choices,
                    value="medium",
                )
                # Prompt input
                prompt_input = gr.Textbox(
                    label="Prompt",
                    placeholder="Describe the music or sound you want to generate...",
                    lines=3,
                    max_lines=5,
                )
                # Prompt suggestions
                with gr.Accordion("Prompt Suggestions", open=False):
                    suggestion_btns = []
                    for i, suggestion in enumerate(suggestions[:4]):
                        btn = gr.Button(suggestion[:60] + "...", size="sm", variant="secondary")
                        suggestion_btns.append((btn, suggestion))
                # Parameters
                gr.Markdown("### Parameters")
                duration_slider = gr.Slider(
                    minimum=1,
                    maximum=30,
                    value=10,
                    step=1,
                    label="Duration (seconds)",
                )
                with gr.Accordion("Advanced Parameters", open=False):
                    gr.Markdown("*MAGNeT uses different sampling compared to MusicGen*")
                    with gr.Row():
                        temperature_slider = gr.Slider(
                            minimum=1.0,
                            maximum=5.0,
                            value=3.0,
                            step=0.1,
                            label="Temperature",
                            info="Higher values recommended (3.0 default)",
                        )
                        cfg_slider = gr.Slider(
                            minimum=1.0,
                            maximum=10.0,
                            value=3.0,
                            step=0.5,
                            label="CFG Coefficient",
                        )
                    with gr.Row():
                        top_k_slider = gr.Slider(
                            minimum=0,
                            maximum=500,
                            value=0,
                            step=10,
                            label="Top-K",
                            info="0 recommended for MAGNeT",
                        )
                        top_p_slider = gr.Slider(
                            minimum=0.0,
                            maximum=1.0,
                            value=0.9,
                            step=0.05,
                            label="Top-P",
                            info="0.9 recommended for MAGNeT",
                        )
                    with gr.Row():
                        decoding_steps_slider = gr.Slider(
                            minimum=10,
                            maximum=100,
                            value=20,
                            step=5,
                            label="Decoding Steps",
                            info="More steps = better quality, slower",
                        )
                        span_arrangement = gr.Dropdown(
                            label="Span Arrangement",
                            choices=[("No Overlap", "nonoverlap"), ("Overlap", "stride1")],
                            value="nonoverlap",
                        )
                    with gr.Row():
                        seed_input = gr.Number(
                            value=None,
                            label="Seed (empty = random)",
                            precision=0,
                        )
                # Generate buttons
                with gr.Row():
                    generate_btn = gr.Button("⚡ Generate", variant="primary", scale=2)
                    queue_btn = gr.Button("Add to Queue", variant="secondary", scale=1)
            # Right column - output
            with gr.Column(scale=3):
                output = create_generation_output()
        # Event handlers
        # Preset change
        def apply_preset(preset_id: str):
            for p in presets:
                if p["id"] == preset_id:
                    params = p["parameters"]
                    return (
                        params.get("duration", 10),
                        params.get("temperature", 3.0),
                        params.get("cfg_coef", 3.0),
                        params.get("top_k", 0),
                        params.get("top_p", 0.9),
                        params.get("decoding_steps", 20),
                    )
            return gr.update(), gr.update(), gr.update(), gr.update(), gr.update(), gr.update()
        preset_dropdown.change(
            fn=apply_preset,
            inputs=[preset_dropdown],
            outputs=[duration_slider, temperature_slider, cfg_slider, top_k_slider, top_p_slider, decoding_steps_slider],
        )
        # Prompt suggestions
        for btn, suggestion in suggestion_btns:
            btn.click(
                fn=lambda s=suggestion: s,
                outputs=[prompt_input],
            )
        # Generate
        async def do_generate(
            prompt, variant, duration, temperature, cfg_coef, top_k, top_p, decoding_steps, span_arr, seed
        ):
            if not prompt:
                return (
                    gr.update(value="Please enter a prompt"),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
            yield (
                gr.update(value="🔄 Generating..."),
                gr.update(visible=True, value=0),
                gr.update(),
                gr.update(),
                gr.update(),
                gr.update(),
            )
            try:
                result, generation = await generate_fn(
                    model_id="magnet",
                    variant=variant,
                    prompts=[prompt],
                    duration=duration,
                    temperature=temperature,
                    top_k=int(top_k),
                    top_p=top_p,
                    cfg_coef=cfg_coef,
                    decoding_steps=int(decoding_steps),
                    span_arrangement=span_arr,
                    seed=int(seed) if seed else None,
                )
                yield (
                    gr.update(value="✅ Generation complete!"),
                    gr.update(visible=False),
                    gr.update(value=generation.audio_path),
                    gr.update(),
                    gr.update(value=f"{result.duration:.2f}s"),
                    gr.update(value=str(result.seed)),
                )
            except Exception as e:
                yield (
                    gr.update(value=f"❌ Error: {str(e)}"),
                    gr.update(visible=False),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
        generate_btn.click(
            fn=do_generate,
            inputs=[
                prompt_input,
                variant_dropdown,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                decoding_steps_slider,
                span_arrangement,
                seed_input,
            ],
            outputs=[
                output["status"],
                output["progress"],
                output["player"]["audio"],
                output["player"]["waveform"],
                output["player"]["duration"],
                output["player"]["seed"],
            ],
        )
        # Add to queue
        def do_add_queue(prompt, variant, duration, temperature, cfg_coef, top_k, top_p, decoding_steps, span_arr, seed):
            if not prompt:
                return "Please enter a prompt"
            job = add_to_queue_fn(
                model_id="magnet",
                variant=variant,
                prompts=[prompt],
                duration=duration,
                temperature=temperature,
                top_k=int(top_k),
                top_p=top_p,
                cfg_coef=cfg_coef,
                decoding_steps=int(decoding_steps),
                span_arrangement=span_arr,
                seed=int(seed) if seed else None,
            )
            return f"✅ Added to queue: {job.id}"
        queue_btn.click(
            fn=do_add_queue,
            inputs=[
                prompt_input,
                variant_dropdown,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                decoding_steps_slider,
                span_arrangement,
                seed_input,
            ],
            outputs=[output["status"]],
        )
    return {
        "preset": preset_dropdown,
        "variant": variant_dropdown,
        "prompt": prompt_input,
        "duration": duration_slider,
        "temperature": temperature_slider,
        "cfg_coef": cfg_slider,
        "top_k": top_k_slider,
        "top_p": top_p_slider,
        "decoding_steps": decoding_steps_slider,
        "span_arrangement": span_arrangement,
        "seed": seed_input,
        "generate_btn": generate_btn,
        "queue_btn": queue_btn,
        "output": output,
    }
--- a/src/ui/tabs/musicgen_tab.py
+++ b/src/ui/tabs/musicgen_tab.py
@@ -0,0 +1,325 @@
 """MusicGen tab for text-to-music generation."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from src.ui.state import DEFAULT_PRESETS, PROMPT_SUGGESTIONS
 from src.ui.components.audio_player import create_generation_output
 MUSICGEN_VARIANTS = [
    {"id": "small", "name": "Small", "vram_mb": 1500, "description": "Fast, 300M params"},
    {"id": "medium", "name": "Medium", "vram_mb": 5000, "description": "Balanced, 1.5B params"},
    {"id": "large", "name": "Large", "vram_mb": 10000, "description": "Best quality, 3.3B params"},
    {"id": "melody", "name": "Melody", "vram_mb": 5000, "description": "With melody conditioning"},
    {"id": "stereo-small", "name": "Stereo Small", "vram_mb": 1800, "description": "Stereo, 300M params"},
    {"id": "stereo-medium", "name": "Stereo Medium", "vram_mb": 6000, "description": "Stereo, 1.5B params"},
    {"id": "stereo-large", "name": "Stereo Large", "vram_mb": 12000, "description": "Stereo, 3.3B params"},
    {"id": "stereo-melody", "name": "Stereo Melody", "vram_mb": 6000, "description": "Stereo with melody"},
 ]
 def create_musicgen_tab(
    generate_fn: Callable[..., Any],
    add_to_queue_fn: Callable[..., Any],
 ) -> dict[str, Any]:
    """Create MusicGen generation tab.
    Args:
        generate_fn: Function to call for generation
        add_to_queue_fn: Function to add to queue
    Returns:
        Dictionary with component references
    """
    presets = DEFAULT_PRESETS.get("musicgen", [])
    suggestions = PROMPT_SUGGESTIONS.get("musicgen", [])
    with gr.Column():
        gr.Markdown("## 🎵 MusicGen")
        gr.Markdown("Generate music from text descriptions")
        with gr.Row():
            # Left column - inputs
            with gr.Column(scale=2):
                # Preset selector
                preset_choices = [(p["name"], p["id"]) for p in presets] + [("Custom", "custom")]
                preset_dropdown = gr.Dropdown(
                    label="Preset",
                    choices=preset_choices,
                    value=presets[0]["id"] if presets else "custom",
                )
                # Model variant
                variant_choices = [(f"{v['name']} ({v['vram_mb']/1024:.1f}GB)", v["id"]) for v in MUSICGEN_VARIANTS]
                variant_dropdown = gr.Dropdown(
                    label="Model Variant",
                    choices=variant_choices,
                    value="medium",
                )
                # Prompt input
                prompt_input = gr.Textbox(
                    label="Prompt",
                    placeholder="Describe the music you want to generate...",
                    lines=3,
                    max_lines=5,
                )
                # Prompt suggestions
                with gr.Accordion("Prompt Suggestions", open=False):
                    suggestion_btns = []
                    for i, suggestion in enumerate(suggestions[:4]):
                        btn = gr.Button(suggestion[:60] + "...", size="sm", variant="secondary")
                        suggestion_btns.append((btn, suggestion))
                # Melody conditioning (for melody variants)
                with gr.Group(visible=False) as melody_section:
                    gr.Markdown("### Melody Conditioning")
                    melody_input = gr.Audio(
                        label="Reference Melody",
                        type="filepath",
                        sources=["upload", "microphone"],
                    )
                    gr.Markdown("*Upload audio to condition generation on its melody*")
                # Parameters
                gr.Markdown("### Parameters")
                duration_slider = gr.Slider(
                    minimum=1,
                    maximum=30,
                    value=10,
                    step=1,
                    label="Duration (seconds)",
                )
                with gr.Accordion("Advanced Parameters", open=False):
                    with gr.Row():
                        temperature_slider = gr.Slider(
                            minimum=0.0,
                            maximum=2.0,
                            value=1.0,
                            step=0.05,
                            label="Temperature",
                        )
                        cfg_slider = gr.Slider(
                            minimum=1.0,
                            maximum=10.0,
                            value=3.0,
                            step=0.5,
                            label="CFG Coefficient",
                        )
                    with gr.Row():
                        top_k_slider = gr.Slider(
                            minimum=0,
                            maximum=500,
                            value=250,
                            step=10,
                            label="Top-K",
                        )
                        top_p_slider = gr.Slider(
                            minimum=0.0,
                            maximum=1.0,
                            value=0.0,
                            step=0.05,
                            label="Top-P",
                        )
                    with gr.Row():
                        seed_input = gr.Number(
                            value=None,
                            label="Seed (empty = random)",
                            precision=0,
                        )
                # Generate buttons
                with gr.Row():
                    generate_btn = gr.Button("🎵 Generate", variant="primary", scale=2)
                    queue_btn = gr.Button("Add to Queue", variant="secondary", scale=1)
            # Right column - output
            with gr.Column(scale=3):
                output = create_generation_output()
        # Event handlers
        # Preset change
        def apply_preset(preset_id: str):
            for p in presets:
                if p["id"] == preset_id:
                    params = p["parameters"]
                    return (
                        params.get("duration", 10),
                        params.get("temperature", 1.0),
                        params.get("cfg_coef", 3.0),
                        params.get("top_k", 250),
                        params.get("top_p", 0.0),
                    )
            # Custom preset - don't change values
            return gr.update(), gr.update(), gr.update(), gr.update(), gr.update()
        preset_dropdown.change(
            fn=apply_preset,
            inputs=[preset_dropdown],
            outputs=[duration_slider, temperature_slider, cfg_slider, top_k_slider, top_p_slider],
        )
        # Variant change - show/hide melody section
        def on_variant_change(variant: str):
            show_melody = "melody" in variant.lower()
            return gr.update(visible=show_melody)
        variant_dropdown.change(
            fn=on_variant_change,
            inputs=[variant_dropdown],
            outputs=[melody_section],
        )
        # Prompt suggestions
        for btn, suggestion in suggestion_btns:
            btn.click(
                fn=lambda s=suggestion: s,
                outputs=[prompt_input],
            )
        # Generate
        async def do_generate(
            prompt, variant, duration, temperature, cfg_coef, top_k, top_p, seed, melody
        ):
            if not prompt:
                return (
                    gr.update(value="Please enter a prompt"),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
            # Update status
            yield (
                gr.update(value="🔄 Generating..."),
                gr.update(visible=True, value=0),
                gr.update(),
                gr.update(),
                gr.update(),
                gr.update(),
            )
            try:
                conditioning = {}
                if melody:
                    conditioning["melody"] = melody
                result, generation = await generate_fn(
                    model_id="musicgen",
                    variant=variant,
                    prompts=[prompt],
                    duration=duration,
                    temperature=temperature,
                    top_k=int(top_k),
                    top_p=top_p,
                    cfg_coef=cfg_coef,
                    seed=int(seed) if seed else None,
                    conditioning=conditioning,
                )
                yield (
                    gr.update(value="✅ Generation complete!"),
                    gr.update(visible=False),
                    gr.update(value=generation.audio_path),
                    gr.update(),
                    gr.update(value=f"{result.duration:.2f}s"),
                    gr.update(value=str(result.seed)),
                )
            except Exception as e:
                yield (
                    gr.update(value=f"❌ Error: {str(e)}"),
                    gr.update(visible=False),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
        generate_btn.click(
            fn=do_generate,
            inputs=[
                prompt_input,
                variant_dropdown,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
                melody_input,
            ],
            outputs=[
                output["status"],
                output["progress"],
                output["player"]["audio"],
                output["player"]["waveform"],
                output["player"]["duration"],
                output["player"]["seed"],
            ],
        )
        # Add to queue
        def do_add_queue(prompt, variant, duration, temperature, cfg_coef, top_k, top_p, seed, melody):
            if not prompt:
                return "Please enter a prompt"
            conditioning = {}
            if melody:
                conditioning["melody"] = melody
            job = add_to_queue_fn(
                model_id="musicgen",
                variant=variant,
                prompts=[prompt],
                duration=duration,
                temperature=temperature,
                top_k=int(top_k),
                top_p=top_p,
                cfg_coef=cfg_coef,
                seed=int(seed) if seed else None,
                conditioning=conditioning,
            )
            return f"✅ Added to queue: {job.id}"
        queue_btn.click(
            fn=do_add_queue,
            inputs=[
                prompt_input,
                variant_dropdown,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
                melody_input,
            ],
            outputs=[output["status"]],
        )
    return {
        "preset": preset_dropdown,
        "variant": variant_dropdown,
        "prompt": prompt_input,
        "melody": melody_input,
        "duration": duration_slider,
        "temperature": temperature_slider,
        "cfg_coef": cfg_slider,
        "top_k": top_k_slider,
        "top_p": top_p_slider,
        "seed": seed_input,
        "generate_btn": generate_btn,
        "queue_btn": queue_btn,
        "output": output,
    }
--- a/src/ui/tabs/style_tab.py
+++ b/src/ui/tabs/style_tab.py
@@ -0,0 +1,292 @@
 """MusicGen Style tab for style-conditioned generation."""
 import gradio as gr
 from typing import Any, Callable, Optional
 from src.ui.state import DEFAULT_PRESETS, PROMPT_SUGGESTIONS
 from src.ui.components.audio_player import create_generation_output
 STYLE_VARIANTS = [
    {"id": "medium", "name": "Medium", "vram_mb": 5000, "description": "1.5B params, style conditioning"},
 ]
 def create_style_tab(
    generate_fn: Callable[..., Any],
    add_to_queue_fn: Callable[..., Any],
 ) -> dict[str, Any]:
    """Create MusicGen Style generation tab.
    Args:
        generate_fn: Function to call for generation
        add_to_queue_fn: Function to add to queue
    Returns:
        Dictionary with component references
    """
    presets = DEFAULT_PRESETS.get("musicgen-style", [])
    suggestions = PROMPT_SUGGESTIONS.get("musicgen", [])
    with gr.Column():
        gr.Markdown("## 🎨 MusicGen Style")
        gr.Markdown("Generate music conditioned on the style of reference audio")
        with gr.Row():
            # Left column - inputs
            with gr.Column(scale=2):
                # Preset selector
                preset_choices = [(p["name"], p["id"]) for p in presets] + [("Custom", "custom")]
                preset_dropdown = gr.Dropdown(
                    label="Preset",
                    choices=preset_choices,
                    value=presets[0]["id"] if presets else "custom",
                )
                # Model variant
                variant_choices = [(f"{v['name']} ({v['vram_mb']/1024:.1f}GB)", v["id"]) for v in STYLE_VARIANTS]
                variant_dropdown = gr.Dropdown(
                    label="Model Variant",
                    choices=variant_choices,
                    value="medium",
                )
                # Prompt input
                prompt_input = gr.Textbox(
                    label="Text Prompt",
                    placeholder="Describe additional characteristics for the music...",
                    lines=3,
                    max_lines=5,
                    info="Optional: combine with style conditioning",
                )
                # Style conditioning (required)
                gr.Markdown("### Style Conditioning")
                gr.Markdown("*Upload reference audio to extract musical style*")
                style_input = gr.Audio(
                    label="Style Reference",
                    type="filepath",
                    sources=["upload", "microphone"],
                )
                style_info = gr.Markdown(
                    "*The model will learn the style (instrumentation, tempo, mood) from this audio*"
                )
                # Parameters
                gr.Markdown("### Parameters")
                duration_slider = gr.Slider(
                    minimum=1,
                    maximum=30,
                    value=10,
                    step=1,
                    label="Duration (seconds)",
                )
                with gr.Accordion("Advanced Parameters", open=False):
                    with gr.Row():
                        temperature_slider = gr.Slider(
                            minimum=0.0,
                            maximum=2.0,
                            value=1.0,
                            step=0.05,
                            label="Temperature",
                        )
                        cfg_slider = gr.Slider(
                            minimum=1.0,
                            maximum=10.0,
                            value=3.0,
                            step=0.5,
                            label="CFG Coefficient",
                        )
                    with gr.Row():
                        top_k_slider = gr.Slider(
                            minimum=0,
                            maximum=500,
                            value=250,
                            step=10,
                            label="Top-K",
                        )
                        top_p_slider = gr.Slider(
                            minimum=0.0,
                            maximum=1.0,
                            value=0.0,
                            step=0.05,
                            label="Top-P",
                        )
                    with gr.Row():
                        seed_input = gr.Number(
                            value=None,
                            label="Seed (empty = random)",
                            precision=0,
                        )
                # Generate buttons
                with gr.Row():
                    generate_btn = gr.Button("🎨 Generate", variant="primary", scale=2)
                    queue_btn = gr.Button("Add to Queue", variant="secondary", scale=1)
            # Right column - output
            with gr.Column(scale=3):
                output = create_generation_output()
        # Event handlers
        # Preset change
        def apply_preset(preset_id: str):
            for p in presets:
                if p["id"] == preset_id:
                    params = p["parameters"]
                    return (
                        params.get("duration", 10),
                        params.get("temperature", 1.0),
                        params.get("cfg_coef", 3.0),
                        params.get("top_k", 250),
                        params.get("top_p", 0.0),
                    )
            return gr.update(), gr.update(), gr.update(), gr.update(), gr.update()
        preset_dropdown.change(
            fn=apply_preset,
            inputs=[preset_dropdown],
            outputs=[duration_slider, temperature_slider, cfg_slider, top_k_slider, top_p_slider],
        )
        # Generate
        async def do_generate(
            prompt, variant, style_audio, duration, temperature, cfg_coef, top_k, top_p, seed
        ):
            if not style_audio:
                return (
                    gr.update(value="Please upload a style reference audio"),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
            yield (
                gr.update(value="🔄 Generating..."),
                gr.update(visible=True, value=0),
                gr.update(),
                gr.update(),
                gr.update(),
                gr.update(),
            )
            try:
                conditioning = {"style": style_audio}
                result, generation = await generate_fn(
                    model_id="musicgen-style",
                    variant=variant,
                    prompts=[prompt] if prompt else [""],
                    duration=duration,
                    temperature=temperature,
                    top_k=int(top_k),
                    top_p=top_p,
                    cfg_coef=cfg_coef,
                    seed=int(seed) if seed else None,
                    conditioning=conditioning,
                )
                yield (
                    gr.update(value="✅ Generation complete!"),
                    gr.update(visible=False),
                    gr.update(value=generation.audio_path),
                    gr.update(),
                    gr.update(value=f"{result.duration:.2f}s"),
                    gr.update(value=str(result.seed)),
                )
            except Exception as e:
                yield (
                    gr.update(value=f"❌ Error: {str(e)}"),
                    gr.update(visible=False),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                    gr.update(),
                )
        generate_btn.click(
            fn=do_generate,
            inputs=[
                prompt_input,
                variant_dropdown,
                style_input,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
            ],
            outputs=[
                output["status"],
                output["progress"],
                output["player"]["audio"],
                output["player"]["waveform"],
                output["player"]["duration"],
                output["player"]["seed"],
            ],
        )
        # Add to queue
        def do_add_queue(prompt, variant, style_audio, duration, temperature, cfg_coef, top_k, top_p, seed):
            if not style_audio:
                return "Please upload a style reference audio"
            conditioning = {"style": style_audio}
            job = add_to_queue_fn(
                model_id="musicgen-style",
                variant=variant,
                prompts=[prompt] if prompt else [""],
                duration=duration,
                temperature=temperature,
                top_k=int(top_k),
                top_p=top_p,
                cfg_coef=cfg_coef,
                seed=int(seed) if seed else None,
                conditioning=conditioning,
            )
            return f"✅ Added to queue: {job.id}"
        queue_btn.click(
            fn=do_add_queue,
            inputs=[
                prompt_input,
                variant_dropdown,
                style_input,
                duration_slider,
                temperature_slider,
                cfg_slider,
                top_k_slider,
                top_p_slider,
                seed_input,
            ],
            outputs=[output["status"]],
        )
    return {
        "preset": preset_dropdown,
        "variant": variant_dropdown,
        "prompt": prompt_input,
        "style": style_input,
        "duration": duration_slider,
        "temperature": temperature_slider,
        "cfg_coef": cfg_slider,
        "top_k": top_k_slider,
        "top_p": top_p_slider,
        "seed": seed_input,
        "generate_btn": generate_btn,
        "queue_btn": queue_btn,
        "output": output,
    }
--- a/src/ui/theme.py
+++ b/src/ui/theme.py
@@ -0,0 +1,303 @@
 """Custom Gradio theme for AudioCraft Studio."""
 import gradio as gr
 def create_theme() -> gr.themes.Base:
    """Create custom theme for AudioCraft Studio.
    Returns:
        Gradio theme instance
    """
    return gr.themes.Soft(
        primary_hue=gr.themes.colors.blue,
        secondary_hue=gr.themes.colors.slate,
        neutral_hue=gr.themes.colors.gray,
        font=[
            gr.themes.GoogleFont("Inter"),
            "ui-sans-serif",
            "system-ui",
            "sans-serif",
        ],
        font_mono=[
            gr.themes.GoogleFont("JetBrains Mono"),
            "ui-monospace",
            "monospace",
        ],
    ).set(
        # Colors
        body_background_fill="#0f172a",
        body_background_fill_dark="#0f172a",
        background_fill_primary="#1e293b",
        background_fill_primary_dark="#1e293b",
        background_fill_secondary="#334155",
        background_fill_secondary_dark="#334155",
        border_color_primary="#475569",
        border_color_primary_dark="#475569",
        # Text
        body_text_color="#e2e8f0",
        body_text_color_dark="#e2e8f0",
        body_text_color_subdued="#94a3b8",
        body_text_color_subdued_dark="#94a3b8",
        # Buttons
        button_primary_background_fill="#3b82f6",
        button_primary_background_fill_dark="#3b82f6",
        button_primary_background_fill_hover="#2563eb",
        button_primary_background_fill_hover_dark="#2563eb",
        button_primary_text_color="#ffffff",
        button_primary_text_color_dark="#ffffff",
        button_secondary_background_fill="#475569",
        button_secondary_background_fill_dark="#475569",
        button_secondary_background_fill_hover="#64748b",
        button_secondary_background_fill_hover_dark="#64748b",
        # Inputs
        input_background_fill="#1e293b",
        input_background_fill_dark="#1e293b",
        input_border_color="#475569",
        input_border_color_dark="#475569",
        input_border_color_focus="#3b82f6",
        input_border_color_focus_dark="#3b82f6",
        # Blocks
        block_background_fill="#1e293b",
        block_background_fill_dark="#1e293b",
        block_border_color="#334155",
        block_border_color_dark="#334155",
        block_label_background_fill="#334155",
        block_label_background_fill_dark="#334155",
        block_label_text_color="#e2e8f0",
        block_label_text_color_dark="#e2e8f0",
        block_title_text_color="#f1f5f9",
        block_title_text_color_dark="#f1f5f9",
        # Tabs
        tab_nav_background_fill="#1e293b",
        # Sliders
        slider_color="#3b82f6",
        slider_color_dark="#3b82f6",
        # Shadows
        shadow_spread="4px",
        block_shadow="0 4px 6px -1px rgba(0, 0, 0, 0.3)",
        # Spacing
        layout_gap="16px",
        block_padding="16px",
        panel_border_width="1px",
        # Radius
        radius_sm="6px",
        radius_md="8px",
        radius_lg="12px",
    )
 # CSS overrides for additional customization
 CUSTOM_CSS = """
 /* Global styles */
 .gradio-container {
    max-width: 100% !important;
 }
 /* Header styling */
 .header-title {
    font-size: 1.5rem;
    font-weight: 700;
    color: #f1f5f9;
 }
 /* Sidebar styling */
 .sidebar {
    background: #1e293b;
    border-right: 1px solid #334155;
    padding: 1rem;
 }
 .sidebar-nav-btn {
    width: 100%;
    justify-content: flex-start;
    margin-bottom: 0.5rem;
 }
 /* Model cards */
 .model-card {
    background: #334155;
    border-radius: 12px;
    padding: 1rem;
    transition: transform 0.2s, box-shadow 0.2s;
 }
 .model-card:hover {
    transform: translateY(-2px);
    box-shadow: 0 8px 25px rgba(0, 0, 0, 0.3);
 }
 /* Audio player */
 .audio-player {
    background: #1e293b;
    border-radius: 8px;
    padding: 1rem;
 }
 /* Progress bar */
 .progress-bar {
    background: #334155;
    border-radius: 4px;
    overflow: hidden;
 }
 .progress-fill {
    background: linear-gradient(90deg, #3b82f6, #8b5cf6);
    height: 100%;
    transition: width 0.3s ease;
 }
 /* VRAM monitor */
 .vram-bar {
    background: #334155;
    border-radius: 4px;
    height: 24px;
    position: relative;
    overflow: hidden;
 }
 .vram-fill {
    position: absolute;
    left: 0;
    top: 0;
    height: 100%;
    background: linear-gradient(90deg, #22c55e, #eab308, #ef4444);
    transition: width 0.5s ease;
 }
 .vram-text {
    position: absolute;
    width: 100%;
    text-align: center;
    line-height: 24px;
    font-size: 0.875rem;
    font-weight: 500;
    color: white;
    text-shadow: 0 1px 2px rgba(0, 0, 0, 0.5);
 }
 /* Queue badge */
 .queue-badge {
    background: #3b82f6;
    color: white;
    padding: 0.25rem 0.75rem;
    border-radius: 9999px;
    font-size: 0.875rem;
    font-weight: 500;
 }
 /* Generation card */
 .generation-card {
    background: #334155;
    border-radius: 8px;
    padding: 1rem;
    margin-bottom: 0.5rem;
 }
 /* Preset chips */
 .preset-chip {
    display: inline-block;
    background: #475569;
    color: #e2e8f0;
    padding: 0.25rem 0.75rem;
    border-radius: 9999px;
    font-size: 0.875rem;
    margin: 0.25rem;
    cursor: pointer;
    transition: background 0.2s;
 }
 .preset-chip:hover {
    background: #3b82f6;
 }
 .preset-chip.active {
    background: #3b82f6;
 }
 /* Tag input */
 .tag {
    display: inline-flex;
    align-items: center;
    background: #475569;
    color: #e2e8f0;
    padding: 0.25rem 0.5rem;
    border-radius: 4px;
    font-size: 0.75rem;
    margin: 0.125rem;
 }
 /* Accordion tweaks */
 .accordion-header {
    font-weight: 600;
    color: #f1f5f9;
 }
 /* Status indicators */
 .status-dot {
    width: 8px;
    height: 8px;
    border-radius: 50%;
    display: inline-block;
    margin-right: 0.5rem;
 }
 .status-dot.loaded {
    background: #22c55e;
 }
 .status-dot.unloaded {
    background: #64748b;
 }
 .status-dot.loading {
    background: #eab308;
    animation: pulse 1s infinite;
 }
@keyframes pulse {
    0%, 100% { opacity: 1; }
    50% { opacity: 0.5; }
 }
 /* Tooltip */
 .tooltip {
    position: relative;
 }
 .tooltip:hover::after {
    content: attr(data-tooltip);
    position: absolute;
    bottom: 100%;
    left: 50%;
    transform: translateX(-50%);
    background: #1e293b;
    color: #e2e8f0;
    padding: 0.5rem;
    border-radius: 4px;
    font-size: 0.75rem;
    white-space: nowrap;
    z-index: 100;
 }
 /* Responsive adjustments */
@media (max-width: 768px) {
    .sidebar {
        display: none;
    }
    .mobile-nav {
        display: flex !important;
    }
 }
 """
		`@@ -0,0 +1,3 @@`
							`"""AudioCraft Studio - AI Audio Generation Web Application."""`

							`__version__ = "0.1.0"`