Improve the demo feature of the project

2025-10-12 18:47:33 +02:00
parent dbc7f2fa0d
commit 7556a00db7
168 changed files with 10102 additions and 18869 deletions
--- a/DEMO_ARCHITECTURE.md
+++ b/DEMO_ARCHITECTURE.md
@@ -1,499 +0,0 @@
 # Demo Architecture - Production Demo System
 ## Overview
 This document describes the complete demo architecture for providing prospects with isolated, ephemeral demo sessions to explore the Bakery IA platform.
 ## Key Features
 - ✅ **Session Isolation**: Each prospect gets their own isolated copy of demo data
 - ✅ **Spanish Content**: All demo data in Spanish for the Spanish market
 - ✅ **Two Business Models**: Individual bakery and central baker satellite
 - ✅ **Automatic Cleanup**: Sessions automatically expire after 30 minutes
 - ✅ **Read-Mostly Access**: Prospects can explore but critical operations are restricted
 - ✅ **Production Ready**: Scalable to 200+ concurrent demo sessions
 ## Architecture Components
 ### 1. Demo Session Service
 **Location**: `services/demo_session/`
 **Responsibilities**:
 - Create isolated demo sessions
 - Manage session lifecycle (create, extend, destroy)
 - Clone base demo data to virtual tenants
 - Track session metrics and activity
 **Key Endpoints**:
 ```
 GET  /api/demo/accounts          # Get public demo account info
 POST /api/demo/session/create    # Create new demo session
 POST /api/demo/session/extend    # Extend session expiration
 POST /api/demo/session/destroy   # Destroy session
 GET  /api/demo/session/{id}      # Get session info
 GET  /api/demo/stats              # Get usage statistics
 ```
 ### 2. Demo Data Seeding
 **Location**: `scripts/demo/`
 **Scripts**:
 - `seed_demo_users.py` - Creates demo user accounts
 - `seed_demo_tenants.py` - Creates base demo tenants (templates)
 - `seed_demo_inventory.py` - Populates inventory with Spanish data (25 ingredients per template)
 - `clone_demo_tenant.py` - Clones data from base template to virtual tenant (runs as K8s Job)
 **Demo Accounts**:
 #### Individual Bakery (Panadería San Pablo)
 ```
 Email: demo.individual@panaderiasanpablo.com
 Password: DemoSanPablo2024!
 Business Model: Producción Local
 Features: Production, Recipes, Inventory, Forecasting, POS, Sales
 ```
 #### Central Baker Satellite (Panadería La Espiga)
 ```
 Email: demo.central@panaderialaespiga.com
 Password: DemoLaEspiga2024!
 Business Model: Obrador Central + Punto de Venta
 Features: Suppliers, Inventory, Orders, POS, Sales, Forecasting
 ```
 ### 3. Gateway Middleware
 **Location**: `gateway/app/middleware/demo_middleware.py`
 **Responsibilities**:
 - Intercept requests with demo session IDs
 - Inject virtual tenant ID
 - Enforce operation restrictions
 - Track session activity
 **Allowed Operations**:
 ```python
 # Read - All allowed
 GET, HEAD, OPTIONS: *
 # Limited Write - Realistic testing
 POST: /api/pos/sales, /api/orders, /api/inventory/adjustments
 PUT: /api/pos/sales/*, /api/orders/*
 # Blocked
 DELETE: All (read-only for destructive operations)
 ```
 ### 4. Redis Cache Layer
 **Purpose**: Store frequently accessed demo session data
 **Data Cached**:
 - Session metadata
 - Inventory summaries
 - POS session data
 - Recent sales
 **TTL**: 30 minutes (auto-cleanup)
 ### 5. Kubernetes Resources
 **Databases**:
 - `demo-session-db` - Tracks session records
 **Services**:
 - `demo-session-service` - Main demo service (2 replicas)
 **Jobs** (Initialization):
 - `demo-seed-users` - Creates demo users
 - `demo-seed-tenants` - Creates demo tenant templates
 - `demo-seed-inventory` - Populates inventory data (25 ingredients per tenant)
 **Dynamic Jobs** (Runtime):
 - `demo-clone-{virtual_tenant_id}` - Created per session to clone data from template
 **CronJob** (Maintenance):
 - `demo-session-cleanup` - Runs hourly to cleanup expired sessions
 **RBAC**:
 - `demo-session-sa` - ServiceAccount for demo-session-service
 - `demo-session-job-creator` - Role allowing job creation and pod management
 - `demo-seed-role` - Role for seed jobs to access databases
 ## Data Flow
 ### Session Creation
 ```
 1. User clicks "Probar Demo" on website
   ↓
 2. Frontend calls POST /api/demo/session/create
   {
     "demo_account_type": "individual_bakery"
   }
   ↓
 3. Demo Session Service:
   - Generates unique session_id: "demo_abc123..."
   - Creates virtual_tenant_id: UUID
   - Stores session in database
   - Returns session_token (JWT)
   ↓
 4. Kubernetes Job Cloning (background):
   - Demo service triggers K8s Job with clone script
   - Job container uses CLONE_JOB_IMAGE (inventory-service image)
   - Clones inventory data from base template tenant
   - Uses ORM models for safe data copying
   - Job runs with IfNotPresent pull policy (works in dev & prod)
   ↓
 5. Frontend receives:
   {
     "session_id": "demo_abc123...",
     "virtual_tenant_id": "uuid-here",
     "expires_at": "2025-10-02T12:30:00Z",
     "session_token": "eyJ..."
   }
   ↓
 6. Frontend stores session_token in cookie/localStorage
   All subsequent requests include:
   Header: X-Demo-Session-Id: demo_abc123...
 ```
 ### Request Handling
 ```
 1. Request arrives at Gateway
   ↓
 2. Demo Middleware checks:
   - Is X-Demo-Session-Id present?
   - Is session still active?
   - Is operation allowed?
   ↓
 3. If valid:
   - Injects X-Tenant-Id: {virtual_tenant_id}
   - Routes to appropriate service
   ↓
 4. Service processes request:
   - Reads/writes data for virtual tenant
   - No knowledge of demo vs. real tenant
   ↓
 5. Response returned to user
 ```
 ### Session Cleanup
 ```
 Every hour (CronJob):
 1. Demo Cleanup Service queries:
   SELECT * FROM demo_sessions
   WHERE status = 'active'
   AND expires_at < NOW()
   ↓
 2. For each expired session:
   - Mark as 'expired'
   - Delete all virtual tenant data
   - Delete Redis keys
   - Update statistics
   ↓
 3. Weekly cleanup:
   DELETE FROM demo_sessions
   WHERE status = 'destroyed'
   AND destroyed_at < NOW() - INTERVAL '7 days'
 ```
 ## Database Schema
 ### demo_sessions Table
 ```sql
 CREATE TABLE demo_sessions (
    id UUID PRIMARY KEY,
    session_id VARCHAR(100) UNIQUE NOT NULL,
    -- Ownership
    user_id UUID,
    ip_address VARCHAR(45),
    user_agent VARCHAR(500),
    -- Demo linking
    base_demo_tenant_id UUID NOT NULL,
    virtual_tenant_id UUID NOT NULL,
    demo_account_type VARCHAR(50) NOT NULL,
    -- Lifecycle
    status VARCHAR(20) NOT NULL,  -- active, expired, destroyed
    created_at TIMESTAMP WITH TIME ZONE NOT NULL,
    expires_at TIMESTAMP WITH TIME ZONE NOT NULL,
    last_activity_at TIMESTAMP WITH TIME ZONE,
    destroyed_at TIMESTAMP WITH TIME ZONE,
    -- Metrics
    request_count INTEGER DEFAULT 0,
    data_cloned BOOLEAN DEFAULT FALSE,
    redis_populated BOOLEAN DEFAULT FALSE,
    -- Metadata
    metadata JSONB
 );
 CREATE INDEX idx_session_id ON demo_sessions(session_id);
 CREATE INDEX idx_virtual_tenant ON demo_sessions(virtual_tenant_id);
 CREATE INDEX idx_status ON demo_sessions(status);
 CREATE INDEX idx_expires_at ON demo_sessions(expires_at);
 ```
 ### tenants Table (Updated)
 ```sql
 ALTER TABLE tenants ADD COLUMN is_demo BOOLEAN DEFAULT FALSE;
 ALTER TABLE tenants ADD COLUMN is_demo_template BOOLEAN DEFAULT FALSE;
 ALTER TABLE tenants ADD COLUMN base_demo_tenant_id UUID;
 ALTER TABLE tenants ADD COLUMN demo_session_id VARCHAR(100);
 ALTER TABLE tenants ADD COLUMN demo_expires_at TIMESTAMP WITH TIME ZONE;
 CREATE INDEX idx_is_demo ON tenants(is_demo);
 CREATE INDEX idx_demo_session ON tenants(demo_session_id);
 ```
 ## Deployment
 ### Initial Deployment
 ```bash
 # 1. Deploy infrastructure (databases, redis, rabbitmq)
 kubectl apply -k infrastructure/kubernetes/overlays/prod
 # 2. Run migrations
 # (Automatically handled by migration jobs)
 # 3. Seed demo data
 # (Automatically handled by demo-seed-* jobs)
 # 4. Verify demo system
 kubectl get jobs -n bakery-ia | grep demo-seed
 kubectl logs -f job/demo-seed-users -n bakery-ia
 kubectl logs -f job/demo-seed-tenants -n bakery-ia
 kubectl logs -f job/demo-seed-inventory -n bakery-ia
 # 5. Test demo session creation
 curl -X POST http://your-domain/api/demo/session/create \
  -H "Content-Type: application/json" \
  -d '{"demo_account_type": "individual_bakery"}'
 ```
 ### Using Tilt (Local Development)
 ```bash
 # Start Tilt
 tilt up
 # Demo resources in Tilt UI:
 # - databases: demo-session-db
 # - migrations: demo-session-migration
 # - services: demo-session-service
 # - demo-init: demo-seed-users, demo-seed-tenants, demo-seed-inventory
 # - config: patch-demo-session-env (sets CLONE_JOB_IMAGE dynamically)
 # Tilt automatically:
 # 1. Gets inventory-service image tag (e.g., tilt-abc123)
 # 2. Patches demo-session-service with CLONE_JOB_IMAGE env var
 # 3. Clone jobs use this image with IfNotPresent pull policy
 ```
 ## Monitoring
 ### Key Metrics
 ```python
 # Session Statistics
 GET /api/demo/stats
 {
  "total_sessions": 1250,
  "active_sessions": 45,
  "expired_sessions": 980,
  "destroyed_sessions": 225,
  "avg_duration_minutes": 18.5,
  "total_requests": 125000
 }
 ```
 ### Health Checks
 ```bash
 # Demo Session Service
 curl http://demo-session-service:8000/health
 # Check active sessions
 kubectl exec -it deployment/demo-session-service -- \
  python -c "from app.services import *; print(get_active_sessions())"
 ```
 ### Logs
 ```bash
 # Demo session service logs
 kubectl logs -f deployment/demo-session-service -n bakery-ia
 # Demo seed job logs
 kubectl logs job/demo-seed-inventory -n bakery-ia
 # Cleanup cron job logs
 kubectl logs -l app=demo-cleanup -n bakery-ia --tail=100
 ```
 ## Scaling Considerations
 ### Current Limits
 - **Concurrent Sessions**: ~200 (2 replicas × ~100 sessions each)
 - **Redis Memory**: ~1-2 GB (10 MB per session × 200)
 - **PostgreSQL**: ~5-10 GB (30 MB per virtual tenant × 200)
 - **Session Duration**: 30 minutes (configurable)
 - **Extensions**: Maximum 3 per session
 ### Scaling Up
 ```yaml
 # Scale demo-session-service
 kubectl scale deployment/demo-session-service --replicas=4 -n bakery-ia
 # Increase Redis memory (if needed)
 # Edit redis deployment, increase memory limits
 # Adjust session settings
 # Edit demo-session configmap:
 DEMO_SESSION_DURATION_MINUTES: 45  # Increase session time
 DEMO_SESSION_MAX_EXTENSIONS: 5     # Allow more extensions
 ```
 ## Security
 ### Public Demo Credentials
 Demo credentials are **intentionally public** for prospect access:
 - Published on marketing website
 - Included in demo documentation
 - Safe because sessions are isolated and ephemeral
 ### Restrictions
 1. **No Destructive Operations**: DELETE blocked
 2. **Limited Modifications**: Only realistic testing operations
 3. **No Sensitive Data Access**: Cannot change passwords, billing, etc.
 4. **Automatic Expiration**: Sessions auto-destroy after 30 minutes
 5. **Rate Limiting**: Standard gateway rate limits apply
 6. **No AI Training**: Forecast API blocked for demo accounts (no trained models)
 7. **Scheduler Prevention**: Procurement scheduler filters out demo tenants
 ### Data Privacy
 - No real customer data in demo tenants
 - Session data automatically deleted
 - Anonymized analytics only
 ## Troubleshooting
 ### Session Creation Fails
 ```bash
 # Check demo-session-service health
 kubectl get pods -l app=demo-session-service -n bakery-ia
 # Check logs
 kubectl logs deployment/demo-session-service -n bakery-ia --tail=50
 # Verify base demo tenants exist
 kubectl exec -it deployment/tenant-service -- \
  psql $TENANT_DATABASE_URL -c \
  "SELECT id, name, subdomain FROM tenants WHERE is_demo_template = true;"
 ```
 ### Sessions Not Cleaning Up
 ```bash
 # Check cleanup cronjob
 kubectl get cronjobs -n bakery-ia
 kubectl get jobs -l app=demo-cleanup -n bakery-ia
 # Manually trigger cleanup
 kubectl create job --from=cronjob/demo-session-cleanup manual-cleanup-$(date +%s) -n bakery-ia
 # Check for orphaned sessions
 kubectl exec -it deployment/demo-session-service -- \
  psql $DEMO_SESSION_DATABASE_URL -c \
  "SELECT status, COUNT(*) FROM demo_sessions GROUP BY status;"
 ```
 ### Redis Connection Issues
 ```bash
 # Test Redis connectivity
 kubectl exec -it deployment/demo-session-service -- \
  python -c "import redis; r=redis.Redis(host='redis-service'); print(r.ping())"
 # Check Redis memory usage
 kubectl exec -it deployment/redis -- redis-cli INFO memory
 ```
 ## Technical Implementation Details
 ### Data Cloning Architecture
 **Choice: Kubernetes Job-based Cloning** (selected over service-based endpoints)
 **Why K8s Jobs**:
 - Database-level operations (faster than API calls)
 - Scalable (one job per session, isolated execution)
 - No service-specific clone endpoints needed
 - Works in both dev (Tilt) and production
 **How it Works**:
 1. Demo-session-service creates K8s Job via K8s API
 2. Job uses `CLONE_JOB_IMAGE` environment variable (configured image)
 3. In **Dev (Tilt)**: `patch-demo-session-env` sets dynamic Tilt image tag
 4. In **Production**: Deployment manifest has stable release tag
 5. Job runs `clone_demo_tenant.py` with `imagePullPolicy: IfNotPresent`
 6. Script uses ORM models to clone data safely
 **Environment-based Image Configuration**:
 ```yaml
 # Demo-session deployment
 env:
 - name: CLONE_JOB_IMAGE
  value: "bakery/inventory-service:latest"  # Overridden by Tilt in dev
 # Tilt automatically patches this to match actual inventory-service tag
 # e.g., bakery/inventory-service:tilt-abc123
 ```
 ### AI Model Restrictions
 **Fake Models in Database**:
 - Demo tenants have AI model records in database
 - No actual model files (.pkl, .h5) stored
 - Forecast API blocked at gateway level for demo accounts
 - Returns user-friendly error message
 **Scheduler Prevention**:
 - Procurement scheduler filters `is_demo = true` tenants
 - Prevents automated procurement runs on demo data
 - Manual procurement still allowed for realistic testing
 ## Future Enhancements
 1. **Analytics Dashboard**: Track demo → paid conversion rates
 2. **Guided Tours**: In-app tutorials for demo users
 3. **Custom Demo Scenarios**: Let prospects choose specific features
 4. **Demo Recordings**: Capture anonymized session recordings
 5. **Multi-Region**: Deploy demo infrastructure in EU, US, LATAM
 6. **Sales & Orders Cloning**: Extend clone script to copy sales and orders data
 ## References
 - [Demo Session Service API](services/demo_session/README.md)
 - [Demo Data Seeding](scripts/demo/README.md)
 - [Gateway Middleware](gateway/app/middleware/README.md)
 - [Kubernetes Manifests](infrastructure/kubernetes/base/components/demo-session/)
--- a/DEMO_IMPLEMENTATION_SUMMARY.md
+++ b/DEMO_IMPLEMENTATION_SUMMARY.md
@@ -1,584 +0,0 @@
 # Demo Architecture Implementation Summary
 ## ✅ Implementation Complete
 All components of the production demo system have been implemented. This document provides a summary of what was created and how to use it.
 ---
 ## 📁 Files Created
 ### Demo Session Service (New Microservice)
 ```
 services/demo_session/
 ├── app/
 │   ├── __init__.py
 │   ├── main.py                          # FastAPI application
 │   ├── api/
 │   │   ├── __init__.py
 │   │   ├── routes.py                    # API endpoints
 │   │   └── schemas.py                   # Pydantic models
 │   ├── core/
 │   │   ├── __init__.py
 │   │   ├── config.py                    # Settings
 │   │   ├── database.py                  # Database manager
 │   │   └── redis_client.py              # Redis client
 │   ├── models/
 │   │   ├── __init__.py
 │   │   └── demo_session.py              # Session model
 │   └── services/
 │       ├── __init__.py
 │       ├── session_manager.py           # Session lifecycle
 │       ├── data_cloner.py               # Data cloning
 │       └── cleanup_service.py           # Cleanup logic
 ├── migrations/
 │   ├── env.py
 │   ├── script.py.mako
 │   └── versions/
 ├── requirements.txt
 ├── Dockerfile
 └── alembic.ini
 ```
 ### Demo Seeding Scripts
 ```
 scripts/demo/
 ├── __init__.py
 ├── seed_demo_users.py                   # Creates demo users
 ├── seed_demo_tenants.py                 # Creates demo tenants
 ├── seed_demo_inventory.py               # Populates Spanish inventory (25 ingredients)
 └── clone_demo_tenant.py                 # Clones data from template (runs as K8s Job)
 ```
 ### Gateway Middleware
 ```
 gateway/app/middleware/
 └── demo_middleware.py                   # Demo session handling
 ```
 ### Kubernetes Resources
 ```
 infrastructure/kubernetes/base/
 ├── components/demo-session/
 │   ├── deployment.yaml                  # Service deployment (with CLONE_JOB_IMAGE env)
 │   ├── service.yaml                     # K8s service
 │   ├── database.yaml                    # PostgreSQL DB
 │   └── rbac.yaml                        # RBAC for job creation
 ├── migrations/
 │   └── demo-session-migration-job.yaml  # Migration job
 ├── jobs/
 │   ├── demo-seed-users-job.yaml         # User seeding
 │   ├── demo-seed-tenants-job.yaml       # Tenant seeding
 │   ├── demo-seed-inventory-job.yaml     # Inventory seeding
 │   ├── demo-seed-rbac.yaml              # RBAC permissions for seed jobs
 │   └── demo-clone-job-template.yaml     # Reference template for clone jobs
 └── cronjobs/
    └── demo-cleanup-cronjob.yaml        # Hourly cleanup
 ```
 ### Documentation
 ```
 DEMO_ARCHITECTURE.md                     # Complete architecture guide
 DEMO_IMPLEMENTATION_SUMMARY.md           # This file
 ```
 ### Updated Files
 ```
 services/tenant/app/models/tenants.py       # Added demo flags
 services/demo_session/app/services/k8s_job_cloner.py  # K8s Job cloning implementation
 gateway/app/main.py                         # Added demo middleware
 gateway/app/middleware/demo_middleware.py   # Converted to BaseHTTPMiddleware
 Tiltfile                                    # Added demo resources + CLONE_JOB_IMAGE patching
 shared/config/base.py                       # Added demo-related settings
 ```
 ---
 ## 🎯 Key Features Implemented
 ### 1. Session Isolation ✅
 - Each prospect gets isolated virtual tenant
 - No data interference between sessions
 - Automatic resource cleanup
 ### 2. Spanish Demo Data ✅
 - **Panadería San Pablo** (Individual Bakery)
  - Raw ingredients: Harina, Levadura, Mantequilla, etc.
  - Local production focus
  - Full recipe management
 - **Panadería La Espiga** (Central Baker Satellite)
  - Pre-baked products from central baker
  - Supplier management
  - Order tracking
 ### 3. Redis Caching ✅
 - Hot data cached for fast access
 - Automatic TTL (30 minutes)
 - Session metadata storage
 ### 4. Gateway Integration ✅
 - Demo session detection
 - Operation restrictions
 - Virtual tenant injection
 ### 5. Automatic Cleanup ✅
 - Hourly CronJob cleanup
 - Expired session detection
 - Database and Redis cleanup
 ### 6. K8s Job-based Data Cloning ✅
 - Database-level cloning (faster than API calls)
 - Environment-based image configuration
 - Works in dev (Tilt dynamic tags) and production (stable tags)
 - Uses ORM models for safe data copying
 - `imagePullPolicy: IfNotPresent` for local images
 ### 7. AI & Scheduler Restrictions ✅
 - Fake AI models in database (no real files)
 - Forecast API blocked at gateway for demo accounts
 - Procurement scheduler filters out demo tenants
 - Manual operations still allowed for realistic testing
 ---
 ## 🚀 Quick Start
 ### Local Development with Tilt
 ```bash
 # Start all services including demo system
 tilt up
 # Watch demo initialization
 tilt logs demo-seed-users
 tilt logs demo-seed-tenants
 tilt logs demo-seed-inventory
 # Check demo service
 tilt logs demo-session-service
 ```
 ### Test Demo Session Creation
 ```bash
 # Get demo accounts info
 curl http://localhost/api/demo/accounts | jq
 # Create demo session
 curl -X POST http://localhost/api/demo/session/create \
  -H "Content-Type: application/json" \
  -d '{
    "demo_account_type": "individual_bakery",
    "ip_address": "127.0.0.1"
  }' | jq
 # Response:
 # {
 #   "session_id": "demo_abc123...",
 #   "virtual_tenant_id": "uuid-here",
 #   "expires_at": "2025-10-02T12:30:00Z",
 #   "session_token": "eyJ..."
 # }
 ```
 ### Use Demo Session
 ```bash
 # Make request with demo session
 curl http://localhost/api/inventory/ingredients \
  -H "X-Demo-Session-Id: demo_abc123..." \
  -H "Content-Type: application/json"
 # Try restricted operation (should fail)
 curl -X DELETE http://localhost/api/inventory/ingredients/uuid \
  -H "X-Demo-Session-Id: demo_abc123..."
 # Response:
 # {
 #   "error": "demo_restriction",
 #   "message": "Esta operación no está permitida en cuentas demo..."
 # }
 ```
 ---
 ## 📊 Demo Accounts
 ### Account 1: Individual Bakery
 ```yaml
 Name: Panadería San Pablo - Demo
 Email: demo.individual@panaderiasanpablo.com
 Password: DemoSanPablo2024!
 Business Model: individual_bakery
 Location: Madrid, Spain
 Features:
  - Production Management ✓
  - Recipe Management ✓
  - Inventory Tracking ✓
  - Demand Forecasting ✓
  - POS System ✓
  - Sales Analytics ✓
 Data:
  - 20+ raw ingredients
  - 5+ finished products
  - Multiple stock lots
  - Production batches
  - Sales history
 ```
 ### Account 2: Central Baker Satellite
 ```yaml
 Name: Panadería La Espiga - Demo
 Email: demo.central@panaderialaespiga.com
 Password: DemoLaEspiga2024!
 Business Model: central_baker_satellite
 Location: Barcelona, Spain
 Features:
  - Supplier Management ✓
  - Inventory Tracking ✓
  - Order Management ✓
  - POS System ✓
  - Sales Analytics ✓
  - Demand Forecasting ✓
 Data:
  - 15+ par-baked products
  - 10+ finished products
  - Supplier relationships
  - Delivery tracking
  - Sales history
 ```
 ---
 ## 🔧 Configuration
 ### Session Settings
 Edit `services/demo_session/app/core/config.py`:
 ```python
 DEMO_SESSION_DURATION_MINUTES = 30        # Session lifetime
 DEMO_SESSION_MAX_EXTENSIONS = 3           # Max extensions allowed
 REDIS_SESSION_TTL = 1800                  # Redis cache TTL (seconds)
 ```
 ### Operation Restrictions
 Edit `gateway/app/middleware/demo_middleware.py`:
 ```python
 DEMO_ALLOWED_OPERATIONS = {
    "GET": ["*"],
    "POST": [
        "/api/pos/sales",           # Allow sales
        "/api/orders",              # Allow orders
        "/api/inventory/adjustments" # Allow adjustments
    ],
    "DELETE": []  # Block all deletes
 }
 ```
 ### Cleanup Schedule
 Edit `infrastructure/kubernetes/base/cronjobs/demo-cleanup-cronjob.yaml`:
 ```yaml
 spec:
  schedule: "0 * * * *"  # Every hour
  # Or:
  # schedule: "*/30 * * * *"  # Every 30 minutes
  # schedule: "0 */3 * * *"   # Every 3 hours
 ```
 ---
 ## 📈 Monitoring
 ### Check Active Sessions
 ```bash
 # Get statistics
 curl http://localhost/api/demo/stats | jq
 # Get specific session
 curl http://localhost/api/demo/session/{session_id} | jq
 ```
 ### View Logs
 ```bash
 # Demo session service
 kubectl logs -f deployment/demo-session-service -n bakery-ia
 # Cleanup job
 kubectl logs -l app=demo-cleanup -n bakery-ia --tail=100
 # Seed jobs
 kubectl logs job/demo-seed-inventory -n bakery-ia
 ```
 ### Metrics
 ```bash
 # Database queries
 kubectl exec -it deployment/demo-session-service -n bakery-ia -- \
  psql $DEMO_SESSION_DATABASE_URL -c \
  "SELECT status, COUNT(*) FROM demo_sessions GROUP BY status;"
 # Redis memory
 kubectl exec -it deployment/redis -n bakery-ia -- \
  redis-cli INFO memory
 ```
 ---
 ## 🔄 Maintenance
 ### Manual Cleanup
 ```bash
 # Trigger cleanup manually
 kubectl create job --from=cronjob/demo-session-cleanup \
  manual-cleanup-$(date +%s) -n bakery-ia
 # Watch cleanup progress
 kubectl logs -f job/manual-cleanup-xxxxx -n bakery-ia
 ```
 ### Reseed Demo Data
 ```bash
 # Delete and recreate seed jobs
 kubectl delete job demo-seed-inventory -n bakery-ia
 kubectl apply -f infrastructure/kubernetes/base/jobs/demo-seed-inventory-job.yaml
 # Watch progress
 kubectl logs -f job/demo-seed-inventory -n bakery-ia
 ```
 ### Scale Demo Service
 ```bash
 # Scale up for high load
 kubectl scale deployment/demo-session-service --replicas=4 -n bakery-ia
 # Scale down for maintenance
 kubectl scale deployment/demo-session-service --replicas=1 -n bakery-ia
 ```
 ---
 ## 🛠 Troubleshooting
 ### Sessions Not Creating
 1. **Check demo-session-service health**
   ```bash
   kubectl get pods -l app=demo-session-service -n bakery-ia
   kubectl logs deployment/demo-session-service -n bakery-ia --tail=50
   ```
 2. **Verify base tenants exist**
   ```bash
   kubectl exec -it deployment/tenant-service -n bakery-ia -- \
     psql $TENANT_DATABASE_URL -c \
     "SELECT id, name, is_demo_template FROM tenants WHERE is_demo = true;"
   ```
 3. **Check Redis connection**
   ```bash
   kubectl exec -it deployment/demo-session-service -n bakery-ia -- \
     python -c "import redis; r=redis.Redis(host='redis-service'); print(r.ping())"
   ```
 ### Sessions Not Cleaning Up
 1. **Check CronJob status**
   ```bash
   kubectl get cronjobs -n bakery-ia
   kubectl get jobs -l app=demo-cleanup -n bakery-ia
   ```
 2. **Manually trigger cleanup**
   ```bash
   curl -X POST http://localhost/api/demo/cleanup/run
   ```
 3. **Check for stuck sessions**
   ```bash
   kubectl exec -it deployment/demo-session-service -n bakery-ia -- \
     psql $DEMO_SESSION_DATABASE_URL -c \
     "SELECT session_id, status, expires_at FROM demo_sessions WHERE status = 'active';"
   ```
 ### Gateway Not Injecting Virtual Tenant
 1. **Check middleware is loaded**
   ```bash
   kubectl logs deployment/gateway -n bakery-ia | grep -i demo
   ```
 2. **Verify session ID in request**
   ```bash
   curl -v http://localhost/api/inventory/ingredients \
     -H "X-Demo-Session-Id: your-session-id"
   ```
 3. **Check demo middleware logic**
   - Review [demo_middleware.py](gateway/app/middleware/demo_middleware.py)
   - Ensure session is active
   - Verify operation is allowed
 ---
 ## 🎉 Success Criteria
 ✅ **Demo session creates successfully**
 - Session ID returned
 - Virtual tenant ID generated
 - Expiration time set
 ✅ **Data is isolated**
 - Multiple sessions don't interfere
 - Each session has unique tenant ID
 ✅ **Spanish demo data loads**
 - Ingredients in Spanish
 - Realistic bakery scenarios
 - Both business models represented
 ✅ **Operations restricted**
 - Read operations allowed
 - Write operations limited
 - Delete operations blocked
 ✅ **Automatic cleanup works**
 - Sessions expire after 30 minutes
 - CronJob removes expired sessions
 - Redis keys cleaned up
 ✅ **Gateway integration works**
 - Middleware detects sessions
 - Virtual tenant injected
 - Restrictions enforced
 ✅ **K8s Job cloning works**
 - Dynamic image detection in Tilt (dev)
 - Environment variable configuration
 - Automatic data cloning per session
 - No service-specific clone endpoints needed
 ✅ **AI & Scheduler protection works**
 - Forecast API blocked for demo accounts
 - Scheduler filters demo tenants
 - Fake models in database only
 ---
 ## 📚 Next Steps
 ### For Frontend Integration
 1. Create demo login page showing both accounts
 2. Implement session token storage (cookie/localStorage)
 3. Add session timer UI component
 4. Show "DEMO MODE" badge in header
 5. Display session expiration warnings
 ### For Marketing
 1. Publish demo credentials on website
 2. Create demo walkthrough videos
 3. Add "Probar Demo" CTA buttons
 4. Track demo → signup conversion
 ### For Operations
 1. Set up monitoring dashboards
 2. Configure alerts for cleanup failures
 3. Track session metrics (duration, usage)
 4. Optimize Redis cache strategy
 ---
 ## 📞 Support
 For issues or questions:
 - Review [DEMO_ARCHITECTURE.md](DEMO_ARCHITECTURE.md) for detailed documentation
 - Check logs: `tilt logs demo-session-service`
 - Inspect database: `psql $DEMO_SESSION_DATABASE_URL`
 ---
 ## 🔧 Technical Architecture Decisions
 ### Data Cloning: Why Kubernetes Jobs?
 **Problem**: Need to clone demo data from base template tenants to virtual tenants for each session.
 **Options Considered**:
 1. ❌ **Service-based clone endpoints** - Would require `/internal/demo/clone` in every service
 2. ❌ **PostgreSQL Foreign Data Wrapper** - Complex setup, doesn't work across databases
 3. ✅ **Kubernetes Jobs** - Selected approach
 **Why K8s Jobs Won**:
 - Database-level operations (ORM-based, faster than API calls)
 - Scalable (one job per session, isolated execution)
 - No service coupling (don't need clone endpoints in every service)
 - Works in all environments (dev & production)
 ### Image Configuration: Environment Variables
 **Problem**: K8s Jobs need container images, but Tilt uses dynamic tags (e.g., `tilt-abc123`) while production uses stable tags.
 **Solution**: Environment variable `CLONE_JOB_IMAGE`
 ```yaml
 # Demo-session deployment has default
 env:
 - name: CLONE_JOB_IMAGE
  value: "bakery/inventory-service:latest"
 # Tilt patches it dynamically
 # Tiltfile line 231-237
 inventory_image_ref = kubectl get deployment inventory-service ...
 kubectl set env deployment/demo-session-service CLONE_JOB_IMAGE=$inventory_image_ref
 ```
 **Benefits**:
 - ✅ General solution (not tied to specific service)
 - ✅ Works in dev (dynamic Tilt tags)
 - ✅ Works in production (stable release tags)
 - ✅ Easy to change image via env var
 ### Middleware: BaseHTTPMiddleware Pattern
 **Problem**: Initial function-based middleware using `@app.middleware("http")` wasn't executing.
 **Solution**: Converted to class-based `BaseHTTPMiddleware`
 ```python
 class DemoMiddleware(BaseHTTPMiddleware):
    async def dispatch(self, request: Request, call_next):
        # ... middleware logic
 ```
 **Why**: FastAPI's `BaseHTTPMiddleware` provides better lifecycle hooks and guaranteed execution order.
 ---
 **Implementation Date**: 2025-10-02
 **Last Updated**: 2025-10-03
 **Status**: ✅ Complete - Ready for Production
 **Next**: Frontend integration and end-to-end testing
--- a/DEPLOYMENT_COMMANDS.md
+++ b/DEPLOYMENT_COMMANDS.md
@@ -1,322 +0,0 @@
 # Deployment Commands - Quick Reference
 ## Implementation Complete ✅
 All changes are implemented. Services now only verify database readiness - they never run migrations.
 ---
 ## Deploy the New Architecture
 ### Option 1: Skaffold (Recommended)
 ```bash
 # Development mode (auto-rebuild on changes)
 skaffold dev
 # Production deployment
 skaffold run
 ```
 ### Option 2: Manual Deployment
 ```bash
 # 1. Build all service images
 for service in auth orders inventory external pos sales recipes \
               training suppliers tenant notification forecasting \
               production alert-processor; do
  docker build -t bakery/${service}-service:latest services/${service}/
 done
 # 2. Apply Kubernetes manifests
 kubectl apply -f infrastructure/kubernetes/base/
 # 3. Wait for rollout
 kubectl rollout status deployment --all -n bakery-ia
 ```
 ---
 ## Verification Commands
 ### Check Services Are Using New Code:
 ```bash
 # Check external service logs for verification (not migration)
 kubectl logs -n bakery-ia deployment/external-service | grep -i "verification"
 # Expected output:
 # [info] Database verification mode - checking database is ready
 # [info] Database verification successful
 # Should NOT see (old behavior):
 # [info] Running pending migrations
 ```
 ### Check All Services:
 ```bash
 # Check all service logs
 for service in auth orders inventory external pos sales recipes \
               training suppliers tenant notification forecasting \
               production alert-processor; do
  echo "=== Checking $service-service ==="
  kubectl logs -n bakery-ia deployment/${service}-service --tail=20 | grep -E "(verification|migration)" || echo "No logs yet"
 done
 ```
 ### Check Startup Times:
 ```bash
 # Watch pod startup times
 kubectl get events -n bakery-ia --sort-by='.lastTimestamp' --watch
 # Or check specific service
 kubectl describe pod -n bakery-ia -l app.kubernetes.io/name=external-service | grep -A 5 "Events:"
 ```
 ---
 ## Troubleshooting
 ### Service Won't Start - "Database is empty"
 ```bash
 # 1. Check migration job status
 kubectl get jobs -n bakery-ia | grep migration
 # 2. Check specific migration job
 kubectl logs -n bakery-ia job/external-migration
 # 3. Re-run migration job if needed
 kubectl delete job external-migration -n bakery-ia
 kubectl apply -f infrastructure/kubernetes/base/migrations/external-migration.yaml
 ```
 ### Service Won't Start - "No migration files found"
 ```bash
 # 1. Check if migrations exist in image
 kubectl exec -n bakery-ia deployment/external-service -- ls -la /app/migrations/versions/
 # 2. If missing, regenerate and rebuild
 ./regenerate_migrations_k8s.sh --verbose
 skaffold build
 kubectl rollout restart deployment/external-service -n bakery-ia
 ```
 ### Check Migration Job Logs:
 ```bash
 # List all migration jobs
 kubectl get jobs -n bakery-ia | grep migration
 # Check specific job logs
 kubectl logs -n bakery-ia job/<service>-migration
 # Example:
 kubectl logs -n bakery-ia job/auth-migration
 ```
 ---
 ## Performance Testing
 ### Measure Startup Time Improvement:
 ```bash
 # 1. Record current startup times
 kubectl get events -n bakery-ia --sort-by='.lastTimestamp' | grep "Started container" > before.txt
 # 2. Deploy new code
 skaffold run
 # 3. Restart services to measure
 kubectl rollout restart deployment --all -n bakery-ia
 # 4. Record new startup times
 kubectl get events -n bakery-ia --sort-by='.lastTimestamp' | grep "Started container" > after.txt
 # 5. Compare (should be 50-80% faster)
 diff before.txt after.txt
 ```
 ### Monitor Database Load:
 ```bash
 # Check database connections during startup
 kubectl exec -n bakery-ia external-db-<pod> -- \
  psql -U external_user -d external_db -c \
  "SELECT count(*) FROM pg_stat_activity WHERE datname='external_db';"
 ```
 ---
 ## Rollback (If Needed)
 ### Rollback Deployments:
 ```bash
 # Rollback specific service
 kubectl rollout undo deployment/external-service -n bakery-ia
 # Rollback all services
 kubectl rollout undo deployment --all -n bakery-ia
 # Check rollout status
 kubectl rollout status deployment --all -n bakery-ia
 ```
 ### Rollback to Specific Revision:
 ```bash
 # List revisions
 kubectl rollout history deployment/external-service -n bakery-ia
 # Rollback to specific revision
 kubectl rollout undo deployment/external-service --to-revision=2 -n bakery-ia
 ```
 ---
 ## Clean Deployment
 ### If You Want Fresh Start:
 ```bash
 # 1. Delete everything
 kubectl delete namespace bakery-ia
 # 2. Recreate namespace
 kubectl create namespace bakery-ia
 # 3. Apply all manifests
 kubectl apply -f infrastructure/kubernetes/base/
 # 4. Wait for all to be ready
 kubectl wait --for=condition=ready pod --all -n bakery-ia --timeout=300s
 ```
 ---
 ## Health Checks
 ### Check All Pods:
 ```bash
 kubectl get pods -n bakery-ia
 ```
 ### Check Services Are Ready:
 ```bash
 # Check all services
 kubectl get deployments -n bakery-ia
 # Check specific service health
 kubectl exec -n bakery-ia deployment/external-service -- \
  curl -s http://localhost:8000/health/live
 ```
 ### Check Migration Jobs Completed:
 ```bash
 # Should all show "Complete"
 kubectl get jobs -n bakery-ia | grep migration
 ```
 ---
 ## Useful Aliases
 Add to your `~/.bashrc` or `~/.zshrc`:
 ```bash
 # Kubernetes bakery-ia shortcuts
 alias k='kubectl'
 alias kn='kubectl -n bakery-ia'
 alias kp='kubectl get pods -n bakery-ia'
 alias kd='kubectl get deployments -n bakery-ia'
 alias kj='kubectl get jobs -n bakery-ia'
 alias kl='kubectl logs -n bakery-ia'
 alias kdesc='kubectl describe -n bakery-ia'
 # Quick log checks
 alias klogs='kubectl logs -n bakery-ia deployment/'
 # Example usage:
 # klogs external-service | grep verification
 ```
 ---
 ## Expected Output Examples
 ### Migration Job (Successful):
 ```
 [info] Migration job starting service=external
 [info] Migration mode - running database migrations
 [info] Running pending migrations
 INFO [alembic.runtime.migration] Context impl PostgresqlImpl.
 INFO [alembic.runtime.migration] Will assume transactional DDL.
 [info] Migrations applied successfully
 [info] Migration job completed successfully
 ```
 ### Service Startup (New Behavior):
 ```
 [info] Starting external-service version=1.0.0
 [info] Database connection established
 [info] Database verification mode - checking database is ready
 [info] Database state checked
 [info] Database verification successful
       migration_count=1 current_revision=374752db316e table_count=6
 [info] Database verification completed
 [info] external-service started successfully
 ```
 ---
 ## CI/CD Integration
 ### GitHub Actions Example:
 ```yaml
 name: Deploy to Kubernetes
 on:
  push:
    branches: [main]
 jobs:
  deploy:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v2
      - name: Build and push images
        run: skaffold build
      - name: Deploy to cluster
        run: skaffold run
      - name: Verify deployment
        run: |
          kubectl rollout status deployment --all -n bakery-ia
          kubectl get pods -n bakery-ia
 ```
 ---
 ## Summary
 **To Deploy**: Just run `skaffold dev` or `skaffold run`
 **To Verify**: Check logs show "verification" not "migration"
 **To Troubleshoot**: Check migration job logs first
 **Expected Result**: Services start 50-80% faster, no redundant migration execution
 **Status**: ✅ Ready to deploy!
--- a/EXTERNAL_DATA_REDESIGN_IMPLEMENTATION.md
+++ b/EXTERNAL_DATA_REDESIGN_IMPLEMENTATION.md
@@ -1,141 +0,0 @@
 # External Data Service Redesign - Implementation Summary
 **Status:** ✅ **COMPLETE**
 **Date:** October 7, 2025
 **Version:** 2.0.0
 ---
 ## 🎯 Objective
 Redesign the external data service to eliminate redundant per-tenant fetching, enable multi-city support, implement automated 24-month rolling windows, and leverage Kubernetes for lifecycle management.
 ---
 ## ✅ All Deliverables Completed
 ### 1. Backend Implementation (Python/FastAPI)
 #### City Registry & Geolocation
 - ✅ `services/external/app/registry/city_registry.py`
 - ✅ `services/external/app/registry/geolocation_mapper.py`
 #### Data Adapters
 - ✅ `services/external/app/ingestion/base_adapter.py`
 - ✅ `services/external/app/ingestion/adapters/madrid_adapter.py`
 - ✅ `services/external/app/ingestion/adapters/__init__.py`
 - ✅ `services/external/app/ingestion/ingestion_manager.py`
 #### Database Layer
 - ✅ `services/external/app/models/city_weather.py`
 - ✅ `services/external/app/models/city_traffic.py`
 - ✅ `services/external/app/repositories/city_data_repository.py`
 - ✅ `services/external/migrations/versions/20251007_0733_add_city_data_tables.py`
 #### Cache Layer
 - ✅ `services/external/app/cache/redis_cache.py`
 #### API Layer
 - ✅ `services/external/app/schemas/city_data.py`
 - ✅ `services/external/app/api/city_operations.py`
 - ✅ Updated `services/external/app/main.py` (router registration)
 #### Job Scripts
 - ✅ `services/external/app/jobs/initialize_data.py`
 - ✅ `services/external/app/jobs/rotate_data.py`
 ### 2. Infrastructure (Kubernetes)
 - ✅ `infrastructure/kubernetes/external/init-job.yaml`
 - ✅ `infrastructure/kubernetes/external/cronjob.yaml`
 - ✅ `infrastructure/kubernetes/external/deployment.yaml`
 - ✅ `infrastructure/kubernetes/external/configmap.yaml`
 - ✅ `infrastructure/kubernetes/external/secrets.yaml`
 ### 3. Frontend (TypeScript)
 - ✅ `frontend/src/api/types/external.ts` (added CityInfoResponse, DataAvailabilityResponse)
 - ✅ `frontend/src/api/services/external.ts` (complete service client)
 ### 4. Documentation
 - ✅ `EXTERNAL_DATA_SERVICE_REDESIGN.md` (complete architecture)
 - ✅ `services/external/IMPLEMENTATION_COMPLETE.md` (deployment guide)
 - ✅ `EXTERNAL_DATA_REDESIGN_IMPLEMENTATION.md` (this file)
 ---
 ## 📊 Performance Improvements
 | Metric | Before | After | Improvement |
 |--------|--------|-------|-------------|
 | **Historical Weather (1 month)** | 3-5 sec | <100ms | **30-50x faster** |
 | **Historical Traffic (1 month)** | 5-10 sec | <100ms | **50-100x faster** |
 | **Training Data Load (24 months)** | 60-120 sec | 1-2 sec | **60x faster** |
 | **Data Redundancy** | N tenants × fetch | 1 fetch shared | **100% deduplication** |
 | **Cache Hit Rate** | 0% | >70% | **70% reduction in DB load** |
 ---
 ## 🚀 Quick Start
 ### 1. Run Database Migration
 ```bash
 cd services/external
 alembic upgrade head
 ```
 ### 2. Configure Secrets
 ```bash
 cd infrastructure/kubernetes/external
 # Edit secrets.yaml with actual API keys
 kubectl apply -f secrets.yaml
 kubectl apply -f configmap.yaml
 ```
 ### 3. Initialize Data (One-time)
 ```bash
 kubectl apply -f init-job.yaml
 kubectl logs -f job/external-data-init -n bakery-ia
 ```
 ### 4. Deploy Service
 ```bash
 kubectl apply -f deployment.yaml
 kubectl wait --for=condition=ready pod -l app=external-service -n bakery-ia
 ```
 ### 5. Schedule Monthly Rotation
 ```bash
 kubectl apply -f cronjob.yaml
 ```
 ---
 ## 🎉 Success Criteria - All Met!
 ✅ **No redundant fetching** - City-based storage eliminates per-tenant downloads
 ✅ **Multi-city support** - Architecture supports Madrid, Valencia, Barcelona, etc.
 ✅ **Sub-100ms access** - Redis cache provides instant training data
 ✅ **Automated rotation** - Kubernetes CronJob handles 24-month window
 ✅ **Zero downtime** - Init job ensures data before service start
 ✅ **Type-safe frontend** - Full TypeScript integration
 ✅ **Production-ready** - No TODOs, complete observability
 ---
 ## 📚 Additional Resources
 - **Full Architecture:** `/Users/urtzialfaro/Documents/bakery-ia/EXTERNAL_DATA_SERVICE_REDESIGN.md`
 - **Deployment Guide:** `/Users/urtzialfaro/Documents/bakery-ia/services/external/IMPLEMENTATION_COMPLETE.md`
 - **API Documentation:** `http://localhost:8000/docs` (when service is running)
 ---
 **Implementation completed:** October 7, 2025
 **Compliance:** ✅ All constraints met (no backward compatibility, no legacy code, production-ready)
--- a/EXTERNAL_DATA_SERVICE_REDESIGN.md
+++ b/EXTERNAL_DATA_SERVICE_REDESIGN.md
--- a/FRONTEND_ALIGNMENT_STRATEGY.md
+++ b/FRONTEND_ALIGNMENT_STRATEGY.md
@@ -1,757 +0,0 @@
 # 🎯 Frontend-Backend Alignment Strategy
 **Status:** Ready for Execution
 **Last Updated:** 2025-10-05
 **Backend Structure:** Fully analyzed (14 services, 3-tier architecture)
 ---
 ## 📋 Executive Summary
 The backend has been successfully refactored to follow a **consistent 3-tier architecture**:
 - **ATOMIC** endpoints = Direct CRUD on models (e.g., `ingredients.py`, `production_batches.py`)
 - **OPERATIONS** endpoints = Business workflows (e.g., `inventory_operations.py`, `supplier_operations.py`)
 - **ANALYTICS** endpoints = Reporting and insights (e.g., `analytics.py`)
 The frontend must now be updated to mirror this structure with **zero drift**.
 ---
 ## 🏗️ Backend Service Structure
 ### Complete Service Map
 | Service | ATOMIC Files | OPERATIONS Files | ANALYTICS Files | Other Files |
 |---------|--------------|------------------|-----------------|-------------|
 | **auth** | `users.py` | `auth_operations.py` | ❌ | `onboarding_progress.py` |
 | **demo_session** | `demo_accounts.py`, `demo_sessions.py` | `demo_operations.py` | ❌ | `schemas.py` |
 | **external** | `traffic_data.py`, `weather_data.py` | `external_operations.py` | ❌ | - |
 | **forecasting** | `forecasts.py` | `forecasting_operations.py` | `analytics.py` | - |
 | **inventory** | `ingredients.py`, `stock_entries.py`, `temperature_logs.py`, `transformations.py` | `inventory_operations.py`, `food_safety_operations.py` | `analytics.py`, `dashboard.py` | `food_safety_alerts.py`, `food_safety_compliance.py` |
 | **notification** | `notifications.py` | `notification_operations.py` | `analytics.py` | - |
 | **orders** | `orders.py`, `customers.py` | `order_operations.py`, `procurement_operations.py` | ❌ | - |
 | **pos** | `configurations.py`, `transactions.py` | `pos_operations.py` | `analytics.py` | - |
 | **production** | `production_batches.py`, `production_schedules.py` | `production_operations.py` | `analytics.py`, `production_dashboard.py` | - |
 | **recipes** | `recipes.py`, `recipe_quality_configs.py` | `recipe_operations.py` | ❌ (in operations) | - |
 | **sales** | `sales_records.py` | `sales_operations.py` | `analytics.py` | - |
 | **suppliers** | `suppliers.py`, `deliveries.py`, `purchase_orders.py` | `supplier_operations.py` | `analytics.py` | - |
 | **tenant** | `tenants.py`, `tenant_members.py` | `tenant_operations.py` | ❌ | `webhooks.py` |
 | **training** | `models.py`, `training_jobs.py` | `training_operations.py` | ❌ | - |
 ---
 ## 🎯 Frontend Refactoring Plan
 ### Phase 1: Update TypeScript Types (`src/api/types/`)
 **Goal:** Ensure types match backend Pydantic schemas exactly.
 #### Priority Services (Start Here)
 1. **inventory.ts** ✅ Already complex - verify alignment with:
   - `ingredients.py` schemas
   - `stock_entries.py` schemas
   - `inventory_operations.py` request/response models
 2. **production.ts** - Map to:
   - `ProductionBatchCreate`, `ProductionBatchUpdate`, `ProductionBatchResponse`
   - `ProductionScheduleCreate`, `ProductionScheduleResponse`
   - Operation-specific types from `production_operations.py`
 3. **sales.ts** - Map to:
   - `SalesRecordCreate`, `SalesRecordUpdate`, `SalesRecordResponse`
   - Import validation types from `sales_operations.py`
 4. **suppliers.ts** - Map to:
   - `SupplierCreate`, `SupplierUpdate`, `SupplierResponse`
   - `PurchaseOrderCreate`, `PurchaseOrderResponse`
   - `DeliveryCreate`, `DeliveryUpdate`, `DeliveryResponse`
 5. **recipes.ts** - Map to:
   - `RecipeCreate`, `RecipeUpdate`, `RecipeResponse`
   - Quality config types
 #### Action Items
 - [ ] Read backend `app/schemas/*.py` files for each service
 - [ ] Compare with current `frontend/src/api/types/*.ts`
 - [ ] Update/create types to match backend exactly
 - [ ] Remove deprecated types for deleted endpoints
 - [ ] Add JSDoc comments referencing backend schema files
 ---
 ### Phase 2: Refactor Service Files (`src/api/services/`)
 **Goal:** Create clean service classes with ATOMIC, OPERATIONS, and ANALYTICS methods grouped logically.
 #### Current State
 ```
 frontend/src/api/services/
 ├── inventory.ts            ✅ Good structure, needs verification
 ├── production.ts           ⚠️ Needs alignment check
 ├── sales.ts               ⚠️ Needs alignment check
 ├── suppliers.ts           ⚠️ Needs alignment check
 ├── recipes.ts             ⚠️ Needs alignment check
 ├── forecasting.ts         ⚠️ Needs alignment check
 ├── training.ts            ⚠️ Needs alignment check
 ├── orders.ts              ⚠️ Needs alignment check
 ├── foodSafety.ts          ⚠️ May need merge with inventory
 ├── classification.ts      ⚠️ Should be in inventory operations
 ├── transformations.ts     ⚠️ Should be in inventory operations
 ├── inventoryDashboard.ts  ⚠️ Should be in inventory analytics
 └── ... (other services)
 ```
 #### Target Structure (Example: Inventory Service)
 ```typescript
 // frontend/src/api/services/inventory.ts
 export class InventoryService {
  private readonly baseUrl = '/tenants';
  // ===== ATOMIC: Ingredients CRUD =====
  async createIngredient(tenantId: string, data: IngredientCreate): Promise<IngredientResponse>
  async getIngredient(tenantId: string, id: string): Promise<IngredientResponse>
  async listIngredients(tenantId: string, filters?: IngredientFilter): Promise<IngredientResponse[]>
  async updateIngredient(tenantId: string, id: string, data: IngredientUpdate): Promise<IngredientResponse>
  async softDeleteIngredient(tenantId: string, id: string): Promise<void>
  async hardDeleteIngredient(tenantId: string, id: string): Promise<DeletionSummary>
  // ===== ATOMIC: Stock CRUD =====
  async createStock(tenantId: string, data: StockCreate): Promise<StockResponse>
  async getStock(tenantId: string, id: string): Promise<StockResponse>
  async listStock(tenantId: string, filters?: StockFilter): Promise<PaginatedResponse<StockResponse>>
  async updateStock(tenantId: string, id: string, data: StockUpdate): Promise<StockResponse>
  async deleteStock(tenantId: string, id: string): Promise<void>
  // ===== OPERATIONS: Stock Management =====
  async consumeStock(tenantId: string, data: StockConsumptionRequest): Promise<StockConsumptionResponse>
  async getExpiringStock(tenantId: string, daysAhead: number): Promise<StockResponse[]>
  async getLowStock(tenantId: string): Promise<IngredientResponse[]>
  async getStockSummary(tenantId: string): Promise<StockSummary>
  // ===== OPERATIONS: Classification =====
  async classifyProduct(tenantId: string, data: ProductClassificationRequest): Promise<ProductSuggestion>
  async classifyBatch(tenantId: string, data: BatchClassificationRequest): Promise<BatchClassificationResponse>
  // ===== OPERATIONS: Food Safety =====
  async logTemperature(tenantId: string, data: TemperatureLogCreate): Promise<TemperatureLogResponse>
  async getComplianceStatus(tenantId: string): Promise<ComplianceStatus>
  async getFoodSafetyAlerts(tenantId: string): Promise<FoodSafetyAlert[]>
  // ===== ANALYTICS: Dashboard =====
  async getInventoryAnalytics(tenantId: string, dateRange?: DateRange): Promise<InventoryAnalytics>
  async getStockValueReport(tenantId: string): Promise<StockValueReport>
  async getWasteAnalysis(tenantId: string, dateRange?: DateRange): Promise<WasteAnalysis>
 }
 ```
 #### Refactoring Rules
 1. **One Service = One Backend Service Domain**
   - `inventoryService` → All `/tenants/{id}/inventory/*` endpoints
   - `productionService` → All `/tenants/{id}/production/*` endpoints
 2. **Group Methods by Type**
   - ATOMIC methods first (CRUD operations)
   - OPERATIONS methods second (business logic)
   - ANALYTICS methods last (reporting)
 3. **URL Construction Pattern**
   ```typescript
   // ATOMIC
   `${this.baseUrl}/${tenantId}/inventory/ingredients`
   // OPERATIONS
   `${this.baseUrl}/${tenantId}/inventory/operations/consume-stock`
   // ANALYTICS
   `${this.baseUrl}/${tenantId}/inventory/analytics/waste-analysis`
   ```
 4. **No Inline API Calls**
   - All `apiClient.get/post/put/delete` calls MUST be in service files
   - Components/hooks should ONLY call service methods
 #### Service-by-Service Checklist
 - [ ] **inventory.ts** - Verify, add missing operations
 - [ ] **production.ts** - Add batch/schedule operations, analytics
 - [ ] **sales.ts** - Add import operations, analytics
 - [ ] **suppliers.ts** - Split into supplier/PO/delivery methods
 - [ ] **recipes.ts** - Add operations (duplicate, activate, feasibility)
 - [ ] **forecasting.ts** - Add operations and analytics
 - [ ] **training.ts** - Add training job operations
 - [ ] **orders.ts** - Add order/procurement operations
 - [ ] **auth.ts** - Add onboarding progress operations
 - [ ] **tenant.ts** - Add tenant member operations
 - [ ] **notification.ts** - Add notification operations
 - [ ] **pos.ts** - Add POS configuration/transaction operations
 - [ ] **external.ts** - Add traffic/weather data operations
 - [ ] **demo.ts** - Add demo session operations
 #### Files to DELETE (Merge into main services)
 - [ ] ❌ `classification.ts` → Merge into `inventory.ts` (operations section)
 - [ ] ❌ `transformations.ts` → Merge into `inventory.ts` (operations section)
 - [ ] ❌ `inventoryDashboard.ts` → Merge into `inventory.ts` (analytics section)
 - [ ] ❌ `foodSafety.ts` → Merge into `inventory.ts` (operations section)
 - [ ] ❌ `dataImport.ts` → Merge into `sales.ts` (operations section)
 - [ ] ❌ `qualityTemplates.ts` → Merge into `recipes.ts` (if still needed)
 - [ ] ❌ `onboarding.ts` → Merge into `auth.ts` (operations section)
 - [ ] ❌ `subscription.ts` → Merge into `tenant.ts` (operations section)
 ---
 ### Phase 3: Update Hooks (`src/api/hooks/`)
 **Goal:** Create typed hooks that use updated service methods.
 #### Current State
 ```
 frontend/src/api/hooks/
 ├── inventory.ts
 ├── production.ts
 ├── suppliers.ts
 ├── recipes.ts
 ├── forecasting.ts
 ├── training.ts
 ├── foodSafety.ts
 ├── inventoryDashboard.ts
 ├── qualityTemplates.ts
 └── ...
 ```
 #### Hook Naming Convention
 ```typescript
 // Query hooks (GET)
 useIngredients(tenantId: string, filters?: IngredientFilter)
 useIngredient(tenantId: string, ingredientId: string)
 useLowStockIngredients(tenantId: string)
 // Mutation hooks (POST/PUT/DELETE)
 useCreateIngredient()
 useUpdateIngredient()
 useDeleteIngredient()
 useConsumeStock()
 useClassifyProducts()
 ```
 #### Hook Structure (React Query)
 ```typescript
 import { useQuery, useMutation, useQueryClient } from '@tanstack/react-query';
 import { inventoryService } from '../services/inventory';
 // Query Hook
 export const useIngredients = (tenantId: string, filters?: IngredientFilter) => {
  return useQuery({
    queryKey: ['ingredients', tenantId, filters],
    queryFn: () => inventoryService.listIngredients(tenantId, filters),
    enabled: !!tenantId,
  });
 };
 // Mutation Hook
 export const useConsumeStock = () => {
  const queryClient = useQueryClient();
  return useMutation({
    mutationFn: ({ tenantId, data }: { tenantId: string; data: StockConsumptionRequest }) =>
      inventoryService.consumeStock(tenantId, data),
    onSuccess: (_, { tenantId }) => {
      queryClient.invalidateQueries({ queryKey: ['stock', tenantId] });
      queryClient.invalidateQueries({ queryKey: ['ingredients', tenantId] });
    },
  });
 };
 ```
 #### Action Items
 - [ ] Audit all hooks in `src/api/hooks/`
 - [ ] Ensure each hook calls correct service method
 - [ ] Update query keys to match new structure
 - [ ] Add proper invalidation logic for mutations
 - [ ] Remove hooks for deleted endpoints
 - [ ] Merge duplicate hooks (e.g., `useFetchIngredients` + `useIngredients`)
 #### Files to DELETE (Merge into main hook files)
 - [ ] ❌ `foodSafety.ts` → Merge into `inventory.ts`
 - [ ] ❌ `inventoryDashboard.ts` → Merge into `inventory.ts`
 - [ ] ❌ `qualityTemplates.ts` → Merge into `recipes.ts`
 ---
 ### Phase 4: Cross-Service Consistency
 **Goal:** Ensure naming and patterns are consistent across all services.
 #### Naming Conventions
 | Backend Pattern | Frontend Method | Hook Name |
 |----------------|-----------------|-----------|
 | `POST /ingredients` | `createIngredient()` | `useCreateIngredient()` |
 | `GET /ingredients` | `listIngredients()` | `useIngredients()` |
 | `GET /ingredients/{id}` | `getIngredient()` | `useIngredient()` |
 | `PUT /ingredients/{id}` | `updateIngredient()` | `useUpdateIngredient()` |
 | `DELETE /ingredients/{id}` | `deleteIngredient()` | `useDeleteIngredient()` |
 | `POST /operations/consume-stock` | `consumeStock()` | `useConsumeStock()` |
 | `GET /analytics/summary` | `getAnalyticsSummary()` | `useAnalyticsSummary()` |
 #### Query Parameter Mapping
 Backend query params should map to TypeScript filter objects:
 ```typescript
 // Backend: ?category=flour&is_low_stock=true&limit=50&offset=0
 // Frontend:
 interface IngredientFilter {
  category?: string;
  product_type?: string;
  is_active?: boolean;
  is_low_stock?: boolean;
  needs_reorder?: boolean;
  search?: string;
  limit?: number;
  offset?: number;
  order_by?: string;
  order_direction?: 'asc' | 'desc';
 }
 ```
 ---
 ## 🧹 Cleanup & Verification
 ### Step 1: Type Check
 ```bash
 cd frontend
 npm run type-check
 ```
 Fix all TypeScript errors related to:
 - Missing types
 - Incorrect method signatures
 - Deprecated imports
 ### Step 2: Search for Inline API Calls
 ```bash
 # Find direct axios/fetch calls in components
 rg "apiClient\.(get|post|put|delete)" frontend/src/components --type ts
 rg "axios\." frontend/src/components --type ts
 rg "fetch\(" frontend/src/components --type ts
 ```
 Move all found calls into appropriate service files.
 ### Step 3: Delete Obsolete Files
 After verification, delete these files from git:
 ```bash
 # Service files to delete
 git rm frontend/src/api/services/classification.ts
 git rm frontend/src/api/services/transformations.ts
 git rm frontend/src/api/services/inventoryDashboard.ts
 git rm frontend/src/api/services/foodSafety.ts
 git rm frontend/src/api/services/dataImport.ts
 git rm frontend/src/api/services/qualityTemplates.ts
 git rm frontend/src/api/services/onboarding.ts
 git rm frontend/src/api/services/subscription.ts
 # Hook files to delete
 git rm frontend/src/api/hooks/foodSafety.ts
 git rm frontend/src/api/hooks/inventoryDashboard.ts
 git rm frontend/src/api/hooks/qualityTemplates.ts
 # Types to verify (may need to merge, not delete)
 # Check if still referenced before deleting
 ```
 ### Step 4: Update Imports
 Search for imports of deleted files:
 ```bash
 rg "from.*classification" frontend/src --type ts
 rg "from.*transformations" frontend/src --type ts
 rg "from.*foodSafety" frontend/src --type ts
 rg "from.*inventoryDashboard" frontend/src --type ts
 ```
 Update all found imports to use the consolidated service files.
 ### Step 5: End-to-End Testing
 Test critical user flows:
 - [ ] Create ingredient → Add stock → Consume stock
 - [ ] Create recipe → Check feasibility → Start production batch
 - [ ] Import sales data → View analytics
 - [ ] Create purchase order → Receive delivery → Update stock
 - [ ] View dashboard analytics for all services
 ### Step 6: Network Inspection
 Open DevTools → Network tab and verify:
 - [ ] All API calls use correct URLs matching backend structure
 - [ ] No 404 errors from old endpoints
 - [ ] Query parameters match backend expectations
 - [ ] Response bodies match TypeScript types
 ---
 ## 📊 Progress Tracking
 ### Backend Analysis
 - [x] Inventory service mapped
 - [x] Production service mapped
 - [x] Sales service mapped
 - [x] Suppliers service mapped
 - [x] Recipes service mapped
 - [x] Forecasting service identified
 - [x] Training service identified
 - [x] All 14 services documented
 ### Frontend Refactoring
 - [x] **Phase 1: Types updated (14/14 services) - ✅ 100% COMPLETE**
  - All TypeScript types now have zero drift with backend Pydantic schemas
  - Comprehensive JSDoc documentation with backend file references
  - All 14 services covered: inventory, production, sales, suppliers, recipes, forecasting, orders, training, tenant, auth, notification, pos, external, demo
 - [x] **Phase 2: Services refactored (14/14 services) - ✅ 100% COMPLETE**
  - [x] **inventory.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS, ANALYTICS, COMPLIANCE)
    - Complete coverage: ingredients, stock, movements, transformations, temperature logs
    - Operations: stock management, classification, food safety
    - Analytics: dashboard summary, inventory analytics
    - All endpoints aligned with backend API structure
    - File: [frontend/src/api/services/inventory.ts](frontend/src/api/services/inventory.ts)
  - [x] **production.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS, ANALYTICS)
    - ATOMIC: Batches CRUD, Schedules CRUD
    - OPERATIONS: Batch lifecycle (start, complete, status), schedule finalization, capacity management, quality checks
    - ANALYTICS: Performance, yield trends, defects, equipment efficiency, capacity bottlenecks, dashboard
    - 33 methods covering complete production workflow
    - File: [frontend/src/api/services/production.ts](frontend/src/api/services/production.ts)
  - [x] **sales.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS, ANALYTICS)
    - ATOMIC: Sales Records CRUD, Categories
    - OPERATIONS: Validation, cross-service product queries, data import (validate, execute, history, template), aggregation (by product, category, channel)
    - ANALYTICS: Sales summary analytics
    - 16 methods covering complete sales workflow including CSV import
    - File: [frontend/src/api/services/sales.ts](frontend/src/api/services/sales.ts)
  - [x] **suppliers.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS, ANALYTICS)
    - ATOMIC: Suppliers CRUD, Purchase Orders CRUD, Deliveries CRUD
    - OPERATIONS: Statistics, active suppliers, top suppliers, pending approvals, supplier approval
    - ANALYTICS: Performance calculation, metrics, alerts evaluation
    - UTILITIES: Order total calculation, supplier code formatting, tax ID validation, currency formatting
    - 25 methods covering complete supplier lifecycle including performance tracking
    - File: [frontend/src/api/services/suppliers.ts](frontend/src/api/services/suppliers.ts)
  - [x] **recipes.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS)
    - ATOMIC: Recipes CRUD, Quality Configuration CRUD
    - OPERATIONS: Recipe Management (duplicate, activate, feasibility)
    - 15 methods covering recipe lifecycle and quality management
    - File: [frontend/src/api/services/recipes.ts](frontend/src/api/services/recipes.ts)
  - [x] **forecasting.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS, ANALYTICS)
    - ATOMIC: Forecast CRUD
    - OPERATIONS: Single/Multi-day/Batch forecasts, Realtime predictions, Validation, Cache management
    - ANALYTICS: Performance metrics
    - 11 methods covering forecasting workflow
    - File: [frontend/src/api/services/forecasting.ts](frontend/src/api/services/forecasting.ts)
  - [x] **orders.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS)
    - ATOMIC: Orders CRUD, Customers CRUD
    - OPERATIONS: Dashboard & Analytics, Business Intelligence, Procurement Planning (21+ methods)
    - 30+ methods covering order and procurement lifecycle
    - File: [frontend/src/api/services/orders.ts](frontend/src/api/services/orders.ts)
  - [x] **training.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS)
    - ATOMIC: Training Job Status, Model Management
    - OPERATIONS: Training Job Creation
    - WebSocket Support for real-time training updates
    - 9 methods covering ML training workflow
    - File: [frontend/src/api/services/training.ts](frontend/src/api/services/training.ts)
  - [x] **tenant.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS)
    - ATOMIC: Tenant CRUD, Team Member Management
    - OPERATIONS: Access Control, Search & Discovery, Model Status, Statistics & Admin
    - Frontend Context Management utilities
    - 17 methods covering tenant and team management
    - File: [frontend/src/api/services/tenant.ts](frontend/src/api/services/tenant.ts)
  - [x] **auth.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS)
    - ATOMIC: User Profile
    - OPERATIONS: Authentication (register, login, tokens, password), Email Verification
    - 10 methods covering authentication workflow
    - File: [frontend/src/api/services/auth.ts](frontend/src/api/services/auth.ts)
  - [x] **pos.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS, ANALYTICS)
    - ATOMIC: POS Configuration CRUD, Transactions
    - OPERATIONS: Supported Systems, Sync Operations, Webhook Management
    - Frontend Utility Methods for UI helpers
    - 20+ methods covering POS integration lifecycle
    - File: [frontend/src/api/services/pos.ts](frontend/src/api/services/pos.ts)
  - [x] **demo.ts** - ✅ COMPLETE (2025-10-05)
    - Organized using 3-tier architecture comments (ATOMIC, OPERATIONS)
    - ATOMIC: Demo Accounts, Demo Sessions
    - OPERATIONS: Demo Session Management (extend, destroy, stats, cleanup)
    - 6 functions covering demo session lifecycle
    - File: [frontend/src/api/services/demo.ts](frontend/src/api/services/demo.ts)
  - Note: notification.ts and external.ts services do not exist as separate files - endpoints likely integrated into other services
 - [x] **Phase 3: Hooks updated (14/14 services) - ✅ 100% COMPLETE**
  - All React Query hooks updated to match Phase 1 type changes
  - Fixed type imports, method signatures, and enum values
  - Updated infinite query hooks with initialPageParam
  - Resolved all service method signature mismatches
  - **Type Check Status: ✅ ZERO ERRORS**
 - [ ] Phase 4: Cross-service consistency verified
 - [ ] Cleanup: Obsolete files deleted
 - [x] **Verification: Type checks passing - ✅ COMPLETE**
  - TypeScript compilation: ✅ 0 errors
  - All hooks properly typed
  - All service methods aligned
 - [ ] Verification: E2E tests passing
 #### Detailed Progress (Last Updated: 2025-10-05)
 **Phase 1 - TypeScript Types:**
 - [x] **inventory.ts** - ✅ COMPLETE (2025-10-05)
  - Added comprehensive JSDoc references to backend schema files
  - All 3 schema categories covered: inventory.py, food_safety.py, dashboard.py
  - Includes: Ingredients, Stock, Movements, Transformations, Classification, Food Safety, Dashboard
  - Type check: ✅ PASSING (no errors)
  - File: [frontend/src/api/types/inventory.ts](frontend/src/api/types/inventory.ts)
 - [x] **production.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored 2 backend schema files: production.py, quality_templates.py
  - Includes: Batches, Schedules, Quality Checks, Quality Templates, Process Stages
  - Added all operations and analytics types
  - Type check: ✅ PASSING (no errors)
  - File: [frontend/src/api/types/production.ts](frontend/src/api/types/production.ts)
 - [x] **sales.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored backend schema: sales.py
  - **BREAKING CHANGE**: Product references now use inventory_product_id (inventory service integration)
  - Includes: Sales Data CRUD, Analytics, Import/Validation operations
  - Type check: ✅ PASSING (no errors)
  - File: [frontend/src/api/types/sales.ts](frontend/src/api/types/sales.ts)
 - [x] **suppliers.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored 2 backend schema files: suppliers.py, performance.py
  - Most comprehensive service: Suppliers, Purchase Orders, Deliveries, Performance, Alerts, Scorecards
  - Includes: 13 enums, 60+ interfaces covering full supplier lifecycle
  - Business model detection and performance analytics included
  - Type check: ✅ PASSING (no errors)
  - File: [frontend/src/api/types/suppliers.ts](frontend/src/api/types/suppliers.ts)
 - [x] **recipes.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored backend schema: recipes.py
  - Includes: Recipe CRUD, Recipe Ingredients, Quality Configuration (stage-based), Operations (duplicate, activate, feasibility)
  - 3 enums, 20+ interfaces covering recipe lifecycle, quality checks, production batches
  - Quality templates integration for production workflow
  - Type check: ✅ PASSING (no type errors specific to recipes)
  - File: [frontend/src/api/types/recipes.ts](frontend/src/api/types/recipes.ts)
 - [x] **forecasting.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored backend schema: forecasts.py
  - Includes: Forecast CRUD, Operations (single, multi-day, batch, realtime predictions), Analytics, Validation
  - 1 enum, 15+ interfaces covering forecast generation, batch processing, predictions, performance metrics
  - Integration with inventory service via inventory_product_id references
  - Type check: ✅ PASSING (no type errors specific to forecasting)
  - File: [frontend/src/api/types/forecasting.ts](frontend/src/api/types/forecasting.ts)
 - [x] **orders.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored 2 backend schema files: order_schemas.py, procurement_schemas.py
  - Includes: Customer CRUD, Order CRUD (items, workflow), Procurement Plans (MRP-style), Requirements, Dashboard
  - 17 enums, 50+ interfaces covering full order and procurement lifecycle
  - Advanced features: Business model detection, procurement planning, demand requirements
  - Type check: ✅ PASSING (no type errors specific to orders)
  - File: [frontend/src/api/types/orders.ts](frontend/src/api/types/orders.ts)
 - [x] **training.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored backend schema: training.py
  - Includes: Training Jobs, Model Management, Data Validation, Real-time Progress (WebSocket), Bulk Operations
  - 1 enum, 25+ interfaces covering ML training workflow, Prophet model configuration, metrics, scheduling
  - Advanced features: WebSocket progress updates, external data integration (weather/traffic), model versioning
  - Type check: ✅ PASSING (no type errors specific to training)
  - File: [frontend/src/api/types/training.ts](frontend/src/api/types/training.ts)
 - [x] **tenant.ts** - ✅ COMPLETE (2025-10-05) - **CRITICAL FIX**
  - Mirrored backend schema: tenants.py
  - **FIXED**: Added required `owner_id` field to TenantResponse - resolves type error
  - Includes: Bakery Registration, Tenant CRUD, Members, Subscriptions, Access Control, Analytics
  - 10+ interfaces covering tenant lifecycle, team management, subscription plans (basic/professional/enterprise)
  - Type check: ✅ PASSING - owner_id error RESOLVED
  - File: [frontend/src/api/types/tenant.ts](frontend/src/api/types/tenant.ts)
 - [x] **auth.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored 2 backend schema files: auth.py, users.py
  - Includes: Registration, Login, Token Management, Password Reset, Email Verification, User Management
  - 14+ interfaces covering authentication workflow, JWT tokens, error handling, internal service communication
  - Token response follows industry standards (Firebase, AWS Cognito)
  - Type check: ⚠️ Hook errors remain (Phase 3) - types complete
  - File: [frontend/src/api/types/auth.ts](frontend/src/api/types/auth.ts)
 - [x] **notification.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored backend schema: notifications.py
  - Includes: Notifications CRUD, Bulk Send, Preferences, Templates, Webhooks, Statistics
  - 3 enums, 14+ interfaces covering notification lifecycle, delivery tracking, user preferences
  - Multi-channel support: Email, WhatsApp, Push, SMS
  - Advanced features: Quiet hours, digest frequency, template system, delivery webhooks
  - Type check: ⚠️ Hook errors remain (Phase 3) - types complete
  - File: [frontend/src/api/types/notification.ts](frontend/src/api/types/notification.ts)
 - [x] **pos.ts** - ✅ ALREADY COMPLETE (2025-09-11)
  - Mirrored backend models: pos_config.py, pos_transaction.py
  - Includes: Configurations, Transactions, Transaction Items, Webhooks, Sync Logs, Analytics
  - 13 type aliases, 40+ interfaces covering POS integration lifecycle
  - Multi-POS support: Square, Toast, Lightspeed
  - Advanced features: Sync management, webhook handling, duplicate detection, sync analytics
  - Type check: ⚠️ Hook errors remain (Phase 3) - types complete
  - File: [frontend/src/api/types/pos.ts](frontend/src/api/types/pos.ts)
 - [x] **external.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored 2 backend schema files: weather.py, traffic.py
  - Includes: Weather Data, Weather Forecasts, Traffic Data, Analytics, Hourly Forecasts
  - 20+ interfaces covering external data lifecycle, historical data, forecasting
  - Data sources: AEMET (weather), Madrid OpenData (traffic)
  - Advanced features: Location-based queries, date range filtering, analytics aggregation
  - Type check: ⚠️ Hook errors remain (Phase 3) - types complete
  - File: [frontend/src/api/types/external.ts](frontend/src/api/types/external.ts)
 - [x] **demo.ts** - ✅ COMPLETE (2025-10-05)
  - Mirrored backend schema: schemas.py
  - Includes: Demo Sessions, Account Info, Data Cloning, Statistics
  - 8 interfaces covering demo session lifecycle, tenant data cloning
  - Demo account types: individual_bakery, central_baker
  - Advanced features: Session extension, virtual tenant management, data cloning
  - Type check: ⚠️ Hook errors remain (Phase 3) - types complete
  - File: [frontend/src/api/types/demo.ts](frontend/src/api/types/demo.ts)
 ---
 ## 🚨 Critical Reminders
 ### ✅ Must Follow
 1. **Read backend schemas first** - Don't guess types
 2. **Test after each service** - Don't batch all changes
 3. **Update one service fully** - Types → Service → Hooks → Test
 4. **Delete old files immediately** - Prevents confusion
 5. **Document breaking changes** - Help other developers
 ### ❌ Absolutely Avoid
 1. ❌ Creating new service files without backend equivalent
 2. ❌ Keeping "temporary" hybrid files
 3. ❌ Skipping type updates
 4. ❌ Direct API calls in components
 5. ❌ Mixing ATOMIC and OPERATIONS in unclear ways
 ---
 ## 🎯 Success Criteria
 The refactoring is complete when:
 - [x] All TypeScript types match backend Pydantic schemas ✅
 - [x] All service methods map 1:1 to backend endpoints ✅
 - [x] All hooks use service methods (no direct API calls) ✅
 - [x] `npm run type-check` passes with zero errors ✅
 - [x] Production build succeeds ✅
 - [x] Code is documented with JSDoc comments ✅
 - [x] This document is marked as [COMPLETED] ✅
 **Note:** Legacy service files (classification, foodSafety, etc.) preserved to maintain backward compatibility with existing components. Future migration recommended but not required.
 ---
 **PROJECT STATUS: ✅ [COMPLETED] - 100%**
 - ✅ Phase 1 Complete (14/14 core services - TypeScript types)
 - ✅ Phase 2 Complete (14/14 core services - Service files)
 - ✅ Phase 3 Complete (14/14 core services - Hooks)
 - ✅ Phase 4 Complete (Cross-service consistency verified)
 - ✅ Phase 5 Complete (Legacy file cleanup and consolidation)
 **Architecture:**
 - 14 Core consolidated services (inventory, sales, production, recipes, etc.)
 - 3 Specialized domain modules (qualityTemplates, onboarding, subscription)
 - Total: 17 production services (down from 22 - **23% reduction**)
 **Final Verification:**
 - ✅ TypeScript compilation: 0 errors
 - ✅ Production build: Success (built in 3.03s)
 - ✅ Zero drift with backend Pydantic schemas
 - ✅ All 14 services fully aligned
 **Achievements:**
 - Complete frontend-backend type alignment across 14 microservices
 - Consistent 3-tier architecture (ATOMIC, OPERATIONS, ANALYTICS)
 - All React Query hooks properly typed with zero errors
 - Comprehensive JSDoc documentation referencing backend schemas
 - Production-ready build verified
 **Cleanup Progress (2025-10-05):**
 - ✅ Deleted unused services: `transformations.ts`, `foodSafety.ts`, `inventoryDashboard.ts`
 - ✅ Deleted unused hooks: `foodSafety.ts`, `inventoryDashboard.ts`
 - ✅ Updated `index.ts` exports to remove deleted modules
 - ✅ Fixed `inventory.ts` hooks to use consolidated `inventoryService`
 - ✅ Production build: **Success (3.06s)**
 **Additional Cleanup (2025-10-05 - Session 2):**
 - ✅ Migrated `classification.ts` → `inventory.ts` hooks (useClassifyBatch)
 - ✅ Migrated `dataImport.ts` → `sales.ts` hooks (useValidateImportFile, useImportSalesData)
 - ✅ Updated UploadSalesDataStep component to use consolidated hooks
 - ✅ Deleted `classification.ts` service and hooks
 - ✅ Deleted `dataImport.ts` service and hooks
 - ✅ Production build: **Success (2.96s)**
 **Total Files Deleted: 9**
 - Services: `transformations.ts`, `foodSafety.ts`, `inventoryDashboard.ts`, `classification.ts`, `dataImport.ts`
 - Hooks: `foodSafety.ts`, `inventoryDashboard.ts`, `classification.ts`, `dataImport.ts`
 **Specialized Service Modules (Intentionally Preserved):**
 These 3 files are **NOT legacy** - they are specialized, domain-specific modules that complement the core consolidated services:
 | Module | Purpose | Justification | Components |
 |--------|---------|---------------|------------|
 | **qualityTemplates.ts** | Production quality check template management | 12 specialized methods for template CRUD, validation, and execution. Domain-specific to quality assurance workflow. | 4 (recipes/production) |
 | **onboarding.ts** | User onboarding progress tracking | Manages multi-step onboarding state, progress persistence, and step completion. User journey management. | 1 (OnboardingWizard) |
 | **subscription.ts** | Subscription tier access control | Feature gating based on subscription plans (STARTER/PROFESSIONAL/ENTERPRISE). Business logic layer. | 2 (analytics pages) |
 **Architecture Decision:**
 These modules follow **Domain-Driven Design** principles - they encapsulate complex domain logic that would clutter the main services. They are:
 - ✅ Well-tested and production-proven
 - ✅ Single Responsibility Principle compliant
 - ✅ Zero duplication with consolidated services
 - ✅ Clear boundaries and interfaces
 - ✅ Actively maintained
 **Status:** These are **permanent architecture components**, not technical debt.
 **Next Steps (Optional - Future Enhancements):**
 1. Add E2E tests to verify all workflows
 2. Performance optimization and bundle size analysis
 3. Document these specialized modules in architecture diagrams
--- a/FRONTEND_INTEGRATION_GUIDE.md
+++ b/FRONTEND_INTEGRATION_GUIDE.md
@@ -1,748 +0,0 @@
 # FRONTEND INTEGRATION GUIDE - Procurement Features
 ## ✅ COMPLETED FRONTEND CHANGES
 All TypeScript types, API service methods, and React hooks have been implemented. This guide shows how to use them in your components.
 ---
 ## 📦 WHAT'S BEEN ADDED
 ### 1. **New Types** (`frontend/src/api/types/orders.ts`)
 ```typescript
 // Approval workflow tracking
 export interface ApprovalWorkflowEntry {
  timestamp: string;
  from_status: string;
  to_status: string;
  user_id?: string;
  notes?: string;
 }
 // Purchase order creation result
 export interface CreatePOsResult {
  success: boolean;
  created_pos: Array<{
    po_id: string;
    po_number: string;
    supplier_id: string;
    items_count: number;
    total_amount: number;
  }>;
  failed_pos: Array<{
    supplier_id: string;
    error: string;
  }>;
  total_created: number;
  total_failed: number;
 }
 // Request types
 export interface LinkRequirementToPORequest {
  purchase_order_id: string;
  purchase_order_number: string;
  ordered_quantity: number;
  expected_delivery_date?: string;
 }
 export interface UpdateDeliveryStatusRequest {
  delivery_status: string;
  received_quantity?: number;
  actual_delivery_date?: string;
  quality_rating?: number;
 }
 export interface ApprovalRequest {
  approval_notes?: string;
 }
 export interface RejectionRequest {
  rejection_notes?: string;
 }
 ```
 **Updated ProcurementPlanResponse:**
 - Added `approval_workflow?: ApprovalWorkflowEntry[]` - tracks all approval actions
 ---
 ### 2. **New API Methods** (`frontend/src/api/services/orders.ts`)
 ```typescript
 class OrdersService {
  // Recalculate plan with current inventory
  static async recalculateProcurementPlan(tenantId: string, planId: string): Promise<GeneratePlanResponse>
  // Approve plan with notes
  static async approveProcurementPlan(tenantId: string, planId: string, request?: ApprovalRequest): Promise<ProcurementPlanResponse>
  // Reject plan with notes
  static async rejectProcurementPlan(tenantId: string, planId: string, request?: RejectionRequest): Promise<ProcurementPlanResponse>
  // Auto-create POs from plan
  static async createPurchaseOrdersFromPlan(tenantId: string, planId: string, autoApprove?: boolean): Promise<CreatePOsResult>
  // Link requirement to PO
  static async linkRequirementToPurchaseOrder(tenantId: string, requirementId: string, request: LinkRequirementToPORequest): Promise<{...}>
  // Update delivery status
  static async updateRequirementDeliveryStatus(tenantId: string, requirementId: string, request: UpdateDeliveryStatusRequest): Promise<{...}>
 }
 ```
 ---
 ### 3. **New React Hooks** (`frontend/src/api/hooks/orders.ts`)
 ```typescript
 // Recalculate plan
 useRecalculateProcurementPlan(options?)
 // Approve plan
 useApproveProcurementPlan(options?)
 // Reject plan
 useRejectProcurementPlan(options?)
 // Create POs from plan
 useCreatePurchaseOrdersFromPlan(options?)
 // Link requirement to PO
 useLinkRequirementToPurchaseOrder(options?)
 // Update delivery status
 useUpdateRequirementDeliveryStatus(options?)
 ```
 ---
 ## 🎨 HOW TO USE IN COMPONENTS
 ### Example 1: Recalculate Plan Button
 ```typescript
 import { useRecalculateProcurementPlan } from '@/api/hooks/orders';
 import { useToast } from '@/hooks/useToast';
 function ProcurementPlanActions({ plan, tenantId }) {
  const { toast } = useToast();
  const recalculateMutation = useRecalculateProcurementPlan({
    onSuccess: (data) => {
      if (data.success && data.plan) {
        toast({
          title: 'Plan recalculado',
          description: `Plan actualizado con ${data.plan.total_requirements} requerimientos`,
          variant: 'success',
        });
      }
    },
    onError: (error) => {
      toast({
        title: 'Error al recalcular',
        description: error.message,
        variant: 'destructive',
      });
    },
  });
  const handleRecalculate = () => {
    if (confirm('¿Recalcular el plan con el inventario actual?')) {
      recalculateMutation.mutate({ tenantId, planId: plan.id });
    }
  };
  // Show warning if plan is old
  const planAgeHours = (new Date().getTime() - new Date(plan.created_at).getTime()) / (1000 * 60 * 60);
  const isStale = planAgeHours > 24;
  return (
    <div>
      {isStale && (
        <Alert variant="warning">
          <AlertCircle className="h-4 w-4" />
          <AlertTitle>Plan desactualizado</AlertTitle>
          <AlertDescription>
            Este plan tiene más de 24 horas. El inventario puede haber cambiado.
          </AlertDescription>
        </Alert>
      )}
      <Button
        onClick={handleRecalculate}
        disabled={recalculateMutation.isPending}
        variant="outline"
      >
        {recalculateMutation.isPending ? 'Recalculando...' : 'Recalcular Plan'}
      </Button>
    </div>
  );
 }
 ```
 ---
 ### Example 2: Approve/Reject Plan with Notes
 ```typescript
 import { useApproveProcurementPlan, useRejectProcurementPlan } from '@/api/hooks/orders';
 import { Dialog, DialogContent, DialogHeader, DialogTitle } from '@/components/ui/dialog';
 import { Textarea } from '@/components/ui/textarea';
 function ApprovalDialog({ plan, tenantId, open, onClose }) {
  const [notes, setNotes] = useState('');
  const [action, setAction] = useState<'approve' | 'reject'>('approve');
  const approveMutation = useApproveProcurementPlan({
    onSuccess: () => {
      toast({ title: 'Plan aprobado', variant: 'success' });
      onClose();
    },
  });
  const rejectMutation = useRejectProcurementPlan({
    onSuccess: () => {
      toast({ title: 'Plan rechazado', variant: 'success' });
      onClose();
    },
  });
  const handleSubmit = () => {
    if (action === 'approve') {
      approveMutation.mutate({
        tenantId,
        planId: plan.id,
        approval_notes: notes || undefined,
      });
    } else {
      rejectMutation.mutate({
        tenantId,
        planId: plan.id,
        rejection_notes: notes || undefined,
      });
    }
  };
  return (
    <Dialog open={open} onOpenChange={onClose}>
      <DialogContent>
        <DialogHeader>
          <DialogTitle>
            {action === 'approve' ? 'Aprobar' : 'Rechazar'} Plan de Compras
          </DialogTitle>
        </DialogHeader>
        <div className="space-y-4">
          <div>
            <label className="text-sm font-medium">Notas (opcional)</label>
            <Textarea
              value={notes}
              onChange={(e) => setNotes(e.target.value)}
              placeholder={
                action === 'approve'
                  ? 'Razón de aprobación...'
                  : 'Razón de rechazo...'
              }
              rows={4}
            />
          </div>
          <div className="flex gap-2">
            <Button
              variant="outline"
              onClick={() => setAction('reject')}
              className={action === 'reject' ? 'bg-red-50' : ''}
            >
              Rechazar
            </Button>
            <Button
              onClick={() => setAction('approve')}
              className={action === 'approve' ? 'bg-green-50' : ''}
            >
              Aprobar
            </Button>
          </div>
          <Button
            onClick={handleSubmit}
            disabled={approveMutation.isPending || rejectMutation.isPending}
            className="w-full"
          >
            Confirmar
          </Button>
        </div>
      </DialogContent>
    </Dialog>
  );
 }
 ```
 ---
 ### Example 3: Auto-Create Purchase Orders
 ```typescript
 import { useCreatePurchaseOrdersFromPlan } from '@/api/hooks/orders';
 function CreatePOsButton({ plan, tenantId }) {
  const createPOsMutation = useCreatePurchaseOrdersFromPlan({
    onSuccess: (result) => {
      if (result.success) {
        toast({
          title: `${result.total_created} órdenes de compra creadas`,
          description: result.created_pos.map(po =>
            `${po.po_number}: ${po.items_count} items - $${po.total_amount.toFixed(2)}`
          ).join('\n'),
          variant: 'success',
        });
        if (result.failed_pos.length > 0) {
          toast({
            title: `${result.total_failed} órdenes fallaron`,
            description: result.failed_pos.map(f => f.error).join('\n'),
            variant: 'destructive',
          });
        }
      }
    },
  });
  const handleCreatePOs = () => {
    if (plan.status !== 'approved') {
      toast({
        title: 'Plan no aprobado',
        description: 'Debes aprobar el plan antes de crear órdenes de compra',
        variant: 'warning',
      });
      return;
    }
    if (confirm(`Crear órdenes de compra para ${plan.total_requirements} requerimientos?`)) {
      createPOsMutation.mutate({
        tenantId,
        planId: plan.id,
        autoApprove: false, // Set to true for auto-approval
      });
    }
  };
  return (
    <Button
      onClick={handleCreatePOs}
      disabled={createPOsMutation.isPending || plan.status !== 'approved'}
    >
      {createPOsMutation.isPending ? (
        <>
          <Loader2 className="mr-2 h-4 w-4 animate-spin" />
          Creando órdenes...
        </>
      ) : (
        <>
          <FileText className="mr-2 h-4 w-4" />
          Crear Órdenes de Compra
        </>
      )}
    </Button>
  );
 }
 ```
 ---
 ### Example 4: Display Approval Workflow History
 ```typescript
 function ApprovalHistory({ plan }: { plan: ProcurementPlanResponse }) {
  if (!plan.approval_workflow || plan.approval_workflow.length === 0) {
    return null;
  }
  return (
    <Card>
      <CardHeader>
        <CardTitle>Historial de Aprobaciones</CardTitle>
      </CardHeader>
      <CardContent>
        <div className="space-y-3">
          {plan.approval_workflow.map((entry, index) => (
            <div key={index} className="flex items-start gap-3 border-l-2 border-gray-200 pl-4">
              <div className="flex-1">
                <div className="flex items-center gap-2">
                  <Badge variant={entry.to_status === 'approved' ? 'success' : 'destructive'}>
                    {entry.from_status} → {entry.to_status}
                  </Badge>
                  <span className="text-xs text-gray-500">
                    {new Date(entry.timestamp).toLocaleString()}
                  </span>
                </div>
                {entry.notes && (
                  <p className="text-sm text-gray-600 mt-1">{entry.notes}</p>
                )}
                {entry.user_id && (
                  <p className="text-xs text-gray-400 mt-1">Usuario: {entry.user_id}</p>
                )}
              </div>
            </div>
          ))}
        </div>
      </CardContent>
    </Card>
  );
 }
 ```
 ---
 ### Example 5: Requirements Table with Supplier Info
 ```typescript
 function RequirementsTable({ requirements }: { requirements: ProcurementRequirementResponse[] }) {
  return (
    <Table>
      <TableHeader>
        <TableRow>
          <TableHead>Producto</TableHead>
          <TableHead>Cantidad</TableHead>
          <TableHead>Proveedor</TableHead>
          <TableHead>Lead Time</TableHead>
          <TableHead>Orden de Compra</TableHead>
          <TableHead>Estado de Entrega</TableHead>
          <TableHead>Acciones</TableHead>
        </TableRow>
      </TableHeader>
      <TableBody>
        {requirements.map((req) => (
          <TableRow key={req.id}>
            <TableCell>
              <div>
                <div className="font-medium">{req.product_name}</div>
                <div className="text-xs text-gray-500">{req.product_sku}</div>
              </div>
            </TableCell>
            <TableCell>
              {req.net_requirement} {req.unit_of_measure}
            </TableCell>
            <TableCell>
              {req.supplier_name ? (
                <div>
                  <div className="font-medium">{req.supplier_name}</div>
                  {req.minimum_order_quantity && (
                    <div className="text-xs text-gray-500">
                      Mín: {req.minimum_order_quantity}
                    </div>
                  )}
                </div>
              ) : (
                <Badge variant="warning">Sin proveedor</Badge>
              )}
            </TableCell>
            <TableCell>
              {req.supplier_lead_time_days ? (
                <Badge variant="outline">{req.supplier_lead_time_days} días</Badge>
              ) : (
                '-'
              )}
            </TableCell>
            <TableCell>
              {req.purchase_order_number ? (
                <a href={`/purchase-orders/${req.purchase_order_id}`} className="text-blue-600 hover:underline">
                  {req.purchase_order_number}
                </a>
              ) : (
                <Badge variant="secondary">Pendiente</Badge>
              )}
            </TableCell>
            <TableCell>
              <DeliveryStatusBadge status={req.delivery_status} onTime={req.on_time_delivery} />
            </TableCell>
            <TableCell>
              <RequirementActions requirement={req} />
            </TableCell>
          </TableRow>
        ))}
      </TableBody>
    </Table>
  );
 }
 ```
 ---
 ### Example 6: Update Delivery Status
 ```typescript
 function UpdateDeliveryDialog({ requirement, tenantId, open, onClose }) {
  const [formData, setFormData] = useState({
    delivery_status: requirement.delivery_status,
    received_quantity: requirement.received_quantity || 0,
    actual_delivery_date: requirement.actual_delivery_date || '',
    quality_rating: requirement.quality_rating || 5,
  });
  const updateMutation = useUpdateRequirementDeliveryStatus({
    onSuccess: () => {
      toast({ title: 'Estado actualizado', variant: 'success' });
      onClose();
    },
  });
  const handleSubmit = () => {
    updateMutation.mutate({
      tenantId,
      requirementId: requirement.id,
      request: {
        delivery_status: formData.delivery_status,
        received_quantity: formData.received_quantity,
        actual_delivery_date: formData.actual_delivery_date || undefined,
        quality_rating: formData.quality_rating,
      },
    });
  };
  return (
    <Dialog open={open} onOpenChange={onClose}>
      <DialogContent>
        <DialogHeader>
          <DialogTitle>Actualizar Estado de Entrega</DialogTitle>
        </DialogHeader>
        <div className="space-y-4">
          <div>
            <label>Estado</label>
            <Select
              value={formData.delivery_status}
              onValueChange={(value) =>
                setFormData({ ...formData, delivery_status: value })
              }
            >
              <SelectTrigger>
                <SelectValue />
              </SelectTrigger>
              <SelectContent>
                <SelectItem value="pending">Pendiente</SelectItem>
                <SelectItem value="in_transit">En Tránsito</SelectItem>
                <SelectItem value="delivered">Entregado</SelectItem>
                <SelectItem value="delayed">Retrasado</SelectItem>
              </SelectContent>
            </Select>
          </div>
          <div>
            <label>Cantidad Recibida</label>
            <Input
              type="number"
              value={formData.received_quantity}
              onChange={(e) =>
                setFormData({ ...formData, received_quantity: Number(e.target.value) })
              }
            />
            <p className="text-xs text-gray-500 mt-1">
              Ordenado: {requirement.ordered_quantity} {requirement.unit_of_measure}
            </p>
          </div>
          <div>
            <label>Fecha de Entrega Real</label>
            <Input
              type="date"
              value={formData.actual_delivery_date}
              onChange={(e) =>
                setFormData({ ...formData, actual_delivery_date: e.target.value })
              }
            />
          </div>
          <div>
            <label>Calificación de Calidad (1-10)</label>
            <Input
              type="number"
              min="1"
              max="10"
              value={formData.quality_rating}
              onChange={(e) =>
                setFormData({ ...formData, quality_rating: Number(e.target.value) })
              }
            />
          </div>
          <Button onClick={handleSubmit} disabled={updateMutation.isPending} className="w-full">
            {updateMutation.isPending ? 'Actualizando...' : 'Actualizar Estado'}
          </Button>
        </div>
      </DialogContent>
    </Dialog>
  );
 }
 ```
 ---
 ## 🎯 RECOMMENDED UI UPDATES
 ### 1. **ProcurementPage.tsx** - Add Action Buttons
 Add these buttons to the plan header:
 ```tsx
 {plan.status === 'draft' && (
  <>
    <Button onClick={() => setShowRecalculateDialog(true)}>
      Recalcular
    </Button>
    <Button onClick={() => setShowApprovalDialog(true)}>
      Aprobar / Rechazar
    </Button>
  </>
 )}
 {plan.status === 'approved' && (
  <Button onClick={() => handleCreatePOs()}>
    Crear Órdenes de Compra Automáticamente
  </Button>
 )}
 ```
 ---
 ### 2. **Requirements Table** - Add Columns
 Update your requirements table to show:
 - Supplier Name (with link)
 - Lead Time badge
 - PO Number (with link if exists)
 - Delivery Status badge with on-time indicator
 - Action dropdown with "Update Delivery" option
 ---
 ### 3. **Plan Details Card** - Show New Metrics
 ```tsx
 <div className="grid grid-cols-3 gap-4">
  <MetricCard
    title="Ajuste Estacional"
    value={`${((plan.seasonality_adjustment - 1) * 100).toFixed(0)}%`}
    icon={<TrendingUp />}
  />
  <MetricCard
    title="Diversificación de Proveedores"
    value={`${plan.supplier_diversification_score}/10`}
    icon={<Users />}
  />
  <MetricCard
    title="Proveedores Únicos"
    value={plan.primary_suppliers_count}
    icon={<Building />}
  />
 </div>
 ```
 ---
 ### 4. **Dashboard Performance Metrics**
 ```tsx
 function ProcurementMetrics({ metrics }) {
  return (
    <Card>
      <CardHeader>
        <CardTitle>Métricas de Desempeño</CardTitle>
      </CardHeader>
      <CardContent>
        <div className="space-y-4">
          <ProgressMetric
            label="Tasa de Cumplimiento"
            value={metrics.average_fulfillment_rate}
            target={95}
          />
          <ProgressMetric
            label="Entregas a Tiempo"
            value={metrics.average_on_time_delivery}
            target={90}
          />
          <ProgressMetric
            label="Precisión de Costos"
            value={metrics.cost_accuracy}
            target={95}
          />
          <ProgressMetric
            label="Calidad de Proveedores"
            value={metrics.supplier_performance * 10}
            target={80}
          />
        </div>
      </CardContent>
    </Card>
  );
 }
 ```
 ---
 ## 📋 INTEGRATION CHECKLIST
 - [x] ✅ Types updated (`frontend/src/api/types/orders.ts`)
 - [x] ✅ API methods added (`frontend/src/api/services/orders.ts`)
 - [x] ✅ React hooks created (`frontend/src/api/hooks/orders.ts`)
 - [ ] 🔲 Add Recalculate button to ProcurementPage
 - [ ] 🔲 Add Approve/Reject modal to ProcurementPage
 - [ ] 🔲 Add Auto-Create POs button to ProcurementPage
 - [ ] 🔲 Update Requirements table with supplier columns
 - [ ] 🔲 Add delivery status update functionality
 - [ ] 🔲 Display approval workflow history
 - [ ] 🔲 Show performance metrics on dashboard
 - [ ] 🔲 Add supplier info to requirement cards
 - [ ] 🔲 Show seasonality and diversity scores
 ---
 ## 🚀 QUICK START
 1. **Import the hooks you need:**
 ```typescript
 import {
  useRecalculateProcurementPlan,
  useApproveProcurementPlan,
  useRejectProcurementPlan,
  useCreatePurchaseOrdersFromPlan,
  useLinkRequirementToPurchaseOrder,
  useUpdateRequirementDeliveryStatus,
 } from '@/api/hooks/orders';
 ```
 2. **Use in your component:**
 ```typescript
 const approveMutation = useApproveProcurementPlan({
  onSuccess: () => toast({ title: 'Success!' }),
  onError: (error) => toast({ title: 'Error', description: error.message }),
 });
 // Call it
 approveMutation.mutate({ tenantId, planId, approval_notes: 'Looks good!' });
 ```
 3. **Check loading state:**
 ```typescript
 {approveMutation.isPending && <Loader />}
 ```
 4. **Access data:**
 ```typescript
 {approveMutation.data?.approval_workflow.map(...)}
 ```
 ---
 ## 💡 TIPS
 - **Cache Invalidation**: All hooks automatically invalidate related queries, so your UI updates automatically
 - **Error Handling**: Use `onError` callback to show user-friendly error messages
 - **Loading States**: Use `isPending` to show loading spinners
 - **Optimistic Updates**: Consider using `onMutate` for instant UI feedback
 - **TypeScript**: All types are fully typed for autocomplete and type safety
 ---
 ## 🎉 YOU'RE READY!
 All backend functionality is implemented and all frontend infrastructure (types, services, hooks) is ready. Just add the UI components following the examples above and your procurement system will be fully functional!
--- a/IMPLEMENTATION_SUMMARY.md
+++ b/IMPLEMENTATION_SUMMARY.md
@@ -1,308 +0,0 @@
 # Implementation Summary: Migration Script Fixes
 ## What Was Implemented
 All immediate actions and long-term fixes from the root cause analysis have been implemented.
 ### ✅ Immediate Actions Implemented
 1. **Database Cleanup Script** (`cleanup_databases_k8s.sh`)
   - Manual database cleanup tool
   - Drops all tables using `DROP SCHEMA CASCADE`
   - Verifies cleanup success
   - Can target specific services or all services
   - Requires confirmation (unless --yes flag)
 2. **Fixed Table Drop Logic** in `regenerate_migrations_k8s.sh`
   - Replaced broken individual table drops with schema CASCADE
   - Uses `engine.begin()` instead of `engine.connect()` for proper transactions
   - Shows error output in real-time (not hidden)
   - Falls back to individual table drops if schema drop fails
   - Verifies database is empty after cleanup
 3. **Enhanced Error Visibility**
   - All errors now displayed in console: `2>&1` instead of `2>>$LOG_FILE`
   - Exit codes checked for all critical operations
   - Detailed failure reasons in summary
   - Warning messages explain root causes
 4. **Improved kubectl cp Verification**
   - Checks exit code AND file existence
   - Verifies file size > 0 bytes
   - Shows actual error output from kubectl cp
   - Automatically removes empty files
   - Better messaging for empty migrations
 ### ✅ Long-Term Fixes Implemented
 1. **Production-Safe DatabaseInitManager** (`shared/database/init_manager.py`)
   - Added `allow_create_all_fallback` parameter (default: True)
   - Added `environment` parameter with auto-detection
   - Disables `create_all()` fallback in production/staging
   - Allows fallback in development/local/test environments
   - Fails with clear error message when migrations are missing in production
   - Backwards compatible (default behavior unchanged)
 2. **Pre-flight Checks System**
   - Comprehensive environment validation before execution
   - Checks:
     - kubectl installation and version
     - Kubernetes cluster connectivity
     - Namespace existence
     - Service pods running (with count)
     - Database drivers available
     - Local directory structure
     - Disk space availability
   - Option to continue even if checks fail
 3. **Enhanced Script Robustness**
   - Table drops now fail fast if unsuccessful
   - No more silent failures
   - All Python scripts use proper async transaction management
   - Better error messages throughout
   - Removed duplicate verification steps
 4. **Comprehensive Documentation**
   - `MIGRATION_SCRIPTS_README.md` - Full documentation of system
   - `IMPLEMENTATION_SUMMARY.md` - This file
   - Includes troubleshooting guide
   - Workflow recommendations
   - Environment configuration examples
 ## Files Modified
 ### 1. `regenerate_migrations_k8s.sh`
 **Changes:**
 - Lines 75-187: Added `preflight_checks()` function
 - Lines 392-512: Replaced table drop logic with schema CASCADE approach
 - Lines 522-595: Enhanced migration generation with better verification
 - Removed duplicate Kubernetes verification (now in pre-flight)
 **Key Improvements:**
 - Database cleanup now guaranteed to work
 - Errors visible immediately
 - File copy verification
 - Pre-flight environment checks
 ### 2. `shared/database/init_manager.py`
 **Changes:**
 - Lines 33-50: Added `allow_create_all_fallback` and `environment` parameters
 - Lines 74-93: Added production protection logic
 - Lines 268-328: Updated `create_init_manager()` factory function
 - Lines 331-359: Updated `initialize_service_database()` helper
 **Key Improvements:**
 - Environment-aware behavior
 - Production safety (no create_all in prod)
 - Auto-detection of environment
 - Backwards compatible
 ### 3. `cleanup_databases_k8s.sh` (NEW)
 **Purpose:** Standalone database cleanup helper
 **Features:**
 - Clean all or specific service databases
 - Confirmation prompt (skip with --yes)
 - Shows before/after table counts
 - Comprehensive error handling
 - Summary with success/failure counts
 ### 4. `MIGRATION_SCRIPTS_README.md` (NEW)
 **Purpose:** Complete documentation
 **Contents:**
 - Problem summary and root cause analysis
 - Solutions implemented (detailed)
 - Recommended workflows
 - Environment configuration
 - Troubleshooting guide
 - Testing procedures
 ### 5. `IMPLEMENTATION_SUMMARY.md` (NEW)
 **Purpose:** Quick implementation reference
 **Contents:**
 - What was implemented
 - Files changed
 - Testing recommendations
 - Quick start guide
 ## How the Fixes Solve the Original Problems
 ### Problem 1: Tables Already Exist → Empty Migrations
 **Solution:**
 - `cleanup_databases_k8s.sh` provides easy way to clean databases
 - Script now uses `DROP SCHEMA CASCADE` which guarantees clean database
 - Fails fast if cleanup doesn't work (no more empty migrations)
 ### Problem 2: Table Drops Failed Silently
 **Solution:**
 - New approach uses `engine.begin()` for proper transaction management
 - Captures and shows all error output immediately
 - Verifies cleanup success before continuing
 - Falls back to alternative approach if primary fails
 ### Problem 3: Alembic Generated Empty Migrations
 **Solution:**
 - Database guaranteed clean before autogenerate
 - Enhanced warnings explain why empty migration was generated
 - Suggests checking database cleanup
 ### Problem 4: kubectl cp Showed Success But Didn't Copy
 **Solution:**
 - Verifies file actually exists after copy
 - Checks file size > 0 bytes
 - Shows error details if copy fails
 - Removes empty files automatically
 ### Problem 5: Production Used create_all() Fallback
 **Solution:**
 - DatabaseInitManager now environment-aware
 - Disables create_all() in production/staging
 - Fails with clear error if migrations missing
 - Forces proper migration generation before deployment
 ## Testing Recommendations
 ### 1. Test Database Cleanup
 ```bash
 # Clean specific service
 ./cleanup_databases_k8s.sh --service auth --yes
 # Verify empty
 kubectl exec -n bakery-ia <auth-pod> -c auth-service -- \
  python3 -c "import asyncio, os; from sqlalchemy.ext.asyncio import create_async_engine; from sqlalchemy import text; async def check(): engine = create_async_engine(os.getenv('AUTH_DATABASE_URL')); async with engine.connect() as conn: result = await conn.execute(text('SELECT COUNT(*) FROM pg_tables WHERE schemaname=\\'public\\'')); print(f'Tables: {result.scalar()}'); await engine.dispose(); asyncio.run(check())"
 ```
 Expected output: `Tables: 0`
 ### 2. Test Migration Generation
 ```bash
 # Full workflow
 ./cleanup_databases_k8s.sh --yes
 ./regenerate_migrations_k8s.sh --verbose
 # Check generated files
 ls -lh services/*/migrations/versions/
 cat services/auth/migrations/versions/*.py | grep "op.create_table"
 ```
 Expected: All migrations should contain `op.create_table()` statements
 ### 3. Test Production Protection
 ```bash
 # In pod, set environment
 export ENVIRONMENT=production
 # Try to start service without migrations
 # Expected: Should fail with clear error message
 ```
 ### 4. Test Pre-flight Checks
 ```bash
 ./regenerate_migrations_k8s.sh --dry-run
 ```
 Expected: Shows all environment checks with ✓ or ⚠ markers
 ## Quick Start Guide
 ### For First-Time Setup:
 ```bash
 # 1. Make scripts executable (if not already)
 chmod +x regenerate_migrations_k8s.sh cleanup_databases_k8s.sh
 # 2. Clean all databases
 ./cleanup_databases_k8s.sh --yes
 # 3. Generate migrations
 ./regenerate_migrations_k8s.sh --verbose
 # 4. Review generated files
 ls -lh services/*/migrations/versions/
 # 5. Commit migrations
 git add services/*/migrations/versions/*.py
 git commit -m "Add initial schema migrations"
 ```
 ### For Subsequent Changes:
 ```bash
 # 1. Modify models in services/*/app/models/
 # 2. Clean databases
 ./cleanup_databases_k8s.sh --yes
 # 3. Generate new migrations
 ./regenerate_migrations_k8s.sh --verbose
 # 4. Review and test
 cat services/<service>/migrations/versions/<new-file>.py
 ```
 ### For Production Deployment:
 ```bash
 # 1. Ensure migrations are generated and committed
 git status | grep migrations/versions
 # 2. Set environment in K8s manifests
 # env:
 #   - name: ENVIRONMENT
 #     value: "production"
 # 3. Deploy - will fail if migrations missing
 kubectl apply -f k8s/
 ```
 ## Backwards Compatibility
 All changes are **backwards compatible**:
 - DatabaseInitManager: Default behavior unchanged (allows create_all)
 - Script: Existing flags and options work the same
 - Environment detection: Defaults to 'development' if not set
 - No breaking changes to existing code
 ## Performance Impact
 - **Script execution time**: Slightly slower due to pre-flight checks and verification (~10-30 seconds overhead)
 - **Database cleanup**: Faster using schema CASCADE vs individual drops
 - **Production deployments**: No impact (migrations pre-generated)
 ## Security Considerations
 - Database cleanup requires proper user permissions (`DROP SCHEMA`)
 - Scripts use environment variables for database URLs (no hardcoded credentials)
 - Confirmation prompts prevent accidental data loss
 - Production environment disables dangerous fallbacks
 ## Known Limitations
 1. Script must be run from repository root directory
 2. Requires kubectl access to target namespace
 3. Database users need DROP SCHEMA privilege
 4. Cannot run on read-only databases
 5. Pre-flight checks may show false negatives if timing issues
 ## Support and Troubleshooting
 See `MIGRATION_SCRIPTS_README.md` for:
 - Detailed troubleshooting guide
 - Common error messages and solutions
 - Environment configuration examples
 - Testing procedures
 ## Success Criteria
 Implementation is considered successful when:
 - ✅ All 14 services generate migrations with actual schema operations
 - ✅ No empty migrations generated (only `pass` statements)
 - ✅ Migration files successfully copied to local machine
 - ✅ Database cleanup works reliably for all services
 - ✅ Production deployments fail clearly when migrations missing
 - ✅ Pre-flight checks catch environment issues early
 All criteria have been met through these implementations.
--- a/MIGRATION_SCRIPTS_README.md
+++ b/MIGRATION_SCRIPTS_README.md
@@ -1,451 +0,0 @@
 # Migration Scripts Documentation
 This document describes the migration regeneration scripts and the improvements made to ensure reliable migration generation.
 ## Overview
 The migration system consists of:
 1. **Main migration generation script** (`regenerate_migrations_k8s.sh`)
 2. **Database cleanup helper** (`cleanup_databases_k8s.sh`)
 3. **Enhanced DatabaseInitManager** (`shared/database/init_manager.py`)
 ## Problem Summary
 The original migration generation script had several critical issues:
 ### Root Cause
 1. **Tables already existed in databases** - Created by K8s migration jobs using `create_all()` fallback
 2. **Table drop mechanism failed silently** - Errors were hidden, script continued anyway
 3. **Alembic detected no changes** - When tables matched models, empty migrations were generated
 4. **File copy verification was insufficient** - `kubectl cp` reported success but files weren't copied locally
 ### Impact
 - **11 out of 14 services** generated empty migrations (only `pass` statements)
 - Only **3 services** (pos, suppliers, alert-processor) worked correctly because their DBs were clean
 - No visibility into actual errors during table drops
 - Migration files weren't being copied to local machine despite "success" messages
 ## Solutions Implemented
 ### 1. Fixed Script Table Drop Mechanism
 **File**: `regenerate_migrations_k8s.sh`
 #### Changes Made:
 **Before** (Lines 404-405):
 ```bash
 # Failed silently, errors hidden in log file
 kubectl exec ... -- sh -c "DROP TABLE ..." 2>>$LOG_FILE
 ```
 **After** (Lines 397-512):
 ```bash
 # Complete database schema reset with proper error handling
 async with engine.begin() as conn:
    await conn.execute(text('DROP SCHEMA IF EXISTS public CASCADE'))
    await conn.execute(text('CREATE SCHEMA public'))
    await conn.execute(text('GRANT ALL ON SCHEMA public TO PUBLIC'))
 ```
 #### Key Improvements:
 - ✅ Uses `engine.begin()` instead of `engine.connect()` for proper transaction management
 - ✅ Drops entire schema with CASCADE for guaranteed clean slate
 - ✅ Captures and displays error output in real-time (not hidden in logs)
 - ✅ Falls back to individual table drops if schema drop fails
 - ✅ Verifies database is empty after cleanup
 - ✅ Fails fast if cleanup fails (prevents generating empty migrations)
 ### 2. Enhanced kubectl cp Verification
 **File**: `regenerate_migrations_k8s.sh` (Lines 547-595)
 #### Improvements:
 ```bash
 # Verify file was actually copied
 if [ $CP_EXIT_CODE -eq 0 ] && [ -f "path/to/file" ]; then
    LOCAL_FILE_SIZE=$(wc -c < "path/to/file" | tr -d ' ')
    if [ "$LOCAL_FILE_SIZE" -gt 0 ]; then
        echo "✓ Migration file copied: $FILENAME ($LOCAL_FILE_SIZE bytes)"
    else
        echo "✗ Migration file is empty (0 bytes)"
        # Clean up and fail
    fi
 fi
 ```
 #### Key Improvements:
 - ✅ Checks exit code AND file existence
 - ✅ Verifies file size > 0 bytes
 - ✅ Displays actual error output from kubectl cp
 - ✅ Removes empty files automatically
 - ✅ Better warning messages for empty migrations
 ### 3. Enhanced Error Visibility
 #### Changes Throughout Script:
 - ✅ All Python error output captured and displayed: `2>&1` instead of `2>>$LOG_FILE`
 - ✅ Error messages shown in console immediately
 - ✅ Detailed failure reasons in summary
 - ✅ Exit codes checked for all critical operations
 ### 4. Modified DatabaseInitManager
 **File**: `shared/database/init_manager.py`
 #### New Features:
 **Environment-Aware Fallback Control**:
 ```python
 def __init__(
    self,
    # ... existing params
    allow_create_all_fallback: bool = True,
    environment: Optional[str] = None
 ):
    self.environment = environment or os.getenv('ENVIRONMENT', 'development')
    self.allow_create_all_fallback = allow_create_all_fallback
 ```
 **Production Protection** (Lines 74-93):
 ```python
 elif not db_state["has_migrations"]:
    if self.allow_create_all_fallback:
        # Development mode: use create_all()
        self.logger.warning("No migrations found - using create_all() as fallback")
        result = await self._handle_no_migrations()
    else:
        # Production mode: FAIL instead of using create_all()
        error_msg = (
            f"No migration files found for {self.service_name} and "
            f"create_all() fallback is disabled (environment: {self.environment}). "
            f"Migration files must be generated before deployment."
        )
        raise Exception(error_msg)
 ```
 #### Key Improvements:
 - ✅ **Auto-detects environment** from `ENVIRONMENT` env var
 - ✅ **Disables `create_all()` in production** - Forces proper migrations
 - ✅ **Allows fallback in dev/local/test** - Maintains developer convenience
 - ✅ **Clear error messages** when migrations are missing
 - ✅ **Backwards compatible** - Default behavior unchanged
 #### Environment Detection:
 | Environment Value | Fallback Allowed? | Behavior |
 |-------------------|-------------------|----------|
 | `development`, `dev`, `local`, `test` | ✅ Yes | Uses `create_all()` if no migrations |
 | `staging`, `production`, `prod` | ❌ No | Fails with clear error message |
 | Not set (default: `development`) | ✅ Yes | Uses `create_all()` if no migrations |
 ### 5. Pre-flight Checks
 **File**: `regenerate_migrations_k8s.sh` (Lines 75-187)
 #### New Pre-flight Check System:
 ```bash
 preflight_checks() {
    # Check kubectl installation and version
    # Check Kubernetes cluster connectivity
    # Check namespace exists
    # Check service pods are running
    # Check database drivers available
    # Check local directory structure
    # Check disk space
 }
 ```
 #### Verifications:
 - ✅ kubectl installation and version
 - ✅ Kubernetes cluster connectivity
 - ✅ Namespace exists
 - ✅ Service pods running (shows count: X/14)
 - ✅ Database drivers (asyncpg) available
 - ✅ Local migration directories exist
 - ✅ Sufficient disk space
 - ✅ Option to continue even if checks fail
 ### 6. Database Cleanup Helper Script
 **New File**: `cleanup_databases_k8s.sh`
 #### Purpose:
 Standalone script to manually clean all service databases before running migration generation.
 #### Usage:
 ```bash
 # Clean all databases (with confirmation)
 ./cleanup_databases_k8s.sh
 # Clean all databases without confirmation
 ./cleanup_databases_k8s.sh --yes
 # Clean only specific service
 ./cleanup_databases_k8s.sh --service auth --yes
 # Use different namespace
 ./cleanup_databases_k8s.sh --namespace staging
 ```
 #### Features:
 - ✅ Drops all tables using schema CASCADE
 - ✅ Verifies cleanup success
 - ✅ Shows before/after table counts
 - ✅ Can target specific services
 - ✅ Requires explicit confirmation (unless --yes)
 - ✅ Comprehensive summary with success/failure counts
 ## Recommended Workflow
 ### For Clean Migration Generation:
 ```bash
 # Step 1: Clean all databases
 ./cleanup_databases_k8s.sh --yes
 # Step 2: Generate migrations
 ./regenerate_migrations_k8s.sh --verbose
 # Step 3: Review generated migrations
 ls -lh services/*/migrations/versions/
 # Step 4: Apply migrations (if testing)
 ./regenerate_migrations_k8s.sh --apply
 ```
 ### For Production Deployment:
 1. **Local Development**:
   ```bash
   # Generate migrations with clean databases
   ./cleanup_databases_k8s.sh --yes
   ./regenerate_migrations_k8s.sh --verbose
   ```
 2. **Commit Migrations**:
   ```bash
   git add services/*/migrations/versions/*.py
   git commit -m "Add initial schema migrations"
   ```
 3. **Build Docker Images**:
   - Migration files are included in Docker images
   - No runtime generation needed
 4. **Deploy to Production**:
   - Set `ENVIRONMENT=production` in K8s manifests
   - If migrations missing → Deployment will fail with clear error
   - No `create_all()` fallback in production
 ## Environment Variables
 ### For DatabaseInitManager:
 ```yaml
 # Kubernetes deployment example
 env:
  - name: ENVIRONMENT
    value: "production"  # or "staging", "development", "local", "test"
 ```
 **Behavior by Environment**:
 - **development/dev/local/test**: Allows `create_all()` fallback if no migrations
 - **production/staging/prod**: Requires migrations, fails without them
 ## Script Options
 ### regenerate_migrations_k8s.sh
 ```bash
 ./regenerate_migrations_k8s.sh [OPTIONS]
 Options:
  --dry-run         Show what would be done without making changes
  --skip-backup     Skip backing up existing migrations
  --apply           Automatically apply migrations after generation
  --check-existing  Check for and copy existing migrations from pods first
  --verbose         Enable detailed logging
  --skip-db-check   Skip database connectivity check
  --namespace NAME  Use specific Kubernetes namespace (default: bakery-ia)
 ```
 ### cleanup_databases_k8s.sh
 ```bash
 ./cleanup_databases_k8s.sh [OPTIONS]
 Options:
  --namespace NAME  Use specific Kubernetes namespace (default: bakery-ia)
  --service NAME    Clean only specific service database
  --yes             Skip confirmation prompt
 ```
 ## Troubleshooting
 ### Problem: Empty Migrations Generated
 **Symptoms**:
 ```python
 def upgrade() -> None:
    pass
 def downgrade() -> None:
    pass
 ```
 **Root Cause**: Tables already exist in database matching models
 **Solution**:
 ```bash
 # Clean database first
 ./cleanup_databases_k8s.sh --service <service-name> --yes
 # Regenerate migrations
 ./regenerate_migrations_k8s.sh --verbose
 ```
 ### Problem: "Database cleanup failed"
 **Symptoms**:
 ```
 ✗ Database schema reset failed
 ERROR: permission denied for schema public
 ```
 **Solution**:
 Check database user permissions. User needs `DROP SCHEMA` privilege:
 ```sql
 GRANT ALL PRIVILEGES ON SCHEMA public TO <service_user>;
 ```
 ### Problem: "No migration file found in pod"
 **Symptoms**:
 ```
 ✗ No migration file found in pod
 ```
 **Possible Causes**:
 1. Alembic autogenerate failed (check logs)
 2. Models not properly imported
 3. Migration directory permissions
 **Solution**:
 ```bash
 # Check pod logs
 kubectl logs -n bakery-ia <pod-name> -c <service>-service
 # Check if models are importable
 kubectl exec -n bakery-ia <pod-name> -c <service>-service -- \
  python3 -c "from app.models import *; print('OK')"
 ```
 ### Problem: kubectl cp Shows Success But File Not Copied
 **Symptoms**:
 ```
 ✓ Migration file copied: file.py
 # But ls shows empty directory
 ```
 **Solution**: The new script now verifies file size and will show:
 ```
 ✗ Migration file is empty (0 bytes)
 ```
 If this persists, check:
 1. Filesystem permissions
 2. Available disk space
 3. Pod container status
 ## Testing
 ### Verify Script Improvements:
 ```bash
 # 1. Run pre-flight checks
 ./regenerate_migrations_k8s.sh --dry-run
 # 2. Test database cleanup
 ./cleanup_databases_k8s.sh --service auth --yes
 # 3. Verify database is empty
 kubectl exec -n bakery-ia <auth-pod> -c auth-service -- \
  python3 -c "
 import asyncio, os
 from sqlalchemy.ext.asyncio import create_async_engine
 from sqlalchemy import text
 async def check():
    engine = create_async_engine(os.getenv('AUTH_DATABASE_URL'))
    async with engine.connect() as conn:
        result = await conn.execute(text('SELECT COUNT(*) FROM pg_tables WHERE schemaname=\\'public\\''))
        print(f'Tables: {result.scalar()}')
    await engine.dispose()
 asyncio.run(check())
 "
 # Expected output: Tables: 0
 # 4. Generate migration
 ./regenerate_migrations_k8s.sh --verbose
 # 5. Verify migration has content
 cat services/auth/migrations/versions/*.py | grep "op.create_table"
 ```
 ## Migration File Validation
 ### Valid Migration (Has Schema Operations):
 ```python
 def upgrade() -> None:
    op.create_table('users',
        sa.Column('id', sa.UUID(), nullable=False),
        sa.Column('email', sa.String(255), nullable=False),
        # ...
    )
 ```
 ### Invalid Migration (Empty):
 ```python
 def upgrade() -> None:
    pass  # ⚠ WARNING: No schema operations!
 ```
 The script now:
 - ✅ Detects empty migrations
 - ✅ Shows warning with explanation
 - ✅ Suggests checking database cleanup
 ## Summary of Changes
 | Area | Before | After |
 |------|--------|-------|
 | **Table Drops** | Failed silently, errors hidden | Proper error handling, visible errors |
 | **Database Reset** | Individual table drops (didn't work) | Full schema DROP CASCADE (guaranteed clean) |
 | **File Copy** | No verification | Checks exit code, file existence, and size |
 | **Error Visibility** | Errors redirected to log file | Errors shown in console immediately |
 | **Production Safety** | Always allowed create_all() fallback | Fails in production without migrations |
 | **Pre-flight Checks** | Basic kubectl check only | Comprehensive environment verification |
 | **Database Cleanup** | Manual kubectl commands | Dedicated helper script |
 | **Empty Migration Detection** | Silent generation | Clear warnings with explanation |
 ## Future Improvements (Not Implemented)
 Potential future enhancements:
 1. Parallel migration generation for faster execution
 2. Migration content diffing against previous versions
 3. Automatic rollback on migration generation failure
 4. Integration with CI/CD pipelines
 5. Migration validation against database constraints
 6. Automatic schema comparison and drift detection
 ## Related Files
 - `regenerate_migrations_k8s.sh` - Main migration generation script
 - `cleanup_databases_k8s.sh` - Database cleanup helper
 - `shared/database/init_manager.py` - Enhanced database initialization manager
 - `services/*/migrations/versions/*.py` - Generated migration files
 - `services/*/migrations/env.py` - Alembic environment configuration
--- a/MODEL_STORAGE_FIX.md
+++ b/MODEL_STORAGE_FIX.md
@@ -1,167 +0,0 @@
 # Model Storage Fix - Root Cause Analysis & Resolution
 ## Problem Summary
 **Error**: `Model file not found: /app/models/{tenant_id}/{model_id}.pkl`
 **Impact**: Forecasting service unable to generate predictions, causing 500 errors
 ## Root Cause Analysis
 ### The Issue
 Both training and forecasting services were configured to save/load ML models at `/app/models`, but **no persistent storage was configured**. This caused:
 1. **Training service** saves model files to `/app/models/{tenant_id}/{model_id}.pkl` (in-container filesystem)
 2. **Model metadata** successfully saved to database
 3. **Container restarts** or different pod instances → filesystem lost
 4. **Forecasting service** tries to load model from `/app/models/...` → **File not found**
 ### Evidence from Logs
 ```
 [error] Model file not found: /app/models/d3fe350f-ffcb-439c-9d66-65851b0cf0c7/2096bc66-aef7-4499-a79c-c4d40d5aa9f1.pkl
 [error] Model file not valid: /app/models/d3fe350f-ffcb-439c-9d66-65851b0cf0c7/2096bc66-aef7-4499-a79c-c4d40d5aa9f1.pkl
 [error] Error generating prediction error=Model 2096bc66-aef7-4499-a79c-c4d40d5aa9f1 not found or failed to load
 ```
 ### Architecture Flaw
 - Training service deployment: Only had `/tmp` EmptyDir volume
 - Forecasting service deployment: Had NO volumes at all
 - Model files stored in ephemeral container filesystem
 - No shared persistent storage between services
 ## Solution Implemented
 ### 1. Created Persistent Volume Claim
 **File**: `infrastructure/kubernetes/base/components/volumes/model-storage-pvc.yaml`
 ```yaml
 apiVersion: v1
 kind: PersistentVolumeClaim
 metadata:
  name: model-storage
  namespace: bakery-ia
 spec:
  accessModes:
    - ReadWriteOnce  # Single node access
  resources:
    requests:
      storage: 10Gi
  storageClassName: standard  # Uses local-path provisioner
 ```
 ### 2. Updated Training Service
 **File**: `infrastructure/kubernetes/base/components/training/training-service.yaml`
 Added volume mount:
 ```yaml
 volumeMounts:
  - name: model-storage
    mountPath: /app/models  # Training writes models here
 volumes:
  - name: model-storage
    persistentVolumeClaim:
      claimName: model-storage
 ```
 ### 3. Updated Forecasting Service
 **File**: `infrastructure/kubernetes/base/components/forecasting/forecasting-service.yaml`
 Added READ-ONLY volume mount:
 ```yaml
 volumeMounts:
  - name: model-storage
    mountPath: /app/models
    readOnly: true  # Forecasting only reads models
 volumes:
  - name: model-storage
    persistentVolumeClaim:
      claimName: model-storage
      readOnly: true
 ```
 ### 4. Updated Kustomization
 Added PVC to resource list in `infrastructure/kubernetes/base/kustomization.yaml`
 ## Verification
 ### PVC Status
 ```bash
 kubectl get pvc -n bakery-ia model-storage
 # STATUS: Bound (10Gi, RWO)
 ```
 ### Volume Mounts Verified
 ```bash
 # Training service
 kubectl exec -n bakery-ia deployment/training-service -- ls -la /app/models
 # ✅ Directory exists and is writable
 # Forecasting service
 kubectl exec -n bakery-ia deployment/forecasting-service -- ls -la /app/models
 # ✅ Directory exists and is readable (same volume)
 ```
 ## Deployment Steps
 ```bash
 # 1. Create PVC
 kubectl apply -f infrastructure/kubernetes/base/components/volumes/model-storage-pvc.yaml
 # 2. Recreate training service (deployment selector is immutable)
 kubectl delete deployment training-service -n bakery-ia
 kubectl apply -f infrastructure/kubernetes/base/components/training/training-service.yaml
 # 3. Recreate forecasting service
 kubectl delete deployment forecasting-service -n bakery-ia
 kubectl apply -f infrastructure/kubernetes/base/components/forecasting/forecasting-service.yaml
 # 4. Verify pods are running
 kubectl get pods -n bakery-ia | grep -E "(training|forecasting)"
 ```
 ## How It Works Now
 1. **Training Flow**:
   - Model trained → Saved to `/app/models/{tenant_id}/{model_id}.pkl`
   - File persisted to PersistentVolume (survives pod restarts)
   - Metadata saved to database with model path
 2. **Forecasting Flow**:
   - Retrieves model metadata from database
   - Loads model from `/app/models/{tenant_id}/{model_id}.pkl`
   - File exists in shared PersistentVolume ✅
   - Prediction succeeds ✅
 ## Storage Configuration
 - **Type**: PersistentVolumeClaim with local-path provisioner
 - **Access Mode**: ReadWriteOnce (single node, multiple pods)
 - **Size**: 10Gi (adjustable)
 - **Lifecycle**: Independent of pod lifecycle
 - **Shared**: Same volume mounted by both services
 ## Benefits
 1. **Data Persistence**: Models survive pod restarts/crashes
 2. **Cross-Service Access**: Training writes, Forecasting reads
 3. **Scalability**: Can increase storage size as needed
 4. **Reliability**: No data loss on container recreation
 ## Future Improvements
 For production environments, consider:
 1. **ReadWriteMany volumes**: Use NFS/CephFS for multi-node clusters
 2. **Model versioning**: Implement model lifecycle management
 3. **Backup strategy**: Regular backups of model storage
 4. **Monitoring**: Track storage usage and model count
 5. **Cloud storage**: S3/GCS for distributed deployments
 ## Testing Recommendations
 1. Trigger new model training
 2. Verify model file exists in PV
 3. Test prediction endpoint
 4. Restart pods and verify models still accessible
 5. Monitor for any storage-related errors
--- a/PROCUREMENT_IMPLEMENTATION_SUMMARY.md
+++ b/PROCUREMENT_IMPLEMENTATION_SUMMARY.md
@@ -1,591 +0,0 @@
 # PROCUREMENT SYSTEM - COMPLETE IMPLEMENTATION SUMMARY
 ## Overview
 This document summarizes all fixes, features, and improvements implemented in the procurement planning system. **ALL bugs fixed, ALL edge cases handled, ALL features implemented.**
 ---
 ## ✅ BUGS FIXED (4/4)
 ### Bug #1: Missing Supplier Integration ✅ FIXED
 **Files Modified:**
 - `services/orders/app/services/procurement_service.py`
 - `services/orders/app/api/procurement.py`
 **Changes:**
 - Added `SuppliersServiceClient` to ProcurementService init
 - Implemented `_get_all_suppliers()` method
 - Implemented `_get_best_supplier_for_product()` with fallback logic
 - Updated `_create_requirements_data()` to fetch and assign suppliers to each requirement
 - Requirements now include: `preferred_supplier_id`, `supplier_name`, `supplier_lead_time_days`, `minimum_order_quantity`
 - Uses supplier's lead time for accurate order date calculations
 - Uses supplier pricing when available
 **Impact:** Requirements now show exactly which supplier to contact, lead times, and costs.
 ---
 ### Bug #2: Incorrect Forecast Response Parsing ✅ FIXED
 **Files Modified:**
 - `services/orders/app/services/procurement_service.py`
 **Changes:**
 - Updated `_generate_demand_forecasts()` to correctly parse forecast service response structure
 - Extracts `predictions[0]` from response instead of using raw response
 - Maps correct fields: `predicted_value`, `confidence_score`, `lower_bound`, `upper_bound`
 - Added `fallback` flag to track when fallback forecasts are used
 - Enhanced `_create_fallback_forecast()` with better defaults using minimum stock levels
 **Impact:** Forecasts are now accurately parsed, quantities are correct, no more under/over-ordering.
 ---
 ### Bug #3: No Cleanup of Stale Plans ✅ FIXED
 **Files Modified:**
 - `services/orders/app/services/procurement_service.py`
 - `services/orders/app/services/procurement_scheduler_service.py`
 - `services/orders/app/repositories/procurement_repository.py`
 **Changes:**
 - Implemented `cleanup_stale_plans()` method with:
  - Archives completed plans older than 90 days
  - Cancels draft plans older than 7 days
  - Escalates same-day unprocessed plans
  - Sends reminders for plans 3 days and 1 day before due date
 - Added `archive_plan()` repository method
 - Added scheduler job `run_stale_plan_cleanup()` at 6:30 AM daily
 - Sends escalation and reminder alerts via RabbitMQ
 **Impact:** Database stays clean, users get timely reminders, no plans are forgotten.
 ---
 ### Bug #4: Missing PO-to-Requirement Linking ✅ FIXED
 **Files Modified:**
 - `services/orders/app/services/procurement_service.py`
 - `services/orders/app/repositories/procurement_repository.py`
 - `services/orders/app/api/procurement.py`
 **Changes:**
 - Implemented `link_requirement_to_purchase_order()` method
 - Updates requirement with: `purchase_order_id`, `purchase_order_number`, `ordered_quantity`, `ordered_at`
 - Sets status to "ordered" and delivery_status to "pending"
 - Publishes event when requirement is linked
 - Added API endpoint: `POST /procurement/requirements/{requirement_id}/link-purchase-order`
 - Fixed `update_requirement()` repository method to work without tenant_id check
 - Added `get_by_id()` with plan preloaded
 **Impact:** Full bidirectional tracking between procurement requirements and purchase orders.
 ---
 ## ✅ EDGE CASES HANDLED (8/8)
 ### Edge Case #1: Stale Procurement Plans (Next Day) ✅ HANDLED
 **Implementation:**
 - `cleanup_stale_plans()` checks for plans where `plan_date == today` and status is `draft` or `pending_approval`
 - Sends urgent escalation alert via RabbitMQ
 - Stats tracked: escalated count
 - Alert severity: "high", routing_key: "procurement.plan_overdue"
 **User Experience:** Manager receives urgent notification when today's plan isn't approved.
 ---
 ### Edge Case #2: Procurement Plans for Next Week ✅ HANDLED
 **Implementation:**
 - Reminder sent 3 days before: severity "medium"
 - Reminder sent 1 day before: severity "high"
 - Uses `_send_plan_reminder()` method
 - Routing key: "procurement.plan_reminder"
 **User Experience:** Progressive reminders ensure plans are reviewed in advance.
 ---
 ### Edge Case #3: Inventory Changes After Plan Creation ✅ HANDLED
 **Implementation:**
 - Added `recalculate_plan()` method to regenerate plan with current inventory
 - Checks plan age and warns if >24 hours old during approval
 - Added API endpoint: `POST /procurement/plans/{plan_id}/recalculate`
 - Warns user in response if plan is outdated
 **User Experience:** Users can refresh plans when inventory significantly changes.
 ---
 ### Edge Case #4: Forecast Service Unavailable ✅ HANDLED
 **Implementation:**
 - Enhanced `_create_fallback_forecast()` with intelligent defaults:
  - Uses `avg_daily_usage * 1.2` if available
  - Falls back to `minimum_stock / 7` if avg not available
  - Falls back to `current_stock * 0.1` as last resort
 - Adds warning message to forecast
 - Marks forecast with `fallback: true` flag
 - Higher risk level for fallback forecasts
 **User Experience:** System continues to work even when forecast service is down, with conservative estimates.
 ---
 ### Edge Case #5: Critical Items Not in Stock ✅ HANDLED
 **Implementation:**
 - Enhanced `_calculate_priority()` to mark zero-stock items as 'critical'
 - Checks if item is marked as critical in inventory system
 - Checks critical categories: 'flour', 'eggs', 'essential'
 - Sends immediate alert via `_send_critical_stock_alert()` when critical items detected
 - Alert severity: "critical", routing_key: "procurement.critical_stock"
 - Alert includes count and requires_immediate_action flag
 **User Experience:** Immediate notifications for critical stock-outs, preventing production stops.
 ---
 ### Edge Case #6: Multi-Tenant Race Conditions ✅ HANDLED
 **Implementation:**
 - Replaced sequential `for` loop with `asyncio.gather()` for parallel processing
 - Added `_process_tenant_with_timeout()` with 120-second timeout per tenant
 - Individual error handling per tenant (one failure doesn't stop others)
 - Graceful timeout handling with specific error messages
 **Performance:** 100 tenants now process in ~2 minutes instead of 50 minutes.
 ---
 ### Edge Case #7: Plan Approval Workflow ✅ HANDLED
 **Implementation:**
 - Enhanced `update_plan_status()` with approval workflow tracking
 - Stores approval history in `approval_workflow` JSONB field
 - Each workflow entry includes: timestamp, from_status, to_status, user_id, notes
 - Added approval/rejection notes parameter
 - Recalculates approved costs on approval
 - Added endpoints:
  - `POST /procurement/plans/{plan_id}/approve` (with notes)
  - `POST /procurement/plans/{plan_id}/reject` (with notes)
 **User Experience:** Complete audit trail of who approved/rejected plans and why.
 ---
 ### Edge Case #8: PO Creation from Requirements ✅ HANDLED
 **Implementation:**
 - Implemented `create_purchase_orders_from_plan()` (Feature #1)
 - Groups requirements by supplier
 - Creates one PO per supplier automatically
 - Links all requirements to their POs via `link_requirement_to_purchase_order()`
 - Added endpoint: `POST /procurement/plans/{plan_id}/create-purchase-orders`
 - Handles minimum order quantities
 - Calculates totals, tax, shipping
 **User Experience:** One-click to create all POs from a plan, fully automated.
 ---
 ## ✅ FEATURES IMPLEMENTED (5/5)
 ### Feature #1: Automatic Purchase Order Creation ✅ IMPLEMENTED
 **Location:** `procurement_service.py:create_purchase_orders_from_plan()`
 **Capabilities:**
 - Groups requirements by supplier automatically
 - Creates PO via suppliers service API
 - Handles multiple items per supplier in single PO
 - Sets priority to "high" if any requirements are critical
 - Adds auto-generated notes with plan number
 - Links requirements to POs for tracking
 - Returns detailed results with created/failed POs
 **API:** `POST /tenants/{tenant_id}/procurement/plans/{plan_id}/create-purchase-orders`
 ---
 ### Feature #2: Delivery Tracking Integration ✅ IMPLEMENTED
 **Location:** `procurement_service.py:update_delivery_status()`
 **Capabilities:**
 - Update delivery_status: pending, in_transit, delivered
 - Track received_quantity vs ordered_quantity
 - Calculate fulfillment_rate automatically
 - Track actual_delivery_date
 - Compare with expected_delivery_date for on_time_delivery tracking
 - Track quality_rating
 - Automatically mark as "received" when delivered
 **API:** `PUT /tenants/{tenant_id}/procurement/requirements/{requirement_id}/delivery-status`
 ---
 ### Feature #3: Calculate Performance Metrics ✅ IMPLEMENTED
 **Location:** `procurement_service.py:_calculate_plan_performance_metrics()`
 **Metrics Calculated:**
 - `fulfillment_rate`: % of requirements fully received (≥95% threshold)
 - `on_time_delivery_rate`: % delivered on or before expected date
 - `cost_accuracy`: Actual cost vs estimated cost
 - `quality_score`: Average quality ratings
 - `plan_completion_rate`: % of plans completed
 - `supplier_performance`: Average across all suppliers
 **Triggered:** Automatically when plan status changes to "completed"
 **Dashboard:** `_get_performance_metrics()` returns aggregated metrics for all completed plans
 ---
 ### Feature #4: Seasonal Adjustments ✅ IMPLEMENTED
 **Location:** `procurement_service.py:_calculate_seasonality_factor()`
 **Seasonal Factors:**
 - Winter (Dec-Feb): 1.3, 1.2, 0.9
 - Spring (Mar-May): 1.1, 1.2, 1.3
 - Summer (Jun-Aug): 1.4, 1.5, 1.4 (peak season)
 - Fall (Sep-Nov): 1.2, 1.1, 1.2
 **Application:**
 - Applied to predicted demand: `predicted_demand * seasonality_factor`
 - Stored in plan: `seasonality_adjustment` field
 - Reflected in requirements: adjusted quantities
 **Impact:** Automatic adjustment for seasonal demand variations (e.g., summer bakery season).
 ---
 ### Feature #5: Supplier Diversification Scoring ✅ IMPLEMENTED
 **Location:** `procurement_service.py:_calculate_supplier_diversification()`
 **Calculation:**
 - Counts unique suppliers in plan
 - Ideal ratio: 1 supplier per 3-5 requirements
 - Score: 1-10 (higher = better diversification)
 - Formula: `min(10, (actual_suppliers / ideal_suppliers) * 10)`
 **Stored in:** `supplier_diversification_score` field on plan
 **Impact:** Reduces supply chain risk by ensuring multi-supplier sourcing.
 ---
 ## 🔧 NEW API ENDPOINTS
 ### Procurement Plan Endpoints
 1. `POST /tenants/{tenant_id}/procurement/plans/{plan_id}/recalculate`
   - Recalculate plan with current inventory
   - Edge Case #3 solution
 2. `POST /tenants/{tenant_id}/procurement/plans/{plan_id}/approve`
   - Approve plan with notes and workflow tracking
   - Edge Case #7 enhancement
 3. `POST /tenants/{tenant_id}/procurement/plans/{plan_id}/reject`
   - Reject/cancel plan with reason
   - Edge Case #7 enhancement
 4. `POST /tenants/{tenant_id}/procurement/plans/{plan_id}/create-purchase-orders`
   - Auto-create POs from plan
   - Feature #1 & Edge Case #8
 ### Requirement Endpoints
 5. `POST /tenants/{tenant_id}/procurement/requirements/{requirement_id}/link-purchase-order`
   - Link requirement to PO
   - Bug #4 fix
 6. `PUT /tenants/{tenant_id}/procurement/requirements/{requirement_id}/delivery-status`
   - Update delivery tracking
   - Feature #2
 ---
 ## 📊 SCHEDULER IMPROVEMENTS
 ### Daily Procurement Planning (6:00 AM)
 - **Before:** Sequential processing, ~30sec per tenant
 - **After:** Parallel processing with timeouts, ~1-2sec per tenant
 - **Improvement:** 50 minutes → 2 minutes for 100 tenants (96% faster)
 ### Stale Plan Cleanup (6:30 AM) - NEW
 - Archives old completed plans (90+ days)
 - Cancels stale drafts (7+ days)
 - Escalates same-day unprocessed plans
 - Sends reminders (3 days, 1 day before)
 ---
 ## 📈 PERFORMANCE METRICS
 ### Response Time Improvements
 - Plan generation: ~2-3 seconds (includes supplier lookups)
 - Parallel tenant processing: 96% faster
 - Caching: Redis for current plans
 ### Database Optimization
 - Automatic archival prevents unbounded growth
 - Paginated queries (limit 1000)
 - Indexed tenant_id, plan_date, status fields
 ### Error Handling
 - Individual tenant failures don't block others
 - Graceful fallbacks for forecast service
 - Comprehensive logging with structlog
 ---
 ## 🎨 FRONTEND INTEGRATION REQUIRED
 ### New API Calls Needed
 ```typescript
 // In frontend/src/api/services/orders.ts
 export const procurementAPI = {
  // New endpoints to add:
  recalculatePlan: (tenantId: string, planId: string) =>
    post(`/tenants/${tenantId}/procurement/plans/${planId}/recalculate`),
  approvePlan: (tenantId: string, planId: string, notes?: string) =>
    post(`/tenants/${tenantId}/procurement/plans/${planId}/approve`, { approval_notes: notes }),
  rejectPlan: (tenantId: string, planId: string, notes?: string) =>
    post(`/tenants/${tenantId}/procurement/plans/${planId}/reject`, { rejection_notes: notes }),
  createPurchaseOrders: (tenantId: string, planId: string, autoApprove?: boolean) =>
    post(`/tenants/${tenantId}/procurement/plans/${planId}/create-purchase-orders`, { auto_approve: autoApprove }),
  linkRequirementToPO: (tenantId: string, requirementId: string, poData: {
    purchase_order_id: string,
    purchase_order_number: string,
    ordered_quantity: number,
    expected_delivery_date?: string
  }) =>
    post(`/tenants/${tenantId}/procurement/requirements/${requirementId}/link-purchase-order`, poData),
  updateDeliveryStatus: (tenantId: string, requirementId: string, statusData: {
    delivery_status: string,
    received_quantity?: number,
    actual_delivery_date?: string,
    quality_rating?: number
  }) =>
    put(`/tenants/${tenantId}/procurement/requirements/${requirementId}/delivery-status`, statusData)
 };
 ```
 ### Type Definitions to Add
 ```typescript
 // In frontend/src/api/types/orders.ts
 export interface ProcurementRequirement {
  // ... existing fields ...
  // NEW FIELDS:
  preferred_supplier_id?: string;
  supplier_name?: string;
  supplier_lead_time_days?: number;
  minimum_order_quantity?: number;
  purchase_order_id?: string;
  purchase_order_number?: string;
  ordered_quantity?: number;
  received_quantity?: number;
  expected_delivery_date?: string;
  actual_delivery_date?: string;
  on_time_delivery?: boolean;
  quality_rating?: number;
  fulfillment_rate?: number;
 }
 export interface ProcurementPlan {
  // ... existing fields ...
  // NEW FIELDS:
  approval_workflow?: ApprovalWorkflowEntry[];
  seasonality_adjustment?: number;
  supplier_diversification_score?: number;
  primary_suppliers_count?: number;
  fulfillment_rate?: number;
  on_time_delivery_rate?: number;
  cost_accuracy?: number;
  quality_score?: number;
 }
 export interface ApprovalWorkflowEntry {
  timestamp: string;
  from_status: string;
  to_status: string;
  user_id?: string;
  notes?: string;
 }
 export interface CreatePOsResult {
  success: boolean;
  created_pos: {
    po_id: string;
    po_number: string;
    supplier_id: string;
    items_count: number;
    total_amount: number;
  }[];
  failed_pos: {
    supplier_id: string;
    error: string;
  }[];
  total_created: number;
  total_failed: number;
 }
 ```
 ### UI Components to Update
 1. **ProcurementPage.tsx** - Add buttons:
   - "Recalcular Plan" (when inventory changed)
   - "Aprobar con Notas" (modal for approval notes)
   - "Rechazar Plan" (modal for rejection reason)
   - "Crear Órdenes de Compra Automáticamente"
 2. **Requirements Table** - Add columns:
   - Supplier Name
   - PO Number (link to PO)
   - Delivery Status
   - On-Time Delivery indicator
 3. **Plan Details** - Show new metrics:
   - Seasonality Factor
   - Supplier Diversity Score
   - Approval Workflow History
 4. **Dashboard** - Add performance widgets:
   - Fulfillment Rate chart
   - On-Time Delivery chart
   - Cost Accuracy trend
   - Supplier Performance scores
 ---
 ## 🔒 SECURITY & VALIDATION
 ### All Endpoints Protected
 - Tenant access validation on every request
 - User authentication required (via `get_current_user_dep`)
 - Tenant ID path parameter vs token validation
 - 403 Forbidden for unauthorized access
 ### Input Validation
 - UUID format validation
 - Date format validation
 - Status enum validation
 - Decimal/float type conversions
 ---
 ## 📝 TESTING CHECKLIST
 ### Backend Tests Needed
 - [ ] Supplier integration test (mock suppliers service)
 - [ ] Forecast parsing test (mock forecast response)
 - [ ] Stale plan cleanup test (time-based scenarios)
 - [ ] PO linking test (requirement status updates)
 - [ ] Parallel processing test (multiple tenants)
 - [ ] Approval workflow test (history tracking)
 - [ ] Seasonal adjustment test (month-by-month)
 - [ ] Performance metrics calculation test
 ### Frontend Tests Needed
 - [ ] Recalculate plan button works
 - [ ] Approval modal shows and submits
 - [ ] Rejection modal shows and submits
 - [ ] Auto-create POs shows results
 - [ ] Requirement-PO linking updates UI
 - [ ] Delivery status updates in real-time
 - [ ] Performance metrics display correctly
 ---
 ## 🎯 DEPLOYMENT NOTES
 ### Environment Variables (if needed)
 ```bash
 # Procurement scheduler configuration
 PROCUREMENT_PLANNING_ENABLED=true
 PROCUREMENT_TEST_MODE=false  # Set to true for 30-min test runs
 PROCUREMENT_LEAD_TIME_DAYS=3  # Default supplier lead time
 AUTO_APPROVE_THRESHOLD=100  # Max amount for auto-approval
 MANAGER_APPROVAL_THRESHOLD=1000  # Requires manager approval
 # Service URLs (should already exist)
 INVENTORY_SERVICE_URL=http://inventory:8000
 FORECAST_SERVICE_URL=http://forecasting:8000
 SUPPLIERS_SERVICE_URL=http://suppliers:8000
 ```
 ### Database Migrations
 No migrations needed - all fields already exist in models. If new fields were added to models, would need:
 ```bash
 # In services/orders directory
 alembic revision --autogenerate -m "Add procurement enhancements"
 alembic upgrade head
 ```
 ### Scheduler Deployment
 - Ensure `procurement_scheduler_service` is started in `main.py`
 - Verify leader election works in multi-instance setup
 - Check RabbitMQ exchanges exist:
  - `alerts.critical`
  - `alerts.escalation`
  - `alerts.reminders`
  - `procurement.events`
 ---
 ## 📊 METRICS TO MONITOR
 ### Application Metrics
 - `procurement_plan_generation_duration_seconds`
 - `recalculate_procurement_plan_duration_seconds`
 - `create_pos_from_plan_duration_seconds`
 - `link_requirement_to_po_duration_seconds`
 ### Business Metrics
 - Daily plans generated count
 - Stale plans escalated count
 - Auto-created POs count
 - Average fulfillment rate
 - Average on-time delivery rate
 - Supplier diversity score trend
 ---
 ## ✅ VERIFICATION CHECKLIST
 - [x] Bug #1: Supplier integration - Requirements show supplier info
 - [x] Bug #2: Forecast parsing - Quantities are accurate
 - [x] Bug #3: Stale cleanup - Old plans archived, reminders sent
 - [x] Bug #4: PO linking - Bidirectional tracking works
 - [x] Edge Case #1: Next-day escalation - Alerts sent
 - [x] Edge Case #2: Next-week reminders - Progressive notifications
 - [x] Edge Case #3: Inventory changes - Recalculation available
 - [x] Edge Case #4: Forecast fallback - Conservative estimates used
 - [x] Edge Case #5: Critical stock - Immediate alerts
 - [x] Edge Case #6: Parallel processing - 96% faster
 - [x] Edge Case #7: Approval workflow - Full audit trail
 - [x] Edge Case #8: Auto PO creation - One-click automation
 - [x] Feature #1: Auto PO creation - Implemented
 - [x] Feature #2: Delivery tracking - Implemented
 - [x] Feature #3: Performance metrics - Implemented
 - [x] Feature #4: Seasonality - Implemented
 - [x] Feature #5: Supplier diversity - Implemented
 ---
 ## 🎉 SUMMARY
 **FULLY IMPLEMENTED:**
 - ✅ 4/4 Critical Bugs Fixed
 - ✅ 8/8 Edge Cases Handled
 - ✅ 5/5 Features Implemented
 - ✅ 6 New API Endpoints
 - ✅ Parallel Processing (96% faster)
 - ✅ Comprehensive Error Handling
 - ✅ Full Audit Trail
 - ✅ Production-Ready
 **NO LEGACY CODE:** All existing files updated directly
 **NO TODOs:** All features fully implemented
 **NO BACKWARD COMPATIBILITY:** Clean, modern implementation
 The procurement system is now production-ready with enterprise-grade features, comprehensive edge case handling, and excellent performance.
--- a/SERVICE_INITIALIZATION_ARCHITECTURE.md
+++ b/SERVICE_INITIALIZATION_ARCHITECTURE.md
@@ -1,532 +0,0 @@
 # Service Initialization Architecture Analysis
 ## Current Architecture Problem
 You've correctly identified a **redundancy and architectural inconsistency** in the current setup:
 ### What's Happening Now:
 ```
 Kubernetes Deployment Flow:
 1. Migration Job runs → applies Alembic migrations → completes
 2. Service Pod starts → runs migrations AGAIN in startup → service ready
 ```
 ### The Redundancy:
 **Migration Job** (`external-migration`):
 - Runs: `/app/scripts/run_migrations.py external`
 - Calls: `initialize_service_database()`
 - Applies: Alembic migrations via `alembic upgrade head`
 - Status: Completes successfully
 **Service Startup** (`external-service` pod):
 - Runs: `BaseFastAPIService._handle_database_tables()` (line 219-241)
 - Calls: `initialize_service_database()` **AGAIN**
 - Applies: Alembic migrations via `alembic upgrade head` **AGAIN**
 - From logs:
  ```
  2025-10-01 09:26:01 [info] Running pending migrations service=external
  INFO [alembic.runtime.migration] Context impl PostgresqlImpl.
  INFO [alembic.runtime.migration] Will assume transactional DDL.
  2025-10-01 09:26:01 [info] Migrations applied successfully service=external
  ```
 ## Why This Is Problematic
 ### 1. **Duplicated Logic**
 - Same code runs twice (`initialize_service_database()`)
 - Both use same `DatabaseInitManager`
 - Both check migration state, run Alembic upgrade
 ### 2. **Unclear Separation of Concerns**
 - **Migration Job**: Supposed to handle migrations
 - **Service Startup**: Also handling migrations
 - Which one is the source of truth?
 ### 3. **Race Conditions Potential**
 If multiple service replicas start simultaneously:
 - All replicas run migrations concurrently
 - Alembic has locking, but still adds overhead
 - Unnecessary database load
 ### 4. **Slower Startup Times**
 Every service pod runs full migration check on startup:
 - Connects to database
 - Checks migration state
 - Runs `alembic upgrade head` (even if no-op)
 - Adds 1-2 seconds to startup
 ### 5. **Confusion About Responsibilities**
 From logs, the service is doing migration work:
 ```
 [info] Running pending migrations service=external
 ```
 This is NOT what a service should do - it should assume DB is ready.
 ## Architectural Patterns (Best Practices)
 ### Pattern 1: **Init Container Pattern** (Recommended for K8s)
 ```yaml
 Deployment:
  initContainers:
    - name: wait-for-migrations
      # Wait for migration job to complete
    - name: run-migrations  # Optional: inline migrations
      command: alembic upgrade head
  containers:
    - name: service
      # Service starts AFTER migrations complete
      # Service does NOT run migrations
 ```
 **Pros:**
 - ✅ Clear separation: Init containers handle setup, main container serves traffic
 - ✅ No race conditions: Init containers run sequentially
 - ✅ Fast service startup: Assumes DB is ready
 - ✅ Multiple replicas safe: Only first pod's init runs migrations
 **Cons:**
 - ⚠ Init containers increase pod startup time
 - ⚠ Need proper migration locking (Alembic provides this)
 ### Pattern 2: **Standalone Migration Job** (Your Current Approach - Almost)
 ```yaml
 Job: migration-job
  command: alembic upgrade head
  # Runs once on deployment
 Deployment: service
  # Service assumes DB is ready
  # NO migration logic in service code
 ```
 **Pros:**
 - ✅ Complete separation: Migrations are separate workload
 - ✅ Clear lifecycle: Job completes before service starts
 - ✅ Fast service startup: No migration checks
 - ✅ Easy rollback: Re-run job with specific version
 **Cons:**
 - ⚠ Need orchestration: Ensure job completes before service starts
 - ⚠ Deployment complexity: Manage job + deployment separately
 ### Pattern 3: **Service Self-Migration** (Anti-pattern in Production)
 ```yaml
 Deployment: service
  # Service runs migrations on startup
  # What you're doing now in both places
 ```
 **Pros:**
 - ✅ Simple deployment: Single resource
 - ✅ Always in sync: Migrations bundled with service
 **Cons:**
 - ❌ Race conditions with multiple replicas
 - ❌ Slower startup: Every pod checks migrations
 - ❌ Service code mixed with operational concerns
 - ❌ Harder to debug: Migration failures look like service failures
 ## Recommended Architecture
 ### **Hybrid Approach: Init Container + Fallback Check**
 ```yaml
 # 1. Pre-deployment Migration Job (runs once)
 apiVersion: batch/v1
 kind: Job
 metadata:
  name: external-migration
 spec:
  template:
    spec:
      containers:
      - name: migrate
        command: ["alembic", "upgrade", "head"]
        # Runs FULL migration logic
 ---
 # 2. Service Deployment (depends on job)
 apiVersion: apps/v1
 kind: Deployment
 metadata:
  name: external-service
 spec:
  template:
    spec:
      initContainers:
      - name: wait-for-db
        # Wait for database to be ready
      # NEW: Wait for migrations to complete
      - name: wait-for-migrations
        command: ["sh", "-c", "
          until alembic current | grep -q 'head'; do
            echo 'Waiting for migrations...';
            sleep 2;
          done
        "]
      containers:
      - name: service
        # Service startup with MINIMAL migration check
        env:
        - name: SKIP_MIGRATIONS
          value: "true"  # Service won't run migrations
 ```
 ### Service Code Changes:
 **Current** (`shared/service_base.py` line 219-241):
 ```python
 async def _handle_database_tables(self):
    """Handle automatic table creation and migration management"""
    # Always runs full migration check
    result = await initialize_service_database(
        database_manager=self.database_manager,
        service_name=self.service_name,
        force_recreate=force_recreate
    )
 ```
 **Recommended**:
 ```python
 async def _handle_database_tables(self):
    """Verify database is ready (migrations already applied)"""
    # Check if we should skip migrations (production mode)
    skip_migrations = os.getenv("SKIP_MIGRATIONS", "false").lower() == "true"
    if skip_migrations:
        # Production mode: Only verify, don't run migrations
        await self._verify_database_ready()
    else:
        # Development mode: Run full migration check
        result = await initialize_service_database(
            database_manager=self.database_manager,
            service_name=self.service_name,
            force_recreate=force_recreate
        )
 async def _verify_database_ready(self):
    """Quick check that database and tables exist"""
    try:
        # Check connection
        if not await self.database_manager.test_connection():
            raise Exception("Database connection failed")
        # Check expected tables exist (if specified)
        if self.expected_tables:
            async with self.database_manager.get_session() as session:
                for table in self.expected_tables:
                    result = await session.execute(
                        text(f"SELECT EXISTS (
                            SELECT FROM information_schema.tables
                            WHERE table_schema = 'public'
                            AND table_name = '{table}'
                        )")
                    )
                    if not result.scalar():
                        raise Exception(f"Expected table '{table}' not found")
        self.logger.info("Database verification successful")
    except Exception as e:
        self.logger.error("Database verification failed", error=str(e))
        raise
 ```
 ## Migration Strategy Comparison
 ### Current State:
 ```
 ┌─────────────────┐
 │ Migration Job   │ ──> Runs migrations
 └─────────────────┘
         │
         ├─> Job completes
         │
         ▼
 ┌─────────────────┐
 │ Service Pod 1   │ ──> Runs migrations AGAIN ❌
 └─────────────────┘
         │
 ┌─────────────────┐
 │ Service Pod 2   │ ──> Runs migrations AGAIN ❌
 └─────────────────┘
         │
 ┌─────────────────┐
 │ Service Pod 3   │ ──> Runs migrations AGAIN ❌
 └─────────────────┘
 ```
 ### Recommended State:
 ```
 ┌─────────────────┐
 │ Migration Job   │ ──> Runs migrations ONCE ✅
 └─────────────────┘
         │
         ├─> Job completes
         │
         ▼
 ┌─────────────────┐
 │ Service Pod 1   │ ──> Verifies DB ready only ✅
 └─────────────────┘
         │
 ┌─────────────────┐
 │ Service Pod 2   │ ──> Verifies DB ready only ✅
 └─────────────────┘
         │
 ┌─────────────────┐
 │ Service Pod 3   │ ──> Verifies DB ready only ✅
 └─────────────────┘
 ```
 ## Implementation Plan
 ### Phase 1: Add Verification-Only Mode
 **File**: `shared/database/init_manager.py`
 Add new mode: `verify_only`
 ```python
 class DatabaseInitManager:
    def __init__(
        self,
        # ... existing params
        verify_only: bool = False  # NEW
    ):
        self.verify_only = verify_only
    async def initialize_database(self) -> Dict[str, Any]:
        if self.verify_only:
            return await self._verify_database_state()
        # Existing logic for full initialization
        # ...
    async def _verify_database_state(self) -> Dict[str, Any]:
        """Quick verification that database is properly initialized"""
        db_state = await self._check_database_state()
        if not db_state["has_migrations"]:
            raise Exception("No migrations found - database not initialized")
        if db_state["is_empty"]:
            raise Exception("Database has no tables - migrations not applied")
        if not db_state["has_alembic_version"]:
            raise Exception("No alembic_version table - migrations not tracked")
        return {
            "action": "verified",
            "message": "Database verified successfully",
            "current_revision": db_state["current_revision"]
        }
 ```
 ### Phase 2: Update BaseFastAPIService
 **File**: `shared/service_base.py`
 ```python
 async def _handle_database_tables(self):
    """Handle database initialization based on environment"""
    # Determine mode
    skip_migrations = os.getenv("SKIP_MIGRATIONS", "false").lower() == "true"
    force_recreate = os.getenv("DB_FORCE_RECREATE", "false").lower() == "true"
    # Import here to avoid circular imports
    from shared.database.init_manager import initialize_service_database
    try:
        if skip_migrations:
            self.logger.info("Migration skip enabled - verifying database only")
            result = await initialize_service_database(
                database_manager=self.database_manager,
                service_name=self.service_name.replace("-service", ""),
                verify_only=True  # NEW parameter
            )
        else:
            self.logger.info("Running full database initialization")
            result = await initialize_service_database(
                database_manager=self.database_manager,
                service_name=self.service_name.replace("-service", ""),
                force_recreate=force_recreate,
                verify_only=False
            )
        self.logger.info("Database initialization completed", result=result)
    except Exception as e:
        self.logger.error("Database initialization failed", error=str(e))
        raise  # Fail fast in production
 ```
 ### Phase 3: Update Kubernetes Manifests
 **Add to all service deployments**:
 ```yaml
 apiVersion: apps/v1
 kind: Deployment
 metadata:
  name: external-service
 spec:
  template:
    spec:
      containers:
      - name: external-service
        env:
        # NEW: Skip migrations in service, rely on Job
        - name: SKIP_MIGRATIONS
          value: "true"
        # Keep ENVIRONMENT for production safety
        - name: ENVIRONMENT
          value: "production"  # or "development"
 ```
 ### Phase 4: Optional - Add Init Container Dependency
 **For production safety**:
 ```yaml
 spec:
  template:
    spec:
      initContainers:
      - name: wait-for-migrations
        image: postgres:15-alpine
        command: ["sh", "-c"]
        args:
        - |
          echo "Waiting for migrations to be applied..."
          export PGPASSWORD="$DB_PASSWORD"
          # Wait for alembic_version table to exist
          until psql -h $DB_HOST -U $DB_USER -d $DB_NAME -c "SELECT version_num FROM alembic_version" > /dev/null 2>&1; do
            echo "Migrations not yet applied, waiting..."
            sleep 2
          done
          echo "Migrations detected, service can start"
        env:
        - name: DB_HOST
          value: "external-db-service"
        # ... other DB connection details
 ```
 ## Environment Configuration Matrix
 | Environment | Migration Job | Service Startup | Use Case |
 |-------------|---------------|-----------------|----------|
 | **Development** | Optional | Run migrations | Fast iteration, create_all fallback OK |
 | **Staging** | Required | Verify only | Test migration workflow |
 | **Production** | Required | Verify only | Safety first, fail fast |
 ## Configuration Examples
 ### Development (Current Behavior - OK)
 ```yaml
 env:
  - name: ENVIRONMENT
    value: "development"
  - name: SKIP_MIGRATIONS
    value: "false"
  - name: DB_FORCE_RECREATE
    value: "false"
 ```
 **Behavior**: Service runs full migration check, allows create_all fallback
 ### Staging/Production (Recommended)
 ```yaml
 env:
  - name: ENVIRONMENT
    value: "production"
  - name: SKIP_MIGRATIONS
    value: "true"
  - name: DB_FORCE_RECREATE
    value: "false"
 ```
 **Behavior**:
 - Service only verifies database is ready
 - No migration execution in service
 - Fails fast if database not properly initialized
 ## Benefits of Proposed Architecture
 ### Performance:
 - ✅ **50-80% faster service startup** (skip migration check: ~1-2 seconds saved)
 - ✅ **Reduced database load** (no concurrent migration checks from multiple pods)
 - ✅ **Faster horizontal scaling** (new pods start immediately)
 ### Reliability:
 - ✅ **No race conditions** (only job runs migrations)
 - ✅ **Clearer error messages** ("DB not ready" vs "migration failed")
 - ✅ **Easier rollback** (re-run job independently)
 ### Maintainability:
 - ✅ **Separation of concerns** (ops vs service code)
 - ✅ **Easier debugging** (check job logs for migration issues)
 - ✅ **Clear deployment flow** (job → service)
 ### Safety:
 - ✅ **Fail-fast in production** (service won't start if DB not ready)
 - ✅ **No create_all in production** (explicit migrations required)
 - ✅ **Audit trail** (job logs show when migrations ran)
 ## Migration Path
 ### Step 1: Implement verify_only Mode (Non-Breaking)
 - Add to `DatabaseInitManager`
 - Backwards compatible (default: full check)
 ### Step 2: Add SKIP_MIGRATIONS Support (Non-Breaking)
 - Update `BaseFastAPIService`
 - Default: false (current behavior)
 ### Step 3: Enable in Development First
 - Test with `SKIP_MIGRATIONS=true` locally
 - Verify services start correctly
 ### Step 4: Enable in Staging
 - Update staging manifests
 - Monitor startup times and errors
 ### Step 5: Enable in Production
 - Update production manifests
 - Services fail fast if migrations not applied
 ## Recommended Next Steps
 1. **Immediate**: Document current redundancy (✅ this document)
 2. **Short-term** (1-2 days):
   - Implement `verify_only` mode in `DatabaseInitManager`
   - Add `SKIP_MIGRATIONS` support in `BaseFastAPIService`
   - Test in development environment
 3. **Medium-term** (1 week):
   - Update all service deployments with `SKIP_MIGRATIONS=true`
   - Add init container to wait for migrations (optional but recommended)
   - Monitor startup times and error rates
 4. **Long-term** (ongoing):
   - Document migration process in runbooks
   - Add migration rollback procedures
   - Consider migration versioning strategy
 ## Summary
 **Current**: Migration Job + Service both run migrations → redundant, slower, confusing
 **Recommended**: Migration Job runs migrations → Service only verifies → clear, fast, reliable
 The key insight: **Migrations are operational concerns, not application concerns**. Services should assume the database is ready, not try to fix it themselves.
--- a/SSE_IMPLEMENTATION_COMPLETE.md
+++ b/SSE_IMPLEMENTATION_COMPLETE.md
@@ -1,363 +0,0 @@
 # SSE Real-Time Alert System Implementation - COMPLETE
 ## Implementation Date
 **2025-10-02**
 ## Summary
 Successfully implemented and configured the SSE (Server-Sent Events) real-time alert system using the gateway pattern with HTTPS support.
 ---
 ## Changes Made
 ### 1. Frontend SSE Connection
 **File:** `frontend/src/contexts/SSEContext.tsx`
 **Changes:**
 - Updated SSE connection to use gateway endpoint instead of direct notification service
 - Changed from hardcoded `http://localhost:8006` to dynamic protocol/host matching the page
 - Updated endpoint from `/api/v1/sse/alerts/stream/{tenantId}` to `/api/events`
 - Added support for gateway event types: `connection`, `heartbeat`, `inventory_alert`, `notification`
 - Removed tenant_id from URL (gateway extracts it from JWT)
 **New Connection:**
 ```typescript
 const protocol = window.location.protocol;
 const host = window.location.host;
 const sseUrl = `${protocol}//${host}/api/events?token=${encodeURIComponent(token)}`;
 ```
 **Benefits:**
 - ✅ Protocol consistency (HTTPS when page is HTTPS, HTTP when HTTP)
 - ✅ No CORS issues (same origin)
 - ✅ No mixed content errors
 - ✅ Works in all environments (localhost, bakery-ia.local)
 ---
 ### 2. Gateway SSE Endpoint
 **File:** `gateway/app/main.py`
 **Changes:**
 - Enhanced `/api/events` endpoint with proper JWT validation
 - Added tenant_id extraction from user context via tenant service
 - Implemented proper token verification using auth middleware
 - Added token expiration checking
 - Fetches user's tenants and subscribes to appropriate Redis channel
 **Flow:**
 1. Validate JWT token using auth middleware
 2. Check token expiration
 3. Extract user_id from token
 4. Query tenant service for user's tenants
 5. Subscribe to Redis channel: `alerts:{tenant_id}`
 6. Stream events to frontend
 **Benefits:**
 - ✅ Secure authentication
 - ✅ Proper token validation
 - ✅ Automatic tenant detection
 - ✅ No tenant_id in URL (security)
 ---
 ### 3. Ingress Configuration
 #### HTTPS Ingress
 **File:** `infrastructure/kubernetes/base/ingress-https.yaml`
 **Changes:**
 - Extended `proxy-read-timeout` from 600s to 3600s (1 hour)
 - Added `proxy-buffering: off` for SSE streaming
 - Added `proxy-http-version: 1.1` for proper SSE support
 - Added `upstream-keepalive-timeout: 3600` for long-lived connections
 - Added `http://localhost` to CORS origins for local development
 - Added `Cache-Control` to CORS allowed headers
 - **Removed direct `/auth` route** (now goes through gateway)
 **SSE Annotations:**
 ```yaml
 nginx.ingress.kubernetes.io/proxy-buffering: "off"
 nginx.ingress.kubernetes.io/proxy-http-version: "1.1"
 nginx.ingress.kubernetes.io/proxy-read-timeout: "3600"
 nginx.ingress.kubernetes.io/upstream-keepalive-timeout: "3600"
 ```
 **CORS Origins:**
 ```yaml
 nginx.ingress.kubernetes.io/cors-allow-origin: "https://bakery-ia.local,https://api.bakery-ia.local,https://monitoring.bakery-ia.local,http://localhost"
 ```
 #### HTTP Ingress (Development)
 **File:** `infrastructure/kubernetes/overlays/dev/dev-ingress.yaml`
 **Changes:**
 - Extended timeouts for SSE (3600s read/send timeout)
 - Added SSE-specific annotations (proxy-buffering off, HTTP/1.1)
 - Enhanced CORS headers to include Cache-Control
 - Added PATCH to allowed methods
 **Benefits:**
 - ✅ Supports long-lived SSE connections (1 hour)
 - ✅ No proxy buffering (real-time streaming)
 - ✅ Works with both HTTP and HTTPS
 - ✅ Proper CORS for all environments
 - ✅ All external access through gateway (security)
 ---
 ### 4. Environment Configuration
 **File:** `.env`
 **Changes:**
 - Added `http://localhost` to CORS_ORIGINS (line 217)
 **New Value:**
 ```bash
 CORS_ORIGINS=http://localhost,http://localhost:3000,http://localhost:3001,http://127.0.0.1:3000,https://bakery.yourdomain.com
 ```
 **Note:** Services need restart to pick up this change (handled by Tilt/Kubernetes)
 ---
 ## Architecture Flow
 ### Complete Alert Flow
 ```
 1. SERVICE LAYER (Inventory, Orders, etc.)
   ├─> Detects alert condition
   ├─> Publishes to RabbitMQ (alerts.exchange)
   └─> Routing key: alert.[severity].[service]
 2. ALERT PROCESSOR SERVICE
   ├─> Consumes from RabbitMQ queue
   ├─> Stores in PostgreSQL database
   ├─> Determines delivery channels (email, whatsapp, etc.)
   ├─> Publishes to Redis: alerts:{tenant_id}
   └─> Calls Notification Service for email/whatsapp
 3. NOTIFICATION SERVICE
   ├─> Email Service (SMTP)
   ├─> WhatsApp Service (Twilio)
   └─> (SSE handled by gateway, not notification service)
 4. GATEWAY SERVICE
   ├─> /api/events endpoint
   ├─> Subscribes to Redis: alerts:{tenant_id}
   ├─> Streams SSE events to frontend
   └─> Handles authentication/authorization
 5. INGRESS (NGINX)
   ├─> Routes /api/* to gateway
   ├─> Handles HTTPS/TLS termination
   ├─> Manages CORS
   └─> Optimized for long-lived SSE connections
 6. FRONTEND (React)
   ├─> EventSource connects to /api/events
   ├─> Receives real-time alerts
   ├─> Shows toast notifications
   └─> Triggers alert listeners
 ```
 ---
 ## Testing
 ### Manual Testing
 #### Test 1: Endpoint Accessibility
 ```bash
 curl -v -N "http://localhost/api/events?token=test"
 ```
 **Expected Result:** 401 Unauthorized (correct - invalid token)
 **Actual Result:** ✅ 401 Unauthorized
 #### Test 2: Frontend Connection
 1. Navigate to https://bakery-ia.local or http://localhost
 2. Login to the application
 3. Check browser console for: `"Connecting to SSE endpoint: ..."`
 4. Look for: `"SSE connection opened"`
 #### Test 3: Alert Delivery
 1. Trigger an alert (e.g., create low stock condition)
 2. Alert should appear in dashboard
 3. Toast notification should show
 4. Check browser network tab for EventSource connection
 ### Verification Checklist
 - [x] Frontend uses dynamic protocol/host for SSE URL
 - [x] Gateway validates JWT and extracts tenant_id
 - [x] Ingress has SSE-specific annotations (proxy-buffering off)
 - [x] Ingress has extended timeouts (3600s)
 - [x] CORS includes http://localhost for development
 - [x] Direct auth route removed from ingress
 - [x] Gateway connected to Redis
 - [x] SSE endpoint returns 401 for invalid token
 - [x] Ingress configuration applied to Kubernetes
 - [x] Gateway service restarted successfully
 ---
 ## Key Decisions
 ### Why Gateway Pattern for SSE?
 **Decision:** Use gateway's `/api/events` instead of proxying to notification service
 **Reasons:**
 1. **Already Implemented:** Gateway has working SSE with Redis pub/sub
 2. **Security:** Single authentication point at gateway
 3. **Simplicity:** No need to expose notification service
 4. **Scalability:** Redis pub/sub designed for this use case
 5. **Consistency:** All external access through gateway
 ### Why Remove Direct Auth Route?
 **Decision:** Route `/auth` through gateway instead of direct to auth-service
 **Reasons:**
 1. **Consistency:** All external API access should go through gateway
 2. **Security:** Centralized rate limiting, logging, monitoring
 3. **Flexibility:** Easier to add middleware (e.g., IP filtering)
 4. **Best Practice:** Microservices should not be directly exposed
 ---
 ## Environment-Specific Configuration
 ### Local Development (http://localhost)
 - Uses HTTP ingress (bakery-ingress)
 - CORS allows all origins (`*`)
 - SSL redirect disabled
 - EventSource: `http://localhost/api/events`
 ### Staging/Production (https://bakery-ia.local)
 - Uses HTTPS ingress (bakery-ingress-https)
 - CORS allows specific domains
 - SSL redirect enforced
 - EventSource: `https://bakery-ia.local/api/events`
 ---
 ## Troubleshooting
 ### Issue: SSE Connection Fails with CORS Error
 **Solution:** Check CORS_ORIGINS in .env includes the frontend origin
 ### Issue: SSE Connection Immediately Closes
 **Solution:** Verify proxy-buffering is "off" in ingress annotations
 ### Issue: No Events Received
 **Solution:**
 1. Check Redis is running: `kubectl get pods -n bakery-ia | grep redis`
 2. Check alert_processor is publishing: Check logs
 3. Verify gateway subscribed to correct channel: Check gateway logs
 ### Issue: 401 Unauthorized on /api/events
 **Solution:** Check JWT token is valid and not expired
 ### Issue: Frontend can't connect (ERR_CONNECTION_REFUSED)
 **Solution:**
 1. Verify ingress is applied: `kubectl get ingress -n bakery-ia`
 2. Check gateway is running: `kubectl get pods -n bakery-ia | grep gateway`
 3. Verify port forwarding or ingress controller
 ---
 ## Performance Considerations
 ### Timeouts
 - **Read Timeout:** 3600s (1 hour) - Allows long-lived connections
 - **Send Timeout:** 3600s (1 hour) - Prevents premature disconnection
 - **Connect Timeout:** 600s (10 minutes) - Initial connection establishment
 ### Heartbeats
 - Gateway sends heartbeat every ~100 seconds (10 timeouts × 10s)
 - Prevents connection from appearing stale
 - Helps detect disconnected clients
 ### Scalability
 - **Redis Pub/Sub:** Can handle millions of messages per second
 - **Gateway:** Stateless, can scale horizontally
 - **Nginx:** Optimized for long-lived connections
 ---
 ## Security
 ### Authentication Flow
 1. Frontend includes JWT token in query parameter
 2. Gateway validates token using auth middleware
 3. Gateway checks token expiration
 4. Gateway extracts user_id from verified token
 5. Gateway queries tenant service for user's tenants
 6. Only subscribed to authorized tenant's channel
 ### Security Benefits
 - ✅ JWT validation at gateway
 - ✅ Token expiration checking
 - ✅ Tenant isolation (each tenant has separate channel)
 - ✅ No tenant_id in URL (prevents enumeration)
 - ✅ HTTPS enforced in production
 - ✅ CORS properly configured
 ---
 ## Next Steps (Optional Enhancements)
 ### 1. Multiple Tenant Support
 Allow users to subscribe to alerts from multiple tenants simultaneously.
 ### 2. Event Filtering
 Add query parameters to filter events by severity or type:
 ```
 /api/events?token=xxx&severity=urgent,high&type=alert
 ```
 ### 3. Historical Events on Connect
 Send recent alerts when client first connects (implemented in notification service but not used).
 ### 4. Reconnection Logic
 Frontend already has exponential backoff - consider adding connection status indicator.
 ### 5. Metrics
 Add Prometheus metrics for:
 - Active SSE connections
 - Events published per tenant
 - Connection duration
 - Reconnection attempts
 ---
 ## Files Modified
 1. `frontend/src/contexts/SSEContext.tsx` - SSE client connection
 2. `gateway/app/main.py` - SSE endpoint with tenant extraction
 3. `infrastructure/kubernetes/base/ingress-https.yaml` - HTTPS ingress config
 4. `infrastructure/kubernetes/overlays/dev/dev-ingress.yaml` - Dev ingress config
 5. `.env` - CORS origins
 ## Files Deployed
 - Ingress configurations applied to Kubernetes cluster
 - Gateway service automatically redeployed by Tilt
 - Frontend changes ready for deployment
 ---
 ## Conclusion
 The SSE real-time alert system is now fully functional with:
 - ✅ Proper gateway pattern implementation
 - ✅ HTTPS support with protocol matching
 - ✅ Secure JWT authentication
 - ✅ Optimized nginx configuration for SSE
 - ✅ CORS properly configured for all environments
 - ✅ All external access through gateway (no direct service exposure)
 The system is production-ready and follows microservices best practices.
--- a/SSE_SECURITY_MITIGATIONS.md
+++ b/SSE_SECURITY_MITIGATIONS.md
@@ -1,291 +0,0 @@
 # SSE Authentication Security Mitigations
 ## Implementation Date
 **2025-10-02**
 ## Security Concern: Token in Query Parameters
 The SSE endpoint (`/api/events?token=xxx`) accepts authentication tokens via query parameters due to browser `EventSource` API limitations. This introduces security risks that have been mitigated.
 ---
 ## Security Risks & Mitigations
 ### 1. Token Exposure in Logs ⚠️
 **Risk:** Nginx access logs contain full URLs including tokens
 **Impact:** Medium - If logs are compromised, tokens could be exposed
 **Mitigations Implemented:**
 - ✅ **Short Token Expiry**: JWT tokens expire in 30 minutes (configurable in `.env`)
 - ✅ **HTTPS Only**: All production traffic uses TLS encryption
 - ✅ **Log Access Control**: Kubernetes logs have RBAC restrictions
 - ⚠️ **Manual Log Filtering**: Nginx configuration-snippet is disabled by admin
 **Additional Mitigation (Manual):**
 If you have access to nginx ingress controller configuration, enable log filtering:
 ```yaml
 # In nginx ingress controller ConfigMap
 data:
  log-format-upstream: '$remote_addr - $remote_user [$time_local] "$request_method $sanitized_uri $server_protocol" $status $body_bytes_sent "$http_referer" "$http_user_agent"'
 ```
 Or use a log aggregation tool (Loki, ELK) with regex filtering to redact tokens.
 ---
 ### 2. Token in Browser History 📝
 **Risk:** Browser stores URLs with query params in history
 **Impact:** Low - Requires local access to user's machine
 **Mitigations Implemented:**
 - ✅ **User's Own Browser**: History is private to the user
 - ✅ **Short Expiry**: Old tokens in history expire quickly
 - ✅ **Auto-logout**: Session management invalidates tokens
 **Not a Risk:** SSE connections are initiated by JavaScript (EventSource), not user navigation, so they typically don't appear in browser history.
 ---
 ### 3. Referrer Header Leakage 🔗
 **Risk:** When user navigates away, Referrer header might include SSE URL
 **Impact:** Medium - Token could leak to third-party sites
 **Mitigations Implemented:**
 - ⚠️ **Referrer-Policy Header**: Attempted via nginx annotation (blocked by admin)
 - ✅ **SameSite Routing**: SSE is same-origin (no external referrers)
 - ✅ **HTTPS**: Browsers don't send Referrer from HTTPS to HTTP
 **Manual Mitigation:**
 Add to HTML head in frontend:
 ```html
 <meta name="referrer" content="no-referrer">
 ```
 Or add HTTP header via frontend response headers.
 ---
 ### 4. Proxy/CDN Caching 🌐
 **Risk:** Intermediary proxies might cache or log URLs
 **Impact:** Low - Internal infrastructure only
 **Mitigations Implemented:**
 - ✅ **Direct Ingress**: No external proxies/CDNs
 - ✅ **Internal Network**: All routing within Kubernetes cluster
 - ✅ **Cache-Control Headers**: SSE endpoints set no-cache
 ---
 ### 5. Accidental URL Sharing 📤
 **Risk:** Users could copy/share URLs with embedded tokens
 **Impact:** High (for regular URLs) / Low (for SSE - not user-visible)
 **Mitigations Implemented:**
 - ✅ **Hidden from Users**: EventSource connections not visible in address bar
 - ✅ **Short Token Expiry**: Shared tokens expire quickly
 - ✅ **One-Time Use**: Tokens invalidated on logout
 ---
 ## Security Comparison
 | Threat | Query Param Token | Header Token | Cookie Token | WebSocket |
 |--------|-------------------|--------------|--------------|-----------|
 | **Server Logs** | ⚠️ Medium | ✅ Safe | ✅ Safe | ✅ Safe |
 | **Browser History** | ⚠️ Low | ✅ Safe | ✅ Safe | ✅ Safe |
 | **Referrer Leakage** | ⚠️ Medium | ✅ Safe | ⚠️ Medium | ✅ Safe |
 | **XSS Attacks** | ⚠️ Vulnerable | ⚠️ Vulnerable | ✅ httpOnly | ⚠️ Vulnerable |
 | **CSRF Attacks** | ✅ Safe | ✅ Safe | ⚠️ Requires token | ✅ Safe |
 | **Ease of Use** | ✅ Simple | ❌ Not supported | ⚠️ Complex | ⚠️ Complex |
 | **Browser Support** | ✅ Native | ❌ No EventSource | ✅ Native | ✅ Native |
 ---
 ## Applied Mitigations Summary
 ### ✅ Implemented:
 1. **Short token expiry** (30 minutes)
 2. **HTTPS enforcement** (production)
 3. **Token validation** (middleware + endpoint)
 4. **CORS restrictions** (specific origins)
 5. **Kubernetes RBAC** (log access control)
 6. **Same-origin policy** (no external referrers)
 7. **Auto-logout** (session management)
 ### ⚠️ Blocked by Infrastructure:
 1. **Nginx log filtering** (configuration-snippet disabled)
 2. **Referrer-Policy header** (configuration-snippet disabled)
 ### 📝 Recommended (Manual):
 1. **Add Referrer-Policy meta tag** to frontend HTML
 2. **Enable nginx log filtering** if ingress admin allows
 3. **Use log aggregation** with token redaction (Loki/ELK)
 4. **Monitor for suspicious patterns** in logs
 ---
 ## Production Checklist
 Before deploying to production, ensure:
 - [ ] HTTPS enforced (no HTTP fallback)
 - [ ] Token expiry set to ≤ 30 minutes
 - [ ] CORS origins limited to specific domains (not `*`)
 - [ ] Kubernetes RBAC configured for log access
 - [ ] Frontend has Referrer-Policy meta tag
 - [ ] Log aggregation configured with token redaction
 - [ ] Monitoring/alerting for failed auth attempts
 - [ ] Rate limiting enabled on gateway
 - [ ] Regular security audits of access logs
 ---
 ## Upgrade Path to Cookie-Based Auth
 For maximum security, migrate to cookie-based authentication:
 **Effort:** ~2-3 hours
 **Security:** ⭐⭐⭐⭐⭐ (5/5)
 **Changes needed:**
 1. Auth service sets httpOnly cookie on login
 2. Gateway auth middleware reads cookie instead of query param
 3. Frontend uses `withCredentials: true` (already done!)
 **Benefits:**
 - ✅ No token in URL
 - ✅ No token in logs
 - ✅ XSS protection (httpOnly)
 - ✅ CSRF protection (SameSite)
 ---
 ## Risk Assessment
 ### Current Risk Level: **MEDIUM** ⚠️
 **Acceptable for:**
 - ✅ Internal/development environments
 - ✅ Short-term production (with monitoring)
 - ✅ Low-sensitivity data
 **Not recommended for:**
 - ❌ High-security environments
 - ❌ Long-term production without upgrade path
 - ❌ Systems handling PII/financial data
 **Upgrade recommended within:** 30-60 days for production
 ---
 ## Incident Response
 ### If Token Leak Suspected:
 1. **Immediate Actions:**
   ```bash
   # Invalidate all active sessions
   kubectl exec -it -n bakery-ia $(kubectl get pod -n bakery-ia -l app=redis -o name) -- redis-cli
   KEYS auth:token:*
   DEL auth:token:*
   ```
 2. **Rotate JWT Secret:**
   ```bash
   # Update .env
   JWT_SECRET_KEY=<new-secret-64-chars>
   # Restart auth service and gateway
   kubectl rollout restart deployment auth-service gateway -n bakery-ia
   ```
 3. **Force Re-authentication:**
   - All users must login again
   - Existing tokens invalidated
 4. **Audit Logs:**
   ```bash
   # Check for suspicious SSE connections
   kubectl logs -n ingress-nginx -l app.kubernetes.io/component=controller | grep "/api/events"
   ```
 ---
 ## Monitoring Queries
 ### Check for Suspicious Activity:
 ```bash
 # High volume of SSE connections from single IP
 kubectl logs -n ingress-nginx -l app.kubernetes.io/component=controller | grep "/api/events" | awk '{print $1}' | sort | uniq -c | sort -rn | head -10
 # Failed authentication attempts
 kubectl logs -n bakery-ia -l app.kubernetes.io/name=gateway | grep "401\|Invalid token"
 # SSE connections with expired tokens
 kubectl logs -n bakery-ia -l app.kubernetes.io/name=gateway | grep "Token expired"
 ```
 ---
 ## Compliance Notes
 ### GDPR:
 - ✅ Tokens are pseudonymous identifiers
 - ✅ Short retention (30 min expiry)
 - ⚠️ Tokens in logs = personal data processing (document in privacy policy)
 ### SOC 2:
 - ⚠️ Query param auth acceptable with compensating controls
 - ✅ Encryption in transit (HTTPS)
 - ✅ Access controls (RBAC)
 - 📝 Document risk acceptance in security policy
 ### PCI DSS:
 - ❌ Not recommended for payment card data
 - ✅ Acceptable for non-cardholder data
 - 📝 May require additional compensating controls
 ---
 ## References
 - [OWASP: Transport Layer Protection](https://owasp.org/www-community/controls/Transport_Layer_Protection)
 - [EventSource API Spec](https://html.spec.whatwg.org/multipage/server-sent-events.html)
 - [RFC 6750: OAuth 2.0 Bearer Token Usage](https://www.rfc-editor.org/rfc/rfc6750.html#section-2.3)
 ---
 ## Decision Log
 **Date:** 2025-10-02
 **Decision:** Use query parameter authentication for SSE endpoint
 **Rationale:** EventSource API limitation (no custom headers)
 **Accepted Risk:** Medium (token in logs, limited referrer leakage)
 **Mitigation Plan:** Implement cookie-based auth within 60 days
 **Approved By:** Technical Lead
 ---
 ## Next Steps
 1. **Short-term (now):**
   - ✅ Query param auth implemented
   - ✅ Security mitigations documented
   - 📝 Add Referrer-Policy to frontend
   - 📝 Configure log monitoring
 2. **Medium-term (30 days):**
   - 📝 Implement cookie-based authentication
   - 📝 Enable nginx log filtering (if allowed)
   - 📝 Set up log aggregation with redaction
 3. **Long-term (60 days):**
   - 📝 Security audit of implementation
   - 📝 Penetration testing
   - 📝 Consider WebSocket migration (if bidirectional needed)
--- a/TIMEZONE_AWARE_DATETIME_FIX.md
+++ b/TIMEZONE_AWARE_DATETIME_FIX.md
@@ -1,234 +0,0 @@
 # Timezone-Aware Datetime Fix
 **Date:** 2025-10-09
 **Status:** ✅ RESOLVED
 ## Problem
 Error in forecasting service logs:
 ```
 [error] Failed to get cached prediction
 error=can't compare offset-naive and offset-aware datetimes
 ```
 ## Root Cause
 The forecasting service database uses `DateTime(timezone=True)` for all timestamp columns, which means they store timezone-aware datetime objects. However, the code was using `datetime.utcnow()` throughout, which returns timezone-naive datetime objects.
 When comparing these two types (e.g., checking if cache has expired), Python raises:
 ```
 TypeError: can't compare offset-naive and offset-aware datetimes
 ```
 ## Database Schema
 All datetime columns in forecasting service models use `DateTime(timezone=True)`:
 ```python
 # From app/models/predictions.py
 class PredictionCache(Base):
    forecast_date = Column(DateTime(timezone=True), nullable=False)
    created_at = Column(DateTime(timezone=True), default=lambda: datetime.now(timezone.utc))
    expires_at = Column(DateTime(timezone=True), nullable=False)  # ← Compared with datetime.utcnow()
    # ... other columns
 class ModelPerformanceMetric(Base):
    evaluation_date = Column(DateTime(timezone=True), nullable=False)
    created_at = Column(DateTime(timezone=True), default=lambda: datetime.now(timezone.utc))
    # ... other columns
 # From app/models/forecasts.py
 class Forecast(Base):
    forecast_date = Column(DateTime(timezone=True), nullable=False, index=True)
    created_at = Column(DateTime(timezone=True), default=lambda: datetime.now(timezone.utc))
 class PredictionBatch(Base):
    requested_at = Column(DateTime(timezone=True), default=lambda: datetime.now(timezone.utc))
    completed_at = Column(DateTime(timezone=True))
 ```
 ## Solution
 Replaced all `datetime.utcnow()` calls with `datetime.now(timezone.utc)` throughout the forecasting service.
 ### Before (BROKEN):
 ```python
 # Returns timezone-naive datetime
 cache_entry.expires_at < datetime.utcnow()  # ❌ TypeError!
 ```
 ### After (WORKING):
 ```python
 # Returns timezone-aware datetime
 cache_entry.expires_at < datetime.now(timezone.utc)  # ✅ Works!
 ```
 ## Files Fixed
 ### 1. Import statements updated
 Added `timezone` to imports in all affected files:
 ```python
 from datetime import datetime, timedelta, timezone
 ```
 ### 2. All datetime.utcnow() replaced
 Fixed in 9 files across the forecasting service:
 1. **[services/forecasting/app/repositories/prediction_cache_repository.py](services/forecasting/app/repositories/prediction_cache_repository.py)**
   - Line 53: Cache expiration time calculation
   - Line 105: Cache expiry check (the main error)
   - Line 175: Cleanup expired cache entries
   - Line 212: Cache statistics query
 2. **[services/forecasting/app/repositories/prediction_batch_repository.py](services/forecasting/app/repositories/prediction_batch_repository.py)**
   - Lines 84, 113, 143, 184: Batch completion timestamps
   - Line 273: Recent activity queries
   - Line 318: Cleanup old batches
   - Line 357: Batch progress calculations
 3. **[services/forecasting/app/repositories/forecast_repository.py](services/forecasting/app/repositories/forecast_repository.py)**
   - Lines 162, 241: Forecast accuracy and trend analysis date ranges
 4. **[services/forecasting/app/repositories/performance_metric_repository.py](services/forecasting/app/repositories/performance_metric_repository.py)**
   - Line 101: Performance trends date range calculation
 5. **[services/forecasting/app/repositories/base.py](services/forecasting/app/repositories/base.py)**
   - Lines 116, 118: Recent records queries
   - Lines 124, 159, 161: Cleanup and statistics
 6. **[services/forecasting/app/services/forecasting_service.py](services/forecasting/app/services/forecasting_service.py)**
   - Lines 292, 365, 393, 409, 447, 553: Processing time calculations and timestamps
 7. **[services/forecasting/app/api/forecasting_operations.py](services/forecasting/app/api/forecasting_operations.py)**
   - Line 274: API response timestamps
 8. **[services/forecasting/app/api/scenario_operations.py](services/forecasting/app/api/scenario_operations.py)**
   - Lines 68, 134, 163: Scenario simulation timestamps
 9. **[services/forecasting/app/services/messaging.py](services/forecasting/app/services/messaging.py)**
   - Message timestamps
 ## Verification
 ```bash
 # Before fix
 $ grep -r "datetime\.utcnow()" services/forecasting/app --include="*.py" | wc -l
 20
 # After fix
 $ grep -r "datetime\.utcnow()" services/forecasting/app --include="*.py" | wc -l
 0
 ```
 ## Why This Matters
 ### Timezone-Naive (datetime.utcnow())
 ```python
 >>> datetime.utcnow()
 datetime.datetime(2025, 10, 9, 9, 10, 37, 123456)  # No timezone info
 ```
 ### Timezone-Aware (datetime.now(timezone.utc))
 ```python
 >>> datetime.now(timezone.utc)
 datetime.datetime(2025, 10, 9, 9, 10, 37, 123456, tzinfo=datetime.timezone.utc)  # Has timezone
 ```
 When PostgreSQL stores `DateTime(timezone=True)` columns, it stores them as timezone-aware. Comparing these with timezone-naive datetimes fails.
 ## Impact
 This fix resolves:
 - ✅ Cache expiration checks
 - ✅ Batch status updates
 - ✅ Performance metric queries
 - ✅ Forecast analytics date ranges
 - ✅ Cleanup operations
 - ✅ Recent activity queries
 ## Best Practice
 **Always use timezone-aware datetimes with PostgreSQL `DateTime(timezone=True)` columns:**
 ```python
 # ✅ GOOD
 created_at = Column(DateTime(timezone=True), default=lambda: datetime.now(timezone.utc))
 expires_at = datetime.now(timezone.utc) + timedelta(hours=24)
 if record.created_at < datetime.now(timezone.utc):
    ...
 # ❌ BAD
 created_at = Column(DateTime(timezone=True), default=datetime.utcnow)  # No timezone!
 expires_at = datetime.utcnow() + timedelta(hours=24)  # Naive!
 if record.created_at < datetime.utcnow():  # TypeError!
    ...
 ```
 ## Additional Issue Found and Fixed
 ### Local Import Shadowing
 After the initial fix, a new error appeared:
 ```
 [error] Multi-day forecast generation failed
 error=cannot access local variable 'timezone' where it is not associated with a value
 ```
 **Cause:** In `forecasting_service.py` line 428, there was a local import inside a conditional block that shadowed the module-level import:
 ```python
 # Module level (line 9)
 from datetime import datetime, date, timedelta, timezone
 # Inside function (line 428) - PROBLEM
 if day_offset > 0:
    from datetime import timedelta, timezone  # ← Creates LOCAL variable
    current_date = current_date + timedelta(days=day_offset)
 # Later in same function (line 447)
 processing_time = (datetime.now(timezone.utc) - start_time)  # ← Error! timezone not accessible
 ```
 When Python sees the local import on line 428, it creates a local variable `timezone` that only exists within that `if` block. When line 447 tries to use `timezone.utc`, Python looks for the local variable but can't find it (it's out of scope), resulting in: "cannot access local variable 'timezone' where it is not associated with a value".
 **Fix:** Removed the redundant local import since `timezone` is already imported at module level:
 ```python
 # Before (BROKEN)
 if day_offset > 0:
    from datetime import timedelta, timezone
    current_date = current_date + timedelta(days=day_offset)
 # After (WORKING)
 if day_offset > 0:
    current_date = current_date + timedelta(days=day_offset)
 ```
 **File:** [services/forecasting/app/services/forecasting_service.py](services/forecasting/app/services/forecasting_service.py#L427-L428)
 ## Deployment
 ```bash
 # Restart forecasting service to apply changes
 kubectl -n bakery-ia rollout restart deployment forecasting-service
 # Monitor for errors
 kubectl -n bakery-ia logs -f deployment/forecasting-service | grep -E "(can't compare|cannot access)"
 ```
 ## Related Issues
 This same issue may exist in other services. Search for:
 ```bash
 # Find services using timezone-aware columns
 grep -r "DateTime(timezone=True)" services/*/app/models --include="*.py"
 # Find services using datetime.utcnow()
 grep -r "datetime\.utcnow()" services/*/app --include="*.py"
 ```
 ## References
 - Python datetime docs: https://docs.python.org/3/library/datetime.html#aware-and-naive-objects
 - SQLAlchemy DateTime: https://docs.sqlalchemy.org/en/20/core/type_basics.html#sqlalchemy.types.DateTime
 - PostgreSQL TIMESTAMP WITH TIME ZONE: https://www.postgresql.org/docs/current/datatype-datetime.html
--- a/64
+++ b/64
@@ -165,24 +165,57 @@ k8s_resource('alert-processor-db', labels=['databases'])
 # =============================================================================
 # DEMO INITIALIZATION JOBS
 # =============================================================================
-# Demo seed jobs run in strict order:
+# Demo seed jobs run in strict order to ensure data consistency across services:
-#   1. demo-seed-users (creates demo user accounts)
+#
-#   2. demo-seed-tenants (creates demo tenant records)
+# Order & Dependencies:
-#   3. demo-seed-inventory (creates ingredients & finished products)
+#   1. demo-seed-users         → Creates demo user accounts in auth service
-#   4. demo-seed-ai-models (creates fake AI model entries)
+#   2. demo-seed-tenants       → Creates demo tenant records (depends on users)
 #   3. demo-seed-subscriptions → Creates enterprise subscriptions for demo tenants (depends on tenants)
 #   4. demo-seed-inventory     → Creates ingredients & finished products (depends on tenants)
 #   5. demo-seed-recipes       → Creates recipes using ingredient IDs (depends on inventory)
 #   6. demo-seed-suppliers     → Creates suppliers with price lists for ingredients (depends on inventory)
 #   7. demo-seed-sales         → Creates historical sales data using finished product IDs (depends on inventory)
 #   8. demo-seed-ai-models     → Creates fake AI model entries (depends on inventory)
 #
 # Note: Recipes, Suppliers, and Sales can run in parallel after Inventory completes,
 #       as they all depend on inventory data but not on each other.
 # Step 1: Seed users (auth service)
 k8s_resource('demo-seed-users',
    resource_deps=['auth-migration'],
    labels=['demo-init'])
 # Step 2: Seed tenants (tenant service)
 k8s_resource('demo-seed-tenants',
    resource_deps=['tenant-migration', 'demo-seed-users'],
    labels=['demo-init'])
 # Step 2.5: Seed subscriptions (creates enterprise subscriptions for demo tenants)
 k8s_resource('demo-seed-subscriptions',
    resource_deps=['tenant-migration', 'demo-seed-tenants'],
    labels=['demo-init'])
 # Step 3: Seed inventory - CRITICAL: All other seeds depend on this
 k8s_resource('demo-seed-inventory',
    resource_deps=['inventory-migration', 'demo-seed-tenants'],
    labels=['demo-init'])
 # Step 4: Seed recipes (uses ingredient IDs from inventory)
 k8s_resource('demo-seed-recipes',
    resource_deps=['recipes-migration', 'demo-seed-inventory'],
    labels=['demo-init'])
 # Step 5: Seed suppliers (uses ingredient IDs for price lists)
 k8s_resource('demo-seed-suppliers',
    resource_deps=['suppliers-migration', 'demo-seed-inventory'],
    labels=['demo-init'])
 # Step 6: Seed sales (uses finished product IDs from inventory)
 k8s_resource('demo-seed-sales',
    resource_deps=['sales-migration', 'demo-seed-inventory'],
    labels=['demo-init'])
 # Step 7: Seed AI models (creates training/forecasting model records)
 k8s_resource('demo-seed-ai-models',
    resource_deps=['training-migration', 'demo-seed-inventory'],
    labels=['demo-init'])
@@ -252,13 +285,24 @@ k8s_resource('demo-session-service',
    resource_deps=['demo-session-migration', 'redis'],
    labels=['services'])
 # Get the image reference for inventory-service to use in demo clone jobs
 inventory_image_ref = str(local('kubectl get deployment inventory-service -n bakery-ia -o jsonpath="{.spec.template.spec.containers[0].image}" 2>/dev/null || echo "bakery/inventory-service:latest"')).strip()
 # Apply environment variable patch to demo-session-service with the inventory image
 # Note: This fetches the CURRENT image tag dynamically when the resource runs
 # Runs after both services are deployed to ensure correct image tag is used
 local_resource('patch-demo-session-env',
-    cmd='kubectl set env deployment/demo-session-service -n bakery-ia CLONE_JOB_IMAGE=' + inventory_image_ref,
+    cmd='''
-    resource_deps=['demo-session-service'],
+        # Wait a moment for deployments to stabilize
        sleep 2
        # Get current inventory-service image tag
        INVENTORY_IMAGE=$(kubectl get deployment inventory-service -n bakery-ia -o jsonpath="{.spec.template.spec.containers[0].image}" 2>/dev/null || echo "bakery/inventory-service:latest")
        # Update demo-session-service environment variable
        kubectl set env deployment/demo-session-service -n bakery-ia CLONE_JOB_IMAGE=$INVENTORY_IMAGE
        echo "✅ Set CLONE_JOB_IMAGE to: $INVENTORY_IMAGE"
    ''',
    resource_deps=['demo-session-service', 'inventory-service'],  # Wait for BOTH services
    auto_init=True,  # Run automatically on Tilt startup
    labels=['config'])
 # =============================================================================
--- a/WEBSOCKET_CLEAN_IMPLEMENTATION_STATUS.md
+++ b/WEBSOCKET_CLEAN_IMPLEMENTATION_STATUS.md
@@ -1,215 +0,0 @@
 # Clean WebSocket Implementation - Status Report
 ## Architecture Overview
 ### Clean KISS Design (Divide and Conquer)
 ```
 Frontend WebSocket → Gateway (Token Verification Only) → Training Service WebSocket → RabbitMQ Events → Broadcast to All Clients
 ```
 ## ✅ COMPLETED Components
 ### 1. WebSocket Connection Manager (`services/training/app/websocket/manager.py`)
 - **Status**: ✅ COMPLETE
 - Simple connection manager for WebSocket clients
 - Thread-safe connection tracking per job_id
 - Broadcasting capability to all connected clients
 - Auto-cleanup of failed connections
 ### 2. RabbitMQ Event Consumer (`services/training/app/websocket/events.py`)
 - **Status**: ✅ COMPLETE
 - Global consumer that listens to all training.* events
 - Automatically broadcasts events to WebSocket clients
 - Maps RabbitMQ event types to WebSocket message types
 - Sets up on service startup
 ### 3. Clean Event Publishers (`services/training/app/services/training_events.py`)
 - **Status**: ✅ COMPLETE
 - **4 Main Events** as specified:
  1. `publish_training_started()` - 0% progress
  2. `publish_data_analysis()` - 20% progress
  3. `publish_product_training_completed()` - contributes to 20-80% progress
  4. `publish_training_completed()` - 100% progress
  5. `publish_training_failed()` - error handling
 ### 4. WebSocket Endpoint (`services/training/app/api/websocket_operations.py`)
 - **Status**: ✅ COMPLETE
 - Simple endpoint at `/api/v1/tenants/{tenant_id}/training/jobs/{job_id}/live`
 - Token validation
 - Connection management
 - Ping/pong support
 - Receives broadcasts from RabbitMQ consumer
 ### 5. Gateway WebSocket Proxy (`gateway/app/main.py`)
 - **Status**: ✅ COMPLETE
 - **KISS**: Token verification ONLY
 - Simple bidirectional forwarding
 - No business logic
 - Clean error handling
 ### 6. Parallel Product Progress Tracker (`services/training/app/services/progress_tracker.py`)
 - **Status**: ✅ COMPLETE
 - Thread-safe tracking of parallel product training
 - Automatic progress calculation (20-80% range)
 - Each product completion = 60/N% progress
 - Emits `publish_product_training_completed` events
 ### 7. Service Integration (services/training/app/main.py`)
 - **Status**: ✅ COMPLETE
 - Added WebSocket router to FastAPI app
 - Setup WebSocket event consumer on startup
 - Cleanup on shutdown
 ### 8. Removed Legacy Code
 - **Status**: ✅ COMPLETE
 - ❌ Deleted all WebSocket code from `training_operations.py`
 - ❌ Removed ConnectionManager, message cache, backfill logic
 - ❌ Removed per-job RabbitMQ consumers
 - ❌ Simplified event imports
 ## 🚧 PENDING Components
 ### 1. Update Training Service to Use New Events
 - **File**: `services/training/app/services/training_service.py`
 - **Current**: Uses old `TrainingStatusPublisher` with many granular events
 - **Needed**: Replace with 4 clean events:
  ```python
  # 1. Start (0%)
  await publish_training_started(job_id, tenant_id, total_products)
  # 2. Data Analysis (20%)
  await publish_data_analysis(job_id, tenant_id, "Analysis details...")
  # 3. Product Training (20-80%) - use ParallelProductProgressTracker
  tracker = ParallelProductProgressTracker(job_id, tenant_id, total_products)
  # In parallel training loop:
  await tracker.mark_product_completed(product_name)
  # 4. Completion (100%)
  await publish_training_completed(job_id, tenant_id, successful, failed, duration)
  ```
 ### 2. Update Training Orchestrator/Trainer
 - **File**: `services/training/app/ml/trainer.py` (likely)
 - **Needed**: Integrate `ParallelProductProgressTracker` in parallel training loop
 - Must emit event for each product completion (order doesn't matter)
 ### 3. Remove Old Messaging Module
 - **File**: `services/training/app/services/messaging.py`
 - **Status**: Still exists with old complex event publishers
 - **Action**: Can be removed once training_service.py is updated
 - Keep only the new `training_events.py`
 ### 4. Update Frontend WebSocket Client
 - **File**: `frontend/src/api/hooks/training.ts`
 - **Current**: Already well-implemented but expects certain message types
 - **Needed**: Update to handle new message types:
  - `started` - 0%
  - `progress` - for data_analysis (20%)
  - `product_completed` - for each product (calculate 20 + (completed/total * 60))
  - `completed` - 100%
  - `failed` - error
 ### 5. Frontend Progress Calculation
 - **Location**: Frontend WebSocket message handler
 - **Logic Needed**:
  ```typescript
  case 'product_completed':
    const { products_completed, total_products } = message.data;
    const progress = 20 + Math.floor((products_completed / total_products) * 60);
    // Update UI with progress
    break;
  ```
 ## Event Flow Diagram
 ```
 Training Start
    ↓
 [Event 1: training.started] → 0% progress
    ↓
 Data Analysis
    ↓
 [Event 2: training.progress] → 20% progress (data_analysis step)
    ↓
 Product Training (Parallel)
    ↓
 [Event 3a: training.product.completed] → Product 1 done
 [Event 3b: training.product.completed] → Product 2 done
 [Event 3c: training.product.completed] → Product 3 done
 ... (progress calculated as: 20 + (completed/total * 60))
    ↓
 [Event 3n: training.product.completed] → Product N done → 80% progress
    ↓
 Training Complete
    ↓
 [Event 4: training.completed] → 100% progress
 ```
 ## Key Design Principles
 1. **KISS (Keep It Simple, Stupid)**
   - No complex caching or backfilling
   - No per-job consumers
   - One global consumer broadcasts to all clients
   - Simple, stateless WebSocket connections
 2. **Divide and Conquer**
   - Gateway: Token verification only
   - Training Service: WebSocket connections + RabbitMQ consumer
   - Progress Tracker: Parallel training progress
   - Event Publishers: 4 simple event types
 3. **No Backward Compatibility**
   - Deleted all legacy WebSocket code
   - Clean slate implementation
   - No TODOs (implement everything)
 ## Next Steps
 1. Update `training_service.py` to use new event publishers
 2. Update trainer to integrate `ParallelProductProgressTracker`
 3. Remove old `messaging.py` module
 4. Update frontend WebSocket client message handlers
 5. Test end-to-end flow
 6. Monitor WebSocket connections in production
 ## Testing Checklist
 - [ ] WebSocket connection established through gateway
 - [ ] Token verification works (valid and invalid tokens)
 - [ ] Event 1 (started) received with 0% progress
 - [ ] Event 2 (data_analysis) received with 20% progress
 - [ ] Event 3 (product_completed) received for each product
 - [ ] Progress correctly calculated (20 + completed/total * 60)
 - [ ] Event 4 (completed) received with 100% progress
 - [ ] Error events handled correctly
 - [ ] Multiple concurrent clients receive same events
 - [ ] Connection survives network hiccups
 - [ ] Clean disconnection when training completes
 ## Files Modified
 ### Created:
 - `services/training/app/websocket/manager.py`
 - `services/training/app/websocket/events.py`
 - `services/training/app/websocket/__init__.py`
 - `services/training/app/api/websocket_operations.py`
 - `services/training/app/services/training_events.py`
 - `services/training/app/services/progress_tracker.py`
 ### Modified:
 - `services/training/app/main.py` - Added WebSocket router and event consumer setup
 - `services/training/app/api/training_operations.py` - Removed all WebSocket code
 - `gateway/app/main.py` - Simplified WebSocket proxy
 ### To Remove:
 - `services/training/app/services/messaging.py` - Replace with `training_events.py`
 ## Notes
 - RabbitMQ exchange: `training.events`
 - Routing keys: `training.*` (wildcard for all events)
 - WebSocket URL: `ws://gateway/api/v1/tenants/{tenant_id}/training/jobs/{job_id}/live?token={token}`
 - Progress range: 0% → 20% → 20-80% (products) → 100%
 - Each product contributes: 60/N% where N = total products
--- a/WEBSOCKET_IMPLEMENTATION_COMPLETE.md
+++ b/WEBSOCKET_IMPLEMENTATION_COMPLETE.md
@@ -1,278 +0,0 @@
 # WebSocket Implementation - COMPLETE ✅
 ## Summary
 Successfully redesigned and implemented a clean, production-ready WebSocket solution for real-time training progress updates following KISS (Keep It Simple, Stupid) and divide-and-conquer principles.
 ## Architecture
 ```
 Frontend WebSocket
    ↓
 Gateway (Token Verification ONLY)
    ↓
 Training Service WebSocket Endpoint
    ↓
 Training Process → RabbitMQ Events
    ↓
 Global RabbitMQ Consumer → WebSocket Manager
    ↓
 Broadcast to All Connected Clients
 ```
 ## Implementation Status: ✅ 100% COMPLETE
 ### Backend Components
 #### 1. WebSocket Connection Manager ✅
 **File**: `services/training/app/websocket/manager.py`
 - Simple, thread-safe WebSocket connection management
 - Tracks connections per job_id
 - Broadcasting to all clients for a specific job
 - Automatic cleanup of failed connections
 #### 2. RabbitMQ → WebSocket Bridge ✅
 **File**: `services/training/app/websocket/events.py`
 - Global consumer listens to all `training.*` events
 - Automatically broadcasts to WebSocket clients
 - Maps RabbitMQ event types to WebSocket message types
 - Sets up on service startup
 #### 3. Clean Event Publishers ✅
 **File**: `services/training/app/services/training_events.py`
 **4 Main Progress Events**:
 1. **Training Started** (0%) - `publish_training_started()`
 2. **Data Analysis** (20%) - `publish_data_analysis()`
 3. **Product Training** (20-80%) - `publish_product_training_completed()`
 4. **Training Complete** (100%) - `publish_training_completed()`
 5. **Training Failed** - `publish_training_failed()`
 #### 4. Parallel Product Progress Tracker ✅
 **File**: `services/training/app/services/progress_tracker.py`
 - Thread-safe tracking for parallel product training
 - Each product completion = 60/N% where N = total products
 - Progress formula: `20 + (products_completed / total_products) * 60`
 - Emits `product_completed` events automatically
 #### 5. WebSocket Endpoint ✅
 **File**: `services/training/app/api/websocket_operations.py`
 - Simple endpoint: `/api/v1/tenants/{tenant_id}/training/jobs/{job_id}/live`
 - Token validation
 - Ping/pong support
 - Receives broadcasts from RabbitMQ consumer
 #### 6. Gateway WebSocket Proxy ✅
 **File**: `gateway/app/main.py`
 - **KISS**: Token verification ONLY
 - Simple bidirectional message forwarding
 - No business logic
 - Clean error handling
 #### 7. Trainer Integration ✅
 **File**: `services/training/app/ml/trainer.py`
 - Replaced old `TrainingStatusPublisher` with new event publishers
 - Replaced `ProgressAggregator` with `ParallelProductProgressTracker`
 - Emits all 4 main progress events
 - Handles parallel product training
 ### Frontend Components
 #### 8. Frontend WebSocket Client ✅
 **File**: `frontend/src/api/hooks/training.ts`
 **Handles all message types**:
 - `connected` - Connection established
 - `started` - Training started (0%)
 - `progress` - Data analysis complete (20%)
 - `product_completed` - Product training done (dynamic progress calculation)
 - `completed` - Training finished (100%)
 - `failed` - Training error
 **Progress Calculation**:
 ```typescript
 case 'product_completed':
  const productsCompleted = eventData.products_completed || 0;
  const totalProducts = eventData.total_products || 1;
  // Calculate: 20% base + (completed/total * 60%)
  progress = 20 + Math.floor((productsCompleted / totalProducts) * 60);
  break;
 ```
 ### Code Cleanup ✅
 #### 9. Removed Legacy Code
 - ❌ Deleted all old WebSocket code from `training_operations.py`
 - ❌ Removed `ConnectionManager`, message cache, backfill logic
 - ❌ Removed per-job RabbitMQ consumers
 - ❌ Removed all `TrainingStatusPublisher` imports and usage
 - ❌ Cleaned up `training_service.py` - removed all status publisher calls
 - ❌ Cleaned up `training_orchestrator.py` - replaced with new events
 - ❌ Cleaned up `models.py` - removed unused event publishers
 #### 10. Updated Module Structure ✅
 **File**: `services/training/app/api/__init__.py`
 - Added `websocket_operations_router` export
 - Properly integrated into service
 **File**: `services/training/app/main.py`
 - Added WebSocket router
 - Setup WebSocket event consumer on startup
 - Cleanup on shutdown
 ## Progress Event Flow
 ```
 Start (0%)
    ↓
 [Event 1: training.started]
    job_id, tenant_id, total_products
    ↓
 Data Analysis (20%)
    ↓
 [Event 2: training.progress]
    step: "Data Analysis"
    progress: 20%
    ↓
 Model Training (20-80%)
    ↓
 [Event 3a: training.product.completed] Product 1 → 20 + (1/N * 60)%
 [Event 3b: training.product.completed] Product 2 → 20 + (2/N * 60)%
 ...
 [Event 3n: training.product.completed] Product N → 80%
    ↓
 Training Complete (100%)
    ↓
 [Event 4: training.completed]
    successful_trainings, failed_trainings, total_duration
 ```
 ## Key Features
 ### 1. KISS (Keep It Simple, Stupid)
 - No complex caching or backfilling
 - No per-job consumers
 - One global consumer broadcasts to all clients
 - Stateless WebSocket connections
 - Simple event structure
 ### 2. Divide and Conquer
 - **Gateway**: Token verification only
 - **Training Service**: WebSocket connections + event publisher
 - **RabbitMQ Consumer**: Listens and broadcasts
 - **Progress Tracker**: Parallel training progress calculation
 - **Event Publishers**: 4 simple, clean event types
 ### 3. Production Ready
 - Thread-safe parallel processing
 - Automatic connection cleanup
 - Error handling at every layer
 - Comprehensive logging
 - No backward compatibility baggage
 ## Event Message Format
 ### Example: Product Completed Event
 ```json
 {
  "type": "product_completed",
  "job_id": "training_abc123",
  "timestamp": "2025-10-08T12:34:56.789Z",
  "data": {
    "job_id": "training_abc123",
    "tenant_id": "tenant_xyz",
    "product_name": "Product A",
    "products_completed": 15,
    "total_products": 60,
    "current_step": "Model Training",
    "step_details": "Completed training for Product A (15/60)"
  }
 }
 ```
 ### Frontend Calculates Progress
 ```
 progress = 20 + (15 / 60) * 60 = 20 + 15 = 35%
 ```
 ## Files Created
 1. `services/training/app/websocket/manager.py`
 2. `services/training/app/websocket/events.py`
 3. `services/training/app/websocket/__init__.py`
 4. `services/training/app/api/websocket_operations.py`
 5. `services/training/app/services/training_events.py`
 6. `services/training/app/services/progress_tracker.py`
 ## Files Modified
 1. `services/training/app/main.py` - WebSocket router + event consumer
 2. `services/training/app/api/__init__.py` - Export WebSocket router
 3. `services/training/app/ml/trainer.py` - New event system
 4. `services/training/app/services/training_service.py` - Removed old events
 5. `services/training/app/services/training_orchestrator.py` - New events
 6. `services/training/app/api/models.py` - Removed unused events
 7. `services/training/app/api/training_operations.py` - Removed all WebSocket code
 8. `gateway/app/main.py` - Simplified proxy
 9. `frontend/src/api/hooks/training.ts` - New event handlers
 ## Files to Remove (Optional Future Cleanup)
 - `services/training/app/services/messaging.py` - No longer used (710 lines of legacy code)
 ## Testing Checklist
 - [ ] WebSocket connection established through gateway
 - [ ] Token verification works (valid and invalid tokens)
 - [ ] Event 1 (started) received with 0% progress
 - [ ] Event 2 (data_analysis) received with 20% progress
 - [ ] Event 3 (product_completed) received for each product
 - [ ] Progress correctly calculated (20 + completed/total * 60)
 - [ ] Event 4 (completed) received with 100% progress
 - [ ] Error events handled correctly
 - [ ] Multiple concurrent clients receive same events
 - [ ] Connection survives network hiccups
 - [ ] Clean disconnection when training completes
 ## Configuration
 ### WebSocket URL
 ```
 ws://gateway-host/api/v1/tenants/{tenant_id}/training/jobs/{job_id}/live?token={auth_token}
 ```
 ### RabbitMQ
 - **Exchange**: `training.events`
 - **Routing Keys**: `training.*` (wildcard)
 - **Queue**: `training_websocket_broadcast` (global)
 ### Progress Ranges
 - **Training Start**: 0%
 - **Data Analysis**: 20%
 - **Model Training**: 20-80% (dynamic based on product count)
 - **Training Complete**: 100%
 ## Benefits of New Implementation
 1. **Simpler**: 80% less code than before
 2. **Faster**: No unnecessary database queries or message caching
 3. **Scalable**: One global consumer vs. per-job consumers
 4. **Maintainable**: Clear separation of concerns
 5. **Reliable**: Thread-safe, error-handled at every layer
 6. **Clean**: No legacy code, no TODOs, production-ready
 ## Next Steps
 1. Deploy and test in staging environment
 2. Monitor RabbitMQ message flow
 3. Monitor WebSocket connection stability
 4. Collect metrics on message delivery times
 5. Optional: Remove old `messaging.py` file
 ---
 **Implementation Date**: October 8, 2025
 **Status**: ✅ COMPLETE AND PRODUCTION-READY
 **No Backward Compatibility**: Clean slate implementation
 **No TODOs**: Fully implemented
--- a/bakery-ia-ca.crt
+++ b/bakery-ia-ca.crt
@@ -1,13 +1,13 @@
 -----BEGIN CERTIFICATE-----
-MIIB9zCCAZ2gAwIBAgIRAJVj3HmLerDV1SxKRT/hLAQwCgYIKoZIzj0EAwIwWzEL
+MIIB9jCCAZ2gAwIBAgIRANcCNyBwnOiQrE/KSE6zkTUwCgYIKoZIzj0EAwIwWzEL
 MAkGA1UEBhMCVVMxEjAQBgNVBAoTCUJha2VyeSBJQTEbMBkGA1UECxMSQmFrZXJ5
 IElBIExvY2FsIENBMRswGQYDVQQDExJiYWtlcnktaWEtbG9jYWwtY2EwHhcNMjUx
-MDA5MTgyOTA0WhcNMjYxMDA5MTgyOTA0WjBbMQswCQYDVQQGEwJVUzESMBAGA1UE
+MDEwMTAyMTIwWhcNMjYxMDEwMTAyMTIwWjBbMQswCQYDVQQGEwJVUzESMBAGA1UE
 ChMJQmFrZXJ5IElBMRswGQYDVQQLExJCYWtlcnkgSUEgTG9jYWwgQ0ExGzAZBgNV
 BAMTEmJha2VyeS1pYS1sb2NhbC1jYTBZMBMGByqGSM49AgEGCCqGSM49AwEHA0IA
-BMftW8JmthKeoYWGsVj42CuJFjidmwCCTcdtj6CcL0nnlFS0Dlv9djFLfxyqnpZP
+BOFR63AhrNrUEHfSUARtLgda4sqfufdyywUSoPHT46HPsakqAfl220wxQcYVsXh+
-QHmjp7b8yhgWKVL8wq/zJUajQjBAMA4GA1UdDwEB/wQEAwICpDAPBgNVHRMBAf8E
+Krqt04bjdnyNzW7qF+WQ5FmjQjBAMA4GA1UdDwEB/wQEAwICpDAPBgNVHRMBAf8E
-BTADAQH/MB0GA1UdDgQWBBRRtIEm0BSsSti/oLUDtbM4spslUjAKBggqhkjOPQQD
+BTADAQH/MB0GA1UdDgQWBBQlcQ1CBEsG0/Gm3Jch3PSt1+c2fjAKBggqhkjOPQQD
-AgNIADBFAiBBoTfxXh5VqlPZVi60uoB76AZ56HJ96BgB7x353ECw8QIhAORe2MAd
+AgNHADBEAh9W1k3MHS7Qj6jUt54MHTeGYo2zbXRR4onDFG6ReabAAiEAgjPCh5kZ
-n/q+dE6TN4VUkze8/Psur8ZbMDXmvRpXzY44
+LfJP2mzmgiTiGFf4imIWAyI8kqhh9V8wZUE=
 -----END CERTIFICATE-----
--- a/config/docker-compose.yml
+++ b/config/docker-compose.yml
--- a/docs/FRONTEND.md
+++ b/docs/FRONTEND.md
@@ -1,223 +0,0 @@
  🎨 Frontend Design Recommendations for PanIA
  1. MODERN UX/UI PRINCIPLES (2024-2025)
  🎯 User-Centered Design Philosophy
  - Jobs-to-be-Done Framework: Organize around what users need to
  accomplish, not features
  - Progressive Disclosure: Show only what's needed when it's needed
  - Contextual Intelligence: AI-powered interfaces that adapt to user
  behavior and business context
  - Micro-Moment Design: Optimize for quick, task-focused interactions
  🏗️ Information Architecture Principles
  - Hub-and-Spoke Model: Central dashboard with specialized workspaces
  - Layered Navigation: Primary → Secondary → Contextual navigation
  levels
  - Cross-Module Integration: Seamless data flow between related
  functions
  - Predictive Navigation: Surface relevant actions before users need
  them
  2. RECOMMENDED NAVIGATION STRUCTURE
  🎛️ Primary Navigation (Top Level)
  🏠 Dashboard    🥖 Operations    📊 Analytics    ⚙️ Settings
  🔗 Secondary Navigation (Operations Hub)
  Operations/
  ├── 📦 Production
  │   ├── Schedule
  │   ├── Active Batches
  │   └── Equipment
  ├── 📋 Orders
  │   ├── Incoming
  │   ├── In Progress
  │   └── Supplier Orders
  ├── 🏪 Inventory
  │   ├── Stock Levels
  │   ├── Movements
  │   └── Alerts
  ├── 🛒 Sales
  │   ├── Daily Sales
  │   ├── Customer Orders
  │   └── POS Integration
  └── 📖 Recipes
      ├── Active Recipes
      ├── Development
      └── Costing
  📈 Analytics Hub
  Analytics/
  ├── 🔮 Forecasting
  ├── 📊 Sales Analytics
  ├── 📈 Production Reports
  ├── 💰 Financial Reports
  ├── 🎯 Performance KPIs
  └── 🤖 AI Insights
  3. MODERN UI DESIGN PATTERNS
  🎨 Visual Design System
  - Neumorphism + Glassmorphism: Subtle depth with transparency effects
  - Adaptive Color System: Dynamic themes based on time of day/business
  hours
  - Micro-Interactions: Delightful feedback for all user actions
  - Data Visualization: Interactive charts with drill-down capabilities
  📱 Layout Patterns
  - Compound Layout: Dashboard cards that expand into detailed views
  - Progressive Web App: Offline-first design with sync indicators
  - Responsive Grid: CSS Grid + Flexbox for complex layouts
  - Floating Action Buttons: Quick access to primary actions
  🎯 Interaction Patterns
  - Command Palette: Universal search + actions (Cmd+K)
  - Contextual Panels: Side panels for related information
  - Smart Defaults: AI-powered form pre-filling
  - Undo/Redo System: Confidence-building interaction safety
  4. PAGE ORGANIZATION STRATEGY
  🏠 Dashboard Design
  ┌─────────────────────────────────────────────────┐
  │ Today's Overview          AI Recommendations    │
  ├─────────────────────────────────────────────────┤
  │ Critical Alerts          Weather Impact         │
  ├─────────────────────────────────────────────────┤
  │ Production Status        Sales Performance      │
  ├─────────────────────────────────────────────────┤
  │ Quick Actions           Recent Activity         │
  └─────────────────────────────────────────────────┘
  📊 Analytics Design
  - Export Everything: PDF, Excel, API endpoints for all reports
  - AI Narrative: Natural language insights explaining the data
  ⚡ Operational Pages
  - Split Complex Pages: Break inventory/production into focused
  sub-pages
  - Context-Aware Sidebars: Related information always accessible
  - Bulk Operations: Multi-select with batch actions
  - Real-Time Sync: Live updates with optimistic UI
  5. COMPONENT ARCHITECTURE
  🧱 Design System Components
  // Foundational Components
  Button, Input, Card, Modal, Table, Form
  // Composite Components  
  DataTable, FilterPanel, SearchBox, ActionBar
  // Domain Components
  ProductCard, OrderSummary, InventoryAlert, RecipeViewer
  // Layout Components
  PageHeader, Sidebar, NavigationBar, BreadcrumbTrail
  // Feedback Components
  LoadingState, EmptyState, ErrorBoundary, SuccessMessage
  🎨 Visual Hierarchy
  - Typography Scale: Clear heading hierarchy with proper contrast
  - Color System: Semantic colors (success, warning, error, info)
  - Spacing System: Consistent 4px/8px grid system
  - Shadow System: Layered depth for component elevation
  6. USER EXPERIENCE ENHANCEMENTS
  🚀 Performance Optimizations
  - Skeleton Loading: Immediate visual feedback during data loading
  - Virtual Scrolling: Handle large datasets efficiently
  - Optimistic Updates: Immediate UI response with error handling
  - Background Sync: Offline-first with automatic sync
  ♿ Accessibility Standards
  - WCAG 2.2 AA Compliance: Screen reader support, keyboard navigation
  - Focus Management: Clear focus indicators and logical tab order
  - Color Blind Support: Pattern + color coding for data visualization
  - High Contrast Mode: Automatic detection and support
  🎯 Personalization Features
  - Customizable Dashboards: User-configurable widgets and layouts
  - Saved Views: Bookmarkable filtered states
  - Notification Preferences: Granular control over alerts
  - Theme Preferences: Light/dark/auto modes
  7. MOBILE-FIRST CONSIDERATIONS
  📱 Progressive Web App Features
  - Offline Mode: Critical functions work without internet
  - Push Notifications: Order alerts, stock alerts, production updates
  - Home Screen Install: Native app-like experience
  - Background Sync: Data synchronization when connection returns
  🖱️ Touch-Optimized Interactions
  - 44px Touch Targets: Minimum size for all interactive elements
  - Swipe Gestures: Navigate between related screens
  - Pull-to-Refresh: Intuitive data refresh mechanism
  - Bottom Navigation: Thumb-friendly primary navigation on mobile
  8. AI-POWERED UX ENHANCEMENTS
  🤖 Intelligent Features
  - Predictive Search: Suggestions based on context and history
  - Smart Notifications: Context-aware alerts with actionable insights
  - Automated Workflows: AI-suggested process optimizations
  - Anomaly Detection: Visual highlights for unusual patterns
  💬 Conversational Interface
  - AI Assistant: Natural language queries for data and actions
  - Voice Commands: Hands-free operation for production environments
  - Smart Help: Context-aware documentation and tips
  - Guided Tours: Adaptive onboarding based on user role
  9. TECHNICAL IMPLEMENTATION RECOMMENDATIONS
  🏗️ Architecture Patterns
  - React Router: Replace custom navigation with URL-based routing
  - Zustand/Redux Toolkit: Predictable state management
  - React Query: Server state management with caching
  - Framer Motion: Smooth animations and transitions
  🎨 Styling Strategy
  - CSS-in-JS: Styled-components or Emotion for dynamic theming
  - Design Tokens: Centralized design system values
  - Responsive Utilities: Mobile-first responsive design
  - Component Variants: Consistent styling patterns
  🎯 Key Priority Areas:
  1. Navigation Restructure: Move from custom state navigation to React
  Router with proper URL structure
  2. Information Architecture: Organize around user workflows
  (Hub-and-Spoke model)
  3. Page Simplification: Break complex pages into focused, task-oriented
   views
  4. Unified Analytics: Replace scattered reports with a cohesive
  Analytics hub
  5. Modern UI Patterns: Implement 2024-2025 design standards with
  AI-powered enhancements
--- a/docs/IMPLEMENTATION_SUMMARY.md
+++ b/docs/IMPLEMENTATION_SUMMARY.md
@@ -1,567 +0,0 @@
 # Production Planning System - Implementation Summary
 **Implementation Date:** 2025-10-09
 **Status:** ✅ COMPLETE
 **Version:** 2.0
 ---
 ## Executive Summary
 Successfully implemented all three phases of the production planning system improvements, transforming the manual procurement-only system into a fully automated, timezone-aware, cached, and monitored production planning platform.
 ### Key Achievements
 ✅ **100% Automation** - Both production and procurement planning now run automatically every morning
 ✅ **50% Cost Reduction** - Forecast caching eliminates duplicate computations
 ✅ **Timezone Accuracy** - All schedulers respect tenant-specific timezones
 ✅ **Complete Observability** - Comprehensive metrics and alerting in place
 ✅ **Robust Workflows** - Plan rejection triggers automatic notifications and regeneration
 ✅ **Production Ready** - Full documentation and runbooks for operations team
 ---
 ## Implementation Phases
 ### ✅ Phase 1: Critical Gaps (COMPLETED)
 #### 1.1 Production Scheduler Service
 **Status:** ✅ COMPLETE
 **Effort:** 4 hours (estimated 3-4 days, completed faster due to reuse of proven patterns)
 **Files Created/Modified:**
 - 📄 Created: [`services/production/app/services/production_scheduler_service.py`](../services/production/app/services/production_scheduler_service.py)
 - ✏️ Modified: [`services/production/app/main.py`](../services/production/app/main.py)
 **Features Implemented:**
 - ✅ Daily production schedule generation at 5:30 AM
 - ✅ Stale schedule cleanup at 5:50 AM
 - ✅ Test mode for development (every 30 minutes)
 - ✅ Parallel tenant processing with 180s timeout per tenant
 - ✅ Leader election support (distributed deployment ready)
 - ✅ Idempotency (checks for existing schedules)
 - ✅ Demo tenant filtering
 - ✅ Comprehensive error handling and logging
 - ✅ Integration with ProductionService.calculate_daily_requirements()
 - ✅ Automatic batch creation from requirements
 - ✅ Notifications to production managers
 **Test Endpoint:**
 ```bash
 POST /test/production-scheduler
 ```
 #### 1.2 Timezone Configuration
 **Status:** ✅ COMPLETE
 **Effort:** 1 hour (as estimated)
 **Files Created/Modified:**
 - ✏️ Modified: [`services/tenant/app/models/tenants.py`](../services/tenant/app/models/tenants.py)
 - 📄 Created: [`services/tenant/migrations/versions/20251009_add_timezone_to_tenants.py`](../services/tenant/migrations/versions/20251009_add_timezone_to_tenants.py)
 - 📄 Created: [`shared/utils/timezone_helper.py`](../shared/utils/timezone_helper.py)
 **Features Implemented:**
 - ✅ `timezone` field added to Tenant model (default: "Europe/Madrid")
 - ✅ Database migration for existing tenants
 - ✅ TimezoneHelper utility class with comprehensive methods:
  - `get_current_date_in_timezone()`
  - `get_current_datetime_in_timezone()`
  - `convert_to_utc()` / `convert_from_utc()`
  - `is_business_hours()`
  - `get_next_business_day_at_time()`
 - ✅ Validation for IANA timezone strings
 - ✅ Fallback to default timezone on errors
 **Migration Command:**
 ```bash
 alembic upgrade head  # Applies 20251009_add_timezone_to_tenants
 ```
 ---
 ### ✅ Phase 2: Optimization (COMPLETED)
 #### 2.1 Forecast Caching
 **Status:** ✅ COMPLETE
 **Effort:** 3 hours (estimated 2 days, completed faster with clear design)
 **Files Created/Modified:**
 - 📄 Created: [`services/forecasting/app/services/forecast_cache.py`](../services/forecasting/app/services/forecast_cache.py)
 - ✏️ Modified: [`services/forecasting/app/api/forecasting_operations.py`](../services/forecasting/app/api/forecasting_operations.py)
 **Features Implemented:**
 - ✅ Service-level Redis caching for forecasts
 - ✅ Cache key format: `forecast:{tenant_id}:{product_id}:{forecast_date}`
 - ✅ Smart TTL calculation (expires midnight after forecast_date)
 - ✅ Batch forecast caching support
 - ✅ Cache invalidation methods:
  - Per product
  - Per tenant
  - All forecasts (admin only)
 - ✅ Cache metadata in responses (`cached: true` flag)
 - ✅ Cache statistics endpoint
 - ✅ Automatic cache hit/miss logging
 - ✅ Graceful fallback if Redis unavailable
 **Performance Impact:**
 | Metric | Before | After | Improvement |
 |--------|--------|-------|-------------|
 | Duplicate forecasts | 2x per day | 1x per day | 50% reduction |
 | Forecast response time | 2-5s | 50-100ms | 95%+ faster |
 | Forecasting service load | 100% | 50% | 50% reduction |
 **Cache Endpoints:**
 ```bash
 GET  /api/v1/{tenant_id}/forecasting/cache/stats
 DELETE /api/v1/{tenant_id}/forecasting/cache/product/{product_id}
 DELETE /api/v1/{tenant_id}/forecasting/cache
 ```
 #### 2.2 Plan Rejection Workflow
 **Status:** ✅ COMPLETE
 **Effort:** 2 hours (estimated 3 days, completed faster by extending existing code)
 **Files Modified:**
 - ✏️ Modified: [`services/orders/app/services/procurement_service.py`](../services/orders/app/services/procurement_service.py)
 **Features Implemented:**
 - ✅ Rejection handler method (`_handle_plan_rejection()`)
 - ✅ Notification system for stakeholders
 - ✅ RabbitMQ events:
  - `procurement.plan.rejected`
  - `procurement.plan.regeneration_requested`
  - `procurement.plan.status_changed`
 - ✅ Auto-regeneration logic based on rejection keywords:
  - "stale", "outdated", "old data"
  - "datos antiguos", "desactualizado", "obsoleto" (Spanish)
 - ✅ Rejection tracking in `approval_workflow` JSONB
 - ✅ Integration with existing status update workflow
 **Workflow:**
 ```
 Plan Rejected → Record in audit trail → Send notifications
                                      → Publish events
                                      → Analyze reason
                                      → Auto-regenerate (if applicable)
                                      → Schedule regeneration
 ```
 ---
 ### ✅ Phase 3: Enhancements (COMPLETED)
 #### 3.1 Monitoring & Metrics
 **Status:** ✅ COMPLETE
 **Effort:** 2 hours (as estimated)
 **Files Created:**
 - 📄 Created: [`shared/monitoring/scheduler_metrics.py`](../shared/monitoring/scheduler_metrics.py)
 **Metrics Implemented:**
 **Production Scheduler:**
 - `production_schedules_generated_total` (Counter by tenant, status)
 - `production_schedule_generation_duration_seconds` (Histogram by tenant)
 - `production_tenants_processed_total` (Counter by status)
 - `production_batches_created_total` (Counter by tenant)
 - `production_scheduler_runs_total` (Counter by trigger)
 - `production_scheduler_errors_total` (Counter by error_type)
 **Procurement Scheduler:**
 - `procurement_plans_generated_total` (Counter by tenant, status)
 - `procurement_plan_generation_duration_seconds` (Histogram by tenant)
 - `procurement_tenants_processed_total` (Counter by status)
 - `procurement_requirements_created_total` (Counter by tenant, priority)
 - `procurement_scheduler_runs_total` (Counter by trigger)
 - `procurement_plan_rejections_total` (Counter by tenant, auto_regenerated)
 - `procurement_plans_by_status` (Gauge by tenant, status)
 **Forecast Cache:**
 - `forecast_cache_hits_total` (Counter by tenant)
 - `forecast_cache_misses_total` (Counter by tenant)
 - `forecast_cache_hit_rate` (Gauge by tenant, 0-100%)
 - `forecast_cache_entries_total` (Gauge by cache_type)
 - `forecast_cache_invalidations_total` (Counter by tenant, reason)
 **General Health:**
 - `scheduler_health_status` (Gauge by service, scheduler_type)
 - `scheduler_last_run_timestamp` (Gauge by service, scheduler_type)
 - `scheduler_next_run_timestamp` (Gauge by service, scheduler_type)
 - `tenant_processing_timeout_total` (Counter by service, tenant_id)
 **Alert Rules Created:**
 - 🚨 `DailyProductionPlanningFailed` (high severity)
 - 🚨 `DailyProcurementPlanningFailed` (high severity)
 - 🚨 `NoProductionSchedulesGenerated` (critical severity)
 - ⚠️ `ForecastCacheHitRateLow` (warning)
 - ⚠️ `HighTenantProcessingTimeouts` (warning)
 - 🚨 `SchedulerUnhealthy` (critical severity)
 #### 3.2 Documentation & Runbooks
 **Status:** ✅ COMPLETE
 **Effort:** 2 hours (as estimated)
 **Files Created:**
 - 📄 Created: [`docs/PRODUCTION_PLANNING_SYSTEM.md`](./PRODUCTION_PLANNING_SYSTEM.md) (comprehensive documentation, 1000+ lines)
 - 📄 Created: [`docs/SCHEDULER_RUNBOOK.md`](./SCHEDULER_RUNBOOK.md) (operational runbook, 600+ lines)
 - 📄 Created: [`docs/IMPLEMENTATION_SUMMARY.md`](./IMPLEMENTATION_SUMMARY.md) (this file)
 **Documentation Includes:**
 - ✅ System architecture overview with diagrams
 - ✅ Scheduler configuration and features
 - ✅ Forecast caching strategy and implementation
 - ✅ Plan rejection workflow details
 - ✅ Timezone configuration guide
 - ✅ Monitoring and alerting guidelines
 - ✅ API reference for all endpoints
 - ✅ Testing procedures (manual and automated)
 - ✅ Troubleshooting guide with common issues
 - ✅ Maintenance procedures
 - ✅ Change log
 **Runbook Includes:**
 - ✅ Quick reference for common incidents
 - ✅ Emergency contact information
 - ✅ Step-by-step resolution procedures
 - ✅ Health check commands
 - ✅ Maintenance mode procedures
 - ✅ Metrics to monitor
 - ✅ Log patterns to watch
 - ✅ Escalation procedures
 - ✅ Known issues and workarounds
 - ✅ Post-deployment testing checklist
 ---
 ## Technical Debt Eliminated
 ### Resolved Issues
 | Issue | Priority | Resolution |
 |-------|----------|------------|
 | **No automated production scheduling** | 🔴 Critical | ✅ ProductionSchedulerService implemented |
 | **Duplicate forecast computations** | 🟡 Medium | ✅ Service-level caching eliminates redundancy |
 | **Timezone configuration missing** | 🟡 High | ✅ Tenant timezone field + TimezoneHelper utility |
 | **Plan rejection incomplete workflow** | 🟡 Medium | ✅ Full workflow with notifications & regeneration |
 | **No monitoring for schedulers** | 🟡 Medium | ✅ Comprehensive Prometheus metrics |
 | **Missing operational documentation** | 🟢 Low | ✅ Full docs + runbooks created |
 ### Code Quality Improvements
 - ✅ **Zero TODOs** in production planning code
 - ✅ **100% type hints** on all new code
 - ✅ **Comprehensive error handling** with structured logging
 - ✅ **Defensive programming** with fallbacks and graceful degradation
 - ✅ **Clean separation of concerns** (service/repository/API layers)
 - ✅ **Reusable patterns** (BaseAlertService, RouteBuilder, etc.)
 - ✅ **No legacy code** - modern async/await throughout
 - ✅ **Full observability** - metrics, logs, traces
 ---
 ## Files Created (12 new files)
 1. [`services/production/app/services/production_scheduler_service.py`](../services/production/app/services/production_scheduler_service.py) - Production scheduler (350 lines)
 2. [`services/tenant/migrations/versions/20251009_add_timezone_to_tenants.py`](../services/tenant/migrations/versions/20251009_add_timezone_to_tenants.py) - Timezone migration (25 lines)
 3. [`shared/utils/timezone_helper.py`](../shared/utils/timezone_helper.py) - Timezone utilities (300 lines)
 4. [`services/forecasting/app/services/forecast_cache.py`](../services/forecasting/app/services/forecast_cache.py) - Forecast caching (450 lines)
 5. [`shared/monitoring/scheduler_metrics.py`](../shared/monitoring/scheduler_metrics.py) - Metrics definitions (250 lines)
 6. [`docs/PRODUCTION_PLANNING_SYSTEM.md`](./PRODUCTION_PLANNING_SYSTEM.md) - Full documentation (1000+ lines)
 7. [`docs/SCHEDULER_RUNBOOK.md`](./SCHEDULER_RUNBOOK.md) - Operational runbook (600+ lines)
 8. [`docs/IMPLEMENTATION_SUMMARY.md`](./IMPLEMENTATION_SUMMARY.md) - This summary (current file)
 ## Files Modified (5 files)
 1. [`services/production/app/main.py`](../services/production/app/main.py) - Integrated ProductionSchedulerService
 2. [`services/tenant/app/models/tenants.py`](../services/tenant/app/models/tenants.py) - Added timezone field
 3. [`services/orders/app/services/procurement_service.py`](../services/orders/app/services/procurement_service.py) - Added rejection workflow
 4. [`services/forecasting/app/api/forecasting_operations.py`](../services/forecasting/app/api/forecasting_operations.py) - Integrated caching
 5. (Various) - Added metrics collection calls
 **Total Lines of Code:** ~3,000+ lines (new functionality + documentation)
 ---
 ## Testing & Validation
 ### Manual Testing Performed
 ✅ Production scheduler test endpoint works
 ✅ Procurement scheduler test endpoint works
 ✅ Forecast cache hit/miss tracking verified
 ✅ Plan rejection workflow tested with auto-regeneration
 ✅ Timezone calculation verified for multiple timezones
 ✅ Leader election tested in multi-instance deployment
 ✅ Timeout handling verified
 ✅ Error isolation between tenants confirmed
 ### Automated Testing Required
 The following tests should be added to the test suite:
 ```python
 # Unit Tests
 - test_production_scheduler_service.py
 - test_procurement_scheduler_service.py
 - test_forecast_cache_service.py
 - test_timezone_helper.py
 - test_plan_rejection_workflow.py
 # Integration Tests
 - test_scheduler_integration.py
 - test_cache_integration.py
 - test_rejection_workflow_integration.py
 # End-to-End Tests
 - test_daily_planning_e2e.py
 - test_plan_lifecycle_e2e.py
 ```
 ---
 ## Deployment Checklist
 ### Pre-Deployment
 - [x] All code reviewed and approved
 - [x] Documentation complete
 - [x] Runbooks created for ops team
 - [x] Metrics and alerts configured
 - [ ] Integration tests passing (to be implemented)
 - [ ] Load testing performed (recommend before production)
 - [ ] Backup procedures verified
 ### Deployment Steps
 1. **Database Migrations**
   ```bash
   # Tenant service - add timezone field
   kubectl exec -it deployment/tenant-service -- alembic upgrade head
   ```
 2. **Deploy Services (in order)**
   ```bash
   # 1. Deploy tenant service (timezone migration)
   kubectl apply -f k8s/tenant-service.yaml
   kubectl rollout status deployment/tenant-service
   # 2. Deploy forecasting service (caching)
   kubectl apply -f k8s/forecasting-service.yaml
   kubectl rollout status deployment/forecasting-service
   # 3. Deploy orders service (rejection workflow)
   kubectl apply -f k8s/orders-service.yaml
   kubectl rollout status deployment/orders-service
   # 4. Deploy production service (scheduler)
   kubectl apply -f k8s/production-service.yaml
   kubectl rollout status deployment/production-service
   ```
 3. **Verify Deployment**
   ```bash
   # Check all services healthy
   curl http://tenant-service:8000/health
   curl http://forecasting-service:8000/health
   curl http://orders-service:8000/health
   curl http://production-service:8000/health
   # Verify schedulers initialized
   kubectl logs deployment/production-service | grep "scheduled jobs configured"
   kubectl logs deployment/orders-service | grep "scheduled jobs configured"
   ```
 4. **Test Schedulers**
   ```bash
   # Manually trigger test runs
   curl -X POST http://production-service:8000/test/production-scheduler \
     -H "Authorization: Bearer $ADMIN_TOKEN"
   curl -X POST http://orders-service:8000/test/procurement-scheduler \
     -H "Authorization: Bearer $ADMIN_TOKEN"
   ```
 5. **Monitor Metrics**
   - Visit Grafana dashboard
   - Verify metrics are being collected
   - Check alert rules are active
 ### Post-Deployment
 - [ ] Monitor schedulers for 48 hours
 - [ ] Verify cache hit rate reaches 70%+
 - [ ] Confirm all tenants processed successfully
 - [ ] Review logs for unexpected errors
 - [ ] Validate metrics and alerts functioning
 - [ ] Collect user feedback on plan quality
 ---
 ## Performance Benchmarks
 ### Before Implementation
 | Metric | Value | Notes |
 |--------|-------|-------|
 | Manual production planning | 100% | Operators create schedules manually |
 | Forecast calls per day | 2x per product | Orders + Production (if automated) |
 | Forecast response time | 2-5 seconds | No caching |
 | Plan rejection handling | Manual only | No automated workflow |
 | Timezone accuracy | UTC only | Could be wrong for non-UTC tenants |
 | Monitoring | Partial | No scheduler-specific metrics |
 ### After Implementation
 | Metric | Value | Improvement |
 |--------|-------|-------------|
 | Automated production planning | 100% | ✅ Fully automated |
 | Forecast calls per day | 1x per product | ✅ 50% reduction |
 | Forecast response time (cache hit) | 50-100ms | ✅ 95%+ faster |
 | Plan rejection handling | Automated | ✅ Full workflow |
 | Timezone accuracy | Per-tenant | ✅ 100% accurate |
 | Monitoring | Comprehensive | ✅ 30+ metrics |
 ---
 ## Business Impact
 ### Quantifiable Benefits
 1. **Time Savings**
   - Production planning: ~30 min/day → automated = **~180 hours/year saved**
   - Procurement planning: Already automated, improved with caching
   - Operations troubleshooting: Reduced by 50% with better monitoring
 2. **Cost Reduction**
   - Forecasting service compute: **50% reduction** in forecast generations
   - Database load: **30% reduction** in duplicate queries
   - Support tickets: Expected **40% reduction** with better monitoring
 3. **Accuracy Improvement**
   - Timezone accuracy: **100%** (previously could be off by hours)
   - Plan consistency: **95%+** (automated → no human error)
   - Data freshness: **24 hours** (plans never stale)
 ### Qualitative Benefits
 - ✅ **Improved UX**: Operators arrive to ready-made plans
 - ✅ **Better insights**: Comprehensive metrics enable data-driven decisions
 - ✅ **Faster troubleshooting**: Runbooks reduce MTTR by 60%+
 - ✅ **Scalability**: System now handles 10x tenants without changes
 - ✅ **Reliability**: Automated workflows eliminate human error
 - ✅ **Compliance**: Full audit trail for all plan changes
 ---
 ## Lessons Learned
 ### What Went Well
 1. **Reusing Proven Patterns**: Leveraging BaseAlertService and existing scheduler infrastructure accelerated development
 2. **Service-Level Caching**: Implementing cache in Forecasting Service (vs. clients) was the right choice
 3. **Comprehensive Documentation**: Writing docs alongside code ensured accuracy and completeness
 4. **Timezone Helper Utility**: Creating a reusable utility prevented timezone bugs across services
 5. **Parallel Processing**: Processing tenants concurrently with timeouts proved robust
 ### Challenges Overcome
 1. **Timezone Complexity**: Required careful design of TimezoneHelper to handle edge cases
 2. **Cache Invalidation**: Needed smart TTL calculation to balance freshness and efficiency
 3. **Leader Election**: Ensuring only one scheduler runs required proper RabbitMQ integration
 4. **Error Isolation**: Preventing one tenant's failure from affecting others required thoughtful error handling
 ### Recommendations for Future Work
 1. **Add Integration Tests**: Comprehensive test suite for scheduler workflows
 2. **Implement Load Testing**: Verify system handles 100+ tenants concurrently
 3. **Add UI for Plan Acceptance**: Complete operator workflow with in-app accept/reject
 4. **Enhance Analytics**: Add ML-based plan quality scoring
 5. **Multi-Region Support**: Extend timezone handling for global deployments
 6. **Webhook Support**: Allow external systems to subscribe to plan events
 ---
 ## Next Steps
 ### Immediate (Week 1-2)
 - [ ] Deploy to staging environment
 - [ ] Perform load testing with 100+ tenants
 - [ ] Add integration tests
 - [ ] Train operations team on runbook procedures
 - [ ] Set up Grafana dashboard
 ### Short-term (Month 1-2)
 - [ ] Deploy to production (phased rollout)
 - [ ] Monitor metrics and tune alert thresholds
 - [ ] Collect user feedback on automated plans
 - [ ] Implement UI for plan acceptance workflow
 - [ ] Add webhook support for external integrations
 ### Long-term (Quarter 2-3)
 - [ ] Add ML-based plan quality scoring
 - [ ] Implement multi-region timezone support
 - [ ] Add advanced caching strategies (prewarming, predictive)
 - [ ] Build analytics dashboard for plan performance
 - [ ] Optimize scheduler performance for 1000+ tenants
 ---
 ## Success Criteria
 ### Phase 1 Success Criteria ✅
 - [x] Production scheduler runs daily at correct time for each tenant
 - [x] Schedules generated successfully for 95%+ of tenants
 - [x] Zero duplicate schedules per day
 - [x] Timezone-accurate execution
 - [x] Leader election prevents duplicate runs
 ### Phase 2 Success Criteria ✅
 - [x] Forecast cache hit rate > 70% within 48 hours
 - [x] Forecast response time < 200ms for cache hits
 - [x] Plan rejection triggers notifications
 - [x] Auto-regeneration works for stale data rejections
 - [x] All events published to RabbitMQ successfully
 ### Phase 3 Success Criteria ✅
 - [x] All 30+ metrics collecting successfully
 - [x] Alert rules configured and firing correctly
 - [x] Documentation comprehensive and accurate
 - [x] Runbook covers all common scenarios
 - [x] Operations team trained and confident
 ---
 ## Conclusion
 The Production Planning System implementation is **COMPLETE** and **PRODUCTION READY**. All three phases have been successfully implemented, tested, and documented.
 The system now provides:
 ✅ **Fully automated** production and procurement planning
 ✅ **Timezone-aware** scheduling for global deployments
 ✅ **Efficient caching** eliminating redundant computations
 ✅ **Robust workflows** with automatic plan rejection handling
 ✅ **Complete observability** with metrics, logs, and alerts
 ✅ **Operational excellence** with comprehensive documentation and runbooks
 The implementation exceeded expectations in several areas:
 - **Faster development** than estimated (reusing patterns)
 - **Better performance** than projected (95%+ cache hit rate expected)
 - **More comprehensive** documentation than required
 - **Production-ready** with zero known critical issues
 **Status:** ✅ READY FOR DEPLOYMENT
 ---
 **Document Version:** 1.0
 **Created:** 2025-10-09
 **Author:** AI Implementation Team
 **Reviewed By:** [Pending]
 **Approved By:** [Pending]
--- a/docs/MVP_GAP_ANALYSIS_REPORT.md
+++ b/docs/MVP_GAP_ANALYSIS_REPORT.md
@@ -1,216 +0,0 @@
 # Bakery AI Platform - MVP Gap Analysis Report
 ## Executive Summary
 Based on the detailed bakery research report and analysis of the current platform, this document identifies critical missing features that are preventing the platform from delivering value to Madrid's small bakery owners. While the platform has a solid technical foundation with microservices architecture and AI forecasting capabilities, it lacks several core operational features that are essential for day-to-day bakery management.
 ## Current Platform Status
 ### ✅ **Implemented Features**
 #### Backend Services (Functional)
 - **Authentication Service**: Complete user registration, login, JWT tokens, role-based access
 - **Tenant Service**: Multi-tenant architecture, subscription management, team member access
 - **Training Service**: ML model training using Prophet for demand forecasting 
 - **Forecasting Service**: AI-powered demand predictions and alerts
 - **Data Service**: Weather data integration (AEMET), traffic data, external data processing
 - **Notification Service**: Email and WhatsApp notifications
 - **API Gateway**: Centralized routing, rate limiting, service discovery
 #### Frontend Features (Functional)  
 - **Dashboard**: Revenue metrics, weather display, production overview
 - **Authentication**: Login/registration pages with proper validation
 - **Forecasting**: Demand prediction visualizations, forecast charts
 - **Production Planning**: Basic production scheduling interface
 - **Order Management**: Mock order display with supplier information
 - **Settings**: User profile and basic configuration
 #### Technical Infrastructure
 - Microservices architecture with Docker containerization
 - PostgreSQL databases per service with proper migrations
 - RabbitMQ message queuing for inter-service communication
 - Monitoring with Prometheus and Grafana
 - Comprehensive error handling and logging
 ### ❌ **Critical Missing Features for MVP Launch**
 ## 1. **INVENTORY MANAGEMENT SYSTEM** 🚨 **HIGHEST PRIORITY**
 ### **Problem Identified**: 
 According to the bakery research, manual inventory tracking is described as "too cumbersome," "time-consuming," and highly susceptible to "mistakes." This leads to:
 - 1.5% to 20% losses due to spoilage and waste
 - Production delays during peak hours 
 - Quality inconsistencies
 - Lost sales opportunities
 ### **Missing Components**:
 - **Ingredient tracking**: Real-time stock levels for flour, yeast, dairy products
 - **Automatic reordering**: FIFO/FEFO expiration date management
 - **Spoilage monitoring**: Track and predict ingredient expiration
 - **Stock alerts**: Low stock warnings integrated with production planning
 - **Barcode/QR scanning**: Easy inventory updates without manual entry
 - **Supplier integration**: Automated ordering from suppliers like Harinas Castellana
 ### **Required Implementation**:
 ```
 Backend Services Needed:
 - Inventory Service (new microservice)
 - Supplier Service (new microservice) 
 - Integration with existing Forecasting Service
 Frontend Components Needed:
 - Real-time inventory dashboard
 - Mobile-friendly inventory scanning
 - Automated reorder interface
 - Expiration date tracking
 ```
 ## 2. **RECIPE & PRODUCTION MANAGEMENT** 🚨 **HIGH PRIORITY**
 ### **Problem Identified**:
 Individual bakeries struggle with production planning complexity due to:
 - Wide variety of products with different preparation times
 - Manual calculation of ingredient quantities
 - Lack of standardized recipes affecting quality consistency
 ### **Missing Components**:
 - **Digital recipe management**: Store recipes with exact measurements
 - **Bill of Materials (BOM)**: Automatic ingredient calculation based on production volume
 - **Yield tracking**: Compare actual vs. expected production output
 - **Cost calculation**: Real-time cost per product based on current ingredient prices
 - **Production workflow**: Step-by-step production guidance
 - **Quality control**: Track temperature, humidity, timing parameters
 ## 3. **SUPPLIER & PROCUREMENT SYSTEM** 🚨 **HIGH PRIORITY**
 ### **Problem Identified**:
 Research shows small bakeries face "low buyer power" and struggle with:
 - Manual ordering processes via phone/WhatsApp
 - Difficulty tracking supplier performance
 - Limited negotiation power with suppliers
 ### **Missing Components**:
 - **Supplier database**: Contact information, lead times, reliability ratings
 - **Purchase order system**: Digital ordering with approval workflows  
 - **Price comparison**: Compare prices across multiple suppliers
 - **Delivery tracking**: Monitor order status and delivery reliability
 - **Payment terms**: Track payment schedules and supplier agreements
 - **Performance analytics**: Supplier reliability and cost analysis
 ## 4. **SALES DATA INTEGRATION** 🚨 **HIGH PRIORITY**
 ### **Problem Identified**:
 Current forecasting relies on manual data entry. Research shows bakeries need:
 - Integration with POS systems
 - Historical sales pattern analysis
 - External factor correlation (weather, events, holidays)
 ### **Missing Components**:
 - **POS Integration**: Automatic sales data import from common Spanish POS systems
 - **Manual sales entry**: Simple interface for bakeries without POS
 - **Product categorization**: Organize sales by bread types, pastries, seasonal items
 - **Customer analytics**: Track popular products and buying patterns
 - **Seasonal adjustments**: Account for holidays, local events, weather impacts
 ## 5. **WASTE TRACKING & REDUCTION** 🚨 **MEDIUM PRIORITY**
 ### **Problem Identified**:
 Research indicates waste reduction potential of 20-40% through AI optimization:
 - Unsold products (1.5% of production)
 - Ingredient spoilage
 - Production errors
 ### **Missing Components**:
 - **Daily waste logging**: Track unsold products, spoiled ingredients
 - **Waste analytics**: Identify patterns in waste generation
 - **Dynamic pricing**: Reduce prices on items approaching expiration
 - **Donation tracking**: Manage food donations to reduce total waste
 - **Cost impact analysis**: Calculate financial impact of waste reduction
 ## 6. **MOBILE-FIRST INTERFACE** 🚨 **MEDIUM PRIORITY**
 ### **Problem Identified**:
 Research emphasizes bakery owners work demanding schedules starting at 4:30 AM and need "mobile accessibility" for on-the-go management.
 ### **Missing Components**:
 - **Mobile-responsive design**: Current frontend is not optimized for mobile
 - **Offline capabilities**: Work without internet connection
 - **Quick actions**: Fast inventory checks, order placement
 - **Voice input**: Hands-free operation in production environment
 - **QR code scanning**: For inventory and product management
 ## 7. **FINANCIAL MANAGEMENT** 🚨 **LOW PRIORITY**
 ### **Problem Identified**:
 With 75-85% of revenue consumed by operating costs and 4-9% profit margins, bakeries need precise cost control.
 ### **Missing Components**:
 - **Cost tracking**: Monitor food costs (25-35% of sales) and labor costs (24-40% of sales)
 - **Profit analysis**: Real-time profit margins per product
 - **Budget planning**: Monthly expense forecasting
 - **Tax preparation**: VAT calculations, expense categorization
 - **Financial reporting**: P&L statements, cash flow analysis
 ## Implementation Priority Matrix
 | Feature | Business Impact | Technical Complexity | Implementation Time | Priority |
 |---------|----------------|---------------------|-------------------|----------|
 | Inventory Management | Very High | Medium | 6-8 weeks | 1 |
 | Recipe & BOM System | Very High | Medium | 4-6 weeks | 2 |
 | Supplier Management | High | Low-Medium | 4-5 weeks | 3 |
 | Sales Data Integration | High | Medium | 3-4 weeks | 4 |
 | Waste Tracking | Medium | Low | 2-3 weeks | 5 |
 | Mobile Optimization | Medium | Medium | 4-6 weeks | 6 |
 | Financial Management | Low | High | 8-10 weeks | 7 |
 ## Technical Architecture Requirements
 ### New Microservices Needed:
 1. **Inventory Service** - Real-time stock management, expiration tracking
 2. **Recipe Service** - Digital recipes, BOM calculations, cost management  
 3. **Supplier Service** - Supplier database, purchase orders, performance tracking
 4. **Integration Service** - POS system connectors, external data feeds
 ### Database Schema Extensions:
 - Products table with recipes and ingredient relationships
 - Inventory transactions with batch/lot tracking
 - Supplier master data with performance metrics
 - Purchase orders with approval workflows
 ### Frontend Components Required:
 - Mobile-responsive inventory management interface
 - Recipe editor with drag-drop ingredient addition
 - Supplier portal for order placement and tracking
 - Real-time dashboard with critical alerts
 ## MVP Launch Recommendations
 ### Phase 1 (8-10 weeks): Core Operations
 - Implement Inventory Management System
 - Build Recipe & BOM functionality  
 - Create Supplier Management portal
 - Mobile UI optimization
 ### Phase 2 (4-6 weeks): Data Integration
 - POS system integrations
 - Enhanced sales data processing
 - Waste tracking implementation
 ### Phase 3 (6-8 weeks): Advanced Features
 - Financial management tools
 - Advanced analytics and reporting
 - Performance optimization
 ## Conclusion
 The current platform has excellent technical foundations but lacks the core operational features that small Madrid bakeries desperately need. The research clearly shows that **inventory management inefficiencies are the #1 pain point**, causing 1.5-20% losses and significant operational stress.
 **Without implementing inventory management, recipe management, and supplier systems, the platform cannot deliver the value proposition of waste reduction and cost savings that bakeries require for survival.**
 The recommended approach is to focus on the top 4 priority features for MVP launch, which will provide immediate tangible value to bakery owners and justify the platform subscription costs.
 ---
 **Report Generated**: January 2025  
 **Status**: MVP Gap Analysis Complete  
 **Next Actions**: Begin Phase 1 implementation planning
--- a/docs/ONBOARDING_AUTOMATION_IMPLEMENTATION.md
+++ b/docs/ONBOARDING_AUTOMATION_IMPLEMENTATION.md
@@ -1,324 +0,0 @@
 # 🚀 AI-Powered Onboarding Automation Implementation
 ## Overview
 This document details the complete implementation of the intelligent onboarding automation system that transforms the bakery AI platform from manual setup to automated inventory creation using AI-powered product classification.
 ## 🎯 Business Impact
 **Before**: Manual file upload → Manual inventory setup → Training (2-3 hours)
 **After**: Upload file → AI creates inventory → Training (5-10 minutes)
 - **80% reduction** in onboarding time
 - **Automated inventory creation** from historical sales data
 - **Business model intelligence** (Production/Retail/Hybrid detection)
 - **Zero technical knowledge required** from users
 ## 🏗️ Architecture Overview
 ### Backend Services
 #### 1. Sales Service (`/services/sales/`)
 **New Components:**
 - `app/api/onboarding.py` - 3-step onboarding API endpoints
 - `app/services/onboarding_import_service.py` - Orchestrates the automation workflow
 - `app/services/inventory_client.py` - Enhanced with AI classification integration
 **API Endpoints:**
 ```
 POST /api/v1/tenants/{tenant_id}/onboarding/analyze
 POST /api/v1/tenants/{tenant_id}/onboarding/create-inventory  
 POST /api/v1/tenants/{tenant_id}/onboarding/import-sales
 GET  /api/v1/tenants/{tenant_id}/onboarding/business-model-guide
 ```
 #### 2. Inventory Service (`/services/inventory/`)
 **New Components:**
 - `app/api/classification.py` - AI product classification endpoints
 - `app/services/product_classifier.py` - 300+ bakery product classification engine
 - Enhanced inventory models for dual product types (ingredients + finished products)
 **AI Classification Engine:**
 ```
 POST /api/v1/tenants/{tenant_id}/inventory/classify-product
 POST /api/v1/tenants/{tenant_id}/inventory/classify-products-batch
 ```
 ### Frontend Components
 #### 1. Enhanced Onboarding Page (`/frontend/src/pages/onboarding/OnboardingPage.tsx`)
 **Features:**
 - Smart/Traditional import mode toggle
 - Conditional navigation (hides buttons during smart import)
 - Integrated business model detection
 - Seamless transition to training phase
 #### 2. Smart Import Component (`/frontend/src/components/onboarding/SmartHistoricalDataImport.tsx`)
 **Phase-Based UI:**
 - **Upload Phase**: Drag-and-drop with file validation
 - **Analysis Phase**: AI processing with progress indicators
 - **Review Phase**: Interactive suggestion cards with approval toggles
 - **Creation Phase**: Automated inventory creation
 - **Import Phase**: Historical data mapping and import
 #### 3. Enhanced API Services (`/frontend/src/api/services/onboarding.service.ts`)
 **New Methods:**
 ```typescript
 analyzeSalesDataForOnboarding(tenantId, file)
 createInventoryFromSuggestions(tenantId, suggestions)
 importSalesWithInventory(tenantId, file, mapping)
 getBusinessModelGuide(tenantId, model)
 ```
 ## 🧠 AI Classification Engine
 ### Product Categories Supported
 #### Ingredients (Production Bakeries)
 - **Flour & Grains**: 15+ varieties (wheat, rye, oat, corn, etc.)
 - **Yeast & Fermentation**: Fresh, dry, instant, sourdough starters
 - **Dairy Products**: Milk, cream, butter, cheese, yogurt
 - **Eggs**: Whole, whites, yolks
 - **Sweeteners**: Sugar, honey, syrups, artificial sweeteners
 - **Fats**: Oils, margarine, lard, specialty fats
 - **Spices & Flavorings**: 20+ common bakery spices
 - **Additives**: Baking powder, soda, cream of tartar, lecithin
 - **Packaging**: Bags, containers, wrapping materials
 #### Finished Products (Retail Bakeries)
 - **Bread**: 10+ varieties (white, whole grain, artisan, etc.)
 - **Pastries**: Croissants, Danish, puff pastry items
 - **Cakes**: Layer cakes, cheesecakes, specialty cakes
 - **Cookies**: 8+ varieties from shortbread to specialty
 - **Muffins & Quick Breads**: Sweet and savory varieties
 - **Sandwiches**: Prepared items for immediate sale
 - **Beverages**: Coffee, tea, juices, hot chocolate
 ### Business Model Detection
 **Algorithm analyzes ingredient ratio:**
 - **Production Model** (≥70% ingredients): Focus on recipe management, supplier relationships
 - **Retail Model** (≤30% ingredients): Focus on central baker relationships, freshness monitoring
 - **Hybrid Model** (30-70% ingredients): Balanced approach with both features
 ### Confidence Scoring
 - **High Confidence (≥70%)**: Auto-approved suggestions
 - **Medium Confidence (40-69%)**: Flagged for review
 - **Low Confidence (<40%)**: Requires manual verification
 ## 🔄 Three-Phase Workflow
 ### Phase 1: AI Analysis
 ```mermaid
 graph LR
    A[Upload File] --> B[Parse Data]
    B --> C[Extract Products]
    C --> D[AI Classification]
    D --> E[Business Model Detection]
    E --> F[Generate Suggestions]
 ```
 **Input**: CSV/Excel/JSON with sales data
 **Processing**: Product name extraction → AI classification → Confidence scoring
 **Output**: Structured suggestions with business model analysis
 ### Phase 2: Review & Approval
 ```mermaid
 graph LR
    A[Display Suggestions] --> B[User Review]
    B --> C[Modify if Needed]
    C --> D[Approve Items]
    D --> E[Create Inventory]
 ```
 **Features**:
 - Interactive suggestion cards
 - Bulk approve/reject options
 - Real-time confidence indicators
 - Modification support
 ### Phase 3: Automated Import
 ```mermaid
 graph LR
    A[Create Inventory Items] --> B[Generate Mapping]
    B --> C[Map Historical Sales]
    C --> D[Import with References]
    D --> E[Complete Setup]
 ```
 **Process**:
 - Creates inventory items via API
 - Maps product names to inventory IDs
 - Imports historical sales with proper references
 - Maintains data integrity
 ## 📊 Business Model Intelligence
 ### Production Bakery Recommendations
 - Set up supplier relationships for ingredients
 - Configure recipe management and costing
 - Enable production planning and scheduling
 - Set up ingredient inventory alerts and reorder points
 ### Retail Bakery Recommendations
 - Configure central baker relationships
 - Set up delivery schedules and tracking
 - Enable finished product freshness monitoring
 - Focus on sales forecasting and ordering
 ### Hybrid Bakery Recommendations
 - Configure both ingredient and finished product management
 - Set up flexible inventory categories
 - Enable comprehensive analytics
 - Plan workflows for both business models
 ## 🛡️ Error Handling & Fallbacks
 ### File Validation
 - **Format Support**: CSV, Excel (.xlsx, .xls), JSON
 - **Size Limits**: 10MB maximum
 - **Encoding**: Auto-detection (UTF-8, Latin-1, CP1252)
 - **Structure Validation**: Required columns detection
 ### Graceful Degradation
 - **AI Classification Fails** → Fallback suggestions generated
 - **Network Issues** → Traditional import mode available
 - **Validation Errors** → Smart import suggestions with helpful guidance
 - **Low Confidence** → Manual review prompts
 ### Data Integrity
 - **Atomic Operations**: All-or-nothing inventory creation
 - **Validation**: Product name uniqueness checks
 - **Rollback**: Failed operations don't affect existing data
 - **Audit Trail**: Complete import history tracking
 ## 🎨 UX/UI Design Principles
 ### Progressive Enhancement
 - **Smart by Default**: AI-powered import is the primary experience
 - **Traditional Fallback**: Manual mode available for edge cases
 - **Contextual Switching**: Easy toggle between modes with clear benefits
 ### Visual Feedback
 - **Progress Indicators**: Clear phase progression
 - **Confidence Colors**: Green (high), Yellow (medium), Red (low)
 - **Real-time Updates**: Instant feedback during processing
 - **Success Celebrations**: Completion animations and confetti
 ### Mobile-First Design
 - **Responsive Layout**: Works on all screen sizes
 - **Touch-Friendly**: Large buttons and touch targets
 - **Gesture Support**: Swipe and pinch interactions
 - **Offline Indicators**: Clear connectivity status
 ## 📈 Performance Optimizations
 ### Backend Optimizations
 - **Async Processing**: Non-blocking AI classification
 - **Batch Operations**: Bulk product processing
 - **Database Indexing**: Optimized queries for product lookup
 - **Caching**: Redis cache for classification results
 ### Frontend Optimizations
 - **Lazy Loading**: Components loaded on demand
 - **File Streaming**: Large file processing without memory issues
 - **Progressive Enhancement**: Core functionality first, enhancements second
 - **Error Boundaries**: Isolated failure handling
 ## 🧪 Testing Strategy
 ### Unit Tests
 - AI classification accuracy (>90% for common products)
 - Business model detection precision
 - API endpoint validation
 - File parsing robustness
 ### Integration Tests
 - End-to-end onboarding workflow
 - Service communication validation
 - Database transaction integrity
 - Error handling scenarios
 ### User Acceptance Tests
 - Bakery owner onboarding simulation
 - Different file format validation
 - Business model detection accuracy
 - Mobile device compatibility
 ## 🚀 Deployment & Rollout
 ### Feature Flags
 - **Smart Import Toggle**: Can be disabled per tenant
 - **AI Confidence Thresholds**: Adjustable based on feedback
 - **Business Model Detection**: Can be bypassed if needed
 ### Monitoring & Analytics
 - **Onboarding Completion Rates**: Track improvement vs traditional
 - **AI Classification Accuracy**: Monitor and improve over time
 - **User Satisfaction**: NPS scoring on completion
 - **Performance Metrics**: Processing time and success rates
 ### Gradual Rollout
 1. **Beta Testing**: Select bakery owners
 2. **Regional Rollout**: Madrid market first
 3. **Full Release**: All markets with monitoring
 4. **Optimization**: Continuous improvement based on data
 ## 📚 Documentation & Training
 ### User Documentation
 - **Video Tutorials**: Step-by-step onboarding guide
 - **Help Articles**: Troubleshooting common issues
 - **Best Practices**: File preparation guidelines
 - **FAQ**: Common questions and answers
 ### Developer Documentation
 - **API Reference**: Complete endpoint documentation
 - **Architecture Guide**: Service interaction diagrams
 - **Deployment Guide**: Infrastructure setup
 - **Troubleshooting**: Common issues and solutions
 ## 🔮 Future Enhancements
 ### AI Improvements
 - **Learning from Corrections**: User feedback training
 - **Multi-language Support**: International product names
 - **Image Recognition**: Product photo classification
 - **Seasonal Intelligence**: Holiday and seasonal product detection
 ### Advanced Features
 - **Predictive Inventory**: AI-suggested initial stock levels
 - **Supplier Matching**: Automatic supplier recommendations
 - **Recipe Suggestions**: AI-generated recipes from ingredients
 - **Market Intelligence**: Competitive analysis integration
 ### User Experience
 - **Voice Upload**: Dictated product lists
 - **Barcode Scanning**: Product identification via camera
 - **Augmented Reality**: Visual inventory setup guide
 - **Collaborative Setup**: Multi-user onboarding process
 ## 📋 Success Metrics
 ### Quantitative KPIs
 - **Onboarding Time**: Target <10 minutes (vs 2-3 hours)
 - **Completion Rate**: Target >95% (vs ~60%)
 - **AI Accuracy**: Target >90% classification accuracy
 - **User Satisfaction**: Target NPS >8.5
 ### Qualitative Indicators
 - **Reduced Support Tickets**: Fewer onboarding-related issues
 - **Positive Feedback**: User testimonials and reviews
 - **Feature Adoption**: High smart import usage rates
 - **Business Growth**: Faster time-to-value for new customers
 ## 🎉 Conclusion
 The AI-powered onboarding automation system successfully transforms the bakery AI platform into a truly intelligent, user-friendly solution. By reducing friction, automating complex tasks, and providing business intelligence, this implementation delivers on the promise of making bakery management as smooth and simple as possible.
 The system is designed for scalability, maintainability, and continuous improvement, ensuring it will evolve with user needs and technological advances.
 ---
 **Implementation Status**: ✅ Complete
 **Last Updated**: 2025-01-13
 **Next Review**: 2025-02-13
--- a/docs/PRODUCTION_PLANNING_SYSTEM.md
+++ b/docs/PRODUCTION_PLANNING_SYSTEM.md
@@ -1,718 +0,0 @@
 # Production Planning System Documentation
 ## Overview
 The Production Planning System automates daily production and procurement scheduling for bakery operations. The system consists of two primary schedulers that run every morning to generate plans based on demand forecasts, inventory levels, and capacity constraints.
 **Last Updated:** 2025-10-09
 **Version:** 2.0 (Automated Scheduling)
 **Status:** Production Ready
 ---
 ## Architecture
 ### System Components
 ```
 ┌─────────────────────────────────────────────────────────────────┐
 │                   DAILY PLANNING WORKFLOW                        │
 └─────────────────────────────────────────────────────────────────┘
 05:30 AM → Production Scheduler
           ├─ Generates production schedules for all tenants
           ├─ Calls Forecasting Service (cached) for demand
           ├─ Calls Orders Service for demand requirements
           ├─ Creates production batches
           └─ Sends notifications to production managers
 06:00 AM → Procurement Scheduler
           ├─ Generates procurement plans for all tenants
           ├─ Calls Forecasting Service (cached - reuses cached data!)
           ├─ Calls Inventory Service for stock levels
           ├─ Matches suppliers for requirements
           └─ Sends notifications to procurement managers
 08:00 AM → Operators review plans
           ├─ Accept → Plans move to "approved" status
           ├─ Reject → Automatic regeneration if stale data detected
           └─ Modify → Recalculate and resubmit
 Throughout Day → Alert services monitor execution
                 ├─ Production delays
                 ├─ Capacity issues
                 ├─ Quality problems
                 └─ Equipment failures
 ```
 ### Services Involved
 | Service | Role | Endpoints |
 |---------|------|-----------|
 | **Production Service** | Generates daily production schedules | `POST /api/v1/{tenant_id}/production/operations/schedule` |
 | **Orders Service** | Generates daily procurement plans | `POST /api/v1/{tenant_id}/orders/operations/procurement/generate` |
 | **Forecasting Service** | Provides demand predictions (cached) | `POST /api/v1/{tenant_id}/forecasting/operations/single` |
 | **Inventory Service** | Provides current stock levels | `GET /api/v1/{tenant_id}/inventory/products` |
 | **Tenant Service** | Provides timezone configuration | `GET /api/v1/tenants/{tenant_id}` |
 ---
 ## Schedulers
 ### 1. Production Scheduler
 **Service:** Production Service
 **Class:** `ProductionSchedulerService`
 **File:** [`services/production/app/services/production_scheduler_service.py`](../services/production/app/services/production_scheduler_service.py)
 #### Schedule
 | Job | Time | Purpose | Grace Period |
 |-----|------|---------|--------------|
 | **Daily Production Planning** | 5:30 AM (tenant timezone) | Generate next-day production schedules | 5 minutes |
 | **Stale Schedule Cleanup** | 5:50 AM | Archive/cancel old schedules, send escalations | 5 minutes |
 | **Test Mode** | Every 30 min (DEBUG only) | Development/testing | 5 minutes |
 #### Features
 - ✅ **Timezone-aware**: Respects tenant timezone configuration
 - ✅ **Leader election**: Only one instance runs in distributed deployment
 - ✅ **Idempotent**: Checks if schedule exists before creating
 - ✅ **Parallel processing**: Processes tenants concurrently with timeouts
 - ✅ **Error isolation**: Tenant failures don't affect others
 - ✅ **Demo tenant filtering**: Excludes demo tenants from automation
 #### Workflow
 1. **Tenant Discovery**: Fetch all active non-demo tenants
 2. **Parallel Processing**: Process each tenant concurrently (180s timeout)
 3. **Date Calculation**: Use tenant timezone to determine target date
 4. **Duplicate Check**: Skip if schedule already exists
 5. **Requirements Calculation**: Call `calculate_daily_requirements()`
 6. **Schedule Creation**: Create schedule with status "draft"
 7. **Batch Generation**: Create production batches from requirements
 8. **Notification**: Send alert to production managers
 9. **Monitoring**: Record metrics for observability
 #### Configuration
 ```python
 # Environment Variables
 PRODUCTION_TEST_MODE=false  # Enable 30-minute test job
 DEBUG=false                 # Enable verbose logging
 # Tenant Configuration
 tenant.timezone=Europe/Madrid  # IANA timezone string
 ```
 ---
 ### 2. Procurement Scheduler
 **Service:** Orders Service
 **Class:** `ProcurementSchedulerService`
 **File:** [`services/orders/app/services/procurement_scheduler_service.py`](../services/orders/app/services/procurement_scheduler_service.py)
 #### Schedule
 | Job | Time | Purpose | Grace Period |
 |-----|------|---------|--------------|
 | **Daily Procurement Planning** | 6:00 AM (tenant timezone) | Generate next-day procurement plans | 5 minutes |
 | **Stale Plan Cleanup** | 6:30 AM | Archive/cancel old plans, send reminders | 5 minutes |
 | **Weekly Optimization** | Monday 7:00 AM | Weekly procurement optimization review | 10 minutes |
 | **Test Mode** | Every 30 min (DEBUG only) | Development/testing | 5 minutes |
 #### Features
 - ✅ **Timezone-aware**: Respects tenant timezone configuration
 - ✅ **Leader election**: Prevents duplicate runs
 - ✅ **Idempotent**: Checks if plan exists before generating
 - ✅ **Parallel processing**: 120s timeout per tenant
 - ✅ **Forecast fallback**: Uses historical data if forecast unavailable
 - ✅ **Critical stock alerts**: Automatic alerts for zero-stock items
 - ✅ **Rejection workflow**: Auto-regeneration for rejected plans
 #### Workflow
 1. **Tenant Discovery**: Fetch active non-demo tenants
 2. **Parallel Processing**: Process each tenant (120s timeout)
 3. **Date Calculation**: Use tenant timezone
 4. **Duplicate Check**: Skip if plan exists (unless force_regenerate)
 5. **Forecasting**: Call Forecasting Service (uses cache!)
 6. **Inventory Check**: Get current stock levels
 7. **Requirements Calculation**: Calculate net requirements
 8. **Supplier Matching**: Find suitable suppliers
 9. **Plan Creation**: Create plan with status "draft"
 10. **Critical Alerts**: Send alerts for critical items
 11. **Notification**: Notify procurement managers
 12. **Caching**: Cache plan in Redis (6h TTL)
 ---
 ## Forecast Caching
 ### Overview
 To eliminate redundant forecast computations, the Forecasting Service now includes a service-level Redis cache. Both Production and Procurement schedulers benefit from this without any code changes.
 **File:** [`services/forecasting/app/services/forecast_cache.py`](../services/forecasting/app/services/forecast_cache.py)
 ### Cache Strategy
 ```
 Key Format: forecast:{tenant_id}:{product_id}:{forecast_date}
 TTL: Until midnight of day after forecast_date
 Example: forecast:abc-123:prod-456:2025-10-10 → expires 2025-10-11 00:00:00
 ```
 ### Cache Flow
 ```
 Client Request → Forecasting API
                     ↓
                Check Redis Cache
                     ├─ HIT → Return cached result (add 'cached: true')
                     └─ MISS → Generate forecast
                                ↓
                           Cache result (TTL)
                                ↓
                           Return result
 ```
 ### Benefits
 | Metric | Before Caching | After Caching | Improvement |
 |--------|---------------|---------------|-------------|
 | **Duplicate Forecasts** | 2x per day (Production + Procurement) | 1x per day | 50% reduction |
 | **Forecast Response Time** | ~2-5 seconds | ~50-100ms (cache hit) | 95%+ faster |
 | **Forecasting Service Load** | 100% | 50% | 50% reduction |
 | **Cache Hit Rate** | N/A | ~80-90% (expected) | - |
 ### Cache Invalidation
 Forecasts are invalidated when:
 1. **TTL Expiry**: Automatic at midnight after forecast_date
 2. **Model Retraining**: When ML model is retrained for product
 3. **Manual Invalidation**: Via API endpoint (admin only)
 ```python
 # Invalidate specific product forecasts
 DELETE /api/v1/{tenant_id}/forecasting/cache/product/{product_id}
 # Invalidate all tenant forecasts
 DELETE /api/v1/{tenant_id}/forecasting/cache
 # Invalidate all forecasts (use with caution!)
 DELETE /admin/forecasting/cache/all
 ```
 ---
 ## Plan Rejection Workflow
 ### Overview
 When a procurement plan is rejected by an operator, the system automatically handles the rejection with notifications and optional regeneration.
 **File:** [`services/orders/app/services/procurement_service.py`](../services/orders/app/services/procurement_service.py:1244-1404)
 ### Rejection Flow
 ```
 User Rejects Plan (status → "cancelled")
           ↓
    Record rejection in approval_workflow (JSONB)
           ↓
    Send notification to stakeholders
           ↓
    Publish rejection event (RabbitMQ)
           ↓
    Analyze rejection reason
           ├─ Contains "stale", "outdated", etc. → Auto-regenerate
           └─ Other reason → Manual regeneration required
           ↓
    Schedule regeneration (if applicable)
           ↓
    Send regeneration request event
 ```
 ### Auto-Regeneration Keywords
 Plans are automatically regenerated if rejection notes contain:
 - `stale`
 - `outdated`
 - `old data`
 - `datos antiguos` (Spanish)
 - `desactualizado` (Spanish)
 - `obsoleto` (Spanish)
 ### Events Published
 | Event | Routing Key | Consumers |
 |-------|-------------|-----------|
 | **Plan Rejected** | `procurement.plan.rejected` | Alert Service, UI Notifications |
 | **Regeneration Requested** | `procurement.plan.regeneration_requested` | Procurement Scheduler |
 | **Plan Status Changed** | `procurement.plan.status_changed` | Inventory Service, Dashboard |
 ---
 ## Timezone Configuration
 ### Overview
 All schedulers are timezone-aware to ensure accurate "daily" execution relative to the bakery's local time.
 ### Tenant Configuration
 **Model:** `Tenant`
 **File:** [`services/tenant/app/models/tenants.py`](../services/tenant/app/models/tenants.py:32-33)
 **Field:** `timezone` (String, default: `"Europe/Madrid"`)
 **Migration:** [`services/tenant/migrations/versions/20251009_add_timezone_to_tenants.py`](../services/tenant/migrations/versions/20251009_add_timezone_to_tenants.py)
 ### Supported Timezones
 All IANA timezone strings are supported. Common examples:
 - `Europe/Madrid` (Spain - CEST/CET)
 - `Europe/London` (UK - BST/GMT)
 - `America/New_York` (US Eastern)
 - `America/Los_Angeles` (US Pacific)
 - `Asia/Tokyo` (Japan)
 - `UTC` (Universal Time)
 ### Usage in Schedulers
 ```python
 from shared.utils.timezone_helper import TimezoneHelper
 # Get current date in tenant's timezone
 target_date = TimezoneHelper.get_current_date_in_timezone(tenant_tz)
 # Get current datetime in tenant's timezone
 now = TimezoneHelper.get_current_datetime_in_timezone(tenant_tz)
 # Check if within business hours
 is_business_hours = TimezoneHelper.is_business_hours(
    timezone_str=tenant_tz,
    start_hour=8,
    end_hour=20
 )
 ```
 ---
 ## Monitoring & Alerts
 ### Prometheus Metrics
 **File:** [`shared/monitoring/scheduler_metrics.py`](../shared/monitoring/scheduler_metrics.py)
 #### Key Metrics
 | Metric | Type | Description |
 |--------|------|-------------|
 | `production_schedules_generated_total` | Counter | Total production schedules generated (by tenant, status) |
 | `production_schedule_generation_duration_seconds` | Histogram | Time to generate schedule per tenant |
 | `procurement_plans_generated_total` | Counter | Total procurement plans generated (by tenant, status) |
 | `procurement_plan_generation_duration_seconds` | Histogram | Time to generate plan per tenant |
 | `forecast_cache_hits_total` | Counter | Forecast cache hits (by tenant) |
 | `forecast_cache_misses_total` | Counter | Forecast cache misses (by tenant) |
 | `forecast_cache_hit_rate` | Gauge | Cache hit rate percentage (0-100) |
 | `procurement_plan_rejections_total` | Counter | Plan rejections (by tenant, auto_regenerated) |
 | `scheduler_health_status` | Gauge | Scheduler health (1=healthy, 0=unhealthy) |
 | `tenant_processing_timeout_total` | Counter | Tenant processing timeouts (by service) |
 ### Recommended Alerts
 ```yaml
 # Alert: Daily production planning failed
 - alert: DailyProductionPlanningFailed
  expr: production_schedules_generated_total{status="failure"} > 0
  for: 10m
  labels:
    severity: high
  annotations:
    summary: "Daily production planning failed for at least one tenant"
    description: "Check production scheduler logs for tenant {{ $labels.tenant_id }}"
 # Alert: Daily procurement planning failed
 - alert: DailyProcurementPlanningFailed
  expr: procurement_plans_generated_total{status="failure"} > 0
  for: 10m
  labels:
    severity: high
  annotations:
    summary: "Daily procurement planning failed for at least one tenant"
    description: "Check procurement scheduler logs for tenant {{ $labels.tenant_id }}"
 # Alert: No production schedules in 24 hours
 - alert: NoProductionSchedulesGenerated
  expr: rate(production_schedules_generated_total{status="success"}[24h]) == 0
  for: 1h
  labels:
    severity: critical
  annotations:
    summary: "No production schedules generated in last 24 hours"
    description: "Production scheduler may be down or misconfigured"
 # Alert: Forecast cache hit rate low
 - alert: ForecastCacheHitRateLow
  expr: forecast_cache_hit_rate < 50
  for: 30m
  labels:
    severity: warning
  annotations:
    summary: "Forecast cache hit rate below 50%"
    description: "Cache may not be functioning correctly for tenant {{ $labels.tenant_id }}"
 # Alert: High tenant processing timeouts
 - alert: HighTenantProcessingTimeouts
  expr: rate(tenant_processing_timeout_total[5m]) > 0.1
  for: 15m
  labels:
    severity: warning
  annotations:
    summary: "High rate of tenant processing timeouts"
    description: "{{ $labels.service }} scheduler experiencing timeouts for tenant {{ $labels.tenant_id }}"
 # Alert: Scheduler unhealthy
 - alert: SchedulerUnhealthy
  expr: scheduler_health_status == 0
  for: 5m
  labels:
    severity: critical
  annotations:
    summary: "Scheduler is unhealthy"
    description: "{{ $labels.service }} {{ $labels.scheduler_type }} scheduler is reporting unhealthy status"
 ```
 ### Grafana Dashboard
 Create dashboard with panels for:
 1. **Scheduler Success Rate** (line chart)
   - `production_schedules_generated_total{status="success"}`
   - `procurement_plans_generated_total{status="success"}`
 2. **Schedule Generation Duration** (heatmap)
   - `production_schedule_generation_duration_seconds`
   - `procurement_plan_generation_duration_seconds`
 3. **Forecast Cache Hit Rate** (gauge)
   - `forecast_cache_hit_rate`
 4. **Tenant Processing Status** (pie chart)
   - `production_tenants_processed_total`
   - `procurement_tenants_processed_total`
 5. **Plan Rejections** (table)
   - `procurement_plan_rejections_total`
 6. **Scheduler Health** (status panel)
   - `scheduler_health_status`
 ---
 ## Testing
 ### Manual Testing
 #### Test Production Scheduler
 ```bash
 # Trigger test production schedule generation
 curl -X POST http://production-service:8000/test/production-scheduler \
  -H "Authorization: Bearer $TOKEN"
 # Expected response:
 {
  "message": "Production scheduler test triggered successfully"
 }
 ```
 #### Test Procurement Scheduler
 ```bash
 # Trigger test procurement plan generation
 curl -X POST http://orders-service:8000/test/procurement-scheduler \
  -H "Authorization: Bearer $TOKEN"
 # Expected response:
 {
  "message": "Procurement scheduler test triggered successfully"
 }
 ```
 ### Automated Testing
 ```python
 # Test production scheduler
 async def test_production_scheduler():
    scheduler = ProductionSchedulerService(config)
    await scheduler.start()
    await scheduler.test_production_schedule_generation()
    assert scheduler._checks_performed > 0
 # Test procurement scheduler
 async def test_procurement_scheduler():
    scheduler = ProcurementSchedulerService(config)
    await scheduler.start()
    await scheduler.test_procurement_generation()
    assert scheduler._checks_performed > 0
 # Test forecast caching
 async def test_forecast_cache():
    cache = get_forecast_cache_service(redis_url)
    # Cache forecast
    await cache.cache_forecast(tenant_id, product_id, forecast_date, data)
    # Retrieve cached forecast
    cached = await cache.get_cached_forecast(tenant_id, product_id, forecast_date)
    assert cached is not None
    assert cached['cached'] == True
 ```
 ---
 ## Troubleshooting
 ### Scheduler Not Running
 **Symptoms:** No schedules/plans generated in morning
 **Checks:**
 1. Verify scheduler service is running: `kubectl get pods -n production`
 2. Check scheduler health endpoint: `curl http://service:8000/health`
 3. Check APScheduler status in logs: `grep "scheduler" logs/production.log`
 4. Verify leader election (distributed setup): Check `is_leader` in logs
 **Solutions:**
 - Restart service: `kubectl rollout restart deployment/production-service`
 - Check environment variables: `PRODUCTION_TEST_MODE`, `DEBUG`
 - Verify database connectivity
 - Check RabbitMQ connectivity for leader election
 ### Timezone Issues
 **Symptoms:** Schedules generated at wrong time
 **Checks:**
 1. Check tenant timezone configuration:
   ```sql
   SELECT id, name, timezone FROM tenants WHERE id = '{tenant_id}';
   ```
 2. Verify server timezone: `date` (should be UTC in containers)
 3. Check logs for timezone warnings
 **Solutions:**
 - Update tenant timezone: `UPDATE tenants SET timezone = 'Europe/Madrid' WHERE id = '{tenant_id}';`
 - Verify TimezoneHelper is being used in schedulers
 - Check cron trigger configuration uses correct timezone
 ### Low Cache Hit Rate
 **Symptoms:** `forecast_cache_hit_rate < 50%`
 **Checks:**
 1. Verify Redis is running: `redis-cli ping`
 2. Check cache keys: `redis-cli KEYS "forecast:*"`
 3. Check TTL on cache entries: `redis-cli TTL "forecast:{tenant}:{product}:{date}"`
 4. Review logs for cache errors
 **Solutions:**
 - Restart Redis if unhealthy
 - Clear cache and let it rebuild: `redis-cli FLUSHDB`
 - Verify REDIS_URL environment variable
 - Check Redis memory limits: `redis-cli INFO memory`
 ### Plan Rejection Not Auto-Regenerating
 **Symptoms:** Rejected plans not triggering regeneration
 **Checks:**
 1. Check rejection notes contain auto-regenerate keywords
 2. Verify RabbitMQ events are being published: Check `procurement.plan.rejected` queue
 3. Check scheduler is listening to regeneration events
 **Solutions:**
 - Use keywords like "stale" or "outdated" in rejection notes
 - Manually trigger regeneration via API
 - Check RabbitMQ connectivity
 - Verify event routing keys are correct
 ### Tenant Processing Timeouts
 **Symptoms:** `tenant_processing_timeout_total` increasing
 **Checks:**
 1. Check timeout duration (180s for production, 120s for procurement)
 2. Review slow queries in database logs
 3. Check external service response times (Forecasting, Inventory)
 4. Monitor CPU/memory usage during scheduler runs
 **Solutions:**
 - Increase timeout if consistently hitting limit
 - Optimize database queries (add indexes)
 - Scale external services if response time high
 - Process fewer tenants in parallel (reduce concurrency)
 ---
 ## Maintenance
 ### Scheduled Maintenance Windows
 When performing maintenance on schedulers:
 1. **Announce downtime** to users (UI banner)
 2. **Disable schedulers** temporarily:
   ```python
   # Set environment variable
   SCHEDULER_DISABLED=true
   ```
 3. **Perform maintenance** (database migrations, service updates)
 4. **Re-enable schedulers**:
   ```python
   SCHEDULER_DISABLED=false
   ```
 5. **Manually trigger** missed runs if needed:
   ```bash
   curl -X POST http://service:8000/test/production-scheduler
   curl -X POST http://service:8000/test/procurement-scheduler
   ```
 ### Database Migrations
 When adding fields to scheduler-related tables:
 1. **Create migration** with proper rollback
 2. **Test migration** on staging environment
 3. **Run migration** during low-traffic period (3-4 AM)
 4. **Verify scheduler** still works after migration
 5. **Monitor metrics** for anomalies
 ### Cache Maintenance
 **Clear Stale Cache Entries:**
 ```bash
 # Clear all forecast cache (will rebuild automatically)
 redis-cli KEYS "forecast:*" | xargs redis-cli DEL
 # Clear specific tenant's cache
 redis-cli KEYS "forecast:{tenant_id}:*" | xargs redis-cli DEL
 ```
 **Monitor Cache Size:**
 ```bash
 # Check number of forecast keys
 redis-cli DBSIZE
 # Check memory usage
 redis-cli INFO memory
 ```
 ---
 ## API Reference
 ### Production Scheduler Endpoints
 ```
 POST /test/production-scheduler
 Description: Manually trigger production scheduler (test mode)
 Auth: Bearer token required
 Response: {"message": "Production scheduler test triggered successfully"}
 ```
 ### Procurement Scheduler Endpoints
 ```
 POST /test/procurement-scheduler
 Description: Manually trigger procurement scheduler (test mode)
 Auth: Bearer token required
 Response: {"message": "Procurement scheduler test triggered successfully"}
 ```
 ### Forecast Cache Endpoints
 ```
 GET /api/v1/{tenant_id}/forecasting/cache/stats
 Description: Get forecast cache statistics
 Auth: Bearer token required
 Response: {
  "available": true,
  "total_forecast_keys": 1234,
  "batch_forecast_keys": 45,
  "single_forecast_keys": 1189,
  "hit_rate_percent": 87.5,
  ...
 }
 DELETE /api/v1/{tenant_id}/forecasting/cache/product/{product_id}
 Description: Invalidate forecast cache for specific product
 Auth: Bearer token required (admin only)
 Response: {"invalidated_keys": 7}
 DELETE /api/v1/{tenant_id}/forecasting/cache
 Description: Invalidate all forecast cache for tenant
 Auth: Bearer token required (admin only)
 Response: {"invalidated_keys": 123}
 ```
 ---
 ## Change Log
 ### Version 2.0 (2025-10-09) - Automated Scheduling
 **Added:**
 - ✨ ProductionSchedulerService for automated daily production planning
 - ✨ Timezone configuration in Tenant model
 - ✨ Forecast caching in Forecasting Service (service-level)
 - ✨ Plan rejection workflow with auto-regeneration
 - ✨ Comprehensive Prometheus metrics for monitoring
 - ✨ TimezoneHelper utility for consistent timezone handling
 **Changed:**
 - 🔄 All schedulers now timezone-aware
 - 🔄 Forecast service returns `cached: true` flag in metadata
 - 🔄 Plan rejection triggers notifications and events
 **Fixed:**
 - 🐛 Duplicate forecast computations eliminated (50% reduction)
 - 🐛 Timezone-related scheduling issues resolved
 - 🐛 Rejected plans now have proper workflow handling
 **Documentation:**
 - 📚 Comprehensive production planning system documentation
 - 📚 Runbooks for troubleshooting common issues
 - 📚 Monitoring and alerting guidelines
 ### Version 1.0 (2025-10-07) - Initial Release
 **Added:**
 - ✨ ProcurementSchedulerService for automated procurement planning
 - ✨ Daily, weekly, and cleanup jobs
 - ✨ Leader election for distributed deployments
 - ✨ Parallel tenant processing with timeouts
 ---
 ## Support & Contact
 For issues or questions about the Production Planning System:
 - **Documentation:** This file
 - **Source Code:** `services/production/`, `services/orders/`
 - **Issues:** GitHub Issues
 - **Slack:** `#production-planning` channel
 ---
 **Document Version:** 2.0
 **Last Review Date:** 2025-10-09
 **Next Review Date:** 2025-11-09
--- a/docs/ROADMAP.md
+++ b/docs/ROADMAP.md
@@ -1,172 +0,0 @@
 ⏺ Based on my analysis of your bakery project, here's a clear divide 
  and conquer strategy to transition from mock/demo data to real API
  connections:
  Current State Analysis
  ✅ What's Already Done:
  - Complete backend microservices architecture (17+ services)
  - Full API client setup with authentication & tenant management
  - Comprehensive service layer (auth, inventory, sales, forecasting,
  etc.)
  - Real-time SSE & WebSocket infrastructure
  🔍 What Needs Connection:
  - Mock data in auth store (mockLogin, mockRefreshToken)
  - Mock alerts in alerts store (mockAlerts, mockRules)
  - Various hardcoded arrays across hooks and utilities
  Divide & Conquer Strategy
  Phase 1: Authentication Foundation (Week 1)
  Priority: CRITICAL - Everything depends on this
  // IMMEDIATE ACTION: Replace mock auth
  1. Update auth.store.ts → Connect to real auth service
  2. Replace mockLogin() with authService.login()
  3. Replace mockRefreshToken() with authService.refreshToken()
  4. Test tenant switching and permission system
  Files to modify:
  - frontend/src/stores/auth.store.ts:46-88 (replace mock functions)
  - frontend/src/services/api/auth.service.ts (already done)
  Phase 2: Core Operations (Week 2)
  Priority: HIGH - Daily operations
  // Connect inventory management first (most used)
  1. Inventory Service → Real API calls
     - Replace mock data in components
     - Connect to /api/v1/inventory/* endpoints
  2. Production Service → Real API calls  
     - Connect batch management
     - Link to /api/v1/production/* endpoints
  3. Sales Service → Real API calls
     - Connect POS integration
     - Link to /api/v1/sales/* endpoints
  Files to modify:
  - All inventory components using mock data
  - Production scheduling hooks
  - Sales tracking components
  Phase 3: Analytics & Intelligence (Week 3)
  Priority: MEDIUM - Business insights
  // Connect AI-powered features
  1. Forecasting Service → Real ML predictions
     - Connect to /api/v1/forecasts/* endpoints
     - Enable real demand forecasting
  2. Training Service → Real model training
     - Connect WebSocket training progress
     - Enable /api/v1/training/* endpoints
  3. Analytics Service → Real business data
     - Connect charts and reports
     - Enable trend analysis
  Phase 4: Communication & Automation (Week 4)
  Priority: LOW - Enhancements
  // Replace mock alerts with real-time system
  1. Alerts Store → Real SSE connection
     - Connect to /api/v1/sse/alerts/stream/{tenant_id}
     - Replace mockAlerts with live data
  2. Notification Service → Real messaging
     - WhatsApp & Email integration
     - Connect to /api/v1/notifications/* endpoints
  3. External Data → Live feeds
     - Weather API (AEMET)
     - Traffic patterns
     - Market events
  Specific Next Steps
  STEP 1: Start with Authentication (Today)
  // 1. Replace frontend/src/stores/auth.store.ts
  // Remove lines 46-88 and replace with:
  const performLogin = async (email: string, password: string) => {
    const response = await authService.login({ email, password });
    if (!response.success) throw new Error(response.error || 'Login 
  failed');
    return response.data;
  };
  const performRefresh = async (refreshToken: string) => {
    const response = await authService.refreshToken(refreshToken);
    if (!response.success) throw new Error('Token refresh failed');
    return response.data;
  };
  STEP 2: Connect Inventory (Next)
  // 2. Update inventory components to use real API
  // Replace mock data arrays with:
  const { data: ingredients, isLoading } = useQuery({
    queryKey: ['ingredients', tenantId],
    queryFn: () => inventoryService.getIngredients(),
  });
  STEP 3: Enable Real-time Alerts (After)
  // 3. Connect SSE for real alerts
  // Replace alerts store mock data with:
  useEffect(() => {
    const eventSource = new
  EventSource(`/api/v1/sse/alerts/stream/${tenantId}`);
    eventSource.onmessage = (event) => {
      const alert = JSON.parse(event.data);
      createAlert(alert);
    };
    return () => eventSource.close();
  }, [tenantId]);
  Migration Checklist
  Immediate (This Week)
  - Replace auth mock functions with real API calls
  - Test login/logout/refresh flows
  - Verify tenant switching works
  - Test permission-based UI rendering
  Short-term (Next 2 Weeks)
  - Connect inventory service to real API
  - Enable production planning with real data
  - Connect sales tracking to POS systems
  - Test data consistency across services
  Medium-term (Next Month)
  - Enable ML forecasting with real models
  - Connect real-time alert system
  - Integrate external data sources (weather, traffic)
  - Test full end-to-end workflows
  Risk Mitigation
  1. Gradual Migration: Keep mock fallbacks during transition
  2. Environment Switching: Use env variables to toggle mock/real APIs
  3. Error Handling: Robust error handling for API failures
  4. Data Validation: Verify API responses match expected schemas
  5. User Testing: Test with real bakery workflows
  Ready to start? I recommend beginning with Step 1 (Authentication)
  today. The infrastructure is already there - you just need to connect
   the pipes!
--- a/docs/SCHEDULER_QUICKSTART.md
+++ b/docs/SCHEDULER_QUICKSTART.md
@@ -1,414 +0,0 @@
 # Production Planning Scheduler - Quick Start Guide
 **For Developers & DevOps**
 ---
 ## 🚀 5-Minute Setup
 ### Prerequisites
 ```bash
 # Running services
 - PostgreSQL (production, orders, tenant databases)
 - Redis (for forecast caching)
 - RabbitMQ (for events and leader election)
 # Environment variables
 PRODUCTION_DATABASE_URL=postgresql://...
 ORDERS_DATABASE_URL=postgresql://...
 TENANT_DATABASE_URL=postgresql://...
 REDIS_URL=redis://localhost:6379/0
 RABBITMQ_URL=amqp://guest:guest@localhost:5672/
 ```
 ### Run Migrations
 ```bash
 # Add timezone to tenants table
 cd services/tenant
 alembic upgrade head
 # Verify migration
 psql $TENANT_DATABASE_URL -c "SELECT id, name, timezone FROM tenants LIMIT 5;"
 ```
 ### Start Services
 ```bash
 # Terminal 1 - Production Service (with scheduler)
 cd services/production
 uvicorn app.main:app --reload --port 8001
 # Terminal 2 - Orders Service (with scheduler)
 cd services/orders
 uvicorn app.main:app --reload --port 8002
 # Terminal 3 - Forecasting Service (with caching)
 cd services/forecasting
 uvicorn app.main:app --reload --port 8003
 ```
 ### Test Schedulers
 ```bash
 # Test production scheduler
 curl -X POST http://localhost:8001/test/production-scheduler
 # Expected output:
 {
  "message": "Production scheduler test triggered successfully"
 }
 # Test procurement scheduler
 curl -X POST http://localhost:8002/test/procurement-scheduler
 # Expected output:
 {
  "message": "Procurement scheduler test triggered successfully"
 }
 # Check logs
 tail -f services/production/logs/production.log | grep "schedule"
 tail -f services/orders/logs/orders.log | grep "plan"
 ```
 ---
 ## 📋 Configuration
 ### Enable Test Mode (Development)
 ```bash
 # Run schedulers every 30 minutes instead of daily
 export PRODUCTION_TEST_MODE=true
 export PROCUREMENT_TEST_MODE=true
 export DEBUG=true
 ```
 ### Configure Tenant Timezone
 ```sql
 -- Update tenant timezone
 UPDATE tenants SET timezone = 'America/New_York' WHERE id = '{tenant_id}';
 -- Verify
 SELECT id, name, timezone FROM tenants WHERE id = '{tenant_id}';
 ```
 ### Check Redis Cache
 ```bash
 # Connect to Redis
 redis-cli
 # Check forecast cache keys
 KEYS forecast:*
 # Get cache stats
 GET forecast:cache:stats
 # Clear cache (if needed)
 FLUSHDB
 ```
 ---
 ## 🔍 Monitoring
 ### View Metrics (Prometheus)
 ```bash
 # Production scheduler metrics
 curl http://localhost:8001/metrics | grep production_schedules
 # Procurement scheduler metrics
 curl http://localhost:8002/metrics | grep procurement_plans
 # Forecast cache metrics
 curl http://localhost:8003/metrics | grep forecast_cache
 ```
 ### Key Metrics to Watch
 ```promql
 # Scheduler success rate (should be > 95%)
 rate(production_schedules_generated_total{status="success"}[5m])
 rate(procurement_plans_generated_total{status="success"}[5m])
 # Cache hit rate (should be > 70%)
 forecast_cache_hit_rate
 # Generation time (should be < 60s)
 histogram_quantile(0.95,
  rate(production_schedule_generation_duration_seconds_bucket[5m]))
 ```
 ---
 ## 🐛 Debugging
 ### Check Scheduler Status
 ```python
 # In Python shell
 from app.services.production_scheduler_service import ProductionSchedulerService
 from app.core.config import settings
 scheduler = ProductionSchedulerService(settings)
 await scheduler.start()
 # Check configured jobs
 jobs = scheduler.scheduler.get_jobs()
 for job in jobs:
    print(f"{job.name}: next run at {job.next_run_time}")
 ```
 ### View Scheduler Logs
 ```bash
 # Production scheduler
 kubectl logs -f deployment/production-service | grep -E "scheduler|schedule"
 # Procurement scheduler
 kubectl logs -f deployment/orders-service | grep -E "scheduler|plan"
 # Look for these patterns:
 # ✅ "Daily production planning completed"
 # ✅ "Production schedule created successfully"
 # ❌ "Error processing tenant production"
 # ⚠️ "Tenant processing timed out"
 ```
 ### Test Timezone Handling
 ```python
 from shared.utils.timezone_helper import TimezoneHelper
 # Get current date in different timezones
 madrid_date = TimezoneHelper.get_current_date_in_timezone("Europe/Madrid")
 ny_date = TimezoneHelper.get_current_date_in_timezone("America/New_York")
 tokyo_date = TimezoneHelper.get_current_date_in_timezone("Asia/Tokyo")
 print(f"Madrid: {madrid_date}")
 print(f"NY: {ny_date}")
 print(f"Tokyo: {tokyo_date}")
 # Check if business hours
 is_business = TimezoneHelper.is_business_hours(
    timezone_str="Europe/Madrid",
    start_hour=8,
    end_hour=20
 )
 print(f"Business hours: {is_business}")
 ```
 ### Test Forecast Cache
 ```python
 from services.forecasting.app.services.forecast_cache import get_forecast_cache_service
 from datetime import date
 from uuid import UUID
 cache = get_forecast_cache_service(redis_url="redis://localhost:6379/0")
 # Check if available
 print(f"Cache available: {cache.is_available()}")
 # Get cache stats
 stats = cache.get_cache_stats()
 print(f"Cache stats: {stats}")
 # Test cache operation
 tenant_id = UUID("your-tenant-id")
 product_id = UUID("your-product-id")
 forecast_date = date.today()
 # Try to get cached forecast
 cached = await cache.get_cached_forecast(tenant_id, product_id, forecast_date)
 print(f"Cached forecast: {cached}")
 ```
 ---
 ## 🧪 Testing
 ### Unit Tests
 ```bash
 # Run scheduler tests
 pytest services/production/tests/test_production_scheduler_service.py -v
 pytest services/orders/tests/test_procurement_scheduler_service.py -v
 # Run cache tests
 pytest services/forecasting/tests/test_forecast_cache.py -v
 # Run timezone tests
 pytest shared/tests/test_timezone_helper.py -v
 ```
 ### Integration Tests
 ```bash
 # Run full scheduler integration test
 pytest tests/integration/test_scheduler_integration.py -v
 # Run cache integration test
 pytest tests/integration/test_cache_integration.py -v
 # Run plan rejection workflow test
 pytest tests/integration/test_plan_rejection_workflow.py -v
 ```
 ### Manual End-to-End Test
 ```bash
 # 1. Clear existing schedules/plans
 psql $PRODUCTION_DATABASE_URL -c "DELETE FROM production_schedules WHERE schedule_date = CURRENT_DATE;"
 psql $ORDERS_DATABASE_URL -c "DELETE FROM procurement_plans WHERE plan_date = CURRENT_DATE;"
 # 2. Trigger schedulers
 curl -X POST http://localhost:8001/test/production-scheduler
 curl -X POST http://localhost:8002/test/procurement-scheduler
 # 3. Wait 30 seconds
 # 4. Verify schedules/plans created
 psql $PRODUCTION_DATABASE_URL -c "SELECT id, schedule_date, status FROM production_schedules WHERE schedule_date = CURRENT_DATE;"
 psql $ORDERS_DATABASE_URL -c "SELECT id, plan_date, status FROM procurement_plans WHERE plan_date = CURRENT_DATE;"
 # 5. Check cache hit rate
 redis-cli GET forecast_cache_hits_total
 redis-cli GET forecast_cache_misses_total
 ```
 ---
 ## 📚 Common Commands
 ### Scheduler Management
 ```bash
 # Disable scheduler (maintenance mode)
 kubectl set env deployment/production-service SCHEDULER_DISABLED=true
 # Re-enable scheduler
 kubectl set env deployment/production-service SCHEDULER_DISABLED-
 # Check scheduler health
 curl http://localhost:8001/health | jq .custom_checks.scheduler_service
 # Manually trigger scheduler
 curl -X POST http://localhost:8001/test/production-scheduler
 ```
 ### Cache Management
 ```bash
 # View cache stats
 curl http://localhost:8003/api/v1/{tenant_id}/forecasting/cache/stats | jq .
 # Clear product cache
 curl -X DELETE http://localhost:8003/api/v1/{tenant_id}/forecasting/cache/product/{product_id}
 # Clear tenant cache
 curl -X DELETE http://localhost:8003/api/v1/{tenant_id}/forecasting/cache
 # View cache keys
 redis-cli KEYS "forecast:*" | head -20
 ```
 ### Database Queries
 ```sql
 -- Check production schedules
 SELECT id, schedule_date, status, total_batches, auto_generated
 FROM production_schedules
 WHERE schedule_date >= CURRENT_DATE - INTERVAL '7 days'
 ORDER BY schedule_date DESC;
 -- Check procurement plans
 SELECT id, plan_date, status, total_requirements, total_estimated_cost
 FROM procurement_plans
 WHERE plan_date >= CURRENT_DATE - INTERVAL '7 days'
 ORDER BY plan_date DESC;
 -- Check tenant timezones
 SELECT id, name, timezone, city
 FROM tenants
 WHERE is_active = true
 ORDER BY timezone;
 -- Check plan approval workflow
 SELECT id, plan_number, status, approval_workflow
 FROM procurement_plans
 WHERE status = 'cancelled'
 ORDER BY created_at DESC
 LIMIT 10;
 ```
 ---
 ## 🔧 Troubleshooting Quick Fixes
 ### Scheduler Not Running
 ```bash
 # Check if service is running
 ps aux | grep uvicorn
 # Check if scheduler initialized
 grep "scheduled jobs configured" logs/production.log
 # Restart service
 pkill -f "uvicorn app.main:app"
 uvicorn app.main:app --reload
 ```
 ### Cache Not Working
 ```bash
 # Check Redis connection
 redis-cli ping  # Should return PONG
 # Check Redis keys
 redis-cli DBSIZE  # Should have keys
 # Restart Redis (if needed)
 redis-cli SHUTDOWN
 redis-server --daemonize yes
 ```
 ### Wrong Timezone
 ```bash
 # Check server timezone (should be UTC)
 date
 # Check tenant timezone
 psql $TENANT_DATABASE_URL -c \
  "SELECT timezone FROM tenants WHERE id = '{tenant_id}';"
 # Update if wrong
 psql $TENANT_DATABASE_URL -c \
  "UPDATE tenants SET timezone = 'Europe/Madrid' WHERE id = '{tenant_id}';"
 ```
 ---
 ## 📖 Additional Resources
 - **Full Documentation:** [PRODUCTION_PLANNING_SYSTEM.md](./PRODUCTION_PLANNING_SYSTEM.md)
 - **Operational Runbook:** [SCHEDULER_RUNBOOK.md](./SCHEDULER_RUNBOOK.md)
 - **Implementation Summary:** [IMPLEMENTATION_SUMMARY.md](./IMPLEMENTATION_SUMMARY.md)
 - **Code:**
  - Production Scheduler: [`services/production/app/services/production_scheduler_service.py`](../services/production/app/services/production_scheduler_service.py)
  - Procurement Scheduler: [`services/orders/app/services/procurement_scheduler_service.py`](../services/orders/app/services/procurement_scheduler_service.py)
  - Forecast Cache: [`services/forecasting/app/services/forecast_cache.py`](../services/forecasting/app/services/forecast_cache.py)
  - Timezone Helper: [`shared/utils/timezone_helper.py`](../shared/utils/timezone_helper.py)
 ---
 **Version:** 1.0
 **Last Updated:** 2025-10-09
 **Maintained By:** Backend Team
--- a/docs/SCHEDULER_RUNBOOK.md
+++ b/docs/SCHEDULER_RUNBOOK.md
@@ -1,530 +0,0 @@
 # Production Planning Scheduler Runbook
 **Quick Reference Guide for DevOps & Support Teams**
 ---
 ## Quick Links
 - [Full Documentation](./PRODUCTION_PLANNING_SYSTEM.md)
 - [Metrics Dashboard](http://grafana:3000/d/production-planning)
 - [Logs](http://kibana:5601)
 - [Alerts](http://alertmanager:9093)
 ---
 ## Emergency Contacts
 | Role | Contact | Availability |
 |------|---------|--------------|
 | **Backend Lead** | #backend-team | 24/7 |
 | **DevOps On-Call** | #devops-oncall | 24/7 |
 | **Product Owner** | TBD | Business hours |
 ---
 ## Scheduler Overview
 | Scheduler | Time | What It Does |
 |-----------|------|--------------|
 | **Production** | 5:30 AM (tenant timezone) | Creates daily production schedules |
 | **Procurement** | 6:00 AM (tenant timezone) | Creates daily procurement plans |
 **Critical:** Both schedulers MUST complete successfully every morning, or bakeries won't have production/procurement plans for the day!
 ---
 ## Common Incidents & Solutions
 ### 🔴 CRITICAL: Scheduler Completely Failed
 **Alert:** `SchedulerUnhealthy` or `NoProductionSchedulesGenerated`
 **Impact:** HIGH - No plans generated for any tenant
 **Immediate Actions (< 5 minutes):**
 ```bash
 # 1. Check if service is running
 kubectl get pods -n production | grep production-service
 kubectl get pods -n orders | grep orders-service
 # 2. Check recent logs for errors
 kubectl logs -n production deployment/production-service --tail=100 | grep ERROR
 kubectl logs -n orders deployment/orders-service --tail=100 | grep ERROR
 # 3. Restart service if frozen/crashed
 kubectl rollout restart deployment/production-service -n production
 kubectl rollout restart deployment/orders-service -n orders
 # 4. Wait 2 minutes for scheduler to initialize, then manually trigger
 curl -X POST http://production-service:8000/test/production-scheduler \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 curl -X POST http://orders-service:8000/test/procurement-scheduler \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 ```
 **Follow-up Actions:**
 - Check RabbitMQ health (leader election depends on it)
 - Review database connectivity
 - Check resource limits (CPU/memory)
 - Monitor metrics for successful generation
 ---
 ### 🟠 HIGH: Single Tenant Failed
 **Alert:** `DailyProductionPlanningFailed{tenant_id="abc-123"}`
 **Impact:** MEDIUM - One bakery affected
 **Immediate Actions (< 10 minutes):**
 ```bash
 # 1. Check logs for specific tenant
 kubectl logs -n production deployment/production-service --tail=500 | \
  grep "tenant_id=abc-123" | grep ERROR
 # 2. Common causes:
 #    - Tenant database connection issue
 #    - External service timeout (Forecasting, Inventory)
 #    - Invalid data (e.g., missing products)
 # 3. Manually retry for this tenant
 curl -X POST http://production-service:8000/test/production-scheduler \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 # (Scheduler will skip tenants that already have schedules)
 # 4. If still failing, check tenant-specific issues:
 # - Verify tenant exists and is active
 # - Check tenant's inventory has products
 # - Check forecasting service can access tenant data
 ```
 **Follow-up Actions:**
 - Contact tenant to understand their setup
 - Review tenant data quality
 - Check if tenant is new (may need initial setup)
 ---
 ### 🟡 MEDIUM: Scheduler Running Slow
 **Alert:** `production_schedule_generation_duration_seconds > 120s`
 **Impact:** LOW - Scheduler completes but takes longer than expected
 **Immediate Actions (< 15 minutes):**
 ```bash
 # 1. Check current execution time
 kubectl logs -n production deployment/production-service --tail=100 | \
  grep "production planning completed"
 # 2. Check database query performance
 # Look for slow query logs in PostgreSQL
 # 3. Check external service response times
 # - Forecasting Service health: curl http://forecasting-service:8000/health
 # - Inventory Service health: curl http://inventory-service:8000/health
 # - Orders Service health: curl http://orders-service:8000/health
 # 4. Check CPU/memory usage
 kubectl top pods -n production | grep production-service
 kubectl top pods -n orders | grep orders-service
 ```
 **Follow-up Actions:**
 - Consider increasing timeout if consistently near limit
 - Optimize slow database queries
 - Scale external services if overloaded
 - Review tenant count (may need to process fewer in parallel)
 ---
 ### 🟡 MEDIUM: Low Forecast Cache Hit Rate
 **Alert:** `ForecastCacheHitRateLow < 50%`
 **Impact:** LOW - Increased load on Forecasting Service, slower responses
 **Immediate Actions (< 10 minutes):**
 ```bash
 # 1. Check Redis is running
 kubectl get pods -n redis | grep redis
 redis-cli ping  # Should return PONG
 # 2. Check cache statistics
 curl http://forecasting-service:8000/api/v1/{tenant_id}/forecasting/cache/stats \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 # 3. Check cache keys
 redis-cli KEYS "forecast:*" | wc -l  # Should have many entries
 # 4. Check Redis memory
 redis-cli INFO memory | grep used_memory_human
 # 5. If cache is empty or Redis is down, restart Redis
 kubectl rollout restart statefulset/redis -n redis
 ```
 **Follow-up Actions:**
 - Monitor cache rebuild (should hit ~80-90% within 1 day)
 - Check Redis configuration (memory limits, eviction policy)
 - Review forecast TTL settings
 - Check for cache invalidation bugs
 ---
 ### 🟢 LOW: Plan Rejected by User
 **Alert:** `procurement_plan_rejections_total` increasing
 **Impact:** LOW - Normal user workflow
 **Actions (< 5 minutes):**
 ```bash
 # 1. Check rejection logs for patterns
 kubectl logs -n orders deployment/orders-service --tail=200 | \
  grep "plan rejection"
 # 2. Check if auto-regeneration triggered
 kubectl logs -n orders deployment/orders-service --tail=200 | \
  grep "Auto-regenerating plan"
 # 3. Verify rejection notification sent
 # Check RabbitMQ queue: procurement.plan.rejected
 # 4. If rejection notes mention "stale" or "outdated", plan will auto-regenerate
 # Otherwise, user needs to manually regenerate or modify plan
 ```
 **Follow-up Actions:**
 - Review rejection reasons for trends
 - Consider user training if many rejections
 - Improve plan accuracy if consistent issues
 ---
 ## Health Check Commands
 ### Quick Service Health Check
 ```bash
 # Production Service
 curl http://production-service:8000/health | jq .
 # Orders Service
 curl http://orders-service:8000/health | jq .
 # Forecasting Service
 curl http://forecasting-service:8000/health | jq .
 # Redis
 redis-cli ping
 # RabbitMQ
 curl http://rabbitmq:15672/api/health/checks/alarms \
  -u guest:guest | jq .
 ```
 ### Detailed Scheduler Status
 ```bash
 # Check last scheduler run time
 curl http://production-service:8000/health | \
  jq '.custom_checks.scheduler_service'
 # Check APScheduler job status (requires internal access)
 # Look for: scheduler.get_jobs() output in logs
 kubectl logs -n production deployment/production-service | \
  grep "scheduled jobs configured"
 ```
 ### Database Connectivity
 ```bash
 # Check production database
 kubectl exec -it deployment/production-service -n production -- \
  python -c "from app.core.database import database_manager; \
             import asyncio; \
             asyncio.run(database_manager.health_check())"
 # Check orders database
 kubectl exec -it deployment/orders-service -n orders -- \
  python -c "from app.core.database import database_manager; \
             import asyncio; \
             asyncio.run(database_manager.health_check())"
 ```
 ---
 ## Maintenance Procedures
 ### Disable Schedulers (Maintenance Mode)
 ```bash
 # 1. Set environment variable to disable schedulers
 kubectl set env deployment/production-service SCHEDULER_DISABLED=true -n production
 kubectl set env deployment/orders-service SCHEDULER_DISABLED=true -n orders
 # 2. Wait for pods to restart
 kubectl rollout status deployment/production-service -n production
 kubectl rollout status deployment/orders-service -n orders
 # 3. Verify schedulers are disabled (check logs)
 kubectl logs -n production deployment/production-service | grep "Scheduler disabled"
 ```
 ### Re-enable Schedulers (After Maintenance)
 ```bash
 # 1. Remove environment variable
 kubectl set env deployment/production-service SCHEDULER_DISABLED- -n production
 kubectl set env deployment/orders-service SCHEDULER_DISABLED- -n orders
 # 2. Wait for pods to restart
 kubectl rollout status deployment/production-service -n production
 kubectl rollout status deployment/orders-service -n orders
 # 3. Manually trigger to catch up (if during scheduled time)
 curl -X POST http://production-service:8000/test/production-scheduler \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 curl -X POST http://orders-service:8000/test/procurement-scheduler \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 ```
 ### Clear Forecast Cache
 ```bash
 # Clear all forecast cache (will rebuild automatically)
 redis-cli KEYS "forecast:*" | xargs redis-cli DEL
 # Clear specific tenant's cache
 redis-cli KEYS "forecast:{tenant_id}:*" | xargs redis-cli DEL
 # Verify cache cleared
 redis-cli DBSIZE
 ```
 ---
 ## Metrics to Monitor
 ### Production Scheduler
 ```promql
 # Success rate (should be > 95%)
 rate(production_schedules_generated_total{status="success"}[5m]) /
 rate(production_schedules_generated_total[5m])
 # Average generation time (should be < 60s)
 histogram_quantile(0.95,
  rate(production_schedule_generation_duration_seconds_bucket[5m]))
 # Failed tenants (should be 0)
 increase(production_tenants_processed_total{status="failure"}[5m])
 ```
 ### Procurement Scheduler
 ```promql
 # Success rate (should be > 95%)
 rate(procurement_plans_generated_total{status="success"}[5m]) /
 rate(procurement_plans_generated_total[5m])
 # Average generation time (should be < 60s)
 histogram_quantile(0.95,
  rate(procurement_plan_generation_duration_seconds_bucket[5m]))
 # Failed tenants (should be 0)
 increase(procurement_tenants_processed_total{status="failure"}[5m])
 ```
 ### Forecast Cache
 ```promql
 # Cache hit rate (should be > 70%)
 forecast_cache_hit_rate
 # Cache hits per minute
 rate(forecast_cache_hits_total[5m])
 # Cache misses per minute
 rate(forecast_cache_misses_total[5m])
 ```
 ---
 ## Log Patterns to Watch
 ### Success Patterns
 ```
 ✅ "Daily production planning completed" - All tenants processed
 ✅ "Production schedule created successfully" - Individual tenant success
 ✅ "Forecast cache HIT" - Cache working correctly
 ✅ "Production scheduler service started" - Service initialized
 ```
 ### Warning Patterns
 ```
 ⚠️ "Tenant processing timed out" - Individual tenant taking too long
 ⚠️ "Forecast cache MISS" - Cache miss (expected some, but not all)
 ⚠️ "Approving plan older than 24 hours" - Stale plan being approved
 ⚠️ "Could not fetch tenant timezone" - Timezone configuration issue
 ```
 ### Error Patterns
 ```
 ❌ "Daily production planning failed completely" - Complete failure
 ❌ "Error processing tenant production" - Tenant-specific failure
 ❌ "Forecast cache Redis connection failed" - Cache unavailable
 ❌ "Migration version mismatch" - Database migration issue
 ❌ "Failed to publish event" - RabbitMQ connectivity issue
 ```
 ---
 ## Escalation Procedure
 ### Level 1: DevOps On-Call (0-30 minutes)
 - Check service health
 - Review logs for obvious errors
 - Restart services if crashed
 - Manually trigger schedulers if needed
 - Monitor for resolution
 ### Level 2: Backend Team (30-60 minutes)
 - Investigate complex errors
 - Check database issues
 - Review scheduler logic
 - Coordinate with other teams (if external service issue)
 ### Level 3: Engineering Lead (> 60 minutes)
 - Major architectural issues
 - Database corruption or loss
 - Multi-service cascading failures
 - Decisions on emergency fixes vs. scheduled maintenance
 ---
 ## Testing After Deployment
 ### Post-Deployment Checklist
 ```bash
 # 1. Verify services are running
 kubectl get pods -n production
 kubectl get pods -n orders
 # 2. Check health endpoints
 curl http://production-service:8000/health
 curl http://orders-service:8000/health
 # 3. Verify schedulers are configured
 kubectl logs -n production deployment/production-service | \
  grep "scheduled jobs configured"
 # 4. Manually trigger test run
 curl -X POST http://production-service:8000/test/production-scheduler \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 curl -X POST http://orders-service:8000/test/procurement-scheduler \
  -H "Authorization: Bearer $ADMIN_TOKEN"
 # 5. Verify test run completed successfully
 kubectl logs -n production deployment/production-service | \
  grep "Production schedule created successfully"
 kubectl logs -n orders deployment/orders-service | \
  grep "Procurement plan generated successfully"
 # 6. Check metrics dashboard
 # Visit: http://grafana:3000/d/production-planning
 ```
 ---
 ## Known Issues & Workarounds
 ### Issue: Scheduler runs twice in distributed setup
 **Symptom:** Duplicate schedules/plans for same tenant and date
 **Cause:** Leader election not working (RabbitMQ connection issue)
 **Workaround:**
 ```bash
 # Temporarily scale to single instance
 kubectl scale deployment/production-service --replicas=1 -n production
 kubectl scale deployment/orders-service --replicas=1 -n orders
 # Fix RabbitMQ connectivity
 # Then scale back up
 kubectl scale deployment/production-service --replicas=3 -n production
 kubectl scale deployment/orders-service --replicas=3 -n orders
 ```
 ### Issue: Timezone shows wrong time
 **Symptom:** Schedules generated at wrong hour
 **Cause:** Tenant timezone not configured or incorrect
 **Workaround:**
 ```sql
 -- Check tenant timezone
 SELECT id, name, timezone FROM tenants WHERE id = '{tenant_id}';
 -- Update if incorrect
 UPDATE tenants SET timezone = 'Europe/Madrid' WHERE id = '{tenant_id}';
 -- Verify server uses UTC
 -- In container: date (should show UTC)
 ```
 ### Issue: Forecast cache always misses
 **Symptom:** `forecast_cache_hit_rate = 0%`
 **Cause:** Redis not accessible or REDIS_URL misconfigured
 **Workaround:**
 ```bash
 # Check REDIS_URL environment variable
 kubectl get deployment forecasting-service -n forecasting -o yaml | \
  grep REDIS_URL
 # Should be: redis://redis:6379/0
 # If incorrect, update:
 kubectl set env deployment/forecasting-service \
  REDIS_URL=redis://redis:6379/0 -n forecasting
 ```
 ---
 ## Additional Resources
 - **Full Documentation:** [PRODUCTION_PLANNING_SYSTEM.md](./PRODUCTION_PLANNING_SYSTEM.md)
 - **Metrics File:** [`shared/monitoring/scheduler_metrics.py`](../shared/monitoring/scheduler_metrics.py)
 - **Scheduler Code:**
  - Production: [`services/production/app/services/production_scheduler_service.py`](../services/production/app/services/production_scheduler_service.py)
  - Procurement: [`services/orders/app/services/procurement_scheduler_service.py`](../services/orders/app/services/procurement_scheduler_service.py)
 - **Forecast Cache:** [`services/forecasting/app/services/forecast_cache.py`](../services/forecasting/app/services/forecast_cache.py)
 ---
 **Runbook Version:** 1.0
 **Last Updated:** 2025-10-09
 **Maintained By:** Backend Team
--- a/frontend/package-lock.json
+++ b/frontend/package-lock.json
@@ -25,6 +25,7 @@
        "clsx": "^2.0.0",
        "date-fns": "^2.30.0",
        "date-fns-tz": "^2.0.0",
        "driver.js": "^1.3.6",
        "event-source-polyfill": "^1.0.31",
        "framer-motion": "^10.16.0",
        "i18next": "^23.7.0",
@@ -8566,6 +8567,12 @@
        "node": ">=12"
      }
    },
    "node_modules/driver.js": {
      "version": "1.3.6",
      "resolved": "https://registry.npmjs.org/driver.js/-/driver.js-1.3.6.tgz",
      "integrity": "sha512-g2nNuu+tWmPpuoyk3ffpT9vKhjPz4NrJzq6mkRDZIwXCrFhrKdDJ9TX5tJOBpvCTBrBYjgRQ17XlcQB15q4gMg==",
      "license": "MIT"
    },
    "node_modules/dunder-proto": {
      "version": "1.0.1",
      "resolved": "https://registry.npmjs.org/dunder-proto/-/dunder-proto-1.0.1.tgz",
--- a/frontend/package.json
+++ b/frontend/package.json
@@ -35,6 +35,7 @@
    "clsx": "^2.0.0",
    "date-fns": "^2.30.0",
    "date-fns-tz": "^2.0.0",
    "driver.js": "^1.3.6",
    "event-source-polyfill": "^1.0.31",
    "framer-motion": "^10.16.0",
    "i18next": "^23.7.0",
--- a/frontend/src/api/client/apiClient.ts
+++ b/frontend/src/api/client/apiClient.ts
@@ -92,12 +92,16 @@ class ApiClient {
        if (this.tenantId && !isPublicEndpoint) {
          config.headers['X-Tenant-ID'] = this.tenantId;
          console.log('🔍 [API Client] Adding X-Tenant-ID header:', this.tenantId, 'for URL:', config.url);
        } else if (!isPublicEndpoint) {
          console.warn('⚠️ [API Client] No tenant ID set for non-public endpoint:', config.url);
        }
        // Check demo session ID from memory OR localStorage
        const demoSessionId = this.demoSessionId || localStorage.getItem('demo_session_id');
        if (demoSessionId) {
          config.headers['X-Demo-Session-Id'] = demoSessionId;
          console.log('🔍 [API Client] Adding X-Demo-Session-Id header:', demoSessionId);
        }
        return config;
--- a/frontend/src/api/services/demo.ts
+++ b/frontend/src/api/services/demo.ts
@@ -15,6 +15,7 @@
 */
 import { apiClient } from '../client';
 import type { DemoSessionResponse } from '../types/demo';
 export interface DemoAccount {
  account_type: string;
@@ -26,16 +27,8 @@ export interface DemoAccount {
  business_model?: string;
 }
-export interface DemoSession {
+// Use the complete type from types/demo.ts which matches backend response
-  session_id: string;
+export type DemoSession = DemoSessionResponse;
  virtual_tenant_id: string;
  base_demo_tenant_id: string;
  demo_account_type: string;
  status: 'active' | 'expired' | 'destroyed';
  created_at: string;
  expires_at: string;
  remaining_extensions: number;
 }
 export interface CreateSessionRequest {
  demo_account_type: 'individual_bakery' | 'central_baker';
@@ -49,6 +42,40 @@ export interface DestroySessionRequest {
 session_id: string;
 }
 export interface ServiceProgress {
  status: 'not_started' | 'in_progress' | 'completed' | 'failed';
  records_cloned: number;
  error?: string;
 }
 export interface SessionStatusResponse {
  session_id: string;
  status: 'pending' | 'ready' | 'partial' | 'failed' | 'active' | 'expired' | 'destroyed';
  total_records_cloned: number;
  progress?: Record<string, ServiceProgress>;
  errors?: Array<{ service: string; error_message: string }>;
 }
 // ===================================================================
 // OPERATIONS: Demo Session Status and Cloning
 // ===================================================================
 /**
 * Get session status
 * GET /demo/sessions/{session_id}/status
 */
 export const getSessionStatus = async (sessionId: string): Promise<SessionStatusResponse> => {
  return await apiClient.get<SessionStatusResponse>(`/demo/sessions/${sessionId}/status`);
 };
 /**
 * Retry data cloning for a session
 * POST /demo/sessions/{session_id}/retry
 */
 export const retryCloning = async (sessionId: string): Promise<SessionStatusResponse> => {
  return await apiClient.post<SessionStatusResponse>(`/demo/sessions/${sessionId}/retry`, {});
 };
 // ===================================================================
 // ATOMIC: Demo Accounts
 // Backend: services/demo_session/app/api/demo_accounts.py
@@ -131,3 +158,47 @@ export const getDemoStats = async (): Promise<any> => {
 export const cleanupExpiredSessions = async (): Promise<any> => {
  return await apiClient.post('/demo/operations/cleanup', {});
 };
 // ===================================================================
 // API Service Class
 // ===================================================================
 export class DemoSessionAPI {
  async getDemoAccounts(): Promise<DemoAccount[]> {
    return getDemoAccounts();
  }
  async createDemoSession(request: CreateSessionRequest): Promise<DemoSession> {
    return createDemoSession(request);
  }
  async getDemoSession(sessionId: string): Promise<any> {
    return getDemoSession(sessionId);
  }
  async extendDemoSession(request: ExtendSessionRequest): Promise<DemoSession> {
    return extendDemoSession(request);
  }
  async destroyDemoSession(request: DestroySessionRequest): Promise<{ message: string }> {
    return destroyDemoSession(request);
  }
  async getDemoStats(): Promise<any> {
    return getDemoStats();
  }
  async cleanupExpiredSessions(): Promise<any> {
    return cleanupExpiredSessions();
  }
  async getSessionStatus(sessionId: string): Promise<SessionStatusResponse> {
    return getSessionStatus(sessionId);
  }
  async retryCloning(sessionId: string): Promise<SessionStatusResponse> {
    return retryCloning(sessionId);
  }
 }
 export const demoSessionAPI = new DemoSessionAPI();
--- a/frontend/src/components/demo/DemoErrorScreen.tsx
+++ b/frontend/src/components/demo/DemoErrorScreen.tsx
@@ -0,0 +1,105 @@
 import React from 'react';
 import { Button, Card, CardBody } from '../ui';
 import { PublicLayout } from '../layout';
 import { AlertCircle, RefreshCw, Home, HelpCircle } from 'lucide-react';
 interface Props {
  error: string;
  details?: Array<{ service: string; error_message: string }>;
  onRetry: () => void;
  isRetrying?: boolean;
 }
 export const DemoErrorScreen: React.FC<Props> = ({
  error,
  details,
  onRetry,
  isRetrying = false,
 }) => {
  return (
    <PublicLayout
      variant="centered"
      headerProps={{
        showThemeToggle: true,
        showAuthButtons: false,
      }}
    >
      <div className="max-w-2xl mx-auto p-8">
        <Card className="shadow-xl">
          <CardBody className="p-8 text-center">
            <div className="flex justify-center mb-4">
              <AlertCircle className="w-20 h-20 text-[var(--color-error)]" />
            </div>
            <h1 className="text-3xl font-bold text-[var(--color-error)] mb-3">
              Error en la Configuración del Demo
            </h1>
            <p className="text-[var(--text-secondary)] mb-6 text-lg">
              {error}
            </p>
            {details && details.length > 0 && (
              <div className="mb-6 p-4 bg-[var(--color-error)]/10 border border-[var(--color-error)] rounded-lg text-left">
                <h3 className="text-sm font-semibold text-[var(--text-primary)] mb-3">
                  Detalles del error:
                </h3>
                <ul className="space-y-2">
                  {details.map((detail, idx) => (
                    <li key={idx} className="text-sm">
                      <span className="font-medium text-[var(--text-primary)]">
                        {detail.service}:
                      </span>{' '}
                      <span className="text-[var(--text-secondary)]">
                        {detail.error_message}
                      </span>
                    </li>
                  ))}
                </ul>
              </div>
            )}
            <div className="flex flex-col gap-3 max-w-md mx-auto mt-6">
              <Button
                onClick={onRetry}
                disabled={isRetrying}
                variant="primary"
                size="lg"
                className="w-full"
              >
                <RefreshCw className={`w-5 h-5 mr-2 ${isRetrying ? 'animate-spin' : ''}`} />
                {isRetrying ? 'Reintentando...' : 'Reintentar Configuración'}
              </Button>
              <Button
                onClick={() => window.location.href = '/demo'}
                variant="secondary"
                size="lg"
                className="w-full"
              >
                <Home className="w-5 h-5 mr-2" />
                Volver a la Página Demo
              </Button>
              <Button
                onClick={() => window.location.href = '/contact'}
                variant="ghost"
                size="lg"
                className="w-full"
              >
                <HelpCircle className="w-5 h-5 mr-2" />
                Contactar Soporte
              </Button>
            </div>
            <div className="mt-6">
              <p className="text-xs text-[var(--text-tertiary)]">
                Si el problema persiste, por favor contacta a nuestro equipo de soporte.
              </p>
            </div>
          </CardBody>
        </Card>
      </div>
    </PublicLayout>
  );
 };
--- a/frontend/src/components/demo/DemoProgressIndicator.tsx
+++ b/frontend/src/components/demo/DemoProgressIndicator.tsx
@@ -0,0 +1,132 @@
 import React from 'react';
 import { Badge, ProgressBar } from '../ui';
 import { CheckCircle, XCircle, Loader2, Clock } from 'lucide-react';
 import { ServiceProgress } from '@/api/services/demo';
 interface Props {
  progress: Record<string, ServiceProgress>;
 }
 const SERVICE_LABELS: Record<string, string> = {
  tenant: 'Tenant Virtual',
  inventory: 'Inventario y Productos',
  recipes: 'Recetas',
  sales: 'Historial de Ventas',
  orders: 'Pedidos de Clientes',
  suppliers: 'Proveedores',
  production: 'Producción',
  forecasting: 'Pronósticos',
 };
 const SERVICE_DESCRIPTIONS: Record<string, string> = {
  tenant: 'Creando tu entorno demo aislado',
  inventory: 'Cargando ingredientes, recetas y datos de stock',
  recipes: 'Configurando recetas y fórmulas',
  sales: 'Importando registros de ventas históricas',
  orders: 'Configurando pedidos de clientes',
  suppliers: 'Importando datos de proveedores',
  production: 'Configurando lotes de producción',
  forecasting: 'Preparando datos de pronósticos',
 };
 export const DemoProgressIndicator: React.FC<Props> = ({ progress }) => {
  return (
    <div className="mt-4 space-y-3">
      {Object.entries(progress).map(([serviceName, serviceProgress]) => (
        <div
          key={serviceName}
          className={`
            p-4 rounded-lg border-2 transition-all
            ${
              serviceProgress.status === 'completed'
                ? 'bg-green-50 dark:bg-green-900/20 border-green-500'
                : serviceProgress.status === 'failed'
                ? 'bg-red-50 dark:bg-red-900/20 border-red-500'
                : serviceProgress.status === 'in_progress'
                ? 'bg-blue-50 dark:bg-blue-900/20 border-blue-500'
                : 'bg-gray-50 dark:bg-gray-800/20 border-gray-300 dark:border-gray-700'
            }
          `}
        >
          <div className="flex items-center justify-between mb-2">
            <div className="flex items-center gap-3 flex-1">
              <div className="flex-shrink-0">
                {getStatusIcon(serviceProgress.status)}
              </div>
              <div className="flex-1 min-w-0">
                <h3 className="text-sm font-semibold text-[var(--text-primary)] truncate">
                  {SERVICE_LABELS[serviceName] || serviceName}
                </h3>
                <p className="text-xs text-[var(--text-secondary)] truncate">
                  {SERVICE_DESCRIPTIONS[serviceName] || 'Procesando...'}
                </p>
              </div>
            </div>
            <Badge variant={getStatusVariant(serviceProgress.status)}>
              {getStatusLabel(serviceProgress.status)}
            </Badge>
          </div>
          {serviceProgress.status === 'in_progress' && (
            <div className="my-2">
              <ProgressBar value={50} variant="primary" className="animate-pulse" />
            </div>
          )}
          {serviceProgress.records_cloned > 0 && (
            <p className="text-xs text-[var(--text-secondary)] mt-2">
              ✓ {serviceProgress.records_cloned} registros clonados
            </p>
          )}
          {serviceProgress.error && (
            <p className="text-xs text-[var(--color-error)] mt-2">
              Error: {serviceProgress.error}
            </p>
          )}
        </div>
      ))}
    </div>
  );
 };
 function getStatusIcon(status: ServiceProgress['status']): React.ReactNode {
  switch (status) {
    case 'completed':
      return <CheckCircle className="w-5 h-5 text-green-500" />;
    case 'failed':
      return <XCircle className="w-5 h-5 text-red-500" />;
    case 'in_progress':
      return <Loader2 className="w-5 h-5 text-blue-500 animate-spin" />;
    default:
      return <Clock className="w-5 h-5 text-gray-400" />;
  }
 }
 function getStatusLabel(status: ServiceProgress['status']): string {
  switch (status) {
    case 'completed':
      return 'Completado';
    case 'failed':
      return 'Fallido';
    case 'in_progress':
      return 'En Progreso';
    default:
      return 'Pendiente';
  }
 }
 function getStatusVariant(
  status: ServiceProgress['status']
 ): 'success' | 'error' | 'info' | 'default' {
  switch (status) {
    case 'completed':
      return 'success';
    case 'failed':
      return 'error';
    case 'in_progress':
      return 'info';
    default:
      return 'default';
  }
 }
--- a/frontend/src/components/layout/AppShell/AppShell.tsx
+++ b/frontend/src/components/layout/AppShell/AppShell.tsx
@@ -211,9 +211,6 @@ export const AppShell = forwardRef<AppShellRef, AppShellProps>(({
      )}
      data-testid="app-shell"
    >
      {/* Demo Banner */}
      <DemoBanner />
      {/* Header */}
      {shouldShowHeader && (
        <Header
@@ -223,6 +220,9 @@ export const AppShell = forwardRef<AppShellRef, AppShellProps>(({
          />
      )}
      {/* Demo Banner - appears below header */}
      <DemoBanner />
      <div className="flex flex-1 relative">
        {/* Sidebar */}
        {shouldShowSidebar && (
--- a/frontend/src/components/layout/DemoBanner/DemoBanner.tsx
+++ b/frontend/src/components/layout/DemoBanner/DemoBanner.tsx
@@ -2,19 +2,26 @@ import React, { useState, useEffect } from 'react';
 import { extendDemoSession, destroyDemoSession } from '../../../api/services/demo';
 import { apiClient } from '../../../api/client';
 import { useNavigate } from 'react-router-dom';
 import { useDemoTour, getTourState, trackTourEvent } from '../../../features/demo-onboarding';
 import { BookOpen, Clock, Sparkles, X } from 'lucide-react';
 export const DemoBanner: React.FC = () => {
  const navigate = useNavigate();
-  const [isDemo, setIsDemo] = useState(false);
+  const { startTour, resumeTour, tourState } = useDemoTour();
-  const [expiresAt, setExpiresAt] = useState<string | null>(null);
+  const [isDemo, setIsDemo] = useState(() => localStorage.getItem('demo_mode') === 'true');
  const [expiresAt, setExpiresAt] = useState<string | null>(() => localStorage.getItem('demo_expires_at'));
  const [timeRemaining, setTimeRemaining] = useState<string>('');
  const [canExtend, setCanExtend] = useState(true);
  const [extending, setExtending] = useState(false);
  const [showExitModal, setShowExitModal] = useState(false);
  useEffect(() => {
    const demoMode = localStorage.getItem('demo_mode') === 'true';
    const expires = localStorage.getItem('demo_expires_at');
    console.log('[DemoBanner] Demo mode from localStorage:', demoMode);
    console.log('[DemoBanner] Expires at:', expires);
    setIsDemo(demoMode);
    setExpiresAt(expires);
@@ -43,6 +50,7 @@ export const DemoBanner: React.FC = () => {
    localStorage.removeItem('demo_session_id');
    localStorage.removeItem('demo_account_type');
    localStorage.removeItem('demo_expires_at');
    localStorage.removeItem('demo_tenant_id');
    apiClient.setDemoSessionId(null);
    navigate('/demo');
  };
@@ -69,10 +77,13 @@ export const DemoBanner: React.FC = () => {
  };
  const handleEndSession = async () => {
    setShowExitModal(true);
  };
  const confirmEndSession = async () => {
    const sessionId = apiClient.getDemoSessionId();
    if (!sessionId) return;
    if (confirm('¿Estás seguro de que quieres terminar la sesión demo?')) {
    try {
      await destroyDemoSession({ session_id: sessionId });
    } catch (error) {
@@ -80,66 +91,158 @@ export const DemoBanner: React.FC = () => {
    } finally {
      handleExpiration();
    }
  };
  const handleCreateAccount = () => {
    trackTourEvent({
      event: 'conversion_cta_clicked',
      timestamp: Date.now(),
    });
    navigate('/register?from=demo_banner');
  };
  const handleStartTour = () => {
    if (tourState && tourState.currentStep > 0 && !tourState.completed) {
      resumeTour();
    } else {
      startTour();
    }
  };
-  if (!isDemo) return null;
+  if (!isDemo) {
    console.log('[DemoBanner] Not demo mode, returning null');
    return null;
  }
  return (
-    <div className="bg-gradient-to-r from-amber-500 to-orange-500 text-white px-4 py-2 shadow-md">
+    <>
-      <div className="max-w-7xl mx-auto flex items-center justify-between">
+      <div
-        <div className="flex items-center space-x-4">
+        data-tour="demo-banner"
-          <div className="flex items-center">
+        className="bg-gradient-to-r from-amber-500 to-orange-500 text-white shadow-lg sticky top-[var(--header-height)] z-[1100]"
-            <svg
+        style={{ minHeight: '60px' }}
              className="w-5 h-5 mr-2"
              fill="currentColor"
              viewBox="0 0 20 20"
      >
-              <path
+        <div className="max-w-7xl mx-auto px-4 py-3">
-                fillRule="evenodd"
+          <div className="flex flex-col lg:flex-row lg:items-center lg:justify-between gap-3">
-                d="M18 10a8 8 0 11-16 0 8 8 0 0116 0zm-7-4a1 1 0 11-2 0 1 1 0 012 0zM9 9a1 1 0 000 2v3a1 1 0 001 1h1a1 1 0 100-2v-3a1 1 0 00-1-1H9z"
+            {/* Left section - Demo info */}
-                clipRule="evenodd"
+            <div className="flex items-center gap-4 flex-wrap">
-              />
+              <div className="flex items-center gap-2">
-            </svg>
+                <Sparkles className="w-5 h-5" />
-            <span className="font-medium">Modo Demo</span>
+                <span className="font-bold text-base">Sesión Demo Activa</span>
              </div>
-          <div className="hidden sm:flex items-center text-sm">
+              <div className="flex items-center gap-2 text-sm bg-white/20 rounded-md px-3 py-1">
-            <svg
+                <Clock className="w-4 h-4" />
-              className="w-4 h-4 mr-1"
+                <span className="font-mono font-semibold">{timeRemaining}</span>
              fill="currentColor"
              viewBox="0 0 20 20"
            >
              <path
                fillRule="evenodd"
                d="M10 18a8 8 0 100-16 8 8 0 000 16zm1-12a1 1 0 10-2 0v4a1 1 0 00.293.707l2.828 2.829a1 1 0 101.415-1.415L11 9.586V6z"
                clipRule="evenodd"
              />
            </svg>
            Tiempo restante: <span className="font-mono ml-1">{timeRemaining}</span>
          </div>
              </div>
-        <div className="flex items-center space-x-2">
+              {tourState && !tourState.completed && tourState.currentStep > 0 && (
                <div className="hidden md:flex items-center gap-2 text-sm bg-white/20 rounded-md px-3 py-1">
                  <span>Tutorial pausado en paso {tourState.currentStep}</span>
                </div>
              )}
            </div>
            {/* Right section - Actions */}
            <div data-tour="demo-banner-actions" className="flex items-center gap-2 flex-wrap">
              {/* Tour button */}
              <button
                onClick={handleStartTour}
                className="inline-flex items-center gap-2 px-4 py-2 bg-white text-amber-600 rounded-lg text-sm font-semibold hover:bg-amber-50 transition-all shadow-sm hover:shadow-md"
              >
                <BookOpen className="w-4 h-4" />
                <span className="hidden sm:inline">
                  {tourState && tourState.currentStep > 0 && !tourState.completed
                    ? 'Continuar Tutorial'
                    : 'Ver Tutorial'}
                </span>
                <span className="sm:hidden">Tutorial</span>
              </button>
              {/* Create account CTA */}
              <button
                onClick={handleCreateAccount}
                className="inline-flex items-center gap-2 px-4 py-2 bg-white text-orange-600 rounded-lg text-sm font-bold hover:bg-orange-50 transition-all shadow-sm hover:shadow-md animate-pulse hover:animate-none"
              >
                <Sparkles className="w-4 h-4" />
                <span className="hidden lg:inline">¡Crear Cuenta Gratis!</span>
                <span className="lg:hidden">Crear Cuenta</span>
              </button>
              {/* Extend session */}
              {canExtend && (
                <button
                  onClick={handleExtendSession}
                  disabled={extending}
-              className="px-3 py-1 bg-white/20 hover:bg-white/30 rounded-md text-sm font-medium transition-colors disabled:opacity-50"
+                  className="px-3 py-2 bg-white/20 hover:bg-white/30 rounded-lg text-sm font-medium transition-colors disabled:opacity-50 hidden sm:block"
                >
                  {extending ? 'Extendiendo...' : '+30 min'}
                </button>
              )}
              {/* End session */}
              <button
                onClick={handleEndSession}
-            className="px-3 py-1 bg-white/20 hover:bg-white/30 rounded-md text-sm font-medium transition-colors"
+                className="px-3 py-2 bg-white/10 hover:bg-white/20 rounded-lg text-sm font-medium transition-colors"
              >
-            Terminar Demo
+                <span className="hidden sm:inline">Salir</span>
                <X className="w-4 h-4 sm:hidden" />
              </button>
            </div>
          </div>
        </div>
      </div>
      {/* Exit confirmation modal */}
      {showExitModal && (
        <div className="fixed inset-0 bg-black/50 backdrop-blur-sm z-[9999] flex items-center justify-center p-4">
          <div className="bg-[var(--bg-primary)] rounded-2xl shadow-2xl max-w-md w-full p-6 border border-[var(--border-default)]">
            <div className="flex items-start gap-4 mb-4">
              <div className="p-3 bg-amber-100 dark:bg-amber-900/30 rounded-xl">
                <Sparkles className="w-6 h-6 text-amber-600 dark:text-amber-400" />
              </div>
              <div className="flex-1">
                <h3 className="text-xl font-bold text-[var(--text-primary)] mb-2">
                  ¿Seguro que quieres salir?
                </h3>
                <p className="text-[var(--text-secondary)] text-sm leading-relaxed">
                  Aún te quedan <span className="font-bold text-amber-600">{timeRemaining}</span> de sesión demo.
                </p>
              </div>
            </div>
            <div className="bg-gradient-to-br from-amber-50 to-orange-50 dark:from-amber-950/30 dark:to-orange-950/30 rounded-xl p-4 mb-6 border border-amber-200 dark:border-amber-800">
              <p className="text-sm font-semibold text-[var(--text-primary)] mb-3">
                ¿Te gusta lo que ves?
              </p>
              <p className="text-sm text-[var(--text-secondary)] mb-4 leading-relaxed">
                Crea una cuenta <span className="font-bold">gratuita</span> para acceder a todas las funcionalidades sin límites de tiempo y guardar tus datos de forma permanente.
              </p>
              <button
                onClick={handleCreateAccount}
                className="w-full py-3 bg-gradient-to-r from-amber-500 to-orange-500 text-white rounded-lg font-bold hover:from-amber-600 hover:to-orange-600 transition-all shadow-md hover:shadow-lg"
              >
                Crear Mi Cuenta Gratis
              </button>
            </div>
            <div className="flex gap-3">
              <button
                onClick={() => setShowExitModal(false)}
                className="flex-1 px-4 py-2.5 bg-[var(--bg-secondary)] hover:bg-[var(--bg-tertiary)] text-[var(--text-primary)] rounded-lg font-semibold transition-colors border border-[var(--border-default)]"
              >
                Seguir en Demo
              </button>
              <button
                onClick={confirmEndSession}
                className="flex-1 px-4 py-2.5 bg-red-50 dark:bg-red-900/20 hover:bg-red-100 dark:hover:bg-red-900/30 text-red-600 dark:text-red-400 rounded-lg font-semibold transition-colors border border-red-200 dark:border-red-800"
              >
                Salir de Demo
              </button>
            </div>
          </div>
        </div>
      )}
    </>
  );
 };
--- a/frontend/src/components/layout/Header/Header.tsx
+++ b/frontend/src/components/layout/Header/Header.tsx
@@ -154,6 +154,7 @@ export const Header = forwardRef<HeaderRef, HeaderProps>(({
      {/* Left section */}
      <div className="flex items-center gap-2 sm:gap-4 flex-1 min-w-0 h-full">
        {/* Mobile menu button */}
        <div data-tour="sidebar-menu-toggle">
          <Button
            variant="ghost"
            size="sm"
@@ -163,6 +164,7 @@ export const Header = forwardRef<HeaderRef, HeaderProps>(({
          >
            <Menu className="h-5 w-5 text-[var(--text-primary)]" />
          </Button>
        </div>
        {/* Logo */}
        <div className="flex items-center gap-2 sm:gap-3 min-w-0 flex-shrink-0">
@@ -185,7 +187,7 @@ export const Header = forwardRef<HeaderRef, HeaderProps>(({
        {/* Tenant Switcher - Desktop */}
        {hasAccess && (
-          <div className="hidden md:block mx-2 lg:mx-4 flex-shrink-0">
+          <div className="hidden md:block mx-2 lg:mx-4 flex-shrink-0" data-tour="header-tenant-selector">
            <TenantSwitcher
              showLabel={true}
              className="min-w-[160px] max-w-[220px] lg:min-w-[200px] lg:max-w-[280px]"
--- a/frontend/src/components/layout/Sidebar/Sidebar.tsx
+++ b/frontend/src/components/layout/Sidebar/Sidebar.tsx
@@ -516,6 +516,16 @@ export const Sidebar = forwardRef<SidebarRef, SidebarProps>(({
    const isHovered = hoveredItem === item.id;
    const ItemIcon = item.icon;
    // Add tour data attributes for main navigation sections
    const getTourAttribute = (itemPath: string) => {
      if (itemPath === '/app/database') return 'sidebar-database';
      if (itemPath === '/app/operations') return 'sidebar-operations';
      if (itemPath === '/app/analytics') return 'sidebar-analytics';
      return undefined;
    };
    const tourAttr = getTourAttribute(item.path);
    const itemContent = (
      <div
        className={clsx(
@@ -635,7 +645,7 @@ export const Sidebar = forwardRef<SidebarRef, SidebarProps>(({
    );
    return (
-      <li key={item.id} className="relative">
+      <li key={item.id} className="relative" data-tour={tourAttr}>
        {isCollapsed && !hasChildren && ItemIcon ? (
          <Tooltip content={item.label} side="right">
            {button}
--- a/frontend/src/features/demo-onboarding/README.md
+++ b/frontend/src/features/demo-onboarding/README.md
@@ -0,0 +1,209 @@
 # Demo Onboarding Tour
 Interactive onboarding tour system for BakeryIA demo sessions using Driver.js.
 ## Quick Start
 ```typescript
 import { useDemoTour } from '@/features/demo-onboarding';
 function MyComponent() {
  const { startTour, resumeTour, tourState } = useDemoTour();
  return (
    <button onClick={() => startTour()}>
      Start Tour
    </button>
  );
 }
 ```
 ## Features
 - ✅ **12-step desktop tour** in Spanish
 - ✅ **8-step mobile tour** optimized for small screens
 - ✅ **State persistence** with auto-resume
 - ✅ **Analytics tracking** (Google Analytics, Plausible)
 - ✅ **Conversion CTAs** throughout experience
 - ✅ **Responsive design** across all devices
 - ✅ **Accessibility** (ARIA, keyboard navigation)
 ## Project Structure
 ```
 demo-onboarding/
 ├── index.ts              # Public API exports
 ├── types.ts              # TypeScript interfaces
 ├── styles.css            # Driver.js custom theme
 ├── config/               # Configuration
 │   ├── driver-config.ts  # Driver.js setup
 │   └── tour-steps.ts     # Tour step definitions
 ├── hooks/                # React hooks
 │   └── useDemoTour.ts    # Main tour hook
 └── utils/                # Utilities
    ├── tour-state.ts     # State management (sessionStorage)
    └── tour-analytics.ts # Analytics tracking
 ```
 ## API Reference
 ### `useDemoTour()`
 Main hook for controlling the tour.
 **Returns:**
 ```typescript
 {
  startTour: (fromStep?: number) => void;
  resumeTour: () => void;
  resetTour: () => void;
  tourActive: boolean;
  tourState: TourState | null;
 }
 ```
 ### `getTourState()`
 Get current tour state from sessionStorage.
 **Returns:** `TourState | null`
 ### `saveTourState(state: Partial<TourState>)`
 Save tour state to sessionStorage.
 ### `clearTourState()`
 Clear tour state from sessionStorage.
 ### `shouldStartTour()`
 Check if tour should auto-start.
 **Returns:** `boolean`
 ### `trackTourEvent(event: TourAnalyticsEvent)`
 Track tour analytics event.
 ## Tour Steps
 ### Desktop (12 steps)
 1. Welcome to Demo Session
 2. Real-time Metrics Dashboard
 3. Intelligent Alerts
 4. Procurement Plans
 5. Production Management
 6. Database Navigation (Sidebar)
 7. Daily Operations (Sidebar)
 8. Analytics & AI (Sidebar)
 9. Multi-Bakery Selector (Header)
 10. Demo Limitations
 11. Final CTA
 ### Mobile (8 steps)
 Optimized version with navigation-heavy steps removed.
 ## State Management
 Tour state is stored in `sessionStorage`:
 ```typescript
 interface TourState {
  currentStep: number;
  completed: boolean;
  dismissed: boolean;
  lastUpdated: number;
  tourVersion: string;
 }
 ```
 ## Analytics Events
 Tracked events:
 - `tour_started`
 - `tour_step_completed`
 - `tour_dismissed`
 - `tour_completed`
 - `conversion_cta_clicked`
 Events are sent to Google Analytics and Plausible (if available).
 ## Styling
 The tour uses a custom theme that matches BakeryIA's design system:
 - CSS variables for colors
 - Smooth animations
 - Dark mode support
 - Responsive breakpoints
 ## Data Attributes
 The tour targets elements with `data-tour` attributes:
 ```tsx
 <div data-tour="dashboard-stats">
  {/* Tour will highlight this element */}
 </div>
 ```
 **Available tour targets:**
 - `demo-banner` - Demo banner
 - `demo-banner-actions` - Banner action buttons
 - `dashboard-stats` - Metrics grid
 - `real-time-alerts` - Alerts section
 - `procurement-plans` - Procurement section
 - `production-plans` - Production section
 - `sidebar-database` - Database navigation
 - `sidebar-operations` - Operations navigation
 - `sidebar-analytics` - Analytics navigation
 - `sidebar-menu-toggle` - Mobile menu button
 - `header-tenant-selector` - Tenant switcher
 ## Integration
 ### Auto-start on Demo Login
 ```typescript
 // DemoPage.tsx
 import { markTourAsStartPending } from '@/features/demo-onboarding';
 // After creating demo session
 markTourAsStartPending();
 navigate('/app/dashboard');
 ```
 ### Dashboard Auto-start
 ```typescript
 // DashboardPage.tsx
 import { useDemoTour, shouldStartTour } from '@/features/demo-onboarding';
 const { startTour } = useDemoTour();
 const isDemoMode = localStorage.getItem('demo_mode') === 'true';
 useEffect(() => {
  if (isDemoMode && shouldStartTour()) {
    setTimeout(() => startTour(), 1500);
  }
 }, [isDemoMode, startTour]);
 ```
 ## Browser Support
 - Chrome 90+
 - Firefox 88+
 - Safari 14+
 - Edge 90+
 - Mobile browsers
 ## Performance
 - **Bundle Size**: +5KB gzipped (Driver.js)
 - **Runtime**: Negligible
 - **Load Time**: No impact (lazy loaded)
 ## See Also
 - [DEMO_ONBOARDING_TOUR.md](../../../../../DEMO_ONBOARDING_TOUR.md) - Full implementation guide
 - [Driver.js Documentation](https://driverjs.com/)
--- a/frontend/src/features/demo-onboarding/config/driver-config.ts
+++ b/frontend/src/features/demo-onboarding/config/driver-config.ts
@@ -0,0 +1,41 @@
 import { Config } from 'driver.js';
 export const getDriverConfig = (
  onNext?: (stepIndex: number) => void
 ): Config => ({
  showProgress: true,
  animate: true,
  smoothScroll: true,
  allowClose: true,
  overlayClickNext: false,
  stagePadding: 10,
  stageRadius: 8,
  allowKeyboardControl: true,
  disableActiveInteraction: false,
  doneBtnText: 'Crear Cuenta Gratis',
  closeBtnText: '×',
  nextBtnText: 'Siguiente →',
  prevBtnText: '← Anterior',
  progressText: 'Paso {{current}} de {{total}}',
  popoverClass: 'bakery-tour-popover',
  popoverOffset: 10,
  onHighlightStarted: (element, step, options) => {
    const currentIndex = options.state?.activeIndex || 0;
    console.log('[Driver] Highlighting element:', element);
    console.log('[Driver] Step:', step);
    console.log('[Driver] Current index:', currentIndex);
    // Track step when it's highlighted
    if (onNext && currentIndex > 0) {
      onNext(currentIndex);
    }
  },
  onHighlighted: (element, step, options) => {
    console.log('[Driver] Element highlighted successfully:', element);
  },
 });
--- a/frontend/src/features/demo-onboarding/config/tour-steps.ts
+++ b/frontend/src/features/demo-onboarding/config/tour-steps.ts
@@ -0,0 +1,176 @@
 import { DriveStep } from 'driver.js';
 export const getDemoTourSteps = (): DriveStep[] => [
  {
    element: '[data-tour="demo-banner"]',
    popover: {
      title: '¡Bienvenido a BakeryIA Demo!',
      description: 'Estás en una sesión demo de 30 minutos con datos reales de una panadería española. Te guiaremos por las funciones principales de la plataforma. Puedes cerrar el tour en cualquier momento con ESC.',
      side: 'bottom',
      align: 'center',
    },
  },
  {
    element: '[data-tour="dashboard-stats"]',
    popover: {
      title: 'Métricas en Tiempo Real',
      description: 'Aquí ves las métricas clave de tu panadería actualizadas al instante: ventas del día, pedidos pendientes, productos vendidos y alertas de stock crítico.',
      side: 'bottom',
      align: 'start',
    },
  },
  {
    element: '[data-tour="real-time-alerts"]',
    popover: {
      title: 'Alertas Inteligentes',
      description: 'El sistema te avisa automáticamente de stock bajo, pedidos urgentes, predicciones de demanda y oportunidades de producción. Toda la información importante en un solo lugar.',
      side: 'top',
      align: 'start',
    },
  },
  {
    element: '[data-tour="procurement-plans"]',
    popover: {
      title: 'Planes de Aprovisionamiento',
      description: 'Visualiza qué ingredientes necesitas comprar hoy según tus planes de producción. El sistema calcula automáticamente las cantidades necesarias.',
      side: 'top',
      align: 'start',
    },
  },
  {
    element: '[data-tour="production-plans"]',
    popover: {
      title: 'Gestión de Producción',
      description: 'Consulta y gestiona tus órdenes de producción programadas. Puedes ver el estado de cada orden, los ingredientes necesarios y el tiempo estimado.',
      side: 'top',
      align: 'start',
    },
  },
  {
    element: '[data-tour="sidebar-database"]',
    popover: {
      title: 'Base de Datos de tu Panadería',
      description: 'Accede a toda la información de tu negocio: inventario de ingredientes, recetas, proveedores, equipos y equipo de trabajo.',
      side: 'right',
      align: 'start',
    },
  },
  {
    element: '[data-tour="sidebar-operations"]',
    popover: {
      title: 'Operaciones Diarias',
      description: 'Gestiona las operaciones del día a día: aprovisionamiento de ingredientes, producción de recetas y punto de venta (POS) para registrar ventas.',
      side: 'right',
      align: 'start',
    },
  },
  {
    element: '[data-tour="sidebar-analytics"]',
    popover: {
      title: 'Análisis e Inteligencia Artificial',
      description: 'Accede a análisis avanzados de ventas, producción y pronósticos de demanda con IA. Simula escenarios y obtén insights inteligentes para tu negocio.',
      side: 'right',
      align: 'start',
    },
  },
  {
    element: '[data-tour="header-tenant-selector"]',
    popover: {
      title: 'Multi-Panadería',
      description: 'Si gestionas varias panaderías o puntos de venta, puedes cambiar entre ellas fácilmente desde aquí. Cada panadería tiene sus propios datos aislados.',
      side: 'bottom',
      align: 'end',
    },
  },
  {
    element: '[data-tour="demo-banner-actions"]',
    popover: {
      title: 'Limitaciones del Demo',
      description: 'En modo demo puedes explorar todas las funciones, pero algunas acciones destructivas están deshabilitadas. Los cambios que hagas no se guardarán después de que expire la sesión.',
      side: 'bottom',
      align: 'center',
    },
  },
  {
    popover: {
      title: '¿Listo para gestionar tu panadería real?',
      description: 'Has explorado las funcionalidades principales de BakeryIA. Crea una cuenta gratuita para acceder a todas las funciones sin límites, guardar tus datos de forma permanente y conectar tu negocio real.',
      side: 'top',
      align: 'center',
    },
  },
 ];
 export const getMobileTourSteps = (): DriveStep[] => [
  {
    element: '[data-tour="demo-banner"]',
    popover: {
      title: '¡Bienvenido a BakeryIA!',
      description: 'Sesión demo de 30 minutos con datos reales. Te mostraremos las funciones clave.',
      side: 'bottom',
      align: 'center',
    },
  },
  {
    element: '[data-tour="dashboard-stats"]',
    popover: {
      title: 'Métricas en Tiempo Real',
      description: 'Ventas, pedidos, productos y alertas actualizadas al instante.',
      side: 'bottom',
      align: 'start',
    },
  },
  {
    element: '[data-tour="real-time-alerts"]',
    popover: {
      title: 'Alertas Inteligentes',
      description: 'Stock bajo, pedidos urgentes y predicciones de demanda en un solo lugar.',
      side: 'top',
      align: 'start',
    },
  },
  {
    element: '[data-tour="procurement-plans"]',
    popover: {
      title: 'Aprovisionamiento',
      description: 'Ingredientes que necesitas comprar hoy calculados automáticamente.',
      side: 'top',
      align: 'start',
    },
  },
  {
    element: '[data-tour="production-plans"]',
    popover: {
      title: 'Producción',
      description: 'Gestiona órdenes de producción y consulta ingredientes necesarios.',
      side: 'top',
      align: 'start',
    },
  },
  {
    element: '[data-tour="sidebar-menu-toggle"]',
    popover: {
      title: 'Menú de Navegación',
      description: 'Toca aquí para acceder a Base de Datos, Operaciones y Análisis.',
      side: 'bottom',
      align: 'start',
    },
  },
  {
    element: '[data-tour="demo-banner-actions"]',
    popover: {
      title: 'Limitaciones del Demo',
      description: 'Puedes explorar todo, pero los cambios no se guardan permanentemente.',
      side: 'bottom',
      align: 'center',
    },
  },
  {
    popover: {
      title: '¿Listo para tu panadería real?',
      description: 'Crea una cuenta gratuita para acceso completo sin límites y datos permanentes.',
      side: 'top',
      align: 'center',
    },
  },
 ];
--- a/frontend/src/features/demo-onboarding/hooks/useDemoTour.ts
+++ b/frontend/src/features/demo-onboarding/hooks/useDemoTour.ts
@@ -0,0 +1,170 @@
 import { useState, useCallback, useEffect } from 'react';
 import { driver, Driver } from 'driver.js';
 import { useNavigate } from 'react-router-dom';
 import { getDriverConfig } from '../config/driver-config';
 import { getDemoTourSteps, getMobileTourSteps } from '../config/tour-steps';
 import { getTourState, saveTourState, clearTourState, clearTourStartPending } from '../utils/tour-state';
 import { trackTourEvent } from '../utils/tour-analytics';
 import '../styles.css';
 export const useDemoTour = () => {
  const navigate = useNavigate();
  const [tourActive, setTourActive] = useState(false);
  const [driverInstance, setDriverInstance] = useState<Driver | null>(null);
  const isMobile = window.innerWidth < 768;
  const handleTourDestroy = useCallback(() => {
    const state = getTourState();
    const currentStep = driverInstance?.getActiveIndex() || 0;
    if (state && !state.completed) {
      saveTourState({
        currentStep,
        dismissed: true,
      });
      trackTourEvent({
        event: 'tour_dismissed',
        step: currentStep,
        timestamp: Date.now(),
      });
    }
    setTourActive(false);
    clearTourStartPending();
  }, [driverInstance]);
  const handleStepComplete = useCallback((stepIndex: number) => {
    saveTourState({
      currentStep: stepIndex + 1,
    });
    trackTourEvent({
      event: 'tour_step_completed',
      step: stepIndex,
      timestamp: Date.now(),
    });
  }, []);
  const handleTourComplete = useCallback(() => {
    saveTourState({
      completed: true,
      currentStep: 0,
    });
    trackTourEvent({
      event: 'tour_completed',
      timestamp: Date.now(),
    });
    setTourActive(false);
    clearTourStartPending();
    setTimeout(() => {
      trackTourEvent({
        event: 'conversion_cta_clicked',
        timestamp: Date.now(),
      });
      navigate('/register?from=demo_tour');
    }, 500);
  }, [navigate]);
  const startTour = useCallback((fromStep: number = 0) => {
    console.log('[useDemoTour] startTour called with fromStep:', fromStep);
    const steps = isMobile ? getMobileTourSteps() : getDemoTourSteps();
    console.log('[useDemoTour] Using', isMobile ? 'mobile' : 'desktop', 'steps, total:', steps.length);
    // Check if first element exists
    const firstElement = steps[0]?.element;
    if (firstElement) {
      const el = document.querySelector(firstElement);
      console.log('[useDemoTour] First element exists:', !!el, 'selector:', firstElement);
      if (!el) {
        console.warn('[useDemoTour] First tour element not found in DOM! Delaying tour start...');
        // Retry after DOM is ready
        setTimeout(() => startTour(fromStep), 500);
        return;
      }
    }
    const config = getDriverConfig(handleStepComplete);
    const driverObj = driver({
      ...config,
      onDestroyed: (element, step, options) => {
        const activeIndex = options.state?.activeIndex || 0;
        const isLastStep = activeIndex === steps.length - 1;
        console.log('[useDemoTour] Tour destroyed, activeIndex:', activeIndex, 'isLastStep:', isLastStep);
        if (isLastStep) {
          handleTourComplete();
        } else {
          handleTourDestroy();
        }
      },
    });
    driverObj.setSteps(steps);
    setDriverInstance(driverObj);
    console.log('[useDemoTour] Driver instance created, starting tour...');
    if (fromStep > 0 && fromStep < steps.length) {
      driverObj.drive(fromStep);
    } else {
      driverObj.drive();
    }
    setTourActive(true);
    trackTourEvent({
      event: 'tour_started',
      timestamp: Date.now(),
    });
    saveTourState({
      currentStep: fromStep,
      completed: false,
      dismissed: false,
    });
    clearTourStartPending();
  }, [isMobile, handleTourDestroy, handleStepComplete, handleTourComplete]);
  const resumeTour = useCallback(() => {
    const state = getTourState();
    if (state && state.currentStep > 0) {
      startTour(state.currentStep);
    } else {
      startTour();
    }
  }, [startTour]);
  const resetTour = useCallback(() => {
    clearTourState();
    if (driverInstance) {
      driverInstance.destroy();
      setDriverInstance(null);
    }
    setTourActive(false);
  }, [driverInstance]);
  useEffect(() => {
    return () => {
      if (driverInstance) {
        driverInstance.destroy();
      }
    };
  }, [driverInstance]);
  return {
    startTour,
    resumeTour,
    resetTour,
    tourActive,
    tourState: getTourState(),
  };
 };
--- a/frontend/src/features/demo-onboarding/index.ts
+++ b/frontend/src/features/demo-onboarding/index.ts
@@ -0,0 +1,4 @@
 export { useDemoTour } from './hooks/useDemoTour';
 export { getTourState, saveTourState, clearTourState, shouldStartTour, markTourAsStartPending, clearTourStartPending } from './utils/tour-state';
 export { trackTourEvent } from './utils/tour-analytics';
 export type { TourState, TourStep, TourAnalyticsEvent } from './types';
--- a/frontend/src/features/demo-onboarding/styles.css
+++ b/frontend/src/features/demo-onboarding/styles.css
@@ -0,0 +1,179 @@
 /* Import Driver.js base styles */
@import 'driver.js/dist/driver.css';
 /* Custom theme for BakeryIA tour */
 .driver-popover.bakery-tour-popover {
  background: var(--bg-primary);
  color: var(--text-primary);
  border: 1px solid var(--border-default);
  box-shadow: 0 20px 25px -5px rgb(0 0 0 / 0.1), 0 8px 10px -6px rgb(0 0 0 / 0.1);
  border-radius: 12px;
  max-width: 400px;
 }
 .driver-popover.bakery-tour-popover .driver-popover-title {
  font-size: 1.125rem;
  font-weight: 700;
  color: var(--text-primary);
  margin-bottom: 0.75rem;
  line-height: 1.4;
 }
 .driver-popover.bakery-tour-popover .driver-popover-description {
  font-size: 0.9375rem;
  color: var(--text-secondary);
  line-height: 1.6;
  margin-bottom: 1rem;
 }
 .driver-popover.bakery-tour-popover .driver-popover-progress-text {
  font-size: 0.875rem;
  color: var(--text-tertiary);
  font-weight: 500;
 }
 .driver-popover.bakery-tour-popover .driver-popover-footer {
  display: flex;
  align-items: center;
  gap: 0.75rem;
  margin-top: 1.25rem;
 }
 .driver-popover.bakery-tour-popover .driver-popover-btn {
  padding: 0.625rem 1.25rem;
  border-radius: 8px;
  font-weight: 600;
  font-size: 0.9375rem;
  transition: all 0.2s ease;
  border: none;
  cursor: pointer;
 }
 .driver-popover.bakery-tour-popover .driver-popover-next-btn {
  background: var(--color-primary);
  color: white;
  flex: 1;
 }
 .driver-popover.bakery-tour-popover .driver-popover-next-btn:hover {
  background: var(--color-primary-dark);
  transform: translateY(-1px);
  box-shadow: 0 4px 6px -1px rgb(0 0 0 / 0.1);
 }
 .driver-popover.bakery-tour-popover .driver-popover-prev-btn {
  background: var(--bg-secondary);
  color: var(--text-primary);
  border: 1px solid var(--border-default);
 }
 .driver-popover.bakery-tour-popover .driver-popover-prev-btn:hover {
  background: var(--bg-tertiary);
  border-color: var(--border-hover);
 }
 .driver-popover.bakery-tour-popover .driver-popover-close-btn {
  position: absolute;
  top: 1rem;
  right: 1rem;
  width: 2rem;
  height: 2rem;
  border-radius: 6px;
  background: var(--bg-secondary);
  color: var(--text-secondary);
  font-size: 1.5rem;
  line-height: 1;
  display: flex;
  align-items: center;
  justify-content: center;
  cursor: pointer;
  transition: all 0.2s ease;
  border: none;
  padding: 0;
 }
 .driver-popover.bakery-tour-popover .driver-popover-close-btn:hover {
  background: var(--bg-tertiary);
  color: var(--text-primary);
 }
 .driver-popover.bakery-tour-popover .driver-popover-arrow-side-top.driver-popover-arrow {
  border-top-color: var(--bg-primary);
 }
 .driver-popover.bakery-tour-popover .driver-popover-arrow-side-bottom.driver-popover-arrow {
  border-bottom-color: var(--bg-primary);
 }
 .driver-popover.bakery-tour-popover .driver-popover-arrow-side-left.driver-popover-arrow {
  border-left-color: var(--bg-primary);
 }
 .driver-popover.bakery-tour-popover .driver-popover-arrow-side-right.driver-popover-arrow {
  border-right-color: var(--bg-primary);
 }
 /*
 * Driver.js Overlay Styling
 * Driver.js v1.3.6 uses SVG with a cutout path for the spotlight effect
 * DO NOT override position, width, height, or other layout properties
 * Only customize visual appearance
 */
 /* SVG Overlay - only customize the fill color */
 .driver-overlay svg {
  /* The SVG path fill color for the dark overlay */
  fill: rgba(0, 0, 0, 0.75);
 }
 /* Prevent backdrop-filter from interfering */
 .driver-overlay {
  backdrop-filter: none !important;
 }
 /* Visual emphasis for highlighted element - adds outline */
 .driver-active-element {
  outline: 4px solid var(--color-primary) !important;
  outline-offset: 4px !important;
 }
 /* Prevent theme glass effects from interfering */
 .driver-overlay.glass-effect,
 .driver-popover.glass-effect,
 .driver-active-element.glass-effect {
  backdrop-filter: none !important;
 }
 /* Mobile responsive */
@media (max-width: 640px) {
  .driver-popover.bakery-tour-popover {
    max-width: calc(100vw - 2rem);
    margin: 0 1rem;
  }
  .driver-popover.bakery-tour-popover .driver-popover-title {
    font-size: 1rem;
  }
  .driver-popover.bakery-tour-popover .driver-popover-description {
    font-size: 0.875rem;
  }
  .driver-popover.bakery-tour-popover .driver-popover-btn {
    padding: 0.5rem 1rem;
    font-size: 0.875rem;
  }
 }
 /* Last step special styling (CTA) */
 .driver-popover.bakery-tour-popover:has(.driver-popover-next-btn:contains("Crear Cuenta")) .driver-popover-next-btn {
  background: linear-gradient(135deg, var(--color-primary) 0%, #d97706 100%);
  box-shadow: 0 10px 15px -3px rgb(0 0 0 / 0.2);
  font-weight: 700;
  padding: 0.75rem 1.5rem;
 }
 .driver-popover.bakery-tour-popover:has(.driver-popover-next-btn:contains("Crear Cuenta")) .driver-popover-next-btn:hover {
  transform: translateY(-2px);
  box-shadow: 0 20px 25px -5px rgb(0 0 0 / 0.3);
 }
--- a/frontend/src/features/demo-onboarding/types.ts
+++ b/frontend/src/features/demo-onboarding/types.ts
@@ -0,0 +1,26 @@
 export interface TourState {
  currentStep: number;
  completed: boolean;
  dismissed: boolean;
  lastUpdated: number;
  tourVersion: string;
 }
 export interface TourStep {
  element?: string;
  popover: {
    title: string;
    description: string;
    side?: 'top' | 'right' | 'bottom' | 'left';
    align?: 'start' | 'center' | 'end';
  };
  onNext?: () => void;
  onPrevious?: () => void;
 }
 export interface TourAnalyticsEvent {
  event: 'tour_started' | 'tour_step_completed' | 'tour_completed' | 'tour_dismissed' | 'conversion_cta_clicked';
  step?: number;
  timestamp: number;
  sessionId?: string;
 }
--- a/frontend/src/features/demo-onboarding/utils/tour-analytics.ts
+++ b/frontend/src/features/demo-onboarding/utils/tour-analytics.ts
@@ -0,0 +1,40 @@
 import { TourAnalyticsEvent } from '../types';
 export const trackTourEvent = (event: TourAnalyticsEvent): void => {
  try {
    const demoSessionId = localStorage.getItem('demo_session_id');
    const enrichedEvent = {
      ...event,
      sessionId: demoSessionId || undefined,
    };
    console.log('[Tour Analytics]', enrichedEvent);
    if (window.gtag) {
      window.gtag('event', event.event, {
        event_category: 'demo_tour',
        event_label: event.step !== undefined ? `step_${event.step}` : undefined,
        session_id: demoSessionId,
      });
    }
    if (window.plausible) {
      window.plausible(event.event, {
        props: {
          step: event.step,
          session_id: demoSessionId,
        },
      });
    }
  } catch (error) {
    console.error('Error tracking tour event:', error);
  }
 };
 declare global {
  interface Window {
    gtag?: (...args: any[]) => void;
    plausible?: (event: string, options?: { props?: Record<string, any> }) => void;
  }
 }
--- a/frontend/src/features/demo-onboarding/utils/tour-state.ts
+++ b/frontend/src/features/demo-onboarding/utils/tour-state.ts
@@ -0,0 +1,84 @@
 import { TourState } from '../types';
 const TOUR_STATE_KEY = 'bakery_demo_tour_state';
 const TOUR_VERSION = '1.0.0';
 export const getTourState = (): TourState | null => {
  try {
    const stored = sessionStorage.getItem(TOUR_STATE_KEY);
    if (!stored) return null;
    const state = JSON.parse(stored) as TourState;
    if (state.tourVersion !== TOUR_VERSION) {
      clearTourState();
      return null;
    }
    return state;
  } catch (error) {
    console.error('Error reading tour state:', error);
    return null;
  }
 };
 export const saveTourState = (state: Partial<TourState>): void => {
  try {
    const currentState = getTourState() || {
      currentStep: 0,
      completed: false,
      dismissed: false,
      lastUpdated: Date.now(),
      tourVersion: TOUR_VERSION,
    };
    const newState: TourState = {
      ...currentState,
      ...state,
      lastUpdated: Date.now(),
      tourVersion: TOUR_VERSION,
    };
    sessionStorage.setItem(TOUR_STATE_KEY, JSON.stringify(newState));
  } catch (error) {
    console.error('Error saving tour state:', error);
  }
 };
 export const clearTourState = (): void => {
  try {
    sessionStorage.removeItem(TOUR_STATE_KEY);
  } catch (error) {
    console.error('Error clearing tour state:', error);
  }
 };
 export const shouldStartTour = (): boolean => {
  const tourState = getTourState();
  const shouldStart = sessionStorage.getItem('demo_tour_should_start') === 'true';
  console.log('[shouldStartTour] tourState:', tourState);
  console.log('[shouldStartTour] shouldStart flag:', shouldStart);
  // If explicitly marked to start, always start (unless already completed)
  if (shouldStart) {
    if (tourState && tourState.completed) {
      console.log('[shouldStartTour] Tour already completed, not starting');
      return false;
    }
    console.log('[shouldStartTour] Should start flag is true, starting tour');
    return true;
  }
  // No explicit start flag, don't auto-start
  console.log('[shouldStartTour] No start flag, not starting');
  return false;
 };
 export const markTourAsStartPending = (): void => {
  sessionStorage.setItem('demo_tour_should_start', 'true');
 };
 export const clearTourStartPending = (): void => {
  sessionStorage.removeItem('demo_tour_should_start');
 };
--- a/frontend/src/pages/app/DashboardPage.tsx
+++ b/frontend/src/pages/app/DashboardPage.tsx
@@ -1,4 +1,4 @@
-import React from 'react';
+import React, { useEffect } from 'react';
 import { useNavigate } from 'react-router-dom';
 import { useTranslation } from 'react-i18next';
 import { PageHeader } from '../../components/layout';
@@ -10,6 +10,7 @@ import ProcurementPlansToday from '../../components/domain/dashboard/Procurement
 import ProductionPlansToday from '../../components/domain/dashboard/ProductionPlansToday';
 import PurchaseOrdersTracking from '../../components/domain/dashboard/PurchaseOrdersTracking';
 import { useTenant } from '../../stores/tenant.store';
 import { useDemoTour, shouldStartTour, clearTourStartPending } from '../../features/demo-onboarding';
 import {
  AlertTriangle,
  Clock,
@@ -23,6 +24,25 @@ const DashboardPage: React.FC = () => {
  const { t } = useTranslation();
  const navigate = useNavigate();
  const { availableTenants } = useTenant();
  const { startTour } = useDemoTour();
  const isDemoMode = localStorage.getItem('demo_mode') === 'true';
  useEffect(() => {
    console.log('[Dashboard] Demo mode:', isDemoMode);
    console.log('[Dashboard] Should start tour:', shouldStartTour());
    console.log('[Dashboard] SessionStorage demo_tour_should_start:', sessionStorage.getItem('demo_tour_should_start'));
    if (isDemoMode && shouldStartTour()) {
      console.log('[Dashboard] Starting tour in 1.5s...');
      const timer = setTimeout(() => {
        console.log('[Dashboard] Executing startTour()');
        startTour();
        clearTourStartPending();
      }, 1500);
      return () => clearTimeout(timer);
    }
  }, [isDemoMode, startTour]);
  const handleAddNewBakery = () => {
    navigate('/app/onboarding?new=true');
@@ -107,12 +127,14 @@ const DashboardPage: React.FC = () => {
      />
      {/* Critical Metrics using StatsGrid */}
      <div data-tour="dashboard-stats">
        <StatsGrid
          stats={criticalStats}
          columns={4}
          gap="lg"
          className="mb-6"
        />
      </div>
      {/* Quick Actions - Add New Bakery */}
      {availableTenants && availableTenants.length > 0 && (
@@ -153,19 +175,24 @@ const DashboardPage: React.FC = () => {
      {/* Full width blocks - one after another */}
      <div className="space-y-6">
        {/* 1. Real-time alerts block */}
        <div data-tour="real-time-alerts">
          <RealTimeAlerts />
        </div>
        {/* 2. Purchase Orders Tracking block */}
        <PurchaseOrdersTracking />
        {/* 3. Procurement plans block */}
        <div data-tour="procurement-plans">
          <ProcurementPlansToday
            onOrderItem={handleOrderItem}
            onViewDetails={handleViewDetails}
            onViewAllPlans={handleViewAllPlans}
          />
        </div>
        {/* 4. Production plans block */}
        <div data-tour="production-plans">
          <ProductionPlansToday
            onStartOrder={handleStartOrder}
            onPauseOrder={handlePauseOrder}
@@ -174,6 +201,7 @@ const DashboardPage: React.FC = () => {
          />
        </div>
      </div>
    </div>
  );
 };
--- a/frontend/src/pages/app/operations/inventory/InventoryPage.tsx
+++ b/frontend/src/pages/app/operations/inventory/InventoryPage.tsx
@@ -35,6 +35,11 @@ const InventoryPage: React.FC = () => {
  const tenantId = useTenantId();
  // Debug tenant ID
  console.log('🔍 [InventoryPage] Tenant ID from hook:', tenantId);
  console.log('🔍 [InventoryPage] tenantId type:', typeof tenantId);
  console.log('🔍 [InventoryPage] tenantId truthy?', !!tenantId);
  // Mutations
  const createIngredientMutation = useCreateIngredient();
  const softDeleteMutation = useSoftDeleteIngredient();
--- a/frontend/src/pages/app/settings/subscription/SubscriptionPage.tsx
+++ b/frontend/src/pages/app/settings/subscription/SubscriptionPage.tsx
@@ -43,7 +43,39 @@ const SubscriptionPage: React.FC = () => {
        subscriptionService.getAvailablePlans()
      ]);
      // FIX: Handle demo mode or missing subscription data
      if (!usage || !usage.usage) {
        // If no usage data, likely a demo tenant - create mock data
        const mockUsage: UsageSummary = {
          plan: 'demo',
          status: 'active',
          monthly_price: 0,
          next_billing_date: new Date(Date.now() + 30 * 24 * 60 * 60 * 1000).toISOString(),
          usage: {
            users: {
              current: 1,
              limit: 5,
              unlimited: false,
              usage_percentage: 20
            },
            locations: {
              current: 1,
              limit: 1,
              unlimited: false,
              usage_percentage: 100
            },
            products: {
              current: 0,
              limit: 50,
              unlimited: false,
              usage_percentage: 0
            }
          }
        };
        setUsageSummary(mockUsage);
      } else {
        setUsageSummary(usage);
      }
      setAvailablePlans(plans);
    } catch (error) {
      console.error('Error loading subscription data:', error);
--- a/frontend/src/pages/public/DemoPage.tsx
+++ b/frontend/src/pages/public/DemoPage.tsx
@@ -5,6 +5,7 @@ import { Button } from '../../components/ui';
 import { getDemoAccounts, createDemoSession, DemoAccount } from '../../api/services/demo';
 import { apiClient } from '../../api/client';
 import { Check, Clock, Shield, Play, Zap, ArrowRight, Store, Factory } from 'lucide-react';
 import { markTourAsStartPending } from '../../features/demo-onboarding';
 export const DemoPage: React.FC = () => {
  const navigate = useNavigate();
@@ -38,9 +39,16 @@ export const DemoPage: React.FC = () => {
        demo_account_type: accountType as 'individual_bakery' | 'central_baker',
      });
      console.log('✅ Demo session created:', session);
      // Store session ID in API client
      apiClient.setDemoSessionId(session.session_id);
      // **CRITICAL FIX: Set the virtual tenant ID in API client**
      // This ensures all API requests include the correct tenant context
      apiClient.setTenantId(session.virtual_tenant_id);
      console.log('✅ Set API client tenant ID:', session.virtual_tenant_id);
      // Store session info in localStorage for UI
      localStorage.setItem('demo_mode', 'true');
      localStorage.setItem('demo_session_id', session.session_id);
@@ -48,8 +56,34 @@ export const DemoPage: React.FC = () => {
      localStorage.setItem('demo_expires_at', session.expires_at);
      localStorage.setItem('demo_tenant_id', session.virtual_tenant_id);
-      // Navigate to dashboard
+      // **CRITICAL FIX: Initialize tenant store with demo tenant**
-      navigate('/app/dashboard');
+      // This ensures useTenantId() returns the correct virtual tenant ID
      const { useTenantStore } = await import('../../stores/tenant.store');
      const demoTenant = {
        id: session.virtual_tenant_id,
        name: session.demo_config?.name || `Demo ${accountType}`,
        business_type: accountType === 'individual_bakery' ? 'bakery' : 'central_baker',
        business_model: accountType,
        address: session.demo_config?.address || 'Demo Address',
        city: session.demo_config?.city || 'Madrid',
        postal_code: '28001',
        phone: null,
        is_active: true,
        subscription_tier: 'demo',
        ml_model_trained: false,
        last_training_date: null,
        owner_id: 'demo-user',
        created_at: new Date().toISOString(),
      };
      useTenantStore.getState().setCurrentTenant(demoTenant);
      console.log('✅ Initialized tenant store with demo tenant:', demoTenant);
      // Mark tour to start automatically
      markTourAsStartPending();
      // Navigate to setup page to wait for data cloning
      navigate(`/demo/setup?session=${session.session_id}`);
    } catch (err: any) {
      setError(err?.message || 'Error al crear sesión demo');
      console.error('Error creating demo session:', err);
--- a/frontend/src/pages/public/DemoSetupPage.tsx
+++ b/frontend/src/pages/public/DemoSetupPage.tsx
@@ -0,0 +1,236 @@
 import React, { useEffect, useState, useCallback } from 'react';
 import { useNavigate, useSearchParams } from 'react-router-dom';
 import { demoSessionAPI, SessionStatusResponse } from '@/api/services/demo';
 import { DemoProgressIndicator } from '@/components/demo/DemoProgressIndicator';
 import { DemoErrorScreen } from '@/components/demo/DemoErrorScreen';
 import { Card, CardBody, ProgressBar, Button, LoadingSpinner } from '@/components/ui';
 import { PublicLayout } from '@/components/layout';
 const POLL_INTERVAL_MS = 1500; // Poll every 1.5 seconds
 export const DemoSetupPage: React.FC = () => {
  const [searchParams] = useSearchParams();
  const navigate = useNavigate();
  const sessionId = searchParams.get('session');
  const [status, setStatus] = useState<SessionStatusResponse | null>(null);
  const [error, setError] = useState<string | null>(null);
  const [isRetrying, setIsRetrying] = useState(false);
  const pollStatus = useCallback(async () => {
    if (!sessionId) return;
    try {
      const statusData = await demoSessionAPI.getSessionStatus(sessionId);
      setStatus(statusData);
      // Redirect to dashboard if:
      // 1. Status is 'ready' (all services succeeded)
      // 2. Status is 'partial' or 'failed' BUT we have usable data (>100 records)
      const hasUsableData = statusData.total_records_cloned > 100;
      const shouldRedirect =
        statusData.status === 'ready' ||
        (statusData.status === 'partial' && hasUsableData) ||
        (statusData.status === 'failed' && hasUsableData);
      if (shouldRedirect) {
        // Data is usable, redirect to dashboard
        setTimeout(() => {
          window.location.href = `/app/dashboard?session=${sessionId}`;
        }, 500);
      }
    } catch (err) {
      const errorMessage = err instanceof Error ? err.message : 'Unknown error occurred';
      setError(errorMessage);
    }
  }, [sessionId]);
  useEffect(() => {
    if (!sessionId) {
      navigate('/demo');
      return;
    }
    // Initial poll
    pollStatus();
    // Set up polling interval
    const intervalId = setInterval(pollStatus, POLL_INTERVAL_MS);
    return () => {
      clearInterval(intervalId);
    };
  }, [sessionId, navigate, pollStatus]);
  const handleRetry = async () => {
    if (!sessionId) return;
    try {
      setIsRetrying(true);
      setError(null);
      await demoSessionAPI.retryCloning(sessionId);
      // Resume polling after retry
      await pollStatus();
    } catch (err) {
      const errorMessage = err instanceof Error ? err.message : 'Retry failed';
      setError(errorMessage);
    } finally {
      setIsRetrying(false);
    }
  };
  const handleContinueAnyway = () => {
    window.location.href = `/app/dashboard?session=${sessionId}`;
  };
  if (error && !status) {
    return (
      <DemoErrorScreen
        error={error}
        onRetry={handleRetry}
        isRetrying={isRetrying}
      />
    );
  }
  if (!status) {
    return (
      <PublicLayout
        variant="centered"
        headerProps={{
          showThemeToggle: true,
          showAuthButtons: false,
        }}
      >
        <div className="flex flex-col items-center justify-center min-h-[60vh]">
          <LoadingSpinner size="large" />
          <p className="mt-4 text-[var(--text-secondary)]">
            Inicializando entorno demo...
          </p>
        </div>
      </PublicLayout>
    );
  }
  // Only show error screen if failed with NO usable data
  if (status.status === 'failed' && status.total_records_cloned <= 100) {
    return (
      <DemoErrorScreen
        error="Demo session setup failed"
        details={status.errors}
        onRetry={handleRetry}
        isRetrying={isRetrying}
      />
    );
  }
  const estimatedTime = estimateRemainingTime(status);
  const progressPercentage = calculateProgressPercentage(status);
  return (
    <PublicLayout
      variant="centered"
      headerProps={{
        showThemeToggle: true,
        showAuthButtons: false,
      }}
    >
      <div className="max-w-2xl mx-auto p-8">
        <Card className="shadow-xl">
          <CardBody className="p-8">
            <div className="text-center mb-6">
              <h1 className="text-3xl font-bold text-[var(--text-primary)] mb-2">
                🔄 Preparando tu Entorno Demo
              </h1>
              <p className="text-[var(--text-secondary)]">
                Configurando tu sesión personalizada con datos de muestra...
              </p>
            </div>
            {status.progress && <DemoProgressIndicator progress={status.progress} />}
            <div className="mt-6">
              <div className="flex justify-between items-center mb-2">
                <span className="text-sm font-medium text-[var(--text-primary)]">
                  Progreso general
                </span>
                <span className="text-sm text-[var(--text-secondary)]">
                  {progressPercentage}%
                </span>
              </div>
              <ProgressBar value={progressPercentage} variant="primary" />
            </div>
            {status.status === 'pending' && (
              <div className="mt-4 text-center">
                <p className="text-sm text-[var(--text-secondary)]">
                  Tiempo estimado restante: ~{estimatedTime} segundos
                </p>
              </div>
            )}
            {status.status === 'partial' && (
              <div className="mt-4 p-4 bg-[var(--color-warning)]/10 border border-[var(--color-warning)] rounded-lg">
                <p className="text-sm text-[var(--text-primary)] mb-3">
                  ⚠️ Algunos datos aún se están cargando. Puedes continuar con
                  funcionalidad limitada o esperar a que se carguen todos los datos.
                </p>
                <Button
                  onClick={handleContinueAnyway}
                  variant="warning"
                  size="sm"
                  className="w-full"
                >
                  Continuar de todos modos
                </Button>
              </div>
            )}
            {status.status === 'failed' && status.total_records_cloned > 100 && (
              <div className="mt-4 p-4 bg-[var(--color-info)]/10 border border-[var(--color-info)] rounded-lg">
                <p className="text-sm text-[var(--text-primary)]">
                  ℹ️ Algunos servicios tuvieron problemas, pero hemos cargado{' '}
                  {status.total_records_cloned} registros exitosamente. ¡El demo está
                  completamente funcional!
                </p>
              </div>
            )}
            <div className="mt-6 text-center">
              <p className="text-xs text-[var(--text-tertiary)]">
                Total de registros clonados: {status.total_records_cloned}
              </p>
            </div>
          </CardBody>
        </Card>
      </div>
    </PublicLayout>
  );
 };
 function estimateRemainingTime(status: SessionStatusResponse): number {
  if (!status.progress) return 5;
  const services = Object.values(status.progress);
  const completedServices = services.filter((s) => s.status === 'completed').length;
  const totalServices = services.length;
  const remainingServices = totalServices - completedServices;
  // Assume ~2 seconds per service
  return Math.max(remainingServices * 2, 1);
 }
 function calculateProgressPercentage(status: SessionStatusResponse): number {
  if (!status.progress) return 0;
  const services = Object.values(status.progress);
  const completedServices = services.filter(
    (s) => s.status === 'completed' || s.status === 'failed'
  ).length;
  const totalServices = services.length;
  return Math.round((completedServices / totalServices) * 100);
 }
 export default DemoSetupPage;
--- a/frontend/src/router/AppRouter.tsx
+++ b/frontend/src/router/AppRouter.tsx
@@ -9,6 +9,7 @@ const LandingPage = React.lazy(() => import('../pages/public/LandingPage'));
 const LoginPage = React.lazy(() => import('../pages/public/LoginPage'));
 const RegisterPage = React.lazy(() => import('../pages/public/RegisterPage'));
 const DemoPage = React.lazy(() => import('../pages/public/DemoPage'));
 const DemoSetupPage = React.lazy(() => import('../pages/public/DemoSetupPage'));
 const DashboardPage = React.lazy(() => import('../pages/app/DashboardPage'));
 // Operations pages
@@ -61,6 +62,7 @@ export const AppRouter: React.FC = () => {
        <Route path="/login" element={<LoginPage />} />
        <Route path="/register" element={<RegisterPage />} />
        <Route path="/demo" element={<DemoPage />} />
        <Route path="/demo/setup" element={<DemoSetupPage />} />
        {/* Protected Routes with AppShell Layout */}
        <Route 
--- a/frontend/src/stores/useTenantInitializer.ts
+++ b/frontend/src/stores/useTenantInitializer.ts
@@ -28,29 +28,54 @@ export const useTenantInitializer = () => {
    if (isDemoMode && demoSessionId) {
      const demoTenantId = localStorage.getItem('demo_tenant_id') || 'demo-tenant-id';
      console.log('🔍 [TenantInitializer] Demo mode detected:', {
        isDemoMode,
        demoSessionId,
        demoTenantId,
        demoAccountType,
        currentTenant: currentTenant?.id
      });
      // Check if current tenant is the demo tenant and is properly set
      const isValidDemoTenant = currentTenant &&
        typeof currentTenant === 'object' &&
        currentTenant.id === demoTenantId;
      if (!isValidDemoTenant) {
        console.log('🔧 [TenantInitializer] Setting up demo tenant...');
        const accountTypeName = demoAccountType === 'individual_bakery'
          ? 'Panadería San Pablo - Demo'
          : 'Panadería La Espiga - Demo';
-        // Create a mock tenant object matching TenantResponse structure
+        // Create a complete tenant object matching TenantResponse structure
        const mockTenant = {
          id: demoTenantId,
          name: accountTypeName,
          subdomain: `demo-${demoSessionId.slice(0, 8)}`,
-          plan_type: 'professional', // Use a valid plan type
+          business_type: demoAccountType === 'individual_bakery' ? 'bakery' : 'central_baker',
          business_model: demoAccountType,
          address: 'Demo Address',
          city: 'Madrid',
          postal_code: '28001',
          phone: null,
          is_active: true,
          subscription_tier: 'demo',
          ml_model_trained: false,
          last_training_date: null,
          owner_id: 'demo-user',
          created_at: new Date().toISOString(),
          updated_at: new Date().toISOString(),
        };
        // Set the demo tenant as current
        setCurrentTenant(mockTenant);
        // **CRITICAL: Also set tenant ID in API client**
        // This ensures API requests include the tenant ID header
        import('../api/client').then(({ apiClient }) => {
          apiClient.setTenantId(demoTenantId);
          console.log('✅ [TenantInitializer] Set API client tenant ID:', demoTenantId);
        });
      }
    }
  }, [isDemoMode, demoSessionId, demoAccountType, currentTenant, setCurrentTenant]);
--- a/gateway/app/middleware/auth.py
+++ b/gateway/app/middleware/auth.py
@@ -66,7 +66,11 @@ class AuthMiddleware(BaseHTTPMiddleware):
        if hasattr(request.state, "is_demo_session") and request.state.is_demo_session:
            if hasattr(request.state, "user") and request.state.user:
                logger.info(f"✅ Demo session authenticated for route: {request.url.path}")
-                # Demo middleware already validated and set user context, pass through
+                # Demo middleware already validated and set user context
                # But we still need to inject context headers for downstream services
                user_context = request.state.user
                tenant_id = user_context.get("tenant_id") or getattr(request.state, "tenant_id", None)
                self._inject_context_headers(request, user_context, tenant_id)
                return await call_next(request)
        # ✅ STEP 1: Extract and validate JWT token
--- a/gateway/app/middleware/demo_middleware.py
+++ b/gateway/app/middleware/demo_middleware.py
@@ -102,20 +102,38 @@ class DemoMiddleware(BaseHTTPMiddleware):
                # Get session info from demo service
                session_info = await self._get_session_info(session_id)
-                if session_info and session_info.get("status") == "active":
+                # Accept pending, ready, partial, failed (if data exists), and active (deprecated) statuses
                # Even "failed" sessions can be usable if some services succeeded
                valid_statuses = ["pending", "ready", "partial", "failed", "active"]
                current_status = session_info.get("status") if session_info else None
                if session_info and current_status in valid_statuses:
                    # Inject virtual tenant ID
                    request.state.tenant_id = session_info["virtual_tenant_id"]
                    request.state.is_demo_session = True
                    request.state.demo_account_type = session_info["demo_account_type"]
                    request.state.demo_session_status = current_status  # Track status for monitoring
                    # Inject demo user context for auth middleware
                    # Map demo account type to the actual demo user IDs from seed_demo_users.py
                    DEMO_USER_IDS = {
                        "individual_bakery": "c1a2b3c4-d5e6-47a8-b9c0-d1e2f3a4b5c6",  # María García López
                        "central_baker": "d2e3f4a5-b6c7-48d9-e0f1-a2b3c4d5e6f7"      # Carlos Martínez Ruiz
                    }
                    demo_user_id = DEMO_USER_IDS.get(
                        session_info.get("demo_account_type", "individual_bakery"),
                        DEMO_USER_IDS["individual_bakery"]
                    )
                    # This allows the request to pass through AuthMiddleware
                    request.state.user = {
-                        "user_id": session_info.get("user_id", "demo-user"),
+                        "user_id": demo_user_id,  # Use actual demo user UUID
                        "email": f"demo-{session_id}@demo.local",
                        "tenant_id": session_info["virtual_tenant_id"],
                        "role": "owner",  # Demo users have owner role
                        "is_demo": True,
-                        "demo_session_id": session_id
+                        "demo_session_id": session_id,
                        "demo_session_status": current_status
                    }
                    # Update activity
@@ -151,13 +169,15 @@ class DemoMiddleware(BaseHTTPMiddleware):
                            }
                        )
                else:
-                    # Session expired or invalid
+                    # Session expired, invalid, or in failed/destroyed state
                    logger.warning(f"Invalid demo session state", session_id=session_id, status=current_status)
                    return JSONResponse(
                        status_code=401,
                        content={
                            "error": "session_expired",
                            "message": "Tu sesión demo ha expirado. Crea una nueva sesión para continuar.",
-                            "message_en": "Your demo session has expired. Create a new session to continue."
+                            "message_en": "Your demo session has expired. Create a new session to continue.",
                            "session_status": current_status
                        }
                    )
--- a/gateway/app/routes/user.py
+++ b/gateway/app/routes/user.py
@@ -59,7 +59,8 @@ class UserProxy:
        try:
            # Get auth service URL (with service discovery if available)
            auth_url = await self._get_auth_service_url()
-            target_url = f"{auth_url}/api/v1/users/{path}"
+            # FIX: Auth service uses /api/v1/auth/ prefix, not /api/v1/users/
            target_url = f"{auth_url}/api/v1/auth/{path}"
            # Prepare headers (remove hop-by-hop headers)
            headers = self._prepare_headers(dict(request.headers))
--- a/infrastructure/kubernetes/base/jobs/demo-clone-job-template.yaml
+++ b/infrastructure/kubernetes/base/jobs/demo-clone-job-template.yaml
@@ -1,55 +0,0 @@
 apiVersion: batch/v1
 kind: Job
 metadata:
  name: demo-clone-VIRTUAL_TENANT_ID
  namespace: bakery-ia
  labels:
    app: demo-clone
    component: runtime
 spec:
  ttlSecondsAfterFinished: 3600  # Clean up after 1 hour
  backoffLimit: 2
  template:
    metadata:
      labels:
        app: demo-clone
    spec:
      restartPolicy: Never
      containers:
      - name: clone-data
        image: bakery/inventory-service:latest  # Uses inventory image which has all scripts
        command: ["python", "/app/scripts/demo/clone_demo_tenant.py"]
        env:
        - name: VIRTUAL_TENANT_ID
          value: "VIRTUAL_TENANT_ID"
        - name: DEMO_ACCOUNT_TYPE
          value: "DEMO_ACCOUNT_TYPE"
        - name: INVENTORY_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: INVENTORY_DATABASE_URL
        - name: SALES_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: SALES_DATABASE_URL
        - name: ORDERS_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: ORDERS_DATABASE_URL
        - name: TENANT_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: TENANT_DATABASE_URL
        - name: LOG_LEVEL
          value: "INFO"
        resources:
          requests:
            memory: "256Mi"
            cpu: "100m"
          limits:
            memory: "512Mi"
            cpu: "500m"
--- a/infrastructure/kubernetes/base/jobs/demo-seed-recipes-job.yaml
+++ b/infrastructure/kubernetes/base/jobs/demo-seed-recipes-job.yaml
@@ -0,0 +1,63 @@
 apiVersion: batch/v1
 kind: Job
 metadata:
  name: demo-seed-recipes
  namespace: bakery-ia
  labels:
    app: demo-seed
    component: initialization
  annotations:
    "helm.sh/hook": post-install,post-upgrade
    "helm.sh/hook-weight": "20"
 spec:
  ttlSecondsAfterFinished: 3600
  template:
    metadata:
      labels:
        app: demo-seed-recipes
    spec:
      initContainers:
      - name: wait-for-recipes-migration
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 30 seconds for recipes-migration to complete..."
          sleep 30
      - name: wait-for-inventory-seed
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 15 seconds for demo-seed-inventory to complete..."
          sleep 15
      containers:
      - name: seed-recipes
        image: bakery/recipes-service:latest
        command: ["python", "/app/scripts/demo/seed_demo_recipes.py"]
        env:
        - name: RECIPES_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: RECIPES_DATABASE_URL
        - name: DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: RECIPES_DATABASE_URL
        - name: DEMO_MODE
          value: "production"
        - name: LOG_LEVEL
          value: "INFO"
        resources:
          requests:
            memory: "256Mi"
            cpu: "100m"
          limits:
            memory: "512Mi"
            cpu: "500m"
      restartPolicy: OnFailure
      serviceAccountName: demo-seed-sa
--- a/infrastructure/kubernetes/base/jobs/demo-seed-sales-job.yaml
+++ b/infrastructure/kubernetes/base/jobs/demo-seed-sales-job.yaml
@@ -0,0 +1,63 @@
 apiVersion: batch/v1
 kind: Job
 metadata:
  name: demo-seed-sales
  namespace: bakery-ia
  labels:
    app: demo-seed
    component: initialization
  annotations:
    "helm.sh/hook": post-install,post-upgrade
    "helm.sh/hook-weight": "25"
 spec:
  ttlSecondsAfterFinished: 3600
  template:
    metadata:
      labels:
        app: demo-seed-sales
    spec:
      initContainers:
      - name: wait-for-sales-migration
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 30 seconds for sales-migration to complete..."
          sleep 30
      - name: wait-for-inventory-seed
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 15 seconds for demo-seed-inventory to complete..."
          sleep 15
      containers:
      - name: seed-sales
        image: bakery/sales-service:latest
        command: ["python", "/app/scripts/demo/seed_demo_sales.py"]
        env:
        - name: SALES_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: SALES_DATABASE_URL
        - name: DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: SALES_DATABASE_URL
        - name: DEMO_MODE
          value: "production"
        - name: LOG_LEVEL
          value: "INFO"
        resources:
          requests:
            memory: "256Mi"
            cpu: "100m"
          limits:
            memory: "512Mi"
            cpu: "500m"
      restartPolicy: OnFailure
      serviceAccountName: demo-seed-sa
--- a/infrastructure/kubernetes/base/jobs/demo-seed-subscriptions-job.yaml
+++ b/infrastructure/kubernetes/base/jobs/demo-seed-subscriptions-job.yaml
@@ -0,0 +1,56 @@
 apiVersion: batch/v1
 kind: Job
 metadata:
  name: demo-seed-subscriptions
  namespace: bakery-ia
  labels:
    app: demo-seed
    component: initialization
  annotations:
    "helm.sh/hook": post-install,post-upgrade
    "helm.sh/hook-weight": "15"
 spec:
  ttlSecondsAfterFinished: 3600
  template:
    metadata:
      labels:
        app: demo-seed-subscriptions
    spec:
      initContainers:
      - name: wait-for-tenant-migration
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 30 seconds for tenant-migration to complete..."
          sleep 30
      - name: wait-for-tenant-seed
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 15 seconds for demo-seed-tenants to complete..."
          sleep 15
      containers:
      - name: seed-subscriptions
        image: bakery/tenant-service:latest
        command: ["python", "/app/scripts/demo/seed_demo_subscriptions.py"]
        env:
        - name: TENANT_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: TENANT_DATABASE_URL
        - name: LOG_LEVEL
          value: "INFO"
        resources:
          requests:
            memory: "256Mi"
            cpu: "100m"
          limits:
            memory: "512Mi"
            cpu: "500m"
      restartPolicy: OnFailure
      serviceAccountName: demo-seed-sa
--- a/infrastructure/kubernetes/base/jobs/demo-seed-suppliers-job.yaml
+++ b/infrastructure/kubernetes/base/jobs/demo-seed-suppliers-job.yaml
@@ -0,0 +1,63 @@
 apiVersion: batch/v1
 kind: Job
 metadata:
  name: demo-seed-suppliers
  namespace: bakery-ia
  labels:
    app: demo-seed
    component: initialization
  annotations:
    "helm.sh/hook": post-install,post-upgrade
    "helm.sh/hook-weight": "20"
 spec:
  ttlSecondsAfterFinished: 3600
  template:
    metadata:
      labels:
        app: demo-seed-suppliers
    spec:
      initContainers:
      - name: wait-for-suppliers-migration
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 30 seconds for suppliers-migration to complete..."
          sleep 30
      - name: wait-for-inventory-seed
        image: busybox:1.36
        command:
        - sh
        - -c
        - |
          echo "Waiting 15 seconds for demo-seed-inventory to complete..."
          sleep 15
      containers:
      - name: seed-suppliers
        image: bakery/suppliers-service:latest
        command: ["python", "/app/scripts/demo/seed_demo_suppliers.py"]
        env:
        - name: SUPPLIERS_DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: SUPPLIERS_DATABASE_URL
        - name: DATABASE_URL
          valueFrom:
            secretKeyRef:
              name: database-secrets
              key: SUPPLIERS_DATABASE_URL
        - name: DEMO_MODE
          value: "production"
        - name: LOG_LEVEL
          value: "INFO"
        resources:
          requests:
            memory: "256Mi"
            cpu: "100m"
          limits:
            memory: "512Mi"
            cpu: "500m"
      restartPolicy: OnFailure
      serviceAccountName: demo-seed-sa
--- a/infrastructure/kubernetes/base/kustomization.yaml
+++ b/infrastructure/kubernetes/base/kustomization.yaml
@@ -36,7 +36,11 @@ resources:
  - jobs/demo-seed-rbac.yaml
  - jobs/demo-seed-users-job.yaml
  - jobs/demo-seed-tenants-job.yaml
  - jobs/demo-seed-subscriptions-job.yaml
  - jobs/demo-seed-inventory-job.yaml
  - jobs/demo-seed-recipes-job.yaml
  - jobs/demo-seed-suppliers-job.yaml
  - jobs/demo-seed-sales-job.yaml
  - jobs/demo-seed-ai-models-job.yaml
  # External data initialization job (v2.0)
--- a/infrastructure/kubernetes/base/migrations/alert-processor-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/alert-processor-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/alert-processor:dev
-        command: ["python", "/app/scripts/run_migrations.py", "alert_processor"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "alert_processor"]
        env:
        - name: ALERT_PROCESSOR_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/auth-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/auth-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/auth-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "auth"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "auth"]
        env:
        - name: AUTH_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/demo-session-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/demo-session-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      - name: migrate
        image: bakery/demo-session-service:latest
        imagePullPolicy: Never
-        command: ["python", "/app/scripts/run_migrations.py", "demo_session"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "demo_session"]
        env:
        - name: DEMO_SESSION_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/external-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/external-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/external-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "external"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "external"]
        env:
        - name: EXTERNAL_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/forecasting-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/forecasting-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/forecasting-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "forecasting"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "forecasting"]
        env:
        - name: FORECASTING_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/inventory-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/inventory-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/inventory-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "inventory"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "inventory"]
        env:
        - name: INVENTORY_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/notification-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/notification-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/notification-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "notification"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "notification"]
        env:
        - name: NOTIFICATION_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/orders-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/orders-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/orders-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "orders"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "orders"]
        env:
        - name: ORDERS_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/pos-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/pos-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/pos-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "pos"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "pos"]
        env:
        - name: POS_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/production-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/production-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/production-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "production"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "production"]
        env:
        - name: PRODUCTION_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/recipes-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/recipes-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/recipes-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "recipes"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "recipes"]
        env:
        - name: RECIPES_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/sales-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/sales-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/sales-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "sales"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "sales"]
        env:
        - name: SALES_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/suppliers-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/suppliers-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/suppliers-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "suppliers"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "suppliers"]
        env:
        - name: SUPPLIERS_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/tenant-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/tenant-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/tenant-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "tenant"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "tenant"]
        env:
        - name: TENANT_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/migrations/training-migration-job.yaml
+++ b/infrastructure/kubernetes/base/migrations/training-migration-job.yaml
@@ -30,7 +30,7 @@ spec:
      containers:
      - name: migrate
        image: bakery/training-service:dev
-        command: ["python", "/app/scripts/run_migrations.py", "training"]
+        command: ["python", "/app/shared/scripts/run_migrations.py", "training"]
        env:
        - name: TRAINING_DATABASE_URL
          valueFrom:
--- a/infrastructure/kubernetes/base/secrets/demo-internal-api-key-secret.yaml
+++ b/infrastructure/kubernetes/base/secrets/demo-internal-api-key-secret.yaml
@@ -0,0 +1,10 @@
 apiVersion: v1
 kind: Secret
 metadata:
  name: demo-internal-api-key
  namespace: bakery-ia
 type: Opaque
 stringData:
  # IMPORTANT: Replace this with a secure randomly generated key in production
  # Generate with: python3 -c "import secrets; print(secrets.token_urlsafe(32))"
  INTERNAL_API_KEY: "REPLACE_WITH_SECURE_RANDOM_KEY_IN_PRODUCTION"
--- a/scripts/cleanup_databases_k8s.sh
+++ b/scripts/cleanup_databases_k8s.sh
--- a/scripts/complete-cleanup.sh
+++ b/scripts/complete-cleanup.sh
--- a/scripts/demo/init.py
+++ b/scripts/demo/init.py
@@ -1 +0,0 @@
 """Demo Data Seeding Scripts"""
--- a/scripts/demo/clone_demo_tenant.py
+++ b/scripts/demo/clone_demo_tenant.py
@@ -1,234 +0,0 @@
 #!/usr/bin/env python3
 """
 Clone Demo Tenant Data - Database Level
 Clones all data from base template tenant to a virtual demo tenant across all databases
 """
 import asyncio
 import sys
 import os
 from sqlalchemy.ext.asyncio import create_async_engine, AsyncSession, async_sessionmaker
 from sqlalchemy import select
 import uuid
 import structlog
 # Add app to path for imports
 sys.path.insert(0, '/app')
 logger = structlog.get_logger()
 # Base template tenant IDs
 DEMO_TENANT_SAN_PABLO = "a1b2c3d4-e5f6-47a8-b9c0-d1e2f3a4b5c6"
 DEMO_TENANT_LA_ESPIGA = "b2c3d4e5-f6a7-48b9-c0d1-e2f3a4b5c6d7"
 async def clone_inventory_data(base_tenant_id: str, virtual_tenant_id: str):
    """Clone inventory database tables using ORM"""
    database_url = os.getenv("INVENTORY_DATABASE_URL")
    if not database_url:
        logger.warning("INVENTORY_DATABASE_URL not set, skipping inventory data")
        return 0
    engine = create_async_engine(database_url, echo=False)
    session_factory = async_sessionmaker(engine, class_=AsyncSession, expire_on_commit=False)
    total_cloned = 0
    try:
        from app.models.inventory import Ingredient
        async with session_factory() as session:
            # Clone ingredients
            result = await session.execute(
                select(Ingredient).where(Ingredient.tenant_id == uuid.UUID(base_tenant_id))
            )
            base_ingredients = result.scalars().all()
            logger.info(f"Found {len(base_ingredients)} ingredients to clone")
            for ing in base_ingredients:
                new_ing = Ingredient(
                    id=uuid.uuid4(),
                    tenant_id=uuid.UUID(virtual_tenant_id),
                    name=ing.name,
                    sku=ing.sku,
                    barcode=ing.barcode,
                    product_type=ing.product_type,
                    ingredient_category=ing.ingredient_category,
                    product_category=ing.product_category,
                    subcategory=ing.subcategory,
                    description=ing.description,
                    brand=ing.brand,
                    unit_of_measure=ing.unit_of_measure,
                    package_size=ing.package_size,
                    average_cost=ing.average_cost,
                    last_purchase_price=ing.last_purchase_price,
                    standard_cost=ing.standard_cost,
                    low_stock_threshold=ing.low_stock_threshold,
                    reorder_point=ing.reorder_point,
                    reorder_quantity=ing.reorder_quantity,
                    max_stock_level=ing.max_stock_level,
                    shelf_life_days=ing.shelf_life_days,
                    is_perishable=ing.is_perishable,
                    is_active=ing.is_active,
                    allergen_info=ing.allergen_info
                )
                session.add(new_ing)
                total_cloned += 1
            await session.commit()
            logger.info(f"Cloned {total_cloned} ingredients")
    except Exception as e:
        logger.error(f"Failed to clone inventory data: {str(e)}", exc_info=True)
        raise
    finally:
        await engine.dispose()
    return total_cloned
 async def clone_sales_data(base_tenant_id: str, virtual_tenant_id: str):
    """Clone sales database tables"""
    database_url = os.getenv("SALES_DATABASE_URL")
    if not database_url:
        logger.warning("SALES_DATABASE_URL not set, skipping sales data")
        return 0
    # Sales cloning not implemented yet
    logger.info("Sales data cloning not yet implemented")
    return 0
 async def clone_orders_data(base_tenant_id: str, virtual_tenant_id: str):
    """Clone orders database tables"""
    database_url = os.getenv("ORDERS_DATABASE_URL")
    if not database_url:
        logger.warning("ORDERS_DATABASE_URL not set, skipping orders data")
        return 0
    # Orders cloning not implemented yet
    logger.info("Orders data cloning not yet implemented")
    return 0
 async def create_virtual_tenant(virtual_tenant_id: str, demo_account_type: str):
    """Create the virtual tenant record in tenant database"""
    database_url = os.getenv("TENANT_DATABASE_URL")
    if not database_url:
        logger.warning("TENANT_DATABASE_URL not set, skipping tenant creation")
        return
    engine = create_async_engine(database_url, echo=False)
    session_factory = async_sessionmaker(engine, class_=AsyncSession, expire_on_commit=False)
    try:
        # Import after adding to path
        from services.tenant.app.models.tenants import Tenant
        async with session_factory() as session:
            # Check if tenant already exists
            result = await session.execute(
                select(Tenant).where(Tenant.id == uuid.UUID(virtual_tenant_id))
            )
            existing = result.scalars().first()
            if existing:
                logger.info(f"Virtual tenant {virtual_tenant_id} already exists")
                return
            # Create virtual tenant
            tenant = Tenant(
                id=uuid.UUID(virtual_tenant_id),
                name=f"Demo Session Tenant",
                is_demo=True,
                is_demo_template=False,
                business_model=demo_account_type
            )
            session.add(tenant)
            await session.commit()
            logger.info(f"Created virtual tenant {virtual_tenant_id}")
    except ImportError:
        # Tenant model not available, skip
        logger.warning("Could not import Tenant model, skipping virtual tenant creation")
    except Exception as e:
        logger.error(f"Failed to create virtual tenant: {str(e)}", exc_info=True)
    finally:
        await engine.dispose()
 async def clone_demo_tenant(virtual_tenant_id: str, demo_account_type: str = "individual_bakery"):
    """
    Main function to clone all demo data for a virtual tenant
    Args:
        virtual_tenant_id: The UUID of the virtual tenant to create
        demo_account_type: Type of demo account (individual_bakery or central_baker)
    """
    base_tenant_id = DEMO_TENANT_SAN_PABLO if demo_account_type == "individual_bakery" else DEMO_TENANT_LA_ESPIGA
    logger.info(
        "Starting demo tenant cloning",
        virtual_tenant=virtual_tenant_id,
        base_tenant=base_tenant_id,
        demo_type=demo_account_type
    )
    try:
        # Create virtual tenant record
        await create_virtual_tenant(virtual_tenant_id, demo_account_type)
        # Clone data from each database
        stats = {
            "inventory": await clone_inventory_data(base_tenant_id, virtual_tenant_id),
            "sales": await clone_sales_data(base_tenant_id, virtual_tenant_id),
            "orders": await clone_orders_data(base_tenant_id, virtual_tenant_id),
        }
        total_records = sum(stats.values())
        logger.info(
            "Demo tenant cloning completed successfully",
            virtual_tenant=virtual_tenant_id,
            total_records=total_records,
            stats=stats
        )
        # Print summary for job logs
        print(f"✅ Cloning completed: {total_records} total records")
        print(f"   - Inventory: {stats['inventory']} records")
        print(f"   - Sales: {stats['sales']} records")
        print(f"   - Orders: {stats['orders']} records")
        return True
    except Exception as e:
        logger.error(
            "Demo tenant cloning failed",
            virtual_tenant=virtual_tenant_id,
            error=str(e),
            exc_info=True
        )
        print(f"❌ Cloning failed: {str(e)}")
        return False
 if __name__ == "__main__":
    # Get virtual tenant ID from environment or CLI argument
    virtual_tenant_id = os.getenv("VIRTUAL_TENANT_ID") or (sys.argv[1] if len(sys.argv) > 1 else None)
    demo_type = os.getenv("DEMO_ACCOUNT_TYPE", "individual_bakery")
    if not virtual_tenant_id:
        print("Usage: python clone_demo_tenant.py <virtual_tenant_id>")
        print("   or: VIRTUAL_TENANT_ID=<uuid> python clone_demo_tenant.py")
        sys.exit(1)
    # Validate UUID
    try:
        uuid.UUID(virtual_tenant_id)
    except ValueError:
        print(f"Error: Invalid UUID format: {virtual_tenant_id}")
        sys.exit(1)
    result = asyncio.run(clone_demo_tenant(virtual_tenant_id, demo_type))
    sys.exit(0 if result else 1)
--- a/scripts/demo/seed_demo_ai_models.py
+++ b/scripts/demo/seed_demo_ai_models.py
@@ -1,278 +0,0 @@
 """
 Demo AI Models Seed Script
 Creates fake AI models for demo tenants to populate the models list
 without having actual trained model files.
 """
 import asyncio
 import sys
 import os
 from uuid import UUID
 from datetime import datetime, timezone, timedelta
 from decimal import Decimal
 # Add project root to path
 sys.path.insert(0, os.path.abspath(os.path.join(os.path.dirname(__file__), "../..")))
 from sqlalchemy import select
 from shared.database.base import create_database_manager
 import structlog
 # Import models - these paths work both locally and in container
 try:
    # Container environment (training-service image)
    from app.models.training import TrainedModel
 except ImportError:
    # Local environment
    from services.training.app.models.training import TrainedModel
 # Tenant model - define minimal version for container environment
 try:
    from services.tenant.app.models.tenants import Tenant
 except ImportError:
    # If running in training-service container, define minimal Tenant model
    from sqlalchemy import Column, String, Boolean
    from sqlalchemy.dialects.postgresql import UUID as PGUUID
    from sqlalchemy.ext.declarative import declarative_base
    Base = declarative_base()
    class Tenant(Base):
        __tablename__ = "tenants"
        id = Column(PGUUID(as_uuid=True), primary_key=True)
        name = Column(String)
        is_demo = Column(Boolean)
        is_demo_template = Column(Boolean)
 logger = structlog.get_logger()
 class DemoAIModelSeeder:
    """Seed fake AI models for demo tenants"""
    def __init__(self):
        self.training_db_url = os.getenv("TRAINING_DATABASE_URL")
        self.tenant_db_url = os.getenv("TENANT_DATABASE_URL")
        if not self.training_db_url or not self.tenant_db_url:
            raise ValueError("Missing required database URLs")
        self.training_db = create_database_manager(self.training_db_url, "demo-ai-seed")
        self.tenant_db = create_database_manager(self.tenant_db_url, "demo-tenant-seed")
    async def get_demo_tenants(self):
        """Get all demo tenants"""
        async with self.tenant_db.get_session() as session:
            result = await session.execute(
                select(Tenant).where(Tenant.is_demo == True, Tenant.is_demo_template == True)
            )
            return result.scalars().all()
    async def get_tenant_products(self, tenant_id: UUID):
        """
        Get finished products for a tenant from inventory database.
        We need to query the actual inventory to get real product UUIDs.
        """
        try:
            inventory_db_url = os.getenv("INVENTORY_DATABASE_URL")
            if not inventory_db_url:
                logger.warning("INVENTORY_DATABASE_URL not set, cannot get products")
                return []
            inventory_db = create_database_manager(inventory_db_url, "demo-inventory-check")
            # Define minimal Ingredient model for querying
            from sqlalchemy import Column, String, Enum as SQLEnum
            from sqlalchemy.dialects.postgresql import UUID as PGUUID
            from sqlalchemy.ext.declarative import declarative_base
            import enum
            Base = declarative_base()
            class IngredientType(str, enum.Enum):
                INGREDIENT = "INGREDIENT"
                FINISHED_PRODUCT = "FINISHED_PRODUCT"
            class Ingredient(Base):
                __tablename__ = "ingredients"
                id = Column(PGUUID(as_uuid=True), primary_key=True)
                tenant_id = Column(PGUUID(as_uuid=True))
                name = Column(String)
                ingredient_type = Column(SQLEnum(IngredientType, name="ingredienttype"))
            async with inventory_db.get_session() as session:
                result = await session.execute(
                    select(Ingredient).where(
                        Ingredient.tenant_id == tenant_id,
                        Ingredient.ingredient_type == IngredientType.FINISHED_PRODUCT
                    ).limit(10)  # Get up to 10 finished products
                )
                products = result.scalars().all()
                product_list = [
                    {"id": product.id, "name": product.name}
                    for product in products
                ]
                logger.info(f"Found {len(product_list)} finished products for tenant",
                           tenant_id=str(tenant_id))
                return product_list
        except Exception as e:
            logger.error("Error fetching tenant products", error=str(e), tenant_id=str(tenant_id))
            return []
    async def create_fake_model(self, session, tenant_id: UUID, product_info: dict):
        """Create a fake AI model entry for a product"""
        now = datetime.now(timezone.utc)
        training_start = now - timedelta(days=90)
        training_end = now - timedelta(days=7)
        fake_model = TrainedModel(
            tenant_id=tenant_id,
            inventory_product_id=product_info["id"],
            model_type="prophet_optimized",
            model_version="1.0-demo",
            job_id=f"demo-job-{tenant_id}-{product_info['id']}",
            # Fake file paths (files don't actually exist)
            model_path=f"/fake/models/{tenant_id}/{product_info['id']}/model.pkl",
            metadata_path=f"/fake/models/{tenant_id}/{product_info['id']}/metadata.json",
            # Fake but realistic metrics
            mape=Decimal("12.5"),  # Mean Absolute Percentage Error
            mae=Decimal("2.3"),    # Mean Absolute Error
            rmse=Decimal("3.1"),   # Root Mean Squared Error
            r2_score=Decimal("0.85"),  # R-squared
            training_samples=60,   # 60 days of training data
            # Fake hyperparameters
            hyperparameters={
                "changepoint_prior_scale": 0.05,
                "seasonality_prior_scale": 10.0,
                "holidays_prior_scale": 10.0,
                "seasonality_mode": "multiplicative"
            },
            # Features used
            features_used=["weekday", "month", "is_holiday", "temperature", "precipitation"],
            # Normalization params (fake)
            normalization_params={
                "temperature": {"mean": 15.0, "std": 5.0},
                "precipitation": {"mean": 2.0, "std": 1.5}
            },
            # Model status
            is_active=True,
            is_production=False,  # Demo models are not production-ready
            # Training data info
            training_start_date=training_start,
            training_end_date=training_end,
            data_quality_score=Decimal("0.75"),  # Good but not excellent
            # Metadata
            notes="Demo model - No actual trained file exists. For demonstration purposes only.",
            created_by="demo-seed-script",
            created_at=now,
            updated_at=now,
            last_used_at=None
        )
        session.add(fake_model)
        return fake_model
    async def seed_models_for_tenant(self, tenant: Tenant):
        """Create fake AI models for a demo tenant"""
        logger.info("Creating fake AI models for demo tenant",
                   tenant_id=str(tenant.id),
                   tenant_name=tenant.name)
        try:
            # Get products for this tenant
            products = await self.get_tenant_products(tenant.id)
            async with self.training_db.get_session() as session:
                models_created = 0
                for product in products:
                    # Check if model already exists
                    result = await session.execute(
                        select(TrainedModel).where(
                            TrainedModel.tenant_id == tenant.id,
                            TrainedModel.inventory_product_id == product["id"]
                        )
                    )
                    existing_model = result.scalars().first()
                    if existing_model:
                        logger.info("Model already exists, skipping",
                                   tenant_id=str(tenant.id),
                                   product_id=product["id"])
                        continue
                    # Create fake model
                    model = await self.create_fake_model(session, tenant.id, product)
                    models_created += 1
                    logger.info("Created fake AI model",
                               tenant_id=str(tenant.id),
                               product_id=product["id"],
                               model_id=str(model.id))
                await session.commit()
                logger.info("Successfully created fake AI models for tenant",
                           tenant_id=str(tenant.id),
                           models_created=models_created)
        except Exception as e:
            logger.error("Error creating fake AI models for tenant",
                        tenant_id=str(tenant.id),
                        error=str(e))
            raise
    async def seed_all_demo_models(self):
        """Seed fake AI models for all demo tenants"""
        logger.info("Starting demo AI models seeding")
        try:
            # Get all demo tenants
            demo_tenants = await self.get_demo_tenants()
            if not demo_tenants:
                logger.warning("No demo tenants found")
                return
            logger.info(f"Found {len(demo_tenants)} demo tenants")
            # Seed models for each tenant
            for tenant in demo_tenants:
                await self.seed_models_for_tenant(tenant)
            logger.info("✅ Demo AI models seeding completed successfully",
                       tenants_processed=len(demo_tenants))
        except Exception as e:
            logger.error("❌ Demo AI models seeding failed", error=str(e))
            raise
 async def main():
    """Main entry point"""
    logger.info("Demo AI Models Seed Script started")
    try:
        seeder = DemoAIModelSeeder()
        await seeder.seed_all_demo_models()
        logger.info("Demo AI models seed completed successfully")
    except Exception as e:
        logger.error("Demo AI models seed failed", error=str(e))
        sys.exit(1)
 if __name__ == "__main__":
    asyncio.run(main())
--- a/scripts/demo/seed_demo_inventory.py
+++ b/scripts/demo/seed_demo_inventory.py
@@ -1,338 +0,0 @@
 #!/usr/bin/env python3
 """
 Seed Demo Inventory Data
 Populates comprehensive Spanish inventory data for both demo tenants
 """
 import asyncio
 import sys
 from pathlib import Path
 project_root = Path(__file__).parent.parent.parent
 sys.path.insert(0, str(project_root))
 import os
 from sqlalchemy.ext.asyncio import create_async_engine, AsyncSession, async_sessionmaker
 from sqlalchemy import select, delete
 import structlog
 import uuid
 from datetime import datetime, timedelta, timezone
 logger = structlog.get_logger()
 # Demo tenant IDs
 DEMO_TENANT_SAN_PABLO = "a1b2c3d4-e5f6-47a8-b9c0-d1e2f3a4b5c6"
 DEMO_TENANT_LA_ESPIGA = "b2c3d4e5-f6a7-48b9-c0d1-e2f3a4b5c6d7"
 async def seed_inventory_for_tenant(session, tenant_id: str, business_model: str):
    """Seed inventory data for a specific tenant"""
    try:
        from app.models.inventory import Ingredient, Stock, StockMovement
    except ImportError:
        from services.inventory.app.models.inventory import Ingredient, Stock, StockMovement
    logger.info(f"Seeding inventory for {business_model}", tenant_id=tenant_id)
    # Check if data already exists - if so, skip seeding to avoid duplicates
    result = await session.execute(select(Ingredient).where(Ingredient.tenant_id == uuid.UUID(tenant_id)).limit(1))
    existing = result.scalars().first()
    if existing:
        logger.info(f"Demo tenant {tenant_id} already has inventory data, skipping seed")
        return
    if business_model == "individual_bakery":
        await seed_individual_bakery_inventory(session, tenant_id)
    elif business_model == "central_baker_satellite":
        await seed_central_baker_inventory(session, tenant_id)
 async def seed_individual_bakery_inventory(session, tenant_id: str):
    """Seed inventory for individual bakery (produces locally)"""
    try:
        from app.models.inventory import Ingredient, Stock
    except ImportError:
        from services.inventory.app.models.inventory import Ingredient, Stock
    tenant_uuid = uuid.UUID(tenant_id)
    # Raw ingredients for local production
    ingredients_data = [
        # Harinas
        ("Harina de Trigo 000", "INGREDIENT", "FLOUR", None, "KILOGRAMS", 25.0, 50.0, 200.0, 2.50, "Molinos del Valle"),
        ("Harina Integral", "INGREDIENT", "FLOUR", None, "KILOGRAMS", 15.0, 30.0, 100.0, 3.20, "Bio Natural"),
        ("Harina de Centeno", "INGREDIENT", "FLOUR", None, "KILOGRAMS", 10.0, 20.0, 50.0, 3.50, "Ecológica"),
        # Levaduras
        ("Levadura Fresca", "INGREDIENT", "YEAST", None, "KILOGRAMS", 1.0, 2.5, 10.0, 8.50, "Levapan"),
        ("Levadura Seca Activa", "INGREDIENT", "YEAST", None, "KILOGRAMS", 0.5, 1.0, 5.0, 12.00, "Fleischmann"),
        # Grasas
        ("Mantequilla", "INGREDIENT", "FATS", None, "KILOGRAMS", 3.0, 8.0, 25.0, 6.80, "La Serenísima"),
        ("Aceite de Oliva Virgen Extra", "INGREDIENT", "FATS", None, "LITERS", 2.0, 5.0, 20.0, 15.50, "Cocinero"),
        # Lácteos y Huevos
        ("Huevos Frescos", "INGREDIENT", "EGGS", None, "UNITS", 36, 60, 180, 0.25, "Granja San José"),
        ("Leche Entera", "INGREDIENT", "DAIRY", None, "LITERS", 5.0, 12.0, 50.0, 1.80, "La Serenísima"),
        ("Nata para Montar", "INGREDIENT", "DAIRY", None, "LITERS", 2.0, 5.0, 20.0, 3.50, "Central Lechera"),
        # Azúcares
        ("Azúcar Blanca", "INGREDIENT", "SUGAR", None, "KILOGRAMS", 8.0, 20.0, 100.0, 1.20, "Ledesma"),
        ("Azúcar Morena", "INGREDIENT", "SUGAR", None, "KILOGRAMS", 3.0, 8.0, 25.0, 2.80, "Orgánica"),
        ("Azúcar Glass", "INGREDIENT", "SUGAR", None, "KILOGRAMS", 2.0, 5.0, 20.0, 2.20, "Ledesma"),
        # Sal y Especias
        ("Sal Fina", "INGREDIENT", "SALT", None, "KILOGRAMS", 2.0, 5.0, 20.0, 0.80, "Celusal"),
        ("Canela en Polvo", "INGREDIENT", "SPICES", None, "GRAMS", 50, 150, 500, 0.08, "Alicante"),
        ("Vainilla en Extracto", "INGREDIENT", "SPICES", None, "MILLILITERS", 100, 250, 1000, 0.15, "McCormick"),
        # Chocolates y Aditivos
        ("Chocolate Negro 70%", "INGREDIENT", "ADDITIVES", None, "KILOGRAMS", 1.0, 3.0, 15.0, 8.50, "Valor"),
        ("Cacao en Polvo", "INGREDIENT", "ADDITIVES", None, "KILOGRAMS", 0.5, 2.0, 10.0, 6.50, "Nestlé"),
        ("Nueces Peladas", "INGREDIENT", "ADDITIVES", None, "KILOGRAMS", 0.5, 1.5, 8.0, 12.00, "Los Nogales"),
        ("Pasas de Uva", "INGREDIENT", "ADDITIVES", None, "KILOGRAMS", 1.0, 2.0, 10.0, 4.50, "Mendoza Premium"),
        # Productos Terminados (producción local)
        ("Croissant Clásico", "FINISHED_PRODUCT", None, "CROISSANTS", "PIECES", 12, 30, 80, 1.20, None),
        ("Pan Integral", "FINISHED_PRODUCT", None, "BREAD", "PIECES", 8, 20, 50, 2.50, None),
        ("Napolitana de Chocolate", "FINISHED_PRODUCT", None, "PASTRIES", "PIECES", 10, 25, 60, 1.80, None),
        ("Pan de Masa Madre", "FINISHED_PRODUCT", None, "BREAD", "PIECES", 6, 15, 40, 3.50, None),
        ("Magdalena de Vainilla", "FINISHED_PRODUCT", None, "PASTRIES", "PIECES", 8, 20, 50, 1.00, None),
    ]
    ingredient_map = {}
    for name, product_type, ing_cat, prod_cat, uom, low_stock, reorder, reorder_qty, cost, brand in ingredients_data:
        ing = Ingredient(
            id=uuid.uuid4(),
            tenant_id=tenant_uuid,
            name=name,
            product_type=product_type,
            ingredient_category=ing_cat,
            product_category=prod_cat,
            unit_of_measure=uom,
            low_stock_threshold=low_stock,
            reorder_point=reorder,
            reorder_quantity=reorder_qty,
            average_cost=cost,
            brand=brand,
            is_active=True,
            is_perishable=(ing_cat in ["DAIRY", "EGGS"] if ing_cat else False),
            shelf_life_days=7 if ing_cat in ["DAIRY", "EGGS"] else (365 if ing_cat else 2),
            created_at=datetime.now(timezone.utc)
        )
        session.add(ing)
        ingredient_map[name] = ing
    await session.commit()
    # Create stock lots
    now = datetime.now(timezone.utc)
    # Harina de Trigo - Good stock
    harina_trigo = ingredient_map["Harina de Trigo 000"]
    session.add(Stock(
        id=uuid.uuid4(),
        tenant_id=tenant_uuid,
        ingredient_id=harina_trigo.id,
        production_stage="raw_ingredient",
        current_quantity=120.0,
        reserved_quantity=15.0,
        available_quantity=105.0,
        batch_number=f"HARINA-TRI-{now.strftime('%Y%m%d')}-001",
        received_date=now - timedelta(days=5),
        expiration_date=now + timedelta(days=360),
        unit_cost=2.50,
        total_cost=300.0,
        storage_location="Almacén Principal - Estante A1",
        is_available=True,
        is_expired=False,
        quality_status="good",
        created_at=now
    ))
    # Levadura Fresca - Low stock (critical)
    levadura = ingredient_map["Levadura Fresca"]
    session.add(Stock(
        id=uuid.uuid4(),
        tenant_id=tenant_uuid,
        ingredient_id=levadura.id,
        production_stage="raw_ingredient",
        current_quantity=0.8,
        reserved_quantity=0.3,
        available_quantity=0.5,
        batch_number=f"LEVAD-FRE-{now.strftime('%Y%m%d')}-001",
        received_date=now - timedelta(days=2),
        expiration_date=now + timedelta(days=5),
        unit_cost=8.50,
        total_cost=6.8,
        storage_location="Cámara Fría - Nivel 2",
        is_available=True,
        is_expired=False,
        quality_status="good",
        created_at=now
    ))
    # Croissants - Fresh batch
    croissant = ingredient_map["Croissant Clásico"]
    session.add(Stock(
        id=uuid.uuid4(),
        tenant_id=tenant_uuid,
        ingredient_id=croissant.id,
        production_stage="fully_baked",
        current_quantity=35,
        reserved_quantity=5,
        available_quantity=30,
        batch_number=f"CROIS-FRESH-{now.strftime('%Y%m%d')}-001",
        received_date=now - timedelta(hours=4),
        expiration_date=now + timedelta(hours=20),
        unit_cost=1.20,
        total_cost=42.0,
        storage_location="Vitrina Principal - Nivel 1",
        is_available=True,
        is_expired=False,
        quality_status="good",
        created_at=now
    ))
    await session.commit()
    logger.info("Individual bakery inventory seeded")
 async def seed_central_baker_inventory(session, tenant_id: str):
    """Seed inventory for central baker satellite (receives products)"""
    try:
        from app.models.inventory import Ingredient, Stock
    except ImportError:
        from services.inventory.app.models.inventory import Ingredient, Stock
    tenant_uuid = uuid.UUID(tenant_id)
    # Finished and par-baked products from central baker
    ingredients_data = [
        # Productos Pre-Horneados (del obrador central)
        ("Croissant Pre-Horneado", "FINISHED_PRODUCT", None, "CROISSANTS", "PIECES", 20, 50, 150, 0.85, "Obrador Central"),
        ("Pan Baguette Pre-Horneado", "FINISHED_PRODUCT", None, "BREAD", "PIECES", 15, 40, 120, 1.20, "Obrador Central"),
        ("Napolitana Pre-Horneada", "FINISHED_PRODUCT", None, "PASTRIES", "PIECES", 15, 35, 100, 1.50, "Obrador Central"),
        ("Pan de Molde Pre-Horneado", "FINISHED_PRODUCT", None, "BREAD", "PIECES", 10, 25, 80, 1.80, "Obrador Central"),
        # Productos Terminados (listos para venta)
        ("Croissant de Mantequilla", "FINISHED_PRODUCT", None, "CROISSANTS", "PIECES", 15, 40, 100, 1.20, "Obrador Central"),
        ("Palmera de Hojaldre", "FINISHED_PRODUCT", None, "PASTRIES", "PIECES", 10, 30, 80, 2.20, "Obrador Central"),
        ("Magdalena Tradicional", "FINISHED_PRODUCT", None, "PASTRIES", "PIECES", 12, 30, 80, 1.00, "Obrador Central"),
        ("Empanada de Atún", "FINISHED_PRODUCT", None, "OTHER_PRODUCTS", "PIECES", 8, 20, 60, 3.50, "Obrador Central"),
        ("Pan Integral de Molde", "FINISHED_PRODUCT", None, "BREAD", "PIECES", 10, 25, 75, 2.80, "Obrador Central"),
        # Algunos ingredientes básicos
        ("Café en Grano", "INGREDIENT", "OTHER", None, "KILOGRAMS", 2.0, 5.0, 20.0, 18.50, "Lavazza"),
        ("Leche para Cafetería", "INGREDIENT", "DAIRY", None, "LITERS", 10.0, 20.0, 80.0, 1.50, "Central Lechera"),
        ("Azúcar para Cafetería", "INGREDIENT", "SUGAR", None, "KILOGRAMS", 3.0, 8.0, 30.0, 1.00, "Azucarera"),
    ]
    ingredient_map = {}
    for name, product_type, ing_cat, prod_cat, uom, low_stock, reorder, reorder_qty, cost, brand in ingredients_data:
        ing = Ingredient(
            id=uuid.uuid4(),
            tenant_id=tenant_uuid,
            name=name,
            product_type=product_type,
            ingredient_category=ing_cat,
            product_category=prod_cat,
            unit_of_measure=uom,
            low_stock_threshold=low_stock,
            reorder_point=reorder,
            reorder_quantity=reorder_qty,
            average_cost=cost,
            brand=brand,
            is_active=True,
            is_perishable=True,
            shelf_life_days=3,
            created_at=datetime.now(timezone.utc)
        )
        session.add(ing)
        ingredient_map[name] = ing
    await session.commit()
    # Create stock lots
    now = datetime.now(timezone.utc)
    # Croissants pre-horneados
    croissant_pre = ingredient_map["Croissant Pre-Horneado"]
    session.add(Stock(
        id=uuid.uuid4(),
        tenant_id=tenant_uuid,
        ingredient_id=croissant_pre.id,
        production_stage="par_baked",
        current_quantity=75,
        reserved_quantity=15,
        available_quantity=60,
        batch_number=f"CROIS-PAR-{now.strftime('%Y%m%d')}-001",
        received_date=now - timedelta(days=1),
        expiration_date=now + timedelta(days=4),
        unit_cost=0.85,
        total_cost=63.75,
        storage_location="Congelador - Sección A",
        is_available=True,
        is_expired=False,
        quality_status="good",
        created_at=now
    ))
    # Palmeras terminadas
    palmera = ingredient_map["Palmera de Hojaldre"]
    session.add(Stock(
        id=uuid.uuid4(),
        tenant_id=tenant_uuid,
        ingredient_id=palmera.id,
        production_stage="fully_baked",
        current_quantity=28,
        reserved_quantity=4,
        available_quantity=24,
        batch_number=f"PALM-{now.strftime('%Y%m%d')}-001",
        received_date=now - timedelta(hours=3),
        expiration_date=now + timedelta(hours=45),
        unit_cost=2.20,
        total_cost=61.6,
        storage_location="Vitrina Pasteles - Nivel 2",
        is_available=True,
        is_expired=False,
        quality_status="good",
        created_at=now
    ))
    await session.commit()
    logger.info("Central baker satellite inventory seeded")
 async def seed_demo_inventory():
    """Main seeding function"""
    database_url = os.getenv("INVENTORY_DATABASE_URL")
    if not database_url:
        logger.error("INVENTORY_DATABASE_URL not set")
        return False
    engine = create_async_engine(database_url, echo=False)
    session_factory = async_sessionmaker(engine, class_=AsyncSession, expire_on_commit=False)
    try:
        async with session_factory() as session:
            # Seed both demo tenants
            await seed_inventory_for_tenant(session, DEMO_TENANT_SAN_PABLO, "individual_bakery")
            await seed_inventory_for_tenant(session, DEMO_TENANT_LA_ESPIGA, "central_baker_satellite")
        logger.info("Demo inventory data seeded successfully")
        return True
    except Exception as e:
        logger.error(f"Failed to seed inventory: {str(e)}")
        import traceback
        traceback.print_exc()
        return False
    finally:
        await engine.dispose()
 if __name__ == "__main__":
    result = asyncio.run(seed_demo_inventory())
    sys.exit(0 if result else 1)
--- a/scripts/demo/seed_demo_tenants.py
+++ b/scripts/demo/seed_demo_tenants.py
@@ -1,144 +0,0 @@
 #!/usr/bin/env python3
 """
 Seed Demo Tenants
 Creates base demo tenant templates with Spanish data
 """
 import asyncio
 import sys
 from pathlib import Path
 project_root = Path(__file__).parent.parent.parent
 sys.path.insert(0, str(project_root))
 import os
 os.environ.setdefault("TENANT_DATABASE_URL", os.getenv("TENANT_DATABASE_URL"))
 from sqlalchemy.ext.asyncio import create_async_engine, AsyncSession, async_sessionmaker
 from sqlalchemy import select
 import structlog
 import uuid
 from datetime import datetime, timezone
 logger = structlog.get_logger()
 # Demo tenant configurations
 DEMO_TENANTS = [
    {
        "id": "a1b2c3d4-e5f6-47a8-b9c0-d1e2f3a4b5c6",
        "name": "Panadería San Pablo - Demo",
        "subdomain": "demo-sanpablo",
        "business_type": "bakery",
        "business_model": "individual_bakery",
        "owner_id": "c1a2b3c4-d5e6-47a8-b9c0-d1e2f3a4b5c6",  # María García
        "address": "Calle Mayor, 15",
        "city": "Madrid",
        "postal_code": "28013",
        "latitude": 40.4168,
        "longitude": -3.7038,
        "phone": "+34 912 345 678",
        "email": "contacto@panaderiasanpablo.com",
        "subscription_tier": "professional",
        "is_active": True,
        "is_demo": True,
        "is_demo_template": True,
        "ml_model_trained": True,
    },
    {
        "id": "b2c3d4e5-f6a7-48b9-c0d1-e2f3a4b5c6d7",
        "name": "Panadería La Espiga - Demo",
        "subdomain": "demo-laespiga",
        "business_type": "bakery",
        "business_model": "central_baker_satellite",
        "owner_id": "d2e3f4a5-b6c7-48d9-e0f1-a2b3c4d5e6f7",  # Carlos Martínez
        "address": "Avenida de la Constitución, 42",
        "city": "Barcelona",
        "postal_code": "08001",
        "latitude": 41.3851,
        "longitude": 2.1734,
        "phone": "+34 913 456 789",
        "email": "contacto@panaderialaespiga.com",
        "subscription_tier": "enterprise",
        "is_active": True,
        "is_demo": True,
        "is_demo_template": True,
        "ml_model_trained": True,
    }
 ]
 async def seed_demo_tenants():
    """Seed demo tenants into tenant database"""
    database_url = os.getenv("TENANT_DATABASE_URL")
    if not database_url:
        logger.error("TENANT_DATABASE_URL environment variable not set")
        return False
    logger.info("Connecting to tenant database", url=database_url.split("@")[-1])
    engine = create_async_engine(database_url, echo=False)
    session_factory = async_sessionmaker(engine, class_=AsyncSession, expire_on_commit=False)
    try:
        async with session_factory() as session:
            try:
                from app.models.tenants import Tenant
            except ImportError:
                from services.tenant.app.models.tenants import Tenant
            for tenant_data in DEMO_TENANTS:
                # Check if tenant already exists
                result = await session.execute(
                    select(Tenant).where(Tenant.subdomain == tenant_data["subdomain"])
                )
                existing_tenant = result.scalar_one_or_none()
                if existing_tenant:
                    logger.info(f"Demo tenant already exists: {tenant_data['subdomain']}")
                    continue
                # Create new demo tenant
                tenant = Tenant(
                    id=uuid.UUID(tenant_data["id"]),
                    name=tenant_data["name"],
                    subdomain=tenant_data["subdomain"],
                    business_type=tenant_data["business_type"],
                    business_model=tenant_data["business_model"],
                    owner_id=uuid.UUID(tenant_data["owner_id"]),
                    address=tenant_data["address"],
                    city=tenant_data["city"],
                    postal_code=tenant_data["postal_code"],
                    latitude=tenant_data.get("latitude"),
                    longitude=tenant_data.get("longitude"),
                    phone=tenant_data.get("phone"),
                    email=tenant_data.get("email"),
                    subscription_tier=tenant_data["subscription_tier"],
                    is_active=tenant_data["is_active"],
                    is_demo=tenant_data["is_demo"],
                    is_demo_template=tenant_data["is_demo_template"],
                    ml_model_trained=tenant_data.get("ml_model_trained", False),
                    created_at=datetime.now(timezone.utc),
                    updated_at=datetime.now(timezone.utc)
                )
                session.add(tenant)
                logger.info(f"Created demo tenant: {tenant_data['name']}")
            await session.commit()
            logger.info("Demo tenants seeded successfully")
            return True
    except Exception as e:
        logger.error(f"Failed to seed demo tenants: {str(e)}")
        import traceback
        traceback.print_exc()
        return False
    finally:
        await engine.dispose()
 if __name__ == "__main__":
    result = asyncio.run(seed_demo_tenants())
    sys.exit(0 if result else 1)
--- a/services/auth/docker-compose.yml
+++ b/services/auth/docker-compose.yml
--- a/scripts/manual_seed_demo.py
+++ b/scripts/manual_seed_demo.py
@@ -1,49 +0,0 @@
 #!/usr/bin/env python3
 """
 Manual demo data seeding script
 Run this to populate the base demo template tenant with inventory data
 """
 import asyncio
 import sys
 import os
 # Add the project root to Python path
 sys.path.insert(0, os.path.join(os.path.dirname(__file__), '..'))
 async def seed_demo_data():
    """Seed demo data by running all seed scripts in order"""
    from scripts.demo.seed_demo_users import main as seed_users
    from scripts.demo.seed_demo_tenants import main as seed_tenants
    from scripts.demo.seed_demo_inventory import main as seed_inventory
    from scripts.demo.seed_demo_ai_models import main as seed_ai_models
    print("🌱 Starting demo data seeding...")
    try:
        print("\n📝 Step 1: Seeding demo users...")
        await seed_users()
        print("✅ Demo users seeded successfully")
        print("\n🏢 Step 2: Seeding demo tenants...")
        await seed_tenants()
        print("✅ Demo tenants seeded successfully")
        print("\n📦 Step 3: Seeding demo inventory...")
        await seed_inventory()
        print("✅ Demo inventory seeded successfully")
        print("\n🤖 Step 4: Seeding demo AI models...")
        await seed_ai_models()
        print("✅ Demo AI models seeded successfully")
        print("\n🎉 All demo data seeded successfully!")
    except Exception as e:
        print(f"\n❌ Error during seeding: {e}")
        import traceback
        traceback.print_exc()
        sys.exit(1)
 if __name__ == "__main__":
    asyncio.run(seed_demo_data())
--- a/scripts/regenerate_all_migrations.sh
+++ b/scripts/regenerate_all_migrations.sh
--- a/scripts/regenerate_migrations_k8s.sh
+++ b/scripts/regenerate_migrations_k8s.sh
--- a/scripts/setup-https.sh
+++ b/scripts/setup-https.sh
--- a/services/alert_processor/Dockerfile
+++ b/services/alert_processor/Dockerfile
@@ -27,8 +27,7 @@ COPY --from=shared /shared /app/shared
 # Copy application code
 COPY services/alert_processor/ .
-# Copy scripts directory
+
 COPY scripts/ /app/scripts/
 # Add shared libraries to Python path
 ENV PYTHONPATH="/app:/app/shared:${PYTHONPATH:-}"
--- a/services/auth/Dockerfile
+++ b/services/auth/Dockerfile
@@ -27,8 +27,7 @@ COPY --from=shared /shared /app/shared
 # Copy application code
 COPY services/auth/ .
-# Copy scripts directory
+
 COPY scripts/ /app/scripts/
 # Add shared libraries to Python path
 ENV PYTHONPATH="/app:/app/shared:${PYTHONPATH:-}"
--- a/services/auth/scripts/demo/seed_demo_users.py
+++ b/services/auth/scripts/demo/seed_demo_users.py
--- a/services/demo_session/app/api/demo_sessions.py
+++ b/services/demo_session/app/api/demo_sessions.py
@@ -20,6 +20,41 @@ logger = structlog.get_logger()
 route_builder = RouteBuilder('demo')
 async def _background_cloning_task(session_id: str, session_obj_id: UUID, base_tenant_id: str):
    """Background task for orchestrated cloning - creates its own DB session"""
    from app.core.database import db_manager
    from app.models import DemoSession
    from sqlalchemy import select
    # Create new database session for background task
    async with db_manager.session_factory() as db:
        try:
            # Get Redis client
            redis = await get_redis()
            # Fetch the session from the database
            result = await db.execute(
                select(DemoSession).where(DemoSession.id == session_obj_id)
            )
            session = result.scalar_one_or_none()
            if not session:
                logger.error("Session not found for cloning", session_id=session_id)
                return
            # Create session manager with new DB session
            session_manager = DemoSessionManager(db, redis)
            await session_manager.trigger_orchestrated_cloning(session, base_tenant_id)
        except Exception as e:
            logger.error(
                "Background cloning failed",
                session_id=session_id,
                error=str(e),
                exc_info=True
            )
@router.post(
    route_builder.build_base_route("sessions", include_tenant_prefix=False),
    response_model=DemoSessionResponse,
@@ -46,22 +81,19 @@ async def create_demo_session(
            user_agent=user_agent
        )
-        # Trigger async data cloning job
+        # Trigger async orchestrated cloning in background
        from app.services.k8s_job_cloner import K8sJobCloner
        import asyncio
        from app.core.config import settings
        from app.models import DemoSession
-        job_cloner = K8sJobCloner()
+        # Get base tenant ID from config
        demo_config = settings.DEMO_ACCOUNTS.get(request.demo_account_type, {})
        base_tenant_id = demo_config.get("base_tenant_id", str(session.base_demo_tenant_id))
        # Start cloning in background task with session ID (not session object)
        asyncio.create_task(
-            job_cloner.clone_tenant_data(
+            _background_cloning_task(session.session_id, session.id, base_tenant_id)
                session.session_id,
                "",
                str(session.virtual_tenant_id),
                request.demo_account_type
        )
        )
        await session_manager.mark_data_cloned(session.session_id)
        await session_manager.mark_redis_populated(session.session_id)
        # Generate session token
        session_token = jwt.encode(
@@ -110,6 +142,61 @@ async def get_session_info(
    return session.to_dict()
@router.get(
    route_builder.build_resource_detail_route("sessions", "session_id", include_tenant_prefix=False) + "/status",
    response_model=dict
 )
 async def get_session_status(
    session_id: str = Path(...),
    db: AsyncSession = Depends(get_db),
    redis: RedisClient = Depends(get_redis)
 ):
    """
    Get demo session provisioning status
    Returns current status of data cloning and readiness.
    Use this endpoint for polling (recommended interval: 1-2 seconds).
    """
    session_manager = DemoSessionManager(db, redis)
    status = await session_manager.get_session_status(session_id)
    if not status:
        raise HTTPException(status_code=404, detail="Session not found")
    return status
@router.post(
    route_builder.build_resource_detail_route("sessions", "session_id", include_tenant_prefix=False) + "/retry",
    response_model=dict
 )
 async def retry_session_cloning(
    session_id: str = Path(...),
    db: AsyncSession = Depends(get_db),
    redis: RedisClient = Depends(get_redis)
 ):
    """
    Retry failed cloning operations
    Only available for sessions in "failed" or "partial" status.
    """
    try:
        session_manager = DemoSessionManager(db, redis)
        result = await session_manager.retry_failed_cloning(session_id)
        return {
            "message": "Cloning retry initiated",
            "session_id": session_id,
            "result": result
        }
    except ValueError as e:
        raise HTTPException(status_code=400, detail=str(e))
    except Exception as e:
        logger.error("Failed to retry cloning", error=str(e))
        raise HTTPException(status_code=500, detail=str(e))
@router.delete(
    route_builder.build_resource_detail_route("sessions", "session_id", include_tenant_prefix=False),
    response_model=dict
@@ -129,3 +216,24 @@ async def destroy_demo_session(
    except Exception as e:
        logger.error("Failed to destroy session", error=str(e))
        raise HTTPException(status_code=500, detail=str(e))
@router.post(
    route_builder.build_resource_detail_route("sessions", "session_id", include_tenant_prefix=False) + "/destroy",
    response_model=dict
 )
 async def destroy_demo_session_post(
    session_id: str = Path(...),
    db: AsyncSession = Depends(get_db),
    redis: RedisClient = Depends(get_redis)
 ):
    """Destroy demo session via POST (for frontend compatibility)"""
    try:
        session_manager = DemoSessionManager(db, redis)
        await session_manager.destroy_session(session_id)
        return {"message": "Session destroyed successfully", "session_id": session_id}
    except Exception as e:
        logger.error("Failed to destroy session", error=str(e))
        raise HTTPException(status_code=500, detail=str(e))
--- a/services/demo_session/app/core/config.py
+++ b/services/demo_session/app/core/config.py
@@ -36,12 +36,14 @@ class Settings(BaseSettings):
        "individual_bakery": {
            "email": "demo.individual@panaderiasanpablo.com",
            "name": "Panadería San Pablo - Demo",
-            "subdomain": "demo-sanpablo"
+            "subdomain": "demo-sanpablo",
            "base_tenant_id": "a1b2c3d4-e5f6-47a8-b9c0-d1e2f3a4b5c6"
        },
        "central_baker": {
            "email": "demo.central@panaderialaespiga.com",
            "name": "Panadería La Espiga - Demo",
-            "subdomain": "demo-laespiga"
+            "subdomain": "demo-laespiga",
            "base_tenant_id": "b2c3d4e5-f6a7-48b9-c0d1-e2f3a4b5c6d7"
        }
    }
--- a/services/demo_session/app/models/init.py
+++ b/services/demo_session/app/models/init.py
@@ -1,5 +1,5 @@
 """Demo Session Service Models"""
-from .demo_session import DemoSession, DemoSessionStatus
+from .demo_session import DemoSession, DemoSessionStatus, CloningStatus
-__all__ = ["DemoSession", "DemoSessionStatus"]
+__all__ = ["DemoSession", "DemoSessionStatus", "CloningStatus"]
--- a/services/demo_session/app/models/demo_session.py
+++ b/services/demo_session/app/models/demo_session.py
@@ -14,9 +14,21 @@ from shared.database.base import Base
 class DemoSessionStatus(enum.Enum):
    """Demo session status"""
-    ACTIVE = "active"
+    PENDING = "pending"      # Data cloning in progress
-    EXPIRED = "expired"
+    READY = "ready"          # All data loaded, safe to use
-    DESTROYED = "destroyed"
+    FAILED = "failed"        # One or more services failed completely
    PARTIAL = "partial"      # Some services failed, others succeeded
    ACTIVE = "active"        # User is actively using the session (deprecated, use READY)
    EXPIRED = "expired"      # Session TTL exceeded
    DESTROYED = "destroyed"  # Session terminated
 class CloningStatus(enum.Enum):
    """Individual service cloning status"""
    NOT_STARTED = "not_started"
    IN_PROGRESS = "in_progress"
    COMPLETED = "completed"
    FAILED = "failed"
 class DemoSession(Base):
@@ -37,16 +49,24 @@ class DemoSession(Base):
    demo_account_type = Column(String(50), nullable=False)  # 'individual_bakery', 'central_baker'
    # Session lifecycle
-    status = Column(SQLEnum(DemoSessionStatus, values_callable=lambda obj: [e.value for e in obj]), default=DemoSessionStatus.ACTIVE, index=True)
+    status = Column(SQLEnum(DemoSessionStatus, values_callable=lambda obj: [e.value for e in obj]), default=DemoSessionStatus.PENDING, index=True)
    created_at = Column(DateTime(timezone=True), default=lambda: datetime.now(timezone.utc), index=True)
    expires_at = Column(DateTime(timezone=True), nullable=False, index=True)
    last_activity_at = Column(DateTime(timezone=True), default=lambda: datetime.now(timezone.utc))
    destroyed_at = Column(DateTime(timezone=True), nullable=True)
    # Cloning progress tracking
    cloning_started_at = Column(DateTime(timezone=True), nullable=True)
    cloning_completed_at = Column(DateTime(timezone=True), nullable=True)
    total_records_cloned = Column(Integer, default=0)
    # Per-service cloning status
    cloning_progress = Column(JSONB, default=dict)  # {service_name: {status, records, started_at, completed_at, error}}
    # Session metrics
    request_count = Column(Integer, default=0)
-    data_cloned = Column(Boolean, default=False)
+    data_cloned = Column(Boolean, default=False)  # Deprecated: use status instead
-    redis_populated = Column(Boolean, default=False)
+    redis_populated = Column(Boolean, default=False)  # Deprecated: use status instead
    # Session metadata
    session_metadata = Column(JSONB, default=dict)
--- a/services/demo_session/app/services/cleanup_service.py
+++ b/services/demo_session/app/services/cleanup_service.py
@@ -27,31 +27,55 @@ class DemoCleanupService:
    async def cleanup_expired_sessions(self) -> dict:
        """
        Find and cleanup all expired sessions
        Also cleans up sessions stuck in PENDING for too long (>5 minutes)
        Returns:
            Cleanup statistics
        """
        from datetime import timedelta
        logger.info("Starting demo session cleanup")
        now = datetime.now(timezone.utc)
        stuck_threshold = now - timedelta(minutes=5)  # Sessions pending > 5 min are stuck
-        # Find expired sessions
+        # Find expired sessions (any status except EXPIRED and DESTROYED)
        result = await self.db.execute(
            select(DemoSession).where(
-                DemoSession.status == DemoSessionStatus.ACTIVE,
+                DemoSession.status.in_([
                    DemoSessionStatus.PENDING,
                    DemoSessionStatus.READY,
                    DemoSessionStatus.PARTIAL,
                    DemoSessionStatus.FAILED,
                    DemoSessionStatus.ACTIVE  # Legacy status, kept for compatibility
                ]),
                DemoSession.expires_at < now
            )
        )
        expired_sessions = result.scalars().all()
        # Also find sessions stuck in PENDING
        stuck_result = await self.db.execute(
            select(DemoSession).where(
                DemoSession.status == DemoSessionStatus.PENDING,
                DemoSession.created_at < stuck_threshold
            )
        )
        stuck_sessions = stuck_result.scalars().all()
        # Combine both lists
        all_sessions_to_cleanup = list(expired_sessions) + list(stuck_sessions)
        stats = {
            "total_expired": len(expired_sessions),
            "total_stuck": len(stuck_sessions),
            "total_to_cleanup": len(all_sessions_to_cleanup),
            "cleaned_up": 0,
            "failed": 0,
            "errors": []
        }
-        for session in expired_sessions:
+        for session in all_sessions_to_cleanup:
            try:
                # Mark as expired
                session.status = DemoSessionStatus.EXPIRED
@@ -128,6 +152,11 @@ class DemoCleanupService:
        now = datetime.now(timezone.utc)
        # Count by status
        pending_count = len([s for s in all_sessions if s.status == DemoSessionStatus.PENDING])
        ready_count = len([s for s in all_sessions if s.status == DemoSessionStatus.READY])
        partial_count = len([s for s in all_sessions if s.status == DemoSessionStatus.PARTIAL])
        failed_count = len([s for s in all_sessions if s.status == DemoSessionStatus.FAILED])
        active_count = len([s for s in all_sessions if s.status == DemoSessionStatus.ACTIVE])
        expired_count = len([s for s in all_sessions if s.status == DemoSessionStatus.EXPIRED])
        destroyed_count = len([s for s in all_sessions if s.status == DemoSessionStatus.DESTROYED])
@@ -135,13 +164,25 @@ class DemoCleanupService:
        # Find sessions that should be expired but aren't marked yet
        should_be_expired = len([
            s for s in all_sessions
-            if s.status == DemoSessionStatus.ACTIVE and s.expires_at < now
+            if s.status in [
                DemoSessionStatus.PENDING,
                DemoSessionStatus.READY,
                DemoSessionStatus.PARTIAL,
                DemoSessionStatus.FAILED,
                DemoSessionStatus.ACTIVE
            ] and s.expires_at < now
        ])
        return {
            "total_sessions": len(all_sessions),
-            "active_sessions": active_count,
+            "by_status": {
-            "expired_sessions": expired_count,
+                "pending": pending_count,
-            "destroyed_sessions": destroyed_count,
+                "ready": ready_count,
                "partial": partial_count,
                "failed": failed_count,
                "active": active_count,  # Legacy
                "expired": expired_count,
                "destroyed": destroyed_count
            },
            "pending_cleanup": should_be_expired
        }
--- a/services/demo_session/app/services/clone_orchestrator.py
+++ b/services/demo_session/app/services/clone_orchestrator.py
@@ -0,0 +1,330 @@
 """
 Demo Data Cloning Orchestrator
 Coordinates asynchronous cloning across microservices
 """
 import asyncio
 import httpx
 import structlog
 from datetime import datetime, timezone
 from typing import Dict, Any, List, Optional
 import os
 from enum import Enum
 from app.models.demo_session import CloningStatus
 logger = structlog.get_logger()
 class ServiceDefinition:
    """Definition of a service that can clone demo data"""
    def __init__(self, name: str, url: str, required: bool = True, timeout: float = 10.0):
        self.name = name
        self.url = url
        self.required = required  # If True, failure blocks session creation
        self.timeout = timeout
 class CloneOrchestrator:
    """Orchestrates parallel demo data cloning across services"""
    def __init__(self):
        self.internal_api_key = os.getenv("INTERNAL_API_KEY", "dev-internal-key-change-in-production")
        # Define services that participate in cloning
        # URLs should be internal Kubernetes service names
        self.services = [
            ServiceDefinition(
                name="tenant",
                url=os.getenv("TENANT_SERVICE_URL", "http://tenant-service:8000"),
                required=True,  # Tenant must succeed - critical for session
                timeout=5.0
            ),
            ServiceDefinition(
                name="inventory",
                url=os.getenv("INVENTORY_SERVICE_URL", "http://inventory-service:8000"),
                required=False,  # Optional - provides ingredients/recipes
                timeout=10.0
            ),
            ServiceDefinition(
                name="recipes",
                url=os.getenv("RECIPES_SERVICE_URL", "http://recipes-service:8000"),
                required=False,  # Optional - provides recipes and production batches
                timeout=15.0
            ),
            ServiceDefinition(
                name="suppliers",
                url=os.getenv("SUPPLIERS_SERVICE_URL", "http://suppliers-service:8000"),
                required=False,  # Optional - provides supplier data and purchase orders
                timeout=20.0  # Longer - clones many entities
            ),
            ServiceDefinition(
                name="sales",
                url=os.getenv("SALES_SERVICE_URL", "http://sales-service:8000"),
                required=False,  # Optional - provides sales history
                timeout=10.0
            ),
            ServiceDefinition(
                name="orders",
                url=os.getenv("ORDERS_SERVICE_URL", "http://orders-service:8000"),
                required=False,  # Optional - provides customer orders & procurement
                timeout=15.0  # Slightly longer - clones more entities
            ),
            ServiceDefinition(
                name="production",
                url=os.getenv("PRODUCTION_SERVICE_URL", "http://production-service:8000"),
                required=False,  # Optional - provides production batches and quality checks
                timeout=20.0  # Longer - clones many entities
            ),
            ServiceDefinition(
                name="forecasting",
                url=os.getenv("FORECASTING_SERVICE_URL", "http://forecasting-service:8000"),
                required=False,  # Optional - provides historical forecasts
                timeout=15.0
            ),
        ]
    async def clone_all_services(
        self,
        base_tenant_id: str,
        virtual_tenant_id: str,
        demo_account_type: str,
        session_id: str
    ) -> Dict[str, Any]:
        """
        Orchestrate cloning across all services in parallel
        Args:
            base_tenant_id: Template tenant UUID
            virtual_tenant_id: Target virtual tenant UUID
            demo_account_type: Type of demo account
            session_id: Session ID for tracing
        Returns:
            Dictionary with overall status and per-service results
        """
        logger.info(
            "Starting orchestrated cloning",
            session_id=session_id,
            virtual_tenant_id=virtual_tenant_id,
            demo_account_type=demo_account_type,
            service_count=len(self.services)
        )
        start_time = datetime.now(timezone.utc)
        # Create tasks for all services
        tasks = []
        service_map = {}
        for service_def in self.services:
            task = asyncio.create_task(
                self._clone_service(
                    service_def=service_def,
                    base_tenant_id=base_tenant_id,
                    virtual_tenant_id=virtual_tenant_id,
                    demo_account_type=demo_account_type,
                    session_id=session_id
                )
            )
            tasks.append(task)
            service_map[task] = service_def.name
        # Wait for all tasks to complete (with individual timeouts)
        results = await asyncio.gather(*tasks, return_exceptions=True)
        # Process results
        service_results = {}
        total_records = 0
        failed_services = []
        required_service_failed = False
        for task, result in zip(tasks, results):
            service_name = service_map[task]
            service_def = next(s for s in self.services if s.name == service_name)
            if isinstance(result, Exception):
                logger.error(
                    "Service cloning failed with exception",
                    service=service_name,
                    error=str(result)
                )
                service_results[service_name] = {
                    "status": CloningStatus.FAILED.value,
                    "records_cloned": 0,
                    "error": str(result),
                    "duration_ms": 0
                }
                failed_services.append(service_name)
                if service_def.required:
                    required_service_failed = True
            else:
                service_results[service_name] = result
                if result.get("status") == "completed":
                    total_records += result.get("records_cloned", 0)
                elif result.get("status") == "failed":
                    failed_services.append(service_name)
                    if service_def.required:
                        required_service_failed = True
        # Determine overall status
        if required_service_failed:
            overall_status = "failed"
        elif failed_services:
            overall_status = "partial"
        else:
            overall_status = "ready"
        duration_ms = int((datetime.now(timezone.utc) - start_time).total_seconds() * 1000)
        result = {
            "overall_status": overall_status,
            "total_records_cloned": total_records,
            "duration_ms": duration_ms,
            "services": service_results,
            "failed_services": failed_services,
            "completed_at": datetime.now(timezone.utc).isoformat()
        }
        logger.info(
            "Orchestrated cloning completed",
            session_id=session_id,
            overall_status=overall_status,
            total_records=total_records,
            duration_ms=duration_ms,
            failed_services=failed_services
        )
        return result
    async def _clone_service(
        self,
        service_def: ServiceDefinition,
        base_tenant_id: str,
        virtual_tenant_id: str,
        demo_account_type: str,
        session_id: str
    ) -> Dict[str, Any]:
        """
        Clone data from a single service
        Args:
            service_def: Service definition
            base_tenant_id: Template tenant UUID
            virtual_tenant_id: Target virtual tenant UUID
            demo_account_type: Type of demo account
            session_id: Session ID for tracing
        Returns:
            Cloning result for this service
        """
        logger.info(
            "Cloning service data",
            service=service_def.name,
            url=service_def.url,
            session_id=session_id
        )
        try:
            async with httpx.AsyncClient(timeout=service_def.timeout) as client:
                response = await client.post(
                    f"{service_def.url}/internal/demo/clone",
                    params={
                        "base_tenant_id": base_tenant_id,
                        "virtual_tenant_id": virtual_tenant_id,
                        "demo_account_type": demo_account_type,
                        "session_id": session_id
                    },
                    headers={
                        "X-Internal-API-Key": self.internal_api_key
                    }
                )
                if response.status_code == 200:
                    result = response.json()
                    logger.info(
                        "Service cloning succeeded",
                        service=service_def.name,
                        records=result.get("records_cloned", 0),
                        duration_ms=result.get("duration_ms", 0)
                    )
                    return result
                else:
                    error_msg = f"HTTP {response.status_code}: {response.text}"
                    logger.error(
                        "Service cloning failed",
                        service=service_def.name,
                        error=error_msg
                    )
                    return {
                        "service": service_def.name,
                        "status": "failed",
                        "records_cloned": 0,
                        "error": error_msg,
                        "duration_ms": 0
                    }
        except asyncio.TimeoutError:
            error_msg = f"Timeout after {service_def.timeout}s"
            logger.error(
                "Service cloning timeout",
                service=service_def.name,
                timeout=service_def.timeout
            )
            return {
                "service": service_def.name,
                "status": "failed",
                "records_cloned": 0,
                "error": error_msg,
                "duration_ms": int(service_def.timeout * 1000)
            }
        except Exception as e:
            logger.error(
                "Service cloning exception",
                service=service_def.name,
                error=str(e),
                exc_info=True
            )
            return {
                "service": service_def.name,
                "status": "failed",
                "records_cloned": 0,
                "error": str(e),
                "duration_ms": 0
            }
    async def health_check_services(self) -> Dict[str, bool]:
        """
        Check health of all cloning endpoints
        Returns:
            Dictionary mapping service names to availability status
        """
        tasks = []
        service_names = []
        for service_def in self.services:
            task = asyncio.create_task(self._check_service_health(service_def))
            tasks.append(task)
            service_names.append(service_def.name)
        results = await asyncio.gather(*tasks, return_exceptions=True)
        return {
            name: (result is True)
            for name, result in zip(service_names, results)
        }
    async def _check_service_health(self, service_def: ServiceDefinition) -> bool:
        """Check if a service's clone endpoint is available"""
        try:
            async with httpx.AsyncClient(timeout=2.0) as client:
                response = await client.get(
                    f"{service_def.url}/internal/demo/clone/health",
                    headers={"X-Internal-API-Key": self.internal_api_key}
                )
                return response.status_code == 200
        except Exception:
            return False
--- a/Show More
+++ b/Show More