762 Commits

Author SHA1 Message Date
Urtzi Alfaro
aeff6b1537 Add new infra architecture 13 2026-01-21 23:16:19 +01:00
Urtzi Alfaro
66dfd50fbc Add new infra architecture 12 2026-01-21 16:21:24 +01:00
Urtzi Alfaro
2512de4173 Add new infra architecture 11 2026-01-20 22:05:10 +01:00
Urtzi Alfaro
0217ad83be Fix: align ingress base and overlays - single host per environment 2026-01-20 21:42:05 +01:00
Urtzi Alfaro
17508b1eac Fix: remove mail TLS from main ingress (handled by mailu ingress) 2026-01-20 21:38:54 +01:00
Urtzi Alfaro
1f65b7a48e Fix: set includeSelectors=false to avoid immutable selector conflicts 2026-01-20 21:35:12 +01:00
Urtzi Alfaro
dbf74fc1cb Fix kustomization: remove merge conflicts, fix paths, add gateway resource 2026-01-20 21:33:53 +01:00
Urtzi Alfaro
3b81b5f77e Add new infra architecture 10 2026-01-20 10:39:40 +01:00
Urtzi Alfaro
bc00bab061 Add new infra architecture 9 2026-01-20 07:20:56 +01:00
Urtzi Alfaro
52b8abdc0e Add new infra architecture 8 2026-01-19 22:28:53 +01:00
Urtzi Alfaro
012aca0d6a Add new infra architecture 7 2026-01-19 18:05:39 +01:00
Urtzi Alfaro
7d6845574c Add new infra architecture 6 2026-01-19 16:31:11 +01:00
Urtzi Alfaro
b78399da2c Add new infra architecture 5 2026-01-19 15:15:04 +01:00
Urtzi Alfaro
e96405b828 Add new infra architecture 4 2026-01-19 14:22:07 +01:00
Urtzi Alfaro
9edcc8c231 Add new infra architecture 3 2026-01-19 13:57:50 +01:00
Urtzi Alfaro
8461226a97 Add new infra architecture 2 2026-01-19 12:12:19 +01:00
Urtzi Alfaro
35f164f0cd Add new infra architecture 2026-01-19 11:55:17 +01:00
Urtzi Alfaro
21d35ea92b Add ci/cd and fix multiple pods issues 2026-01-18 09:02:27 +01:00
Urtzi Alfaro
3c4b5c2a06 Add minio support and forntend analitycs 2026-01-17 22:42:40 +01:00
Urtzi Alfaro
fbc670ddb3 Improve demo tennat and user get 2026-01-17 09:19:42 +01:00
Urtzi Alfaro
4b65817b3e Add subcription feature 10 2026-01-16 23:52:26 +01:00
Urtzi Alfaro
3a7d57ef90 Add subcription feature 9 2026-01-16 20:25:45 +01:00
Urtzi Alfaro
fa7b62bd6c Add subcription feature 8 2026-01-16 16:09:32 +01:00
Urtzi Alfaro
5e01b34cc0 Add subcription feature 7 2026-01-16 15:21:11 +01:00
Urtzi Alfaro
4bafceed0d Add subcription feature 6 2026-01-16 15:19:34 +01:00
Urtzi Alfaro
6b43116efd Add subcription feature 5 2026-01-16 09:55:54 +01:00
Urtzi Alfaro
483a9f64cd Add subcription feature 4 2026-01-15 22:06:36 +01:00
Urtzi Alfaro
b674708a4c Add subcription feature 3 2026-01-15 20:45:49 +01:00
Urtzi Alfaro
a4c3b7da3f Add subcription feature 2 2026-01-14 13:15:48 +01:00
Urtzi Alfaro
6ddf608d37 Add subcription feature 2026-01-13 22:22:38 +01:00
Urtzi Alfaro
b931a5c45e Add improvements 2 2026-01-12 22:15:11 +01:00
Urtzi Alfaro
230bbe6a19 Add improvements 2026-01-12 14:24:14 +01:00
Urtzi Alfaro
6037faaf8c Fix user delete 2026-01-11 21:51:13 +01:00
Urtzi Alfaro
55bb1c6451 Refactor subcription layer 2026-01-11 21:40:04 +01:00
Urtzi Alfaro
54163843ec Fix some issues 2026-01-11 19:38:54 +01:00
Urtzi Alfaro
ce4f3aff8c Add equipment fail feature 2026-01-11 17:03:46 +01:00
Urtzi Alfaro
b66bfda100 Update pilot launch doc 2026-01-11 09:18:17 +01:00
Urtzi Alfaro
5533198cab Imporve UI and token 2026-01-11 07:50:34 +01:00
Urtzi Alfaro
bf1db7cb9e New token arch 2026-01-10 21:45:37 +01:00
Urtzi Alfaro
cc53037552 Fix auth service login failure by correcting logging calls 2026-01-10 21:43:31 +01:00
Urtzi Alfaro
b089c216db Imporve monitoring 6 2026-01-10 13:43:38 +01:00
Urtzi Alfaro
c05538cafb Imporve monitoring 5 2026-01-09 23:14:12 +01:00
Urtzi Alfaro
22dab143ba Imporve monitoring 4 2026-01-09 14:48:44 +01:00
Urtzi Alfaro
7ef85c1188 Add comprehensive SigNoz configuration guide and monitoring setup
Documentation includes:

1. OpAMP Root Cause Analysis:
   - Explains OpenAMP (Open Agent Management Protocol) functionality
   - Documents how OpAMP was overwriting config with "nop" receivers
   - Provides two solution paths:
     * Option 1: Disable OpAMP (current solution)
     * Option 2: Fix OpAMP server configuration (recommended for prod)
   - References: SigNoz architecture and OTel collector docs

2. Database Receivers Configuration:
   - PostgreSQL: Complete setup for 21 database instances
     * SQL commands to create monitoring users
     * Proper pg_monitor role permissions
     * Environment variable configuration
   - Redis: Configuration with/without TLS
     * Uses existing redis-secrets
     * Optional TLS certificate generation
   - RabbitMQ: Management API setup
     * Uses existing rabbitmq-secrets
     * Port 15672 management interface

3. Automation Script:
   - create-pg-monitoring-users.sh
   - Creates monitoring user in all 21 PostgreSQL databases
   - Generates secure random password
   - Verifies permissions
   - Provides next-step commands

Resources Referenced:
- PostgreSQL: https://signoz.io/docs/integrations/postgresql/
- Redis: https://signoz.io/blog/redis-opentelemetry/
- RabbitMQ: https://signoz.io/blog/opentelemetry-rabbitmq-metrics-monitoring/
- OpAMP: https://signoz.io/docs/operate/configuration/
- OTel Config: https://signoz.io/docs/opentelemetry-collection-agents/opentelemetry-collector/configuration/

Current Infrastructure Discovered:
- 21 PostgreSQL databases (all services have dedicated DBs)
- 1 Redis instance (password in redis-secrets)
- 1 RabbitMQ instance (credentials in rabbitmq-secrets)

Next Implementation Steps:
1. Run create-pg-monitoring-users.sh script
2. Create Kubernetes secrets for monitoring credentials
3. Update signoz-values-dev.yaml with receivers
4. Enable receivers in metrics pipeline
5. Test and verify metric collection

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-09 12:15:58 +01:00
Urtzi Alfaro
1329bae784 Fix SigNoz OTel Collector configuration and disable OpAMP
Root Cause Analysis:
- OTel Collector was starting but OpAMP was overwriting config with "nop" receivers/exporters
- ClickHouse authentication was failing due to missing credentials in DSN strings
- Redis/PostgreSQL/RabbitMQ receivers had missing TLS certs causing startup failures

Changes:
1. Fixed ClickHouse Exporters:
   - Added admin credentials to clickhousetraces datasource
   - Added admin credentials to clickhouselogsexporter dsn
   - Now using: tcp://admin:27ff0399-0d3a-4bd8-919d-17c2181e6fb9@signoz-clickhouse:9000/

2. Disabled Unconfigured Receivers:
   - Commented out PostgreSQL receivers (no monitor users configured)
   - Commented out Redis receiver (TLS certificates not available)
   - Commented out RabbitMQ receiver (credentials not configured)
   - Updated metrics pipeline to use only OTLP receiver

3. OpAMP Disabled:
   - OpAMP was causing collector to use nop exporters/receivers
   - Cannot disable via Helm (extraArgs appends, doesn't replace)
   - Must apply kubectl patch after Helm install:
     kubectl patch deployment signoz-otel-collector --type=json -p='[{"op":"replace","path":"/spec/template/spec/containers/0/args","value":["--config=/conf/otel-collector-config.yaml","--feature-gates=-pkg.translator.prometheus.NormalizeName"]}]'

Results:
 OTel Collector successfully receiving traces (97+ spans)
 Services connecting without UNAVAILABLE errors
 No ClickHouse authentication failures
 All pipelines active (traces, metrics, logs)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-09 11:51:03 +01:00
Urtzi Alfaro
43a3f35bd1 Imporve monitoring 3 2026-01-09 11:18:20 +01:00
Urtzi Alfaro
8ca5d9c100 Imporve monitoring 2 2026-01-09 07:26:11 +01:00
Urtzi Alfaro
4af860c010 Imporve monitoring 2026-01-09 06:57:18 +01:00
Urtzi Alfaro
e8fda39e50 Improve metrics 2026-01-08 20:48:24 +01:00
Urtzi Alfaro
29d19087f1 Update monitoring packages to latest versions
- Updated all OpenTelemetry packages to latest versions:
  - opentelemetry-api: 1.27.0 → 1.39.1
  - opentelemetry-sdk: 1.27.0 → 1.39.1
  - opentelemetry-exporter-otlp-proto-grpc: 1.27.0 → 1.39.1
  - opentelemetry-exporter-otlp-proto-http: 1.27.0 → 1.39.1
  - opentelemetry-instrumentation-fastapi: 0.48b0 → 0.60b1
  - opentelemetry-instrumentation-httpx: 0.48b0 → 0.60b1
  - opentelemetry-instrumentation-redis: 0.48b0 → 0.60b1
  - opentelemetry-instrumentation-sqlalchemy: 0.48b0 → 0.60b1

- Removed prometheus-client==0.23.1 from all services
- Unified all services to use the same monitoring package versions

Generated by Mistral Vibe.
Co-Authored-By: Mistral Vibe <vibe@mistral.ai>
2026-01-08 19:25:52 +01:00