Raynet One On-Premises Installation and Operations Guide

Implementing monitoring for your Raynet One deployment is essential to maintain operational health, detect issues early, and ensure high availability and performance.

To achieve this, we recommend an industry-standard stack based on Prometheus for data collection and Grafana for visualization.

Consider monitoring the following key aspects:

•Container Health: Monitor status and resource consumption (CPU, memory) to identify bottlenecks or crash loops before they impact users.

•Service Availability: Perform regular health checks for critical services (UI, backend API, Keycloak, MinIO, etc.). Ensure endpoints respond correctly via HTTP probes.

•Database Performance: Track query latency, connection pool usage, and error rates to identify scaling needs early.

•Storage Capacity: Monitor disk usage and growth trends for all persistent volumes to prevent data loss or service outages.

•Log Analysis: Implement centralized log collection (e.g., via Loki) to detect errors and anomalies across all services in real time.

•Network Performance: Track throughput and latency for internal and external traffic to ensure optimal data flow through reverse proxies and network overlays.

Effective monitoring reduces downtime and helps you respond proactively rather than reactively.

Proactive alerting:

Define thresholds for these metrics and set up automated alerts to ensure a rapid response to potential incidents.

Monitoring Recommendations

Monitoring Recommendations