Server Monitoring — Production Stack Stack Overview | Layer | Tool | Purpose | |-------|------|---------| | Metrics collection | Node Exporter | OS/hardware metrics from and | | Metrics scraping | Prometheus | Pull-based time-series database | | Visualization | Grafana | Dashboards and alerting UI | | Alerting | Alertmanager | Route, deduplicate, silence alerts | | Log shipping | Promtail | Tail logs → push to Loki | | Log aggregation | Loki | Log storage with label-based indexing | | Uptime (external) | UptimeRobot | External HTTP/TCP reachability checks | | Cron monitoring | healthchecks.io…