The ungleich monitoring infrastructure 2024 » History » Version 2
Nico Schottelius, 12/17/2023 11:50 AM
1 | 1 | Nico Schottelius | h1. The ungleich monitoring infrastructure 2024 (WIP) |
---|---|---|---|
2 | |||
3 | h2. Intro |
||
4 | |||
5 | This is a work-in-progress update from [[The_ungleich_monitoring_infrastructure]]. The infrastructure is still based on prometheus + blackbox exporter, but now also makes use of kubernetes native objects. |
||
6 | |||
7 | h2. Monitoring definition |
||
8 | |||
9 | h3. External primary router/link monitoring |
||
10 | |||
11 | * Objective: find out from an external PoV whether the lines are functioning |
||
12 | * Implementation: |
||
13 | ** Collecting/alerting with prometheus on place12 |
||
14 | ** blackbox on place12 |
||
15 | ** blackbox on place11 |
||
16 | 2 | Nico Schottelius | * Targets |
17 | ** ipv6/router1.place10/snr |
||
18 | ** ipv4/router1.place10/snr |
||
19 | ** ipv6/server12X.place10/snr |
||
20 | ** ipv4/server12X.place10/snr |
||
21 | |||
22 | h3. External primary router |
||
23 | |||
24 | * Objective: find out whether a router is reachable via any path |
||
25 | * Implementation: |
||
26 | ** Collecting/alerting with prometheus on place12 |
||
27 | ** blackbox on place12 |
||
28 | ** blackbox on place11 |
||
29 | |||
30 | h3. Test external monitoring |
||
31 | |||
32 | * Objective: find out whether the external monitoring is alive |
||
33 | * Implementation: |
||
34 | ** Collecting/alerting with prometheus on place10 |
||
35 | * Targets |
||
36 | ** ipv6/emonitor1.place12/prometheus |
||
37 | ** ipv6/emonitor1.place12/blackbox |
||
38 | ** ipv6/emonitor1.place12/alertmanager |
||
39 | ** ipv6/vm1.place11/blackbox |