Task #7185
closedSetup network monitoring system on new off-site VPS
0%
Description
Objective¶
- We want to have a live update similar to https://status.ungleich.ch/ running on the VPS.
- We want to do this via some open source monitoring.
- We want to check ping, http and maybe https reachability
- The information should be public
Hints¶
Checkout on one of our VMs, if opennms allows to create a public interface.Checkout the list of systems on https://www.pcwdld.com/linux-network-monitor-software-and-tools#OpenNMSAnother interesting project that we already have running is smokeping- Easiest way, reproduce what we already have:
- setup prometheus
- setup grafana
- setup blackbox-exporter
- create a public dashboard
-> checkout the tools, discuss with me in the chat which one to use
Files
Updated by Nico Schottelius about 5 years ago
- Related to Task #7184: Create a mailing list for maintenance notifications added
Updated by Ahmed Bilal about 5 years ago
- Assignee changed from Ahmed Bilal to Nico Schottelius
I have setup (prometheus + grafana + blackbox exporter) on debian.sandbox.ahmedbilal.ungleich.cloud
You can see the Grafana at http://debian.sandbox.ahmedbilal.ungleich.cloud:3000- username: admin
- password: admin
You can see the Prometheus at http://debian.sandbox.ahmedbilal.ungleich.cloud:9090
What to do next except deploying it to VPS. I assign the DNS name post.ungleich.ch to VPS.
Updated by Nico Schottelius about 5 years ago
you can also add another name to the VPS like
monitoring.place11.ungleich.ch ;-)
redmine@ungleich.ch writes:
Updated by Ahmed Bilal about 5 years ago
- Assignee changed from Nico Schottelius to Ahmed Bilal
Updated by Nico Schottelius about 5 years ago
Seems not to be solved:
monitoring.place11.ungleich.ch’s server IP address could not be found.
redmine@ungleich.ch writes:
Updated by Nico Schottelius about 5 years ago
Ah, sorry, thought it was assigned to me!
Updated by Ahmed Bilal about 5 years ago
/etc/prometheus/blackbox.yml
modules: http_2xx: prober: http http: http_post_2xx: prober: http http: method: POST tcp_connect: prober: tcp pop3s_banner: prober: tcp tcp: query_response: - expect: "^+OK" tls: true tls_config: insecure_skip_verify: false ssh_banner: prober: tcp tcp: query_response: - expect: "^SSH-2.0-" irc_banner: prober: tcp tcp: query_response: - send: "NICK prober" - send: "USER prober prober prober :prober" - expect: "PING :([^ ]+)" send: "PONG ${1}" - expect: "^:[^ ]+ 001" icmp: prober: icmp
/etc/prometheus/prometheus
global: scrape_interval: 10s # can be overridden by setting scrape_interval in a job evaluation_interval: 30s # for rules # Attach these labels to any time series or alerts when communicating with # external systems (federation, remote storage, Alertmanager). external_labels: # monitor: 'place5-prod' scrape_configs: - job_name: 'routers' metrics_path: /probe params: module: [icmp] static_configs: - targets: - router1.place6.ungleich.ch - router2.place6.ungleich.ch - router1.place5.ungleich.ch - router2.place5.ungleich.ch - router3.place5.ungleich.ch - router4.place5.ungleich.ch relabel_configs: - source_labels: [__address__] target_label: __param_target - source_labels: [__param_target] target_label: instance - target_label: __address__ replacement: post.ungleich.ch:9115 - job_name: 'core-services' metrics_path: /probe params: module: [http_2xx] static_configs: - targets: - https://ungleich.ch - https://monitoring.place5.ungleich.ch - https://monitoring.place6.ungleich.ch relabel_configs: - source_labels: [__address__] target_label: __param_target - source_labels: [__param_target] target_label: instance - target_label: __address__ replacement: post.ungleich:9115 rule_files: - /etc/prometheus/*.rules alerting: alertmanagers: - consul_sd_configs: - server: '127.0.0.1:8500' services: - alertmanager
Updated by Ahmed Bilal about 5 years ago
How to change admin password? https://grafana.com/docs/administration/cli/
Updated by Ahmed Bilal about 5 years ago
- File Ungleich-1570814496841.json added
Updated by Ahmed Bilal about 5 years ago
- File deleted (
Ungleich-1570814496841.json)
Updated by Ahmed Bilal about 5 years ago
Updated by Ahmed Bilal about 5 years ago
Monitoring system is accessible at https://monitoring.place11.ungleich.ch/
Updated by Ahmed Bilal about 5 years ago
- Assignee changed from Ahmed Bilal to Nico Schottelius
Updated by Nico Schottelius about 5 years ago
- Status changed from Seen to New
- Assignee changed from Nico Schottelius to ll nu
Well. done. Balazs, can you confirm/ensure that you can
- reproduce the setup
- understand how monitoring is done there
- can make changes to prometheus and grafana
Additionally, please create a new ticket for creating an email account on the external system and configure alertmanager as follows:
Send alerts to
- sre@ungleich.ch (our standard list; also add ahmed here, if he isn't in it)
- sre@place11.ungleich.ch -> new imap mailbox that we all can add to our phone
Updated by ll nu almost 5 years ago
ABK is added to sre@
imap mailbox creation is pending
Updated by Nico Schottelius over 4 years ago
- Assignee changed from ll nu to Nico Schottelius