Project

General

Profile

Actions

Task #7191

closed

Setup time servers and monitor time differences

Added by Nico Schottelius over 4 years ago. Updated over 2 years ago.

Status:
Closed
Priority:
Low
Assignee:
Target version:
-
Start date:
09/30/2019
Due date:
% Done:

0%

Estimated time:
PM Check date:

Description

  • Setup ntpd on black* and red*
  • Configure hosts to synchronise clock with the local time servers
  • Raise an alert if time is > 10 second different to the average of the time servers
    • node_time_seconds in promtheus
Actions #1

Updated by Ahmed Bilal over 4 years ago

  • Status changed from New to Seen
Actions #2

Updated by Ahmed Bilal over 4 years ago

  • Assignee changed from Ahmed Bilal to Nico Schottelius

What is red* and black*?

Can I add their time difference monitoring at https://monitoring.place11.ungleich.ch

Actions #3

Updated by Nico Schottelius over 4 years ago

Ah - the time difference should probably better monitored on
monitoring1.place{5,6}, not on monitoring.place11.

monitoring.place11 should be mainly configured as "monitoring us from
outside".

writes:

Actions #4

Updated by Ahmed Bilal over 4 years ago

  • Assignee changed from Nico Schottelius to Ahmed Bilal
Actions #5

Updated by Ahmed Bilal over 4 years ago

Install ntp

apt install ntp
service ntp start

To enable, ntp collector run the following command on targets only if the file is not all commented out.

cat > /etc/default/prometheus-node-exporter << EOF
ARGS="--collector.ntp --collector.ntp.server ::1"
EOF

service prometheus-node-exporter restart

Actions #6

Updated by Ahmed Bilal over 4 years ago

I think that the prometheus query node_ntp_offset would be the one we needed. For more details, please look https://github.com/prometheus/node_exporter/blob/master/docs/TIME.md.

Actions #7

Updated by Ahmed Bilal over 4 years ago

  • Assignee changed from Ahmed Bilal to ll nu

Please verify whether the alerts are correct according to above mentioned criteria https://monitoring.place6.ungleich.ch/d/D4dhTzgik/ceph-cluster?orgId=1&panelId=59&fullscreen&edit&tab=alert

Actions #8

Updated by Ahmed Bilal over 4 years ago

  • Assignee changed from ll nu to Ahmed Bilal
Actions #9

Updated by Nico Schottelius over 2 years ago

  • Status changed from Seen to Closed
Actions

Also available in: Atom PDF