Project

General

Profile

Uptime objectives » History » Version 1

Nico Schottelius, 03/01/2019 03:38 PM

1 1 Nico Schottelius
h1. Uptime objectives
2
3
4
h2. Power Supply
5
6
* What: Power supply to all systems
7
* Setup:
8
** Core systems are connected to UPS that last between 7-30 minutes
9
** Virtualisation systems are not (yet) fully connected to UPS
10
* Expected outages
11
12
13
14
15
h2. Internal Network
16
17
* What: The connection between servers, routers and switches.
18
* Setup: All systems are connected twice internally, usually via fiber
19
* Expected outages
20
** Single switch outage: no outage, maybe short packet loss (LACP link detection might take some seconds)
21
** Double switch outage: full outage, manual replacement
22
* Uptime objectives
23
** 2019: >= 99.999%
24
** 2020: >= 99.9995%
25
** 2021: >= 99.9995%
26
27
28
29
h2. L2 external Network
30
31
* What: the network between the different locations
32
* Setup:
33
** Provided by local (electricity) companies.
34
** No additional active equipment / same as internal network
35
* Expected outages
36
** 1 in 2018 that could be bridged by Wifi
37
** If an outage happens, it's long (digging through the cable)
38
** But it happens very rarely
39
** Mid term geo redundant lines planned
40
* Uptime objectives
41
** 2019: >= 99.99%
42
** 2020: >= 99.995%
43
** 2021: >= 99.995%
44
45
46
h2. L3 external Network
47
48
* What: the external (uplink) networks
49
* Setup
50
** Currently one uplink by EDIG / HIAG
51
* Expected outages
52
** Based on 2018
53
** Partially unresponsive / unwilling to cooperate
54
** Multiple smaller, one bigger outage
55
**  2nd and 3rd line providers are evaluated / phased in
56
** Plan 2019 phase in 1 connection per DC + third at the hub
57
* Uptime objectives
58
** 2019: >= 99.9%
59
** 2020: >= 99.99%
60
** 2021: >= 99.995%
61
62
63
h2. Routers
64
65
66
67
h2. Servers
68
69
* What: Servers host VMs and in case of a defect VMs need to be restarted on a different server