Uptime objectives » History » Version 2
Nico Schottelius, 03/01/2019 03:38 PM
1 | 1 | Nico Schottelius | h1. Uptime objectives |
---|---|---|---|
2 | |||
3 | 2 | Nico Schottelius | {{toc}} |
4 | 1 | Nico Schottelius | |
5 | h2. Power Supply |
||
6 | |||
7 | * What: Power supply to all systems |
||
8 | * Setup: |
||
9 | ** Core systems are connected to UPS that last between 7-30 minutes |
||
10 | ** Virtualisation systems are not (yet) fully connected to UPS |
||
11 | * Expected outages |
||
12 | |||
13 | |||
14 | |||
15 | |||
16 | h2. Internal Network |
||
17 | |||
18 | * What: The connection between servers, routers and switches. |
||
19 | * Setup: All systems are connected twice internally, usually via fiber |
||
20 | * Expected outages |
||
21 | ** Single switch outage: no outage, maybe short packet loss (LACP link detection might take some seconds) |
||
22 | ** Double switch outage: full outage, manual replacement |
||
23 | * Uptime objectives |
||
24 | ** 2019: >= 99.999% |
||
25 | ** 2020: >= 99.9995% |
||
26 | ** 2021: >= 99.9995% |
||
27 | |||
28 | |||
29 | |||
30 | h2. L2 external Network |
||
31 | |||
32 | * What: the network between the different locations |
||
33 | * Setup: |
||
34 | ** Provided by local (electricity) companies. |
||
35 | ** No additional active equipment / same as internal network |
||
36 | * Expected outages |
||
37 | ** 1 in 2018 that could be bridged by Wifi |
||
38 | ** If an outage happens, it's long (digging through the cable) |
||
39 | ** But it happens very rarely |
||
40 | ** Mid term geo redundant lines planned |
||
41 | * Uptime objectives |
||
42 | ** 2019: >= 99.99% |
||
43 | ** 2020: >= 99.995% |
||
44 | ** 2021: >= 99.995% |
||
45 | |||
46 | |||
47 | h2. L3 external Network |
||
48 | |||
49 | * What: the external (uplink) networks |
||
50 | * Setup |
||
51 | ** Currently one uplink by EDIG / HIAG |
||
52 | * Expected outages |
||
53 | ** Based on 2018 |
||
54 | ** Partially unresponsive / unwilling to cooperate |
||
55 | ** Multiple smaller, one bigger outage |
||
56 | ** 2nd and 3rd line providers are evaluated / phased in |
||
57 | ** Plan 2019 phase in 1 connection per DC + third at the hub |
||
58 | * Uptime objectives |
||
59 | ** 2019: >= 99.9% |
||
60 | ** 2020: >= 99.99% |
||
61 | ** 2021: >= 99.995% |
||
62 | |||
63 | |||
64 | h2. Routers |
||
65 | |||
66 | |||
67 | |||
68 | h2. Servers |
||
69 | |||
70 | * What: Servers host VMs and in case of a defect VMs need to be restarted on a different server |