Uptime objectives » History » Version 1
Nico Schottelius, 03/01/2019 03:38 PM
1 | 1 | Nico Schottelius | h1. Uptime objectives |
---|---|---|---|
2 | |||
3 | |||
4 | h2. Power Supply |
||
5 | |||
6 | * What: Power supply to all systems |
||
7 | * Setup: |
||
8 | ** Core systems are connected to UPS that last between 7-30 minutes |
||
9 | ** Virtualisation systems are not (yet) fully connected to UPS |
||
10 | * Expected outages |
||
11 | |||
12 | |||
13 | |||
14 | |||
15 | h2. Internal Network |
||
16 | |||
17 | * What: The connection between servers, routers and switches. |
||
18 | * Setup: All systems are connected twice internally, usually via fiber |
||
19 | * Expected outages |
||
20 | ** Single switch outage: no outage, maybe short packet loss (LACP link detection might take some seconds) |
||
21 | ** Double switch outage: full outage, manual replacement |
||
22 | * Uptime objectives |
||
23 | ** 2019: >= 99.999% |
||
24 | ** 2020: >= 99.9995% |
||
25 | ** 2021: >= 99.9995% |
||
26 | |||
27 | |||
28 | |||
29 | h2. L2 external Network |
||
30 | |||
31 | * What: the network between the different locations |
||
32 | * Setup: |
||
33 | ** Provided by local (electricity) companies. |
||
34 | ** No additional active equipment / same as internal network |
||
35 | * Expected outages |
||
36 | ** 1 in 2018 that could be bridged by Wifi |
||
37 | ** If an outage happens, it's long (digging through the cable) |
||
38 | ** But it happens very rarely |
||
39 | ** Mid term geo redundant lines planned |
||
40 | * Uptime objectives |
||
41 | ** 2019: >= 99.99% |
||
42 | ** 2020: >= 99.995% |
||
43 | ** 2021: >= 99.995% |
||
44 | |||
45 | |||
46 | h2. L3 external Network |
||
47 | |||
48 | * What: the external (uplink) networks |
||
49 | * Setup |
||
50 | ** Currently one uplink by EDIG / HIAG |
||
51 | * Expected outages |
||
52 | ** Based on 2018 |
||
53 | ** Partially unresponsive / unwilling to cooperate |
||
54 | ** Multiple smaller, one bigger outage |
||
55 | ** 2nd and 3rd line providers are evaluated / phased in |
||
56 | ** Plan 2019 phase in 1 connection per DC + third at the hub |
||
57 | * Uptime objectives |
||
58 | ** 2019: >= 99.9% |
||
59 | ** 2020: >= 99.99% |
||
60 | ** 2021: >= 99.995% |
||
61 | |||
62 | |||
63 | h2. Routers |
||
64 | |||
65 | |||
66 | |||
67 | h2. Servers |
||
68 | |||
69 | * What: Servers host VMs and in case of a defect VMs need to be restarted on a different server |