Uptime objectives » History » Version 2
Nico Schottelius, 03/01/2019 03:38 PM
| 1 | 1 | Nico Schottelius | h1. Uptime objectives |
|---|---|---|---|
| 2 | |||
| 3 | 2 | Nico Schottelius | {{toc}} |
| 4 | 1 | Nico Schottelius | |
| 5 | h2. Power Supply |
||
| 6 | |||
| 7 | * What: Power supply to all systems |
||
| 8 | * Setup: |
||
| 9 | ** Core systems are connected to UPS that last between 7-30 minutes |
||
| 10 | ** Virtualisation systems are not (yet) fully connected to UPS |
||
| 11 | * Expected outages |
||
| 12 | |||
| 13 | |||
| 14 | |||
| 15 | |||
| 16 | h2. Internal Network |
||
| 17 | |||
| 18 | * What: The connection between servers, routers and switches. |
||
| 19 | * Setup: All systems are connected twice internally, usually via fiber |
||
| 20 | * Expected outages |
||
| 21 | ** Single switch outage: no outage, maybe short packet loss (LACP link detection might take some seconds) |
||
| 22 | ** Double switch outage: full outage, manual replacement |
||
| 23 | * Uptime objectives |
||
| 24 | ** 2019: >= 99.999% |
||
| 25 | ** 2020: >= 99.9995% |
||
| 26 | ** 2021: >= 99.9995% |
||
| 27 | |||
| 28 | |||
| 29 | |||
| 30 | h2. L2 external Network |
||
| 31 | |||
| 32 | * What: the network between the different locations |
||
| 33 | * Setup: |
||
| 34 | ** Provided by local (electricity) companies. |
||
| 35 | ** No additional active equipment / same as internal network |
||
| 36 | * Expected outages |
||
| 37 | ** 1 in 2018 that could be bridged by Wifi |
||
| 38 | ** If an outage happens, it's long (digging through the cable) |
||
| 39 | ** But it happens very rarely |
||
| 40 | ** Mid term geo redundant lines planned |
||
| 41 | * Uptime objectives |
||
| 42 | ** 2019: >= 99.99% |
||
| 43 | ** 2020: >= 99.995% |
||
| 44 | ** 2021: >= 99.995% |
||
| 45 | |||
| 46 | |||
| 47 | h2. L3 external Network |
||
| 48 | |||
| 49 | * What: the external (uplink) networks |
||
| 50 | * Setup |
||
| 51 | ** Currently one uplink by EDIG / HIAG |
||
| 52 | * Expected outages |
||
| 53 | ** Based on 2018 |
||
| 54 | ** Partially unresponsive / unwilling to cooperate |
||
| 55 | ** Multiple smaller, one bigger outage |
||
| 56 | ** 2nd and 3rd line providers are evaluated / phased in |
||
| 57 | ** Plan 2019 phase in 1 connection per DC + third at the hub |
||
| 58 | * Uptime objectives |
||
| 59 | ** 2019: >= 99.9% |
||
| 60 | ** 2020: >= 99.99% |
||
| 61 | ** 2021: >= 99.995% |
||
| 62 | |||
| 63 | |||
| 64 | h2. Routers |
||
| 65 | |||
| 66 | |||
| 67 | |||
| 68 | h2. Servers |
||
| 69 | |||
| 70 | * What: Servers host VMs and in case of a defect VMs need to be restarted on a different server |