Project

General

Profile

Actions

Task #5770

closed

Create a howto: how we maintain our disks and ceph cluster(s)

Added by Nico Schottelius over 3 years ago. Updated about 3 years ago.

Status:
Rejected
Priority:
Normal
Assignee:
Target version:
-
Start date:
10/05/2018
Due date:
10/08/2018
% Done:

0%

Estimated time:
PM Check date:

Description

  • Detailed explanation
  • Including commands of megacli
  • Including motivation

Disk handling

Our servers are using Perc H700 or H800 disk controllers. The tool megacli can be used to manage them.

Handling foreign configuration

When plugging a disk into a server that was configured / used in a different server before, we need to clear the previous configuration.

Listing all known foreign configuration using megacli -CfgForeign -Scan -aALL.

Remove/clear the foreign configuration using

# for instance -L5 -a0 or -L2 -a1
megacli -DiscardPreservedCache -L<slot-or-ID> -a<Controller>

Removal of a disk

If a (dead or intact) disk is being removed from the system, the system will HANG on the next reboot, complaining about it missing. For that reason AFTER removing a disk, we need to "clear the cache":


# Get list of caches that need to be cleared
megacli -GetPreservedCacheList -aALL

# Clear cache: -L0 -a1
megacli -DiscardPreservedCache -L<??> -a<controller>

How to move an OSD (Disk, SSD) to another server

  • Use ceph-osd-stop-disable from ungleich tools to stop / disable the OSD
  • Clear the raid controller information
  • Insert disk into new host
  • Clear the foreign configuration (megacli)
  • [add it to the system with megacli]
  • OSD should automatically come up afterwards
    • Check using ceph osd tree
Actions #1

Updated by Nico Schottelius over 3 years ago

  • Description updated (diff)
  • Status changed from New to In Progress
Actions #2

Updated by Nico Schottelius over 3 years ago

  • Description updated (diff)
Actions #3

Updated by Nico Schottelius over 3 years ago

  • Subject changed from Create a howto: how we maintain our ceph cluster(s) to Create a howto: how we maintain our disks and ceph cluster(s)
  • Description updated (diff)
Actions #4

Updated by Nico Schottelius over 3 years ago

  • Description updated (diff)
Actions #5

Updated by Nico Schottelius over 3 years ago

  • Description updated (diff)
Actions #7

Updated by Mirjana Rupar about 3 years ago

Samuel, what is the status of this task now, please?

Actions #8

Updated by Samuel Hailu about 3 years ago

  • This is not a task, this Wiki and it is related to my command-book which belongs to me.
  • But this this wiki has been used by @Roland Zweifel @Jin-Guk Kwon @auer
  • So it is more like open for everyone but not a task
  • it can be set to closed or progress or anything
Actions #9

Updated by Nico Schottelius about 3 years ago

  • Status changed from In Progress to Rejected
Actions

Also available in: Atom PDF