Prepare to replace controller

You prepare to replace a controller canister by verifying that the replacement controller canister has the correct FRU part number, backing up the configuration, and collecting support data. If the controller is still online, you must take it offline.

This task describes how to prepare to replace a controller canister in a controller shelf with two controllers.

Perform this task only if your storage array has two controllers.

  1. Unpack the new controller canister, and set it on a flat, static-free surface.
    Save the packing materials to use when shipping the failed controller canister.
  2. Locate the MAC address and FRU part number labels on the back of the controller canister.

    MAC address: The MAC address for management port 1 (“P1”). If you used DHCP to obtain the original controller's IP address, you will need this address to connect to the new controller.

    FRU part number: This number must match the replacement part number for the currently installed controller.

  3. From ThinkSystem System Manager, locate the replacement part number for the controller canister you are replacing.
    When a controller has a fault and needs to be replaced, the replacement part number is displayed in the Details area of the Recovery Guru. If you need to find this number manually, follow these steps:
    1. Select Hardware.
    2. Locate the controller shelf, which is marked with the controller icon .
    3. Click the controller icon.
    4. Select the controller, and click Next.
    5. On the Base tab, make a note of the Replacement Part Number for the controller.
  4. Confirm that the replacement part number for the failed controller is the same as the FRU part number for the replacement controller.
    Attention: Possible loss of data access – If the two part numbers are not the same, do not attempt this procedure. In addition, if the failed controller canister includes a host interface card (HIC), you must install that HIC into the new controller canister. The presence of mismatched controllers or HICs will cause the new controller to lock down when you bring it online.
  5. Back up the storage array's configuration database.

    If a problem occurs when you remove a controller, you can use the saved file to restore your configuration.

    1. Open a system terminal and change current directory to the same directory of SMcli executable.
    2. Execute the following SMcli command.
      save storageArray dbmDatabase sourceLocation=onboard contentType=all file="filename";
      In this command, filename is the file path and file name to which you want to save the database. Enclose the file name in backslash following double quotation marks (\"). For example:
       file="C:\Program Files\CLI\logs\" 

      This command does not automatically append a file extension to the saved file. You must specify a file extension when entering the file name.

      Note: Under PowerShell, enclose the file name following two quotation marks(""), for example file=\""\"".
      Note: For more information, see Command Line Interface (CLI) Reference.
  6. Collect support data for your storage array using ThinkSystem System Manager.
    1. Select Support > Support Center > Diagnostics.
    2. Select Collect Support Data.
    3. Click Collect.

    The file is saved in the Downloads folder for your browser with the name support-data.7z.

  7. If the controller is not already offline, take it offline now using either ThinkSystem System Manager or the SMcli tool.
      1. Select Hardware.
      2. If the graphic shows the drives, select Show back of shelf to show the controllers.
      3. Select the controller that you want to place offline.
      4. From the context menu, select Place offline, and confirm that you want to perform the operation.
      1. Open a system terminal and change current directory to the same directory of SMcli executable.
      2. Execute one of the following SMCli commands:
        For controller A: set controller [a] availability=offline;
        For controller B: set controller [b] availability=offline;
        Note: For more information, see Command Line Interface (CLI) Reference.
  8. Wait for System Manager to update the controller's status to offline.
    Attention: Do not begin any other operations until after the status has been updated.
  9. Select Recheck from the Recovery Guru, and confirm that the OK to remove field in the Details area displays Yes, indicating that it is safe to remove this component.

Go to Remove failed controller.