MAP43B0 Resolving SFI LPAR Dual Hard-Drive SRCs

SRCs in the range BE1110xx are used to report problems that were detected by the SFI LPAR dual hard-drive process. In some cases they will be logged normally during a repair action and in some cases they will appear in conjunction with other SRCs.

MAP43B0 Section-1

About this task

Note: A single hard drive failure can normally be repaired without the SFI LPAR being Quiesced or shut down. Do not shutdown or reboot the LPAR with the failing hard-drive during this repair unless directed by the maintenance package.

Procedure

  1. Review the list of open serviceable events and display the details for each. Note the SRCs, MAPs and FRUs listed. For further information on displaying serviceable event details go to MAP1210 Displaying and repairing a serviceable event..
  2. Use the table to determine the condition you have, and the action to perform:
    SRC Action
    BE111050 Description: The SFI LPAR hard-disk drives are not mirrored, and the automatic mirroring function has been disabled.

    Action: Go to step 4.

    BE111051 Description: An SFI LPAR hard-drive has failed.

    Action:

    Notes: This SRC may be logged if a hard-drive repair action was started but not completed.
    1. If a repair action against another serviceable event failed or was interrupted, close this serviceable event and retry the original repair.
    2. If there is another open serviceable event that lists a CEC enclosure hard-drive FRU, repair that serviceable event and close this one.
    3. If there are no other serviceable events that relate to hard-drive failures, contact your next level of support.
    Note: Next level should check the dual hard-drive status with lsdev -C | grep hdisk. If either hard drive displays status as defined, it should be replaced. To replace a FRU that is not listed in a serviceable event, use the HMC Exchange Parts menu option. See MAP1215 Replace a FRU.
    BE111052 Description: An SFI LPAR dual hard-drive Mirror operation failed.

    Action: Go to step 4.

    BE111053 Description: An SFI LPAR dual hard-drive Unmirror operation failed.

    Action:
    Notes: This SRC may have been created during a repair action.
    1. Retry the repair action.
    2. If the retry was unsuccessful, then contact your next level of support.
    BE111056 Description: The server partition dual hard disk drives are not installed in correct drive slots.

    Action: Install the replacement drive in the same location as the original drive. Confirm that drives are in the correct locations and retry the repair. If the problem continues, contact your next level of support.

    Additional details: Standard drive locations are D2 and D6 for systems 8408-E8E (storage models 982 and 988) and 8408-44E (storage model 988). Refer to CEC enclosure location codes (Models 982, 988). These systems have dual drive controllers that require standard drive locations (a drive must be attached to each controller).

    BE111057

    Description: The capacity of the disk drive being installed is less than the capacity of the original disk drive.

    Action: Use the repair process to replace the disk drive with one of proper capacity.

    BE111058 Description: An SFI LPAR dual hard-drive Mirroring cleanup operation failed.

    Action: Contact your next level of support.

    BE111059 Description: The partition's file system is full.

    Action: This can occur if a CD or DVD media is discovered in the CEC enclosure DVD drive. Remove the media. If no media is present, contact your next level of support.

    BE11105A Description: An illegal SFI dual hard-drive operation was attempted.

    Action: If all repair actions were being done through the maintenance package, then this is an unexpected condition. Contact your next level of support.

    BE11105B Description: An SFI LPAR dual hard-drive command failed.

    Action: Go to step 3.

    BE11105C Description: The mirrored SFI LPAR dual hard-drives cannot be synchronized.

    Action: Go to step 4.

    BE11105D Description: An SFI LPAR dual hard-drive drive bosboot command failed.

    Action: Go to step 3.

    BE11105F Description: An SFI LPAR dual hard-drive is failing. The other is not mirrored so cannot be used. The AIX and Licensed Internal Code will have to be reloaded. Go to MAP4020 Hard disk drive build process for both boot drives in a storage facility image LPAR.
    BE111060 Description: The SFI LPAR IML'd from the second hard disk drive in the bootlist, instead of the first.

    Action: Go to MAP43C0 SFI LPAR IML from Second Hard Disk Drive.

    BE11106F Description: One of the FHD logical volumes is failing.

    Action: Go to step 5.

  3. Was this problem logged while completing a repair action against one of the SFI LPAR hard-drives?
    • Yes, note the SRC that sent you here and close this problem. Retry the repair. If the repair fails again, contact your next level of support.
    • No, perform the following:
      • If there are other SFI LPAR hard-drive problems to repair, then repair those and close this serviceable event.
      • Contact your next level of support to check the status of the SFI LPAR hard drives and provide an action plan.
  4. Contact your next level of support. They will involve PFE to:
    • Use rsIdentifyDisk to verify that both hard-drives are in a good condition.
    • Use rsMirrorVg to mirror the hard-drives.
    • Use rsQueryVgState to verify that the Mirror operation completes successfully.
    Note: A utility will be provided in a Post-GA code level to enable the CE to check for a failing hard-drive and whether it is OK to initiate mirroring.
  5. Are there any open serviceable events for either disk drive in this CEC?
    • Yes, exit this MAP and repair that serviceable event now, which will automatically fix this problem also. Remember to close the serviceable event that sent you here.
    • No, contact your next level of support. They will involve PFE.
      • There are two CEC disk drives that use mirroring for everything except the FHD logical volumes.
      • The FHD logical volume on each physical disk drive is unique.
      • PFE will use the lsvg -l rootvg to check for the failing FHD LV. (fhd_lv00 is on hdisk0 and fhd_lv01 is on hdisk1).
      • PFE will use the rsCreateFHDLV <hdiskx> command to repair the broken FHD LV.