MAP4F20 SRCs BE40020x unavailable resource recovery

This procedure recovers an unavailable CEC enclosure or I/O enclosure to operational.

MAP4F20 Section-1

About this task

Find the system reference code (SRC) in the serviceable event that sent you here in Table 1 and do the action that is listed.

Table 1. Actions for BE40020x SRCs
SRC Definition Go to:
BE400201 CEC found unavailable during an I/O enclosure repair. MAP4F20 Section-2
BE400202 CEC found unavailable during a CEC RIO repair. MAP4F20 Section-3
BE400203 I/O enclosure found unavailable during a CEC enclosure repair. MAP4F20 Section-4
BE400204 I/O enclosure found unavailable during an I/O enclosure power/cooling repair. MAP4F20 Section-5
BE400207 Logical I/O enclosure found unavailable during a CEC enclosure repair. MAP4F20 Section-6

MAP4F20 Section-2

About this task

This procedure addresses a situation in which a failure, in an I/O enclosure, causes a CEC to become unavailable.

Use this procedure after the original I/O enclosure failure is successfully repaired. This procedure makes the CEC available and restores the storage facility to dual CEC operational.

  • You completed replacing one or more of the following I/O enclosure FRUs:
    • I/O enclosure PCIe/SPCN card
    • I/O enclosure I/O backplane assembly
    • I/O enclosure PCIe cable
  • The serviceable event that you repaired was automatically closed.
  • A new serviceable event was created with "SRC BE400201 = CEC found unavailable during an I/O enclosure repair." A CEC enclosure is unavailable, quiesced, or powered off.

Procedure

  1. Are there any open serviceable events with SRCs BE1E2167, BE1E2543, or BE1E2551?
    • Yes, do not repair these serviceable events; close them. These are expected when an I/O enclosure has serviceable events and a CEC enclosure is unavailable. Go to the next step.
    • No, go to the next step.
  2. Are there any other open serviceable events with CEC enclosure FRUs?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  3. The CEC enclosure must be reset by using the CEC enclosure processor module exchange procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not replace the FRU.
    1. Read all these sub-steps before going to sub-step b, which closes this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the CEC enclosure processor module from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.

MAP4F20 Section-3

About this task

This procedure addresses a situation in which a failure in a CEC enclosure-to-CEC enclosure RIO interface causes a CEC to become unavailable.

Use this procedure after the original CEC enclosure RIO interface failure is successfully repaired. This procedure makes the CEC available and restores the storage facility to dual CEC to operational.

  • You completed replacing a CEC enclosure RIO card FRU.
  • The serviceable event that you repaired was automatically closed.
  • A new serviceable event was created with "SRC BE400202 = CEC found unavailable during a CEC RIO repair." A CEC enclosure is unavailable or in service mode.

Do the following actions:

Procedure

  1. Are there any other open serviceable events, that you did not attempt to repair, with CEC enclosure FRUs?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  2. The CEC enclosure must be reset by using the CEC enclosure processor module exchange procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not replace the FRU.
    1. Read all these sub-steps before going to sub-step b, which closes this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the CEC enclosure processor module from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.

MAP4F20 Section-4

About this task

This procedure addresses situations in which a failure in a CEC enclosure causes an I/O enclosure to become unavailable.

Situation 1:

Use this procedure after the original CEC enclosure failure is successfully repaired. This procedure recovers the I/O enclosure to operational.
  • You completed replacing a CEC enclosure FRU.
  • The serviceable event for the CEC enclosure was automatically closed.
  • A new serviceable event was created with "SRC BE400203 = I/O enclosure found unavailable during a CEC enclosure repair." An I/O enclosure is unavailable, quiesced, or powered off.

Situation 2:

Use this procedure when the original CEC enclosure failure repair failed, for example, in a deactivation phase. This procedure recovers the I/O enclosure to operational.
  • A CEC enclosure FRU repair failed.
  • The serviceable event for the CEC enclosure is not closed.
  • A new serviceable event was created with "SRC BE400203 = I/O enclosure found unavailable during a CEC enclosure repair." An I/O enclosure is unavailable or in service mode.

Do the following actions:

Procedure

  1. Are there any other open serviceable events, that you did not attempt to repair, with these I/O enclosure FRUs: I/O enclosure backplane assembly or I/O enclosure PCIe/SPCN card?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  2. The I/O enclosure must be reset by using the I/O enclosure backplane assembly replace procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not replace the FRU. In this situation, it is not necessary to disconnect and reconnect the cables as part of the pseudo repair.
    1. Read all these sub-steps before going to sub-step b, which closes this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the I/O enclosure backplane assembly from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.
  3. If you are in MAP4F20 Section-4 because of situation 2 (at the top of Section-4), for example, a CEC enclosure FRU repair failed, retry the CEC enclosure FRU repair now. Use the CEC enclosure FRU location code from the original serviceable event FRU list and then use MAP1230 Replace a FRU without using a serviceable event to replace this FRU.

MAP4F20 Section-5

About this task

This procedure addresses a situation in which multiple power or cooling failures in an I/O enclosure causes the I/O enclosure to become unavailable.

Use this procedure after the I/O enclosure power or cooling failures are successfully repaired. This procedure recovers the I/O enclosure to operational.

  • You completed replacing one or more of the following I/O enclosure power or cooling FRUs:
    • I/O enclosure PCIe/SPCN card
    • I/O enclosure backplane assembly
    • I/O enclosure fan
    • I/O enclosure power supply
  • The serviceable events for the I/O enclosure power or cooling FRUs were automatically closed.
  • A new serviceable event was created with "SRC BE400204 = I/O enclosure found unavailable during an I/O enclosure power/cooling repair." An I/O enclosure is unavailable, quiesced, or powered off.

Procedure

  1. Are there any other open serviceable events, that you did not attempt to repair, with I/O enclosure FRUs?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  2. The I/O enclosure must be reset by using the I/O enclosure backplane assembly replace procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not replace the FRU. In this situation, it is not necessary to disconnect and reconnect the cables as part of the pseudo repair.
    1. Read all these sub-steps before going to sub-step b, which closes this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the I/O enclosure backplane assembly from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.

MAP4F20 Section-6

About this task

This procedure addresses situations in which a failure in a CEC enclosure causes a logical I/O enclosure to become unavailable.

The I/O enclosure (2U) is managed as two logical I/O enclosures, one containing locations C1, C3, C5 and C7; and the other containing locations C2, C4, C6 and C8.

Situation 1:

Use this procedure after the original CEC enclosure failure is successfully repaired. This procedure recovers the logical I/O enclosure to operational.
  • You completed replacing a CEC enclosure FRU.
  • The serviceable event for the CEC enclosure was automatically closed.
  • A new serviceable event was created with "SRC BE400207 = Logical I/O enclosure found unavailable during a CEC enclosure repair." A logical I/O enclosure is unavailable, quiesced, or powered off.

Situation 2:

Use this procedure when the original CEC enclosure failure repair failed, for example, in a deactivation phase. This procedure recovers the logical I/O enclosure to operational.
  • A CEC enclosure FRU repair failed.
  • The serviceable event for the CEC enclosure is not closed.
  • A new serviceable event was created with "SRC BE400207 = Logical I/O enclosure found unavailable during a CEC enclosure repair." A logical I/O enclosure is unavailable or in service mode.

Do the following actions:

Procedure

  1. Are there any other open serviceable events, that you did not attempt to repair, with these I/O enclosure FRUs: I/O enclosure (2U) adapter (PCIe and SAS device) or I/O enclosure (2U) adapter (PCIe)?
    • Yes, exit this MAP and repair the open serviceable events.
    • No, go to the next step.
  2. The logical I/O enclosure must be reset by using the I/O enclosure (2U) adapter (PCIe and SAS device) replace procedure. Return to the repair screen you followed to this point and do a pseudo repair of the FRU. A pseudo repair means that you use the normal FRU replacement procedures, but you do not replace the FRU. In this situation, it is not necessary to disconnect and reconnect the cables as part of the pseudo repair.
    1. Read all these sub-steps before going to sub-step b, which closes this information center window. To view this MAP in a separate information center window, click Help in the upper right corner of the main HMC GUI screen and navigate to the MAP.
    2. Click Close in the current service information window.
    3. One or more HMC repair screens might prompt you for the result of using the service procedure in the MAP. Select Problem not fixed.
    4. Select No when prompted for whether you exchanged any parts.
    5. Select Yes when prompted for whether you isolated the problem.
    6. Select the I/O enclosure (2U) adapter (PCIe and SAS device) from the FRU list. Click Next. If the FRU is not listed, select Show more FRUs. If it is still not listed, you must manually select the FRU by using the procedure MAP1230 Replace a FRU without using a serviceable event.
    7. The HMC begins the FRU exchange process for the selected FRU.
  3. If you are in MAP4F20 Section-6 because of situation 2 (at the top of MAP4F20 Section-6), for example, a CEC enclosure FRU repair failed, retry the CEC enclosure FRU repair now. Use the CEC enclosure FRU location code from the original serviceable event FRU list and then use MAP1230 Replace a FRU without using a serviceable event to replace this FRU.