Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

ssa adapter errors in errpt

Status
Not open for further replies.

billo102

Technical User
Mar 2, 2009
31
GB
Gettingt the following on one of our servers
---------------------------------------------------------------------------
LABEL: SSA_DEVICE_ERROR
IDENTIFIER: FE9E9357

Date/Time: Thu Aug 27 16:00:01 BST
Sequence Number: 8814
Machine Id: 00414D6A4C00
Node Id: clukua24
Class: H
Type: PERM
Resource Name: ssa1
Resource Class: adapter
Resource Type: ssa
Location: P2-I9
VPD:
Part Number.................025L5814
FRU Number..................025L5814
Serial Number...............C8388060
EC Level....................0000F23655
Manufacturer................IBM053
ROS Level and ID............5101
Loadable Microcode Level....04
Device Driver Level.........00
Displayable Message.........SSA-ADAPTER
Device Specific.(Z0)........DRAM=032
Device Specific.(Z1)........CACHE=0
Device Specific.(Z2)........000000062956231e

Description
DISK OPERATION ERROR

Probable Causes
DASD DEVICE

Failure Causes
DISK DRIVE

Recommended Actions
PERFORM PROBLEM DETERMINATION PROCEDURES

Detail Data
ERROR CODE
0441 0100 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838



Just wondering what this means exactly. I have checked inside our adapters and this comes up as an adapter. What could be failing exactly? It seems like our application on that machine is working ok.
 
It would appear that you have a failed disk drive. Phone IBM support and they will tell you how to diagnose the failing component.
 
Yep, disk drive failure - probably.

7133 Models D40 and T40
Serial Disk Systems
Service Guide
ftp://ftp.software.ibm.com/storage/7133/pdfs/7133_ServGde.pdf

From the error Detail Data:
0441 0100 0000 0000 etc.

We get an SRN of 44101

In the book this is 44PAA

44 is a drive (or loop) failure and PAA is the location of the failure in the loop.

P = 1 = Port A2 on the adapter ssa1 which is in pci slot I9 on the I/O planer p2 (location shows P2-I9)

AA = 01 = the second device you come to from that port A2 (0 would be the first device).


The book says:

Disk drive module (90%) (“Disk Drive
Modules and Dummy Disk Drive
Modules” on page 150).
SSA attachment (10%) (using-system
Installation and Service Guide).

So it could be a connection problem but is much more likely to be a disk failure.

The problem description:

Description: An SSA device on the loop is preventing the
completion of the loop configuration.
Action: If the SSA service aids are available, run a link
verification, or physical view, service aid to determine which
device is preventing configuration (see the user or
maintenance information for your using-system SSA
attachment). The device that is preventing configuration is
the device that is beyond the last-configured device on an
open SSA loop.
If the SSA service aids or functions are not available, note
the value of PAA in this SRN, and go to “Physical Location
of a Device” on page 59.

Additional Device Location Information
You can find the device by using the Port (P) and SSA-address (AA) values that are
provided by some service request numbers (SRNs). Examples of these SRNs are
43PAA, 44PAA, and 45PAA.
The Port (P) value can be a number 0 through 3. It indicates the port connector on the
SSA attachment:
0 = Connector A1
1 = Connector A2
2 = Connector B1
3 = Connector B2
The AA value is the decimal SSA-address value. It indicates the position of the device
that you are trying to find (counted around the SSA loop).
Use the PORT value to locate the relevant connector on the SSA attachment, then
follow the SSA cable to the first real device. Include other SSA attachments as real
devices if they are in the same SSA loop. Do not include dummy devices or bypass
cards.

You'll find the SSA service aids in diags:
diag command > task selection > ssa service aids.

The error is reported by the adapter but the SRN indicates a problem out in the SSA loop, SSA errros often report this way and there seems to be no indication of an adapter problem if this is the only ssa error you have.

As sjm2 said, call IBM or your hardware support service provider if you get stuck. Don't just ignore this error, your data could be at risk.

If there was more than one host connected to the disk drawer and the other system was switched off or disconnected you may see this error but you should still make sure you know the cause and that all is OK.

If you keep seeing the error then there is an outstanding problem.

If you only ever saw the error once when you know another system on this disk drawer was being worked on, powered off, etc. then you are probably OK to ignore it because the SSA health check should run every hour from cron.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top