DouggieC
Technical User
- May 14, 2002
- 2
Hi All,
I have a Netra 440 server which has been showing an amber maintenance light for some time now. It first appeared when I upgraded the memory modules, but the new memory is working fine. All other components seem OK and the machine is up and running. I am completely baffled as to what the problem could be. Prtdiag shows no problems:
Last week I ran SunVTS for 8 hours to stress test the machine, hoping it would through up some errors, but all it shows is one bad disk read:
SunVTS Summary Test Report
Latest Test Session Start Time: 10/24/12 16:12:20
Latest Test Session End Time: 10/25/12 00:12:24
Hostname: hmi1.com
Logical Test Status
Disk: PASS
Environment: PASS
Ioports: PASS
Media: NO RESULT
Memory: PASS
Network: PASS
Processor: PASS
Graphics: PASS
Faults Detected
No Faults or Suspect Hardware Detected by FMA
SunVTS Messages
10/24/12 17:31:23 hmi1 SunVTS7.0ps11: VTSID 6005 Disk.diskmediatest.ERROR rdsk/c1t1d0: "I/O (read) request could not be completed successfully on block : 65024987, Error Message : I/O error"
Syslog Messages
Oct 24 09:49:24 hmi1 dtsession[13209]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
Oct 24 11:57:40 hmi1 dtsession[13209]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
Oct 24 12:19:53 hmi1 dtsession[1248]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
Oct 24 12:54:13 hmi1 dtsession[25206]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
End SunVTS Test Report
Could this bad read be responsible for the warning light, and if so should I replace the disk? If not, any other ideas as to what may be wrong? Any help would be gratefully received as I have run otu of ideas.
Thanks in advance
Doug
I have a Netra 440 server which has been showing an amber maintenance light for some time now. It first appeared when I upgraded the memory modules, but the new memory is working fine. All other components seem OK and the machine is up and running. I am completely baffled as to what the problem could be. Prtdiag shows no problems:
5ksh# ./prtdiag -v
System Configuration: Sun Microsystems sun4u Netra 440
System clock frequency: 177 MHZ
Memory size: 16GB
==================================== CPUs ====================================
E$ CPU CPU
CPU Freq Size Implementation Mask Status Location
--- -------- ---------- --------------------- ----- ------ --------
0 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
1 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
2 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
3 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
================================= IO Devices =================================
Bus Freq Slot + Name +
Type MHz Status Path Model
------ ---- ---------- ---------------------------- --------------------
pci 66 PCI5 SUNW,XVR-100 (display) SUNW,375-3290
okay /pci@1c,600000/SUNW,XVR-100@1
pci 66 MB pci108e,abba (network) SUNW,pci-ce
okay /pci@1c,600000/network@2
pci 33 MB isa/su (serial)
okay /pci@1e,600000/isa@7/serial@0,3f8
pci 33 MB isa/su (serial)
okay /pci@1e,600000/isa@7/serial@0,2e8
pci 33 MB isa/rmc-comm-rmc_comm (seria+
okay /pci@1e,600000/isa@7/rmc-comm@0,3e8
pci 33 MB pci10b9,5229 (ide)
okay /pci@1e,600000/ide@d
pci 66 MB pci108e,abba (network) SUNW,pci-ce
okay /pci@1f,700000/network@1
pci 66 MB scsi-pci1000,30 (scsi-2) LSI,1030
okay /pci@1f,700000/scsi@2
pci 66 MB scsi-pci1000,30 (scsi-2) LSI,1030
okay /pci@1f,700000/scsi@2,1
============================ Memory Configuration ============================
Segment Table:
-----------------------------------------------------------------------
Base Address Size Interleave Factor Contains
-----------------------------------------------------------------------
0x0 4GB 16 BankIDs 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
0x1000000000 4GB 16 BankIDs 16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31
0x2000000000 4GB 16 BankIDs 32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47
0x3000000000 4GB 16 BankIDs 48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63
Bank Table:
-----------------------------------------------------------
Physical Location
ID ControllerID GroupID Size Interleave Way
-----------------------------------------------------------
0 0 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
1 0 0 256MB
2 0 1 256MB
3 0 1 256MB
4 0 0 256MB
5 0 0 256MB
6 0 1 256MB
7 0 1 256MB
8 0 1 256MB
9 0 1 256MB
10 0 0 256MB
11 0 0 256MB
12 0 1 256MB
13 0 1 256MB
14 0 0 256MB
15 0 0 256MB
16 1 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
17 1 0 256MB
18 1 1 256MB
19 1 1 256MB
20 1 0 256MB
21 1 0 256MB
22 1 1 256MB
23 1 1 256MB
24 1 1 256MB
25 1 1 256MB
26 1 0 256MB
27 1 0 256MB
28 1 1 256MB
29 1 1 256MB
30 1 0 256MB
31 1 0 256MB
32 2 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
33 2 0 256MB
34 2 1 256MB
35 2 1 256MB
36 2 0 256MB
37 2 0 256MB
38 2 1 256MB
39 2 1 256MB
40 2 1 256MB
41 2 1 256MB
42 2 0 256MB
43 2 0 256MB
44 2 1 256MB
45 2 1 256MB
46 2 0 256MB
47 2 0 256MB
48 3 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
49 3 0 256MB
50 3 1 256MB
51 3 1 256MB
52 3 0 256MB
53 3 0 256MB
54 3 1 256MB
55 3 1 256MB
56 3 1 256MB
57 3 1 256MB
58 3 0 256MB
59 3 0 256MB
60 3 1 256MB
61 3 1 256MB
62 3 0 256MB
63 3 0 256MB
Memory Module Groups:
--------------------------------------------------
ControllerID GroupID Labels Status
--------------------------------------------------
0 0 C0/P0/B0/D0
0 0 C0/P0/B0/D1
0 1 C0/P0/B1/D0
0 1 C0/P0/B1/D1
1 0 C1/P0/B0/D0
1 0 C1/P0/B0/D1
1 1 C1/P0/B1/D0
1 1 C1/P0/B1/D1
2 0 C2/P0/B0/D0
2 0 C2/P0/B0/D1
2 1 C2/P0/B1/D0
2 1 C2/P0/B1/D1
3 0 C3/P0/B0/D0
3 0 C3/P0/B0/D1
3 1 C3/P0/B1/D0
3 1 C3/P0/B1/D1
=============================== usb Devices ===============================
Name Port#
------------ -----
hub 2
=============================== hub#2 Devices ===============================
Name Port#
------------ -----
keyboard 4
mouse 1
============================ Environmental Status ============================
Fan Status:
-------------------------------------------
Location Sensor Status
-------------------------------------------
FT0/F0 TACH okay
FT1/F0 TACH okay
FT2/F0 TACH okay
FT3/F0 F0 okay
PSU0 FF_FAN okay
PSU1 FF_FAN okay
PSU2 FF_FAN okay
PSU3 FF_FAN okay
Temperature sensors:
-----------------------------------------
Location Sensor Status
-----------------------------------------
C0/P0 T_CORE okay
C1/P0 T_CORE okay
C2/P0 T_CORE okay
C3/P0 T_CORE okay
PSU0 FF_OT okay
PSU1 FF_OT okay
PSU2 FF_OT okay
PSU3 FF_OT okay
------------------------------------
Current sensors:
----------------------------------------
Location Sensor Status
----------------------------------------
MB FF_SCSIA okay
MB FF_SCSIB okay
MB FF_POK okay
C0/P0 FF_POK okay
C1/P0 FF_POK okay
C2/P0 FF_POK okay
C3/P0 FF_POK okay
PSU0 FF_OC okay
PSU1 FF_OC okay
PSU2 FF_OC okay
PSU3 FF_OC okay
------------------------------------
Voltage sensors:
-----------------------------------
Location Sensor Status
-----------------------------------
MB V_+1V5 okay
MB V_VCCTM okay
MB V_NET0_1V2D okay
MB V_NET1_1V2D okay
MB V_NET0_1V2A okay
MB V_NET1_1V2A okay
MB V_+3V3 okay
MB V_+3V3STBY okay
MB/BAT V_BAT okay
MB V_SCSI_CORE okay
MB V_+5V okay
MB V_+12V okay
MB V_-12V okay
PSU0 P_PWR okay
PSU0 FF_POK okay
PSU0 FF_UV okay
PSU0 FF_OV okay
PSU1 P_PWR okay
PSU1 FF_POK okay
PSU1 FF_UV okay
PSU1 FF_OV okay
PSU2 P_PWR okay
PSU2 FF_POK okay
PSU2 FF_UV okay
PSU2 FF_OV okay
PSU3 P_PWR okay
PSU3 FF_POK okay
PSU3 FF_UV okay
PSU3 FF_OV okay
-----------------------------------------
Keyswitch:
-----------------------------------------
Location Keyswitch State
-----------------------------------------
SYS SYSCTRL NORMAL
--------------------------------------------------
Led State:
--------------------------------------------------------------
Location Led State Color
--------------------------------------------------------------
SYS ACT on green
SYS SERVICE on amber
SYS LOCATE off white
PSU0 POK on green
PSU0 SERVICE off amber
PSU0 OK2RM off blue
PSU1 POK on green
PSU1 SERVICE off amber
PSU1 OK2RM off blue
HDD0 SERVICE off amber
HDD0 OK2RM off blue
HDD1 SERVICE off amber
HDD1 OK2RM off blue
HDD2 SERVICE off amber
HDD2 OK2RM off blue
HDD3 SERVICE off amber
HDD3 OK2RM off blue
PSU2 POK on green
PSU2 SERVICE off amber
PSU2 OK2RM off blue
PSU3 POK on green
PSU3 SERVICE off amber
PSU3 OK2RM off blue
MB CRITICAL off red
MB MAJOR off red
MB MINOR off amber
MB USER off amber
FT0/F0 ACT on green
FT0/F0 FAULT off amber
FT1/F0 ACT on green
FT1/F0 FAULT off amber
FT2/F0 ACT on green
FT2/F0 FAULT off amber
=========================== FRU Operational Status ===========================
---------------------------------
Fru Operational Status:
---------------------------------
Location Status
---------------------------------
SC okay
HDD0 present
HDD1 present
HDD2 present
HDD3 present
PSU0 okay
PSU1 okay
PSU2 okay
PSU3 okay
================================ HW Revisions ================================
ASIC Revisions:
-------------------------------------------------------------------
Path Device Status Revision
-------------------------------------------------------------------
/pci@1c,600000 pci108e,a801 okay 4
/pci@1d,700000 pci108e,a801 okay 4
/pci@1e,600000 pci108e,a801 okay 4
/pci@1f,700000 pci108e,a801 okay 4
System PROM revisions:
----------------------
OBP 4.30.4.a 2010/01/06 14:45 Sun Fire V440,Netra 440
OBDIAG 4.30.4 2010/01/06 15:01
System Configuration: Sun Microsystems sun4u Netra 440
System clock frequency: 177 MHZ
Memory size: 16GB
==================================== CPUs ====================================
E$ CPU CPU
CPU Freq Size Implementation Mask Status Location
--- -------- ---------- --------------------- ----- ------ --------
0 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
1 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
2 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
3 1593 MHz 1MB SUNW,UltraSPARC-IIIi 3.4 on-line -
================================= IO Devices =================================
Bus Freq Slot + Name +
Type MHz Status Path Model
------ ---- ---------- ---------------------------- --------------------
pci 66 PCI5 SUNW,XVR-100 (display) SUNW,375-3290
okay /pci@1c,600000/SUNW,XVR-100@1
pci 66 MB pci108e,abba (network) SUNW,pci-ce
okay /pci@1c,600000/network@2
pci 33 MB isa/su (serial)
okay /pci@1e,600000/isa@7/serial@0,3f8
pci 33 MB isa/su (serial)
okay /pci@1e,600000/isa@7/serial@0,2e8
pci 33 MB isa/rmc-comm-rmc_comm (seria+
okay /pci@1e,600000/isa@7/rmc-comm@0,3e8
pci 33 MB pci10b9,5229 (ide)
okay /pci@1e,600000/ide@d
pci 66 MB pci108e,abba (network) SUNW,pci-ce
okay /pci@1f,700000/network@1
pci 66 MB scsi-pci1000,30 (scsi-2) LSI,1030
okay /pci@1f,700000/scsi@2
pci 66 MB scsi-pci1000,30 (scsi-2) LSI,1030
okay /pci@1f,700000/scsi@2,1
============================ Memory Configuration ============================
Segment Table:
-----------------------------------------------------------------------
Base Address Size Interleave Factor Contains
-----------------------------------------------------------------------
0x0 4GB 16 BankIDs 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
0x1000000000 4GB 16 BankIDs 16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31
0x2000000000 4GB 16 BankIDs 32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47
0x3000000000 4GB 16 BankIDs 48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63
Bank Table:
-----------------------------------------------------------
Physical Location
ID ControllerID GroupID Size Interleave Way
-----------------------------------------------------------
0 0 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
1 0 0 256MB
2 0 1 256MB
3 0 1 256MB
4 0 0 256MB
5 0 0 256MB
6 0 1 256MB
7 0 1 256MB
8 0 1 256MB
9 0 1 256MB
10 0 0 256MB
11 0 0 256MB
12 0 1 256MB
13 0 1 256MB
14 0 0 256MB
15 0 0 256MB
16 1 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
17 1 0 256MB
18 1 1 256MB
19 1 1 256MB
20 1 0 256MB
21 1 0 256MB
22 1 1 256MB
23 1 1 256MB
24 1 1 256MB
25 1 1 256MB
26 1 0 256MB
27 1 0 256MB
28 1 1 256MB
29 1 1 256MB
30 1 0 256MB
31 1 0 256MB
32 2 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
33 2 0 256MB
34 2 1 256MB
35 2 1 256MB
36 2 0 256MB
37 2 0 256MB
38 2 1 256MB
39 2 1 256MB
40 2 1 256MB
41 2 1 256MB
42 2 0 256MB
43 2 0 256MB
44 2 1 256MB
45 2 1 256MB
46 2 0 256MB
47 2 0 256MB
48 3 0 256MB 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
49 3 0 256MB
50 3 1 256MB
51 3 1 256MB
52 3 0 256MB
53 3 0 256MB
54 3 1 256MB
55 3 1 256MB
56 3 1 256MB
57 3 1 256MB
58 3 0 256MB
59 3 0 256MB
60 3 1 256MB
61 3 1 256MB
62 3 0 256MB
63 3 0 256MB
Memory Module Groups:
--------------------------------------------------
ControllerID GroupID Labels Status
--------------------------------------------------
0 0 C0/P0/B0/D0
0 0 C0/P0/B0/D1
0 1 C0/P0/B1/D0
0 1 C0/P0/B1/D1
1 0 C1/P0/B0/D0
1 0 C1/P0/B0/D1
1 1 C1/P0/B1/D0
1 1 C1/P0/B1/D1
2 0 C2/P0/B0/D0
2 0 C2/P0/B0/D1
2 1 C2/P0/B1/D0
2 1 C2/P0/B1/D1
3 0 C3/P0/B0/D0
3 0 C3/P0/B0/D1
3 1 C3/P0/B1/D0
3 1 C3/P0/B1/D1
=============================== usb Devices ===============================
Name Port#
------------ -----
hub 2
=============================== hub#2 Devices ===============================
Name Port#
------------ -----
keyboard 4
mouse 1
============================ Environmental Status ============================
Fan Status:
-------------------------------------------
Location Sensor Status
-------------------------------------------
FT0/F0 TACH okay
FT1/F0 TACH okay
FT2/F0 TACH okay
FT3/F0 F0 okay
PSU0 FF_FAN okay
PSU1 FF_FAN okay
PSU2 FF_FAN okay
PSU3 FF_FAN okay
Temperature sensors:
-----------------------------------------
Location Sensor Status
-----------------------------------------
C0/P0 T_CORE okay
C1/P0 T_CORE okay
C2/P0 T_CORE okay
C3/P0 T_CORE okay
PSU0 FF_OT okay
PSU1 FF_OT okay
PSU2 FF_OT okay
PSU3 FF_OT okay
------------------------------------
Current sensors:
----------------------------------------
Location Sensor Status
----------------------------------------
MB FF_SCSIA okay
MB FF_SCSIB okay
MB FF_POK okay
C0/P0 FF_POK okay
C1/P0 FF_POK okay
C2/P0 FF_POK okay
C3/P0 FF_POK okay
PSU0 FF_OC okay
PSU1 FF_OC okay
PSU2 FF_OC okay
PSU3 FF_OC okay
------------------------------------
Voltage sensors:
-----------------------------------
Location Sensor Status
-----------------------------------
MB V_+1V5 okay
MB V_VCCTM okay
MB V_NET0_1V2D okay
MB V_NET1_1V2D okay
MB V_NET0_1V2A okay
MB V_NET1_1V2A okay
MB V_+3V3 okay
MB V_+3V3STBY okay
MB/BAT V_BAT okay
MB V_SCSI_CORE okay
MB V_+5V okay
MB V_+12V okay
MB V_-12V okay
PSU0 P_PWR okay
PSU0 FF_POK okay
PSU0 FF_UV okay
PSU0 FF_OV okay
PSU1 P_PWR okay
PSU1 FF_POK okay
PSU1 FF_UV okay
PSU1 FF_OV okay
PSU2 P_PWR okay
PSU2 FF_POK okay
PSU2 FF_UV okay
PSU2 FF_OV okay
PSU3 P_PWR okay
PSU3 FF_POK okay
PSU3 FF_UV okay
PSU3 FF_OV okay
-----------------------------------------
Keyswitch:
-----------------------------------------
Location Keyswitch State
-----------------------------------------
SYS SYSCTRL NORMAL
--------------------------------------------------
Led State:
--------------------------------------------------------------
Location Led State Color
--------------------------------------------------------------
SYS ACT on green
SYS SERVICE on amber
SYS LOCATE off white
PSU0 POK on green
PSU0 SERVICE off amber
PSU0 OK2RM off blue
PSU1 POK on green
PSU1 SERVICE off amber
PSU1 OK2RM off blue
HDD0 SERVICE off amber
HDD0 OK2RM off blue
HDD1 SERVICE off amber
HDD1 OK2RM off blue
HDD2 SERVICE off amber
HDD2 OK2RM off blue
HDD3 SERVICE off amber
HDD3 OK2RM off blue
PSU2 POK on green
PSU2 SERVICE off amber
PSU2 OK2RM off blue
PSU3 POK on green
PSU3 SERVICE off amber
PSU3 OK2RM off blue
MB CRITICAL off red
MB MAJOR off red
MB MINOR off amber
MB USER off amber
FT0/F0 ACT on green
FT0/F0 FAULT off amber
FT1/F0 ACT on green
FT1/F0 FAULT off amber
FT2/F0 ACT on green
FT2/F0 FAULT off amber
=========================== FRU Operational Status ===========================
---------------------------------
Fru Operational Status:
---------------------------------
Location Status
---------------------------------
SC okay
HDD0 present
HDD1 present
HDD2 present
HDD3 present
PSU0 okay
PSU1 okay
PSU2 okay
PSU3 okay
================================ HW Revisions ================================
ASIC Revisions:
-------------------------------------------------------------------
Path Device Status Revision
-------------------------------------------------------------------
/pci@1c,600000 pci108e,a801 okay 4
/pci@1d,700000 pci108e,a801 okay 4
/pci@1e,600000 pci108e,a801 okay 4
/pci@1f,700000 pci108e,a801 okay 4
System PROM revisions:
----------------------
OBP 4.30.4.a 2010/01/06 14:45 Sun Fire V440,Netra 440
OBDIAG 4.30.4 2010/01/06 15:01
Last week I ran SunVTS for 8 hours to stress test the machine, hoping it would through up some errors, but all it shows is one bad disk read:
SunVTS Summary Test Report
Latest Test Session Start Time: 10/24/12 16:12:20
Latest Test Session End Time: 10/25/12 00:12:24
Hostname: hmi1.com
Logical Test Status
Disk: PASS
Environment: PASS
Ioports: PASS
Media: NO RESULT
Memory: PASS
Network: PASS
Processor: PASS
Graphics: PASS
Faults Detected
No Faults or Suspect Hardware Detected by FMA
SunVTS Messages
10/24/12 17:31:23 hmi1 SunVTS7.0ps11: VTSID 6005 Disk.diskmediatest.ERROR rdsk/c1t1d0: "I/O (read) request could not be completed successfully on block : 65024987, Error Message : I/O error"
Syslog Messages
Oct 24 09:49:24 hmi1 dtsession[13209]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
Oct 24 11:57:40 hmi1 dtsession[13209]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
Oct 24 12:19:53 hmi1 dtsession[1248]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
Oct 24 12:54:13 hmi1 dtsession[25206]: [ID 293258 user.error] libsldap: Status: 49 Mesg: openConnection: simple bind failed - Invalid credentials
End SunVTS Test Report
Could this bad read be responsible for the warning light, and if so should I replace the disk? If not, any other ideas as to what may be wrong? Any help would be gratefully received as I have run otu of ideas.
Thanks in advance
Doug