Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Spontaneous Reboot on RS6000 with AIX 5.2.0.0

Status
Not open for further replies.

majorbroncosfan

Programmer
Feb 23, 2001
121
US
We are currently running AIX 5.2.0.0 on an RS6000. From time to time, the machine spontaneously reboots itself. We are unsure as to why this is happening, but when we ask our support company what is causing it, all we get is that we need an OS patch (no direction, however, as to what that patch is).

Does anybody have any ideas about this?

Thanks in advance.

Jeff
 
What do you mean reboot and reboot? Were you able to reach the console and login or only during the IPL stage? Please give us more info... thanks
 
HI,

Please post the output of the following:

1.errpt|head -20
2.errpt -a|head -60
3.diag ->Diagnostic Routines->Problem Determination->Problem determination

This will analize the errpt for any possibly HW error.
Your problem does sound like a HW issue.

"Long live king Moshiach !"
 
Thank you so much for your help!

The output from #1 is:

IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTION
B6048838 0506000004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0506000004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0506000004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0506000004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0506000004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0506000004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0427143204 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0427143204 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0427143204 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0427143204 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0427143104 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0427143004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
B6048838 0427143004 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED
A6DF45AA 0427141104 I O RMCdaemon The daemon is started.
DED8E752 0427140904 T H ent2 ETHERNET DOWN
1104AA28 0427140204 T S SYSPROC SYSTEM RESET INTERRUPT RECEIVED
9DBCFDEE 0427140904 T O errdemon ERROR LOGGING TURNED ON


From #2:
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Thu May 6 00:00:59 CDT
Sequence Number: 237
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
114306
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
0
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_bar_main
ADDITIONAL INFORMATION
IPI_disco 14
IPI_disco 80
IPIdelete 164
PAMS_dele 470
pams_exit B0
exit 88
??
??
??
??
??

And #3 returned to problems/issues found.

On a side note, where can I get documentation for AIX? I have used SCO for so long that many of the diagnostic tools and other tools are foreign to me.

Thanks again!

Jeff
 
Thanks,

But I see we are missing the most interesting part of the errpt - these prior to self-reboot.

Please post now:

errpt|head 40

"Long live king Moshiach !"
 
looks to me like the application is crashing and bringing down the OS with it, if i had to guess.

IBM Certified -- AIX 4.3 Obfuscation
 
The problem is likely to be an initiated process on your application. Try to boot the system to single user mode and observed the state.



 
Have you checked that auto-reboot isn't enabled in system hang detection

smit shpriocfg

--
| Mike Nixon
| Unix Admin
|
----------------------------
 
Thanks for all of the help!

errpt | head -40 gives me the same output

I looked into smit shpriocfg and auto-reboot isn't enabled.

My guess is also on the application. The vendor states that the OS needs to be upgraded to ML 2, but I'm guessing that won't stop this.
 
I still think that we could look deeper into this problem if we had "errpt -a" for 20 errors around the reboout time - 10 before and 10 after.

"Long live king Moshiach !"
 
O.K. Here it is...

---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Thu May 6 00:00:59 CDT
Sequence Number: 237
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
114306
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
0
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_bar_main
ADDITIONAL INFORMATION
IPI_disco 14
IPI_disco 80
IPIdelete 164
PAMS_dele 470
pams_exit B0
exit 88
??
??
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_bar SIG/11 FLDS/IPI_disco VALU/14 FLDS/IPIdelete
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Thu May 6 00:00:58 CDT
Sequence Number: 236
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
119612
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
1
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
html_server_main
ADDITIONAL INFORMATION
??
??
??
Unable to generate symptom string.
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Thu May 6 00:00:55 CDT
Sequence Number: 235
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
94434
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
1
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_utl_main
ADDITIONAL INFORMATION
ttyname_r 18
ttyslot 2C
getlogin_ 180
getlogin 18
??
??
??
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_utl SIG/11 FLDS/ttyname_r VALU/18 FLDS/ttyslot
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Thu May 6 00:00:54 CDT
Sequence Number: 234
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
83666
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
2
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_user_main
ADDITIONAL INFORMATION
_doprnt 28
_doprnt C
??
??
??
??
??
??
??
??
??
Unable to generate symptom string.
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Thu May 6 00:00:53 CDT
Sequence Number: 233
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
125346
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
1
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_res_main
ADDITIONAL INFORMATION
ttyslot 1C
getlogin_ 180
getlogin 18
??
??
??
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_res SIG/11 FLDS/ttyslot VALU/1c FLDS/getlogin_
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Thu May 6 00:00:50 CDT
Sequence Number: 232
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
104380
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
0
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_function_main
ADDITIONAL INFORMATION
IPI_disco 14
IPI_disco 80
IPIdelete 164
PAMS_dele 470
pams_exit B0
exit 88
??
??
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_fun SIG/11 FLDS/IPI_disco VALU/14 FLDS/IPIdelete
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Tue Apr 27 14:32:27 CDT
Sequence Number: 231
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
68220
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
3
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_utl_main
ADDITIONAL INFORMATION
ttyname_r 18
ttyslot 2C
getlogin_ 180
getlogin 18
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_utl SIG/11 FLDS/ttyname_r VALU/18 FLDS/ttyslot
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Tue Apr 27 14:32:26 CDT
Sequence Number: 230
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
54580
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
0
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_utl_main
ADDITIONAL INFORMATION
ttyname_r 18
ttyslot 2C
getlogin_ 180
getlogin 18
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_utl SIG/11 FLDS/ttyname_r VALU/18 FLDS/ttyslot
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Tue Apr 27 14:32:22 CDT
Sequence Number: 229
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
59700
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
1
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_user_main
ADDITIONAL INFORMATION
_doprnt 28
_doprnt C
??
??
??
??
??
Unable to generate symptom string.
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Tue Apr 27 14:32:02 CDT
Sequence Number: 228
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
55364
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
0
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
html_server_main
ADDITIONAL INFORMATION
getlogin_ 18
getlogin 18
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/html_serv SIG/11 FLDS/getlogin_ VALU/18 FLDS/getlogin
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Tue Apr 27 14:31:51 CDT
Sequence Number: 227
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
55350
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
2
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
mpps_server_main
ADDITIONAL INFORMATION
??
??
??
Unable to generate symptom string.
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Tue Apr 27 14:30:56 CDT
Sequence Number: 226
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
55302
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
3
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_res_main
ADDITIONAL INFORMATION
ttyslot 1C
getlogin_ 180
getlogin 18
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_res SIG/11 FLDS/ttyslot VALU/1c FLDS/getlogin_
---------------------------------------------------------------------------
LABEL: CORE_DUMP
IDENTIFIER: B6048838

Date/Time: Tue Apr 27 14:30:55 CDT
Sequence Number: 225
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: PERM
Resource Name: SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
SOFTWARE PROGRAM

User Causes
USER GENERATED SIGNAL

Recommended Actions
CORRECT THEN RETRY

Failure Causes
SOFTWARE PROGRAM

Recommended Actions
RERUN THE APPLICATION PROGRAM
IF PROBLEM PERSISTS THEN DO THE FOLLOWING
CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
11
USER'S PROCESS ID:
66868
FILE SYSTEM SERIAL NUMBER
11
INODE NUMBER
1536
PROCESSOR ID
2
CORE FILE NAME
/SMSCLN/log/core
PROGRAM NAME
tpsrv_res_main
ADDITIONAL INFORMATION
ttyslot 1C
getlogin_ 180
getlogin 18
??
??
??

Symptom Data
REPORTABLE
1
INTERNAL ERROR
0
SYMPTOM CODE
PCSS/SPI2 FLDS/tpsrv_res SIG/11 FLDS/ttyslot VALU/1c FLDS/getlogin_
---------------------------------------------------------------------------
LABEL: RMCD_INFO_0_ST
IDENTIFIER: A6DF45AA

Date/Time: Tue Apr 27 14:11:06 CDT
Sequence Number: 224
Machine Id: 0003748A4C00
Node Id: RMS26
Class: O
Type: INFO
Resource Name: RMCdaemon

Description
The daemon is started.

Probable Causes
The Resource Monitoring and Control daemon has been started.

User Causes
The startsrc -s ctrmc command has been executed or
the rmcctrl -s command has been executed.

Recommended Actions
Confirm that the daemon should be started.

Detail Data
DETECTING MODULE
RSCT,rmcd.c,1.36,199
ERROR ID
6eKora08/fX./l3316Ll4g0...................
REFERENCE CODE

---------------------------------------------------------------------------
LABEL: SYS_RESET
IDENTIFIER: 1104AA28

Date/Time: Tue Apr 27 14:02:54 CDT
Sequence Number: 221
Machine Id: 0003748A4C00
Node Id: RMS26
Class: S
Type: TEMP
Resource Name: SYSPROC

Description
SYSTEM RESET INTERRUPT RECEIVED

Probable Causes
SYSTEM RESET INTERRUPT

Detail Data
KEY MODE SWITCH POSITION AT BOOT TIME
normal
KEY MODE SWITCH POSITION CURRENTLY
normal
---------------------------------------------------------------------------
LABEL: ERRLOG_ON
IDENTIFIER: 9DBCFDEE

Date/Time: Tue Apr 27 14:09:56 CDT
Sequence Number: 220
Machine Id: 0003748A4C00
Node Id: RMS26
Class: O
Type: TEMP
Resource Name: errdemon

Description
ERROR LOGGING TURNED ON

Probable Causes
ERRDEMON STARTED AUTOMATICALLY

User Causes
/USR/LIB/ERRDEMON COMMAND

Recommended Actions
NONE

 
Well, i agree to the rest, sounds like application program to me that pushes your server to reboot. Anyways, i suspect you are currently using an in-house application here, otherwise, is your application running meets the prerequisite from the AIX side? Also, try to patch in the latest RML for 5.2 but dont commit yet, try and observe wether its going to help you..
good luck!
 
Maybe it's this one?

3) AIX 5.2 : High impact/highly pervasive (2004.05.13)

APAR Number: IY56376
Systems with many devices may experience reboot loop where
system starts booting, displays 517 on LED display, and
then reboots.

 
We have had the same trouble : spontanous server shutdown...and nog logging in errpt.
Appearantly a deadmanswitch was invoked, this doesn't put any entry in de errpt.We still don't have a clue what triggered the event, but that was the cause of the reboot

greetz

R.
 
the SYS_RESET is the interesting part. good luck figuring out what is sending the reset. maybe somebody is poking the wrong button when they want to eject a tape?

IBM Certified -- AIX 4.3 Obfuscation
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top