Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Novell 5.1 server crash 2

Status
Not open for further replies.

DaveBrown

Technical User
Jul 31, 2001
81
GB
Morning all and Happy new Year,
I have a dell PE2450 running NW 5.1 SP8 and it has started crashing every couple of days with Abends similar to the following:-

Abend 1 on P00: Server-5.00l: Nonmaskable Interrupt Processor Exception (Error code 000000B0)

Registers:
CS = 0008 DS = 0010 ES = 0010 FS = 0010 GS = 0010 SS = 0010
EAX = 00000001 EBX = BE57EA17 ECX = CC3690E0 EDX = D0005B80
ESI = 00000000 EDI = 00000000 EBP = 00000000 ESP = D076BF90
EIP = FC016C9C FLAGS = 00000046
SERVER.NLM|kQueCountNoCheck:
FC016C9C 8B442404 MOV EAX,[ESP+04]=D0005B80
EIP in SERVER.NLM at code start +00016C9Ch

The violation occurred while processing the following instruction:
SERVER.NLM|kQueCountNoCheck:
FC016C9C 8B442404 MOV EAX,[ESP+04]
FC016CA0 8B4010 MOV EAX,[EAX+10]
FC016CA3 C3 RET
SERVER.NLM|kFirstQLINKNoLock:
FC016CA4 8B442404 MOV EAX,[ESP+04]
FC016CA8 85C0 TEST EAX,EAX
FC016CAA 740C JZ FC016CB8
FC016CAC 81780C02070418 CMP [EAX+0C],18040702
FC016CB3 7517 JNZ FC016CCC
FC016CB5 8B00 MOV EAX,[EAX]
FC016CB7 C3 RET



Running process: Server 12 Process
Created by: NetWare Application
Thread Owned by NLM: SERVER.NLM
Stack pointer: D076BF88
OS Stack limit: D0764040
Scheduling priority: 67371008
Wait state: 50500F0 (Waiting for work)

The error is always a Abend 1 on P00: Server-5.00l: Nonmaskable Interrupt Processor Exception but sometimes the error code is 000000A0

Has anyone any ideas what I should do? - I've reseated the memory chips after the latest crash to see if that helps.
Dell Array manager doesn't indicate errors with the the discs or raid 5 array

Thanks for any help

Dave
 
Any NMI exception is usually related to a hardware failure of some kind..either that or an incompatilbe hadrware-kernel interface.
It's an old,old box and it's a cpu exception. You could look at the cooling subsystem in the box, then replace the first processor (P00) and then possibly replace the ps to see if the issue goes away.

Good luck.
 
I had a similiar issue and had to replace a power supply. Once done, that took care of it. It was actually the fan in the power supply that had failed and caused the problem. Good luck. I know it's no fun.

Iolair MacWalter
Director of IT
 
many thanks - I'll take a llok at some stats tomorrow
 
Hi All,
I've loaded DellOpen Manage System Administrator and this indicates a problem with memory DIMM B
Single bit warning error rate exceeded
Multi bit error encountered
I'll get the memory changed
Thanks again for all your assistance

Dave
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top