Hi!
I'm investigating an SP frame and associated nodes that haven't been supported for some years (the previus admin team having been rendered redundant to cut costs). There was a power outage some months ago and some nodes can't be reached.
Unfortunately my knowledge of SP's is postage-stamp in size.
spmon -G -d displays;
1. Checking server process
Process 14196 has accumulated 36 minutes and 13 seconds.
Check ok
2. Opening connection to server
Connection opened
Check ok
3. Querying frame(s)
1 frame(s)
Check ok
4. Checking frames
Controller Slot 17 Switch Switch Power supplies
Frame Responds Switch Power Clocking A B C D
------------------------------------------------------------
1 yes yes on 0 on on on on
5. Checking nodes
--------------------------------- Frame 1 ---------------
Frame Node Node Host/Swch Key Env FrontPanel
Slot Number Type Power Responds Switch Fail LCD/LED
---------------------------------------------------------
1 1 high on no autojn normal no LCD is blank
5 5 wide on no no normal no LEDblank
9 9 high on no no normal no LCD blank 13 13 wide on no autojn N/A no LCDsblank
I'm concerned abut the "Host Responds"=no. I can ping the nodes, but cannot login. telnet responds but no login is displayed. s1term works to a node, displaying a Console login which works, but the password isn't processed (this might be an unrelated problem with a NIS domain message being constantly displayed, though "root" isn't in the NIS passwd map.)
A command like cshutdown produces (in its log file)
"Node spnode1en0 is not running. Unable to rsh to node"
However I can start and stop nodes with spmon and change the key switch setting. SMIT verification routines report that "Information in the SDR indicates that the node is not up"
PSSP version 4.1 is installed. The Control Workstation and nodes have AIX 4.3 (yes!)
Any SP whizz out there can point me in the right direction as to what I should be seeing if all was OK, or can a problem be diagnosed from what I've got to date so far?
Many thanks
recl
I'm investigating an SP frame and associated nodes that haven't been supported for some years (the previus admin team having been rendered redundant to cut costs). There was a power outage some months ago and some nodes can't be reached.
Unfortunately my knowledge of SP's is postage-stamp in size.
spmon -G -d displays;
1. Checking server process
Process 14196 has accumulated 36 minutes and 13 seconds.
Check ok
2. Opening connection to server
Connection opened
Check ok
3. Querying frame(s)
1 frame(s)
Check ok
4. Checking frames
Controller Slot 17 Switch Switch Power supplies
Frame Responds Switch Power Clocking A B C D
------------------------------------------------------------
1 yes yes on 0 on on on on
5. Checking nodes
--------------------------------- Frame 1 ---------------
Frame Node Node Host/Swch Key Env FrontPanel
Slot Number Type Power Responds Switch Fail LCD/LED
---------------------------------------------------------
1 1 high on no autojn normal no LCD is blank
5 5 wide on no no normal no LEDblank
9 9 high on no no normal no LCD blank 13 13 wide on no autojn N/A no LCDsblank
I'm concerned abut the "Host Responds"=no. I can ping the nodes, but cannot login. telnet responds but no login is displayed. s1term works to a node, displaying a Console login which works, but the password isn't processed (this might be an unrelated problem with a NIS domain message being constantly displayed, though "root" isn't in the NIS passwd map.)
A command like cshutdown produces (in its log file)
"Node spnode1en0 is not running. Unable to rsh to node"
However I can start and stop nodes with spmon and change the key switch setting. SMIT verification routines report that "Information in the SDR indicates that the node is not up"
PSSP version 4.1 is installed. The Control Workstation and nodes have AIX 4.3 (yes!)
Any SP whizz out there can point me in the right direction as to what I should be seeing if all was OK, or can a problem be diagnosed from what I've got to date so far?
Many thanks
recl