Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations gkittelson on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

stuck bootinfo and platform_dump processes vs stuck system backup

Status
Not open for further replies.

trifo

MIS
May 9, 2002
269
HU
Hi!

I have a host holding one of most iporttant application of us. As I am trying to create a system backup, the process seems to be frozen and no reaction anymore in that terminal.

As I watch output of ps -ef, I see heaps of frozen processes of "bootinfo -t" and "platform_dump -c". And I see one new stuck "lsdev -Cc disk -F physloc -l hdisk2" every time I try to start an mksysb backup.

Tried to run lsdev with the above parameters and found that it runs OK without the "-F physloc" flag. But -F is just a formatting parameter!!!

Well, I am not allowed to reboot the host, also I am not really sure if it would boot normally using its disks.

Can you help me to recover from this situation?

Thankfully,

--Trifo
 
HI,

Please send the following info ,taken right after the last attemp/sticking:

1.errpt |head -20
2.errpt -a|head -60
3.diag -Be -c
4.diag -B -c
5.df -k
6.topas screen shot while the system is stuck (if possible)
thanks

Long live king Moshiach !
 

Hi!

Here is the information you requested. Not so funny. It seems that sendmail process terminated abnormally several times in the last weeks, although the process itself seems running since the last reboot (according to the pf -ef output). Also sendmail is handling requests on port 25.

Code:
c22# errpt | head -20
IDENTIFIER TIMESTAMP  T C RESOURCE_NAME  DESCRIPTION
45C7A35B   0907232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0907232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0906232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0906232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0906081305 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0905232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0905232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0904232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0904232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0903232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0903232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0902232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0902232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0902081405 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0901232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0901232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0831232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0831232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED
45C7A35B   0830232005 P S SYSPROC        SOFTWARE PROGRAM ABNORMALLY TERMINATED

Code:
c22# errpt -a | head -60
---------------------------------------------------------------------------
LABEL:          CORE_DUMP_FAILED
IDENTIFIER:     45C7A35B

Date/Time:       Wed Sep  7 23:20:14 DFT
Sequence Number: 893
Machine Id:      0003534A4C00
Node Id:         c22
Class:           S
Type:            PERM
Resource Name:   SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Probable Causes
INTERNAL SOFTWARE ERROR
SYSTEM RUNNING OUT OF PAGING SPACE

User Causes
USER GENERATED SIGNAL

Failure Causes
CORE DUMP FAILED - SEE A REASON CODE BELOW

        Recommended Actions
        DEFINE ADDITIONAL PAGING SPACE
        RERUN THE APPLICATION PROGRAM
        IF PROBLEM PERSISTS THEN DO THE FOLLOWING
        CONTACT APPROPRIATE SERVICE REPRESENTATIVE

Detail Data
SIGNAL NUMBER
          11
USER'S PROCESS ID:
                884792
REASON CODE
           1
USER ID
           1
PROCESSOR ID
           4
CORE FILE NAME

PROGRAM NAME
sendmail
---------------------------------------------------------------------------
LABEL:          CORE_DUMP_FAILED
IDENTIFIER:     45C7A35B

Date/Time:       Wed Sep  7 23:20:07 DFT
Sequence Number: 892
Machine Id:      0003534A4C00
Node Id:         c22
Class:           S
Type:            PERM
Resource Name:   SYSPROC

Description
SOFTWARE PROGRAM ABNORMALLY TERMINATED

Code:
Topas Monitor for host:    c22                  EVENTS/QUEUES    FILE/TTY
Thu Sep  8 10:57:33 2005   Interval:  2         Cswitch    2301  Readch  3052.7K
                                                Syscall    5030  Writech   10382
Kernel    2.4   |#                           |  Reads       851  Rawin         0
User     16.2   |#####                       |  Writes      107  Ttyout      424
Wait      0.0   |                            |  Forks         0  Igets         0
Idle     81.4   |#######################     |  Execs         0  Namei       103
                                                Runqueue    0.5  Dirblk        0
Network  KBPS   I-Pack  O-Pack   KB-In  KB-Out  Waitqueue   0.0
en1     381.4   1250.5  1194.0   172.2   209.3
lo0      80.3    232.0   231.5    40.2    40.1  PAGING           MEMORY
en4       1.2      6.5     6.0     0.6     0.6  Faults        8  Real,MB    5631
                                                Steals        0  % Comp     39.5
Disk    Busy%     KBPS     TPS KB-Read KB-Writ  PgspIn        0  % Noncomp  28.7
hdisk2    0.0      0.0     0.0     0.0     0.0  PgspOut       0  % Client    3.6
hdisk3    0.0      0.0     0.0     0.0     0.0  PageIn        0
                                                PageOut       0  PAGING SPACE
Name            PID  CPU%  PgSp Owner           Sios          0  Size,MB    8000
java        1163436   5.6 230.8 app                              % Used      0.6
java         393376   1.4  87.9 root            NFS (calls/sec)  % Free     99.3
httpd       2367726   0.5   3.8 app             ServerV2       0
httpd       1503482   0.5   3.5 app             ClientV2       0   Press:
httpd        872540   0.5   3.7 app             ServerV3       0   "h" for help
IBM.CSMA     331984   0.4   4.9 root            ClientV3       0   "q" to quit
httpd       2265300   0.4   3.7 app
httpd       1388792   0.4   3.7 app
httpd       2212068   0.4   3.7 app
httpd       1257600   0.4   3.7 app
httpd       1167370   0.4   3.6 app
httpd       1405026   0.4   3.7 app
httpd       1839118   0.4   3.7 app
httpd       1220792   0.2   3.6 app
java        1032286   0.2  26.3 app
httpd        843842   0.2   3.6 app
httpd       2490588   0.2   3.5 app
java        2457694   0.2  18.7 app
httpd       2343106   0.2   3.8 app
httpd       2248716   0.2   3.7 app
java         614550   0.2  51.4 app
httpd        897134   0.2   3.7 app
httpd       2187488   0.2   3.7 app
httpd       2097230   0.2   3.7 app
httpd       2043996   0.2   3.7 app

Code:
diag -Be -c  gets frozen
diag -B -c also gets frozen

df -k  shows no space problems.

 
Smells like some HW issue to me (freezing diag).
I'd now run an interactive diag and see where exactly I get stuck.
Was the topas taken WHILE the system is running the system backup ?

1.possibly some bad/missing driver on some device.
2."cfgmgr -v" can be interesting,watch out for any sticking/errors while it runs.
3.Possibly some memory chip is bad,normally diag should reveal it.
4.Please place the result of the following command:
lscfg -vp|grep -E "Size|Physical Location.*M"|awk '/Physical/{printf "%-35s",$0;getline}1'|grep Size

Long live king Moshiach !
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top