ValleyDweller
Technical User
Here is my environment config:
* 1 NetBackup Master Server, 4.5FP6_S1021
* 1 NetBackup Media Server, 4.5FP6_S1021
* 4 NetBackup Clients, 4.5FP6_S1021
* 1 STK L180 Changer, SCSI attached to NB Master.
* 2 IBM Ultrium LTO Gen2 drives, fibre connected to STK 3800 2Gbps SAN Switch.
4 client servers write through the Media Server. Master backs only itself up. Both LTO drives are SSO Shared between the Master & Media Server. The media server has ~700GB of user data on it...file server. Media Server has Emulex LP9002 HBA connecting to SAN Switch. also, this HBA is shared for Disk and Tape access.
I run the backup of this media server multistreamed as 9 jobs, multiplex 4. A max of 4 streams will run to each drive. When I do the backup this way, it always ends up hanging at different points during the backup, and the job cannot be killed using standard methods. I have to use the "Kill" command to get rid of the jobs. In the "All Log entries", I see the media server issuing a "WriteFile Failed" message. If I run the same backup to one drive, it works fine. I have tested this to each drive and they're both fine. When I kill the bpbkar, oprd, bpbrm, and bptm processes on the media server, the jobs die on the master, but the device file having been used for tape access seems tied up and I can't use the drive again until I reboot the server...if I could at least avoid having to reboot the server, that would be a big help. Note that this only happens to the streams running on one particular drive. the streams running on the other will complete just fine.
The Master is Windows 2000 A.S. and the Media Server, as well as the clients, are Windows Server 2K3. Note: I am using VNETD and the NOSHM touch file exists on the Media Server to allow the VNETD restore to function properly.
Can someone help? this has been going on for some time now and is really baffling me.
Thanks,
Pat.
* 1 NetBackup Master Server, 4.5FP6_S1021
* 1 NetBackup Media Server, 4.5FP6_S1021
* 4 NetBackup Clients, 4.5FP6_S1021
* 1 STK L180 Changer, SCSI attached to NB Master.
* 2 IBM Ultrium LTO Gen2 drives, fibre connected to STK 3800 2Gbps SAN Switch.
4 client servers write through the Media Server. Master backs only itself up. Both LTO drives are SSO Shared between the Master & Media Server. The media server has ~700GB of user data on it...file server. Media Server has Emulex LP9002 HBA connecting to SAN Switch. also, this HBA is shared for Disk and Tape access.
I run the backup of this media server multistreamed as 9 jobs, multiplex 4. A max of 4 streams will run to each drive. When I do the backup this way, it always ends up hanging at different points during the backup, and the job cannot be killed using standard methods. I have to use the "Kill" command to get rid of the jobs. In the "All Log entries", I see the media server issuing a "WriteFile Failed" message. If I run the same backup to one drive, it works fine. I have tested this to each drive and they're both fine. When I kill the bpbkar, oprd, bpbrm, and bptm processes on the media server, the jobs die on the master, but the device file having been used for tape access seems tied up and I can't use the drive again until I reboot the server...if I could at least avoid having to reboot the server, that would be a big help. Note that this only happens to the streams running on one particular drive. the streams running on the other will complete just fine.
The Master is Windows 2000 A.S. and the Media Server, as well as the clients, are Windows Server 2K3. Note: I am using VNETD and the NOSHM touch file exists on the Media Server to allow the VNETD restore to function properly.
Can someone help? this has been going on for some time now and is really baffling me.
Thanks,
Pat.