Here is some background into the continuing problem we are experiencing in getting the Backup System to function continuously without errors.
After the Quantum ATL M2500 was installed it worked fine for a period of 2 weeks. The weeks following we began to experience the following;
Failure to mount - none of the slots mounted, other times most slots mounted with 2 or 3 dismounted slots. When trying to mount error message "Failure to mount, door open..." message appears.
We then noticed LUN errors: "SCSCI LUN Bad, ha index x, device x, LUN 0"
Would not dynamically update the device list when moving tapes, had to dismount and mount, often resulting in dismount problems mentioned above.
Success in the mount was intermittent
We contacted Craig Tamlin who advised it was possibly the Arcserve Software
We opened a Service ticket with Computer Associates.
CA asked SCSI card details. We advised from Windows 2000 device Manager that SCSI card was Adaptec 3960D. CA advised SCSI card not supported by Arcserve, they closed ticket.
We checked further, physical model is 39160 which is supported by Arcserve. According to Adaptec 39160 and 3960D are the same card. CA reopened ticket. Updated Adaptec driver, physical model now shows correctly in Device Manager.
We then contacted Simon Tippet as suggested by Craig Tamlin we were asked to change the cables, try different channels on the 2 SCSI cards.
We have changed the cables, tried both channels on a single card. Tried channel a from both the SCSI cards, then channel b, all with similar result. That is LUN errors and mounting issues.
We then applied the latest patches to the SCSI Card and to Arcserve, including SP4.
Made change to IOMGR.ini to attempt stopping the LUN errors. Changed tape busy toleration time to 360 sec, still errors appeared. Set it to 0 to disable error reports, errors still appeared. set to 3600sec. Error occurs 10 to 15 minutes during successful mount process. Also changed checksparetapes time to once a week at 5:45 am instead of once a day at 2:00 pm.
LUN errors still appeared, however the Library did not experience any mount or dismount issues for the next week.
ARCServe device manager reported unreadable media. Tapes are new.
Monday 2/09/02 - "Failure to mount, door is open.." after dismounting all slots and trying to mount. Turning off server and tape library is the only way to get around this problem.
Bar Code serial numbers on some tapes are not reported in the BLO??? way but have generic labeling as 1000001??, we are therefore unable to determine which tapes are which unless counting the slots, ie in ARCServe Friday backup is in slot 53, go to tape library and count slots until 53 is found. This is the only way to determine where a particular tape is.
Tuesday 3/9/02 More unreadable media errors. Changed ABSL (Adapter/Bus/SCSCI/LUN) order since post in forum said this would solve the unreadable media error. It seemed to make things worse. Now experiencing Windows NT SCSI Port errors in Arcserve, as well as "The device \Device\SCSI\ADPU160m2, did not respond within the time out period" errors in Event viewer's system log.
Error appeared on tape library: Error code 26C3: Drive 4 has not responded to multiple times to unload.
Drive statistics - drive fetches bad: 9 on Tape library.
Changed iomgr.ini tape busy toleration time back to 180.
Switched off server and tape library - slots not mounting.
We are desperate to get this working as you can imagine. Edmee de Klerk
Technical Support
Fluor Australia