mayurkirti
IS-IT--Management
This is a really innovative project for me. We are a mid size IT company with around 350 employees. As a part of my support engineer job I am working on a business continuity and disaster recovery project.
Here's where I need help: Our IT infrastructure is a combination of physical and virtual windows servers. We mostly use Windows server 2003, VMWare, Cisco and XP pro in our environment. One problem that an on-call engineer faces in the event of disaster is to successfully shutdown the critical services before UPS runs out of power. Gracfull shutdown can prevent the hardware from powering down abruptly and other evident damage. I am trying to automate this process considering interdependencies between devices (clients/servers/networking stuff/ virtualization) and designing the correct sequence of powering down the services.
One challenge is the dynamic environment where severs change every day, new services are added and removed. I also have to factor in the business needs so that if necessary, I should be able to shutdown X non-critical services and still keep the Y number of business-critical services running.
I need some ideas and implementation suggestions for this project. If anyone has done this in the past or if this interests you, please drop some notes. I will appreciate it. Let me know if you need any other details.
Thanks,
Mayur
Here's where I need help: Our IT infrastructure is a combination of physical and virtual windows servers. We mostly use Windows server 2003, VMWare, Cisco and XP pro in our environment. One problem that an on-call engineer faces in the event of disaster is to successfully shutdown the critical services before UPS runs out of power. Gracfull shutdown can prevent the hardware from powering down abruptly and other evident damage. I am trying to automate this process considering interdependencies between devices (clients/servers/networking stuff/ virtualization) and designing the correct sequence of powering down the services.
One challenge is the dynamic environment where severs change every day, new services are added and removed. I also have to factor in the business needs so that if necessary, I should be able to shutdown X non-critical services and still keep the Y number of business-critical services running.
I need some ideas and implementation suggestions for this project. If anyone has done this in the past or if this interests you, please drop some notes. I will appreciate it. Let me know if you need any other details.
Thanks,
Mayur