PSD,
I was thinking about the unlikely event were both NICs in one of my machines would fail. (Our testing chaps will test this by pulling out both the cables!)
I am in the process of writing 2 scripts to handle this, a network_down_complete_post script and a network_up_pre script.
Firstly the network_down_complete_post script checks if the network_down event is global (if [ $3 = "-1" ];then "do nothing....)
If it is not a global failure then it basically check the local_node against the "nodename" passed from the network_down_complete event ($3).
If they are both the same it will touch a tempory file and test a number of times to see if this file still exists. (The network_up_pre script will remove this file!).
If the file gets removed by the network_up_pre script then the script will exit, otherwise after the timeout period it will run a graceful takeover.
Does this make sense or am I on completely the wrong lines here?
Regards,
JP