检测myslq从库状态,跳过固定的错误号,每隔30秒检测一次,如果符合条件自动跳过或者是重启从库
1)取出mysql从库的关键字
1 [root@localhost scripts]# mysql -u root -p123qq.com3307 -S /data/3307/mysql.sock -e "show slave status\G"|grep -E "Running|Seconds_Behind_Master|Last_SQL_Errno"|awk '{print $NF}'2 Yes3 Yes4 05 0
2)把错误号定义在数组里面
3)while ture
根据思路调试出如下脚本
1 [root@localhost scripts]# cat check_mysql_slave.sh 2 #/bin/bash 3 #Date: 4 #Author: 5 #Mail: 917667797@qq.com 6 #Function: This scripts function is check mysql slave is ok 7 #Version: 1.1 8 qq="917667796@qq.com" 9 cmd="mysql -u root -p123qq.com3307 -S /data/3307/mysql.sock -e"10 ip=`ifconfig eth2|sed -n 's#^.*ddr:\(.*\) Bc.*$#\1#gp'`11 skip=`$cmd "stop slave;set global sql_slave_skip_counter=1;start slave;"`12 error_numb=(1158 1159 1008 1007 1062)13 while true14 do15 status=(`$cmd "show slave status\G"|grep -E "Running|Seconds_Behind_Master|Last_SQL_Errno"|awk '{print $NF}'`)16 17 if [ "${status[0]}" == "Yes" -a "${status[1]}" == "Yes" -a "${status[2]}" == "0" ]18 then19 echo "mysql slave is ok"20 else21 for ((n=0;n<${#error_numb[*]};n++))22 do23 if [ "${status[3]}" == "${error_numb[n]}" ];then24 ${skip}25 else26 $cmd "stop slave;start slave;"27 fi28 done29 echo "mysql salve is not ok"30 echo "${ip} mysql is not ok"|mail -s "mysql slave error at `date +%F%T`" $qq31 fi32 sleep 333 done
测试结果如下
[root@localhost scripts]# sh check_mysql_slave.sh
mysql salve is not ok
mysql slave is ok
mysql slave is ok
mysql slave is ok
mysql slave is ok
mysql slave is ok
mysql slave is ok
mysql slave is ok
mysql slave is ok
mysql slave is ok
mysql slave is ok