In nagios, when I force a host check, I can clearly see in the logs:
However it never actually runs the check-host-alive check, which is check_ping. It doesn't appear in the scheduling queue, and even though the host in not pingable, and running the command itself fails it still shows as being up.
I can grep back through my archive logs, and see where hosts have been reported down however the above host was down for almost 30 minutes and we still received no alert:
My host and check-host-alive settings are:
and check-host-alive is defined here:
Thoughts? asked 20 Jul '10, 16:04 rfelsburg ♦ |
The CATCH is it doesn't look like its running, because it turns out nagios only logs failed checks unless you tell it to --enable-DEBUG3 at build time. Took me a couple of days of sorting through to realize that I was only seeing SERVICE ALERT in log files. Still haven't figured out why it was reporting as up, but I'll get to that next. answered 21 Jul '10, 13:55 rfelsburg ♦ |