problems with nagios spinning on notification

David Ford david at blue-labs.org
Fri Jul 4 07:40:25 CEST 2008


When a service goes offline, nagios is supposed to tell nagios ircbot
about it.  The message gets to nagircbot but nagios is stuck in a dead
heat run at 100% cpu on read()

[1215140088.267404] [256.1] [pid=12801] Running command '/bin/echo
"PROBLEM DispatchBuddy Internal_PING_WAP_Dayroom CRITICAL $OUTPUT$
$DATETIME$" > /etc/nagios/nagiosbot.pipe &'...
[1215140088.274510] [064.1] [pid=12801] Making callbacks (type 10)...
[1215140088.284985] [001.0] [pid=20014] process_macros()
[1215140088.285654] [001.0] [pid=20014] process_macros()
[1215140088.285925] [001.0] [pid=20014] process_macros()
[1215140088.286113] [001.0] [pid=20014] process_macros()
[1215140088.286279] [001.0] [pid=20014] process_macros()
[1215140088.286455] [001.0] [pid=20014] process_macros()
[1215140088.290542] [001.0] [pid=20014] process_macros()
[1215140088.290890] [001.0] [pid=20014] process_macros()
[1215140088.291084] [001.0] [pid=20014] process_macros()


This was about 2.5 hours ago, the time now is +9400 seconds (2h36m)

nagios is doing nothing else but:

[...]
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
read(7, 0xbfa75b1c, 1023)               = -1 EAGAIN (Resource
temporarily unavailable)
[...]

Ranger data # ls -l /proc/12801/fd/7
lr-x------+ 1 root root 64 Jul  4 01:32 /proc/12801/fd/7 -> pipe:[65450357]

Ranger data # lsof|grep 65450357
nagios    12801     nagios    7r     FIFO        0,5            
65450357 pipe
sh        20016     nagios    8w     FIFO        0,5            
65450357 pipe

Ranger data # ps auxww|grep 20016
nagios   20016  0.0  0.0   2816   560 ?        S    Jul03   0:00 sh -c
/bin/echo "PROBLEM DispatchBuddy Internal_PING_WAP_Dayroom CRITICAL
$OUTPUT$ $DATETIME$" > /etc/nagios/nagiosbot.pipe &



nagios won't do anything further.  I have to restart it.

-------------------------------------------------------------------------
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list