New user with nagios 3.0.3 hanging.

Jose Luis Garcia garciam at teleline.es
Mon Jul 14 09:30:28 CEST 2008


Hello agian,

I'm experiencing right now the same problem. And I'm trying to doing some
forensic.

It's working in a 2 CPU system and main daemon is taking one of them.

With truss I just get many many lines like this

/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1
/2:     poll(0xFEF79F88, 1, 500)                        = 1

And the only pool entry in code is in utill.c, function

void * command_file_worker_thread(void *arg){

And the output (1) match this

#define EPERM   1       /* Not super-user                       */

But although with this error it should write to nagios.log, nothing is
written.

I have also checked the external command file

# ls -l /usr/local/nagios/var/rw/nagios.cmd
prw-rw----   1 nagios   nagcmd     47675 Jul 14 09:08
/usr/local/nagios/var/rw/nagios.cmd

And I've seen than this code is the same than in 3.0.1.

Any idea?

Trying to stop nagios (/etc/init.d/nagios stop) I can't. I just get

Warning - nagios did not exit in a timely manner

And 3 nagios processes are still running. So I had to "kill -9" them

Thanks a lot

Jose Luis



2008/7/11 Jose Luis Garcia <garciam at teleline.es>:

>
> Hello,
>>
>> My name is Jose Luis and I'm wrinting from Spain.
>>
>> I have recently installed Nagios in a Solaris 9 box.
>>
>> While I was in version 3.0.1, 1 month or so, everything has been working
>> smoothly, but now, with version 3.0.3 Nagios has "hanged" twice (once a week
>> more or less).
>>
>> Nagios daemon is running but neither checks are performed nor external
>> commands processed. So I have to restart it.
>>
>> Anyone else have experienced this behaviour?
>>
>> Any suggestions? We are thinking of setting up a cron check to detect this
>> situation (no entries in nagios.log), but perhaps we are experiencing some
>> kind of bug that should be solved in new releases.
>>
>> Thank you very much indded
>>
>> Jose Luis
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20080714/dd364d61/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list