New user with nagios 3.0.3 hanging.

Jose Luis Garcia garciam at teleline.es
Fri Aug 1 09:26:10 CEST 2008


Hello again,

Has anybody else experienced this behaviour?

I'm suffering it every few days.

Thank you very much

Jose Luis

2008/7/14 Jose Luis Garcia <garciam at teleline.es>

> Hello agian,
>
> I'm experiencing right now the same problem. And I'm trying to doing some
> forensic.
>
> It's working in a 2 CPU system and main daemon is taking one of them.
>
> With truss I just get many many lines like this
>
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
> /2:     poll(0xFEF79F88, 1, 500)                        = 1
>
> And the only pool entry in code is in utill.c, function
>
> void * command_file_worker_thread(void *arg){
>
> And the output (1) match this
>
> #define EPERM   1       /* Not super-user                       */
>
> But although with this error it should write to nagios.log, nothing is
> written.
>
> I have also checked the external command file
>
> # ls -l /usr/local/nagios/var/rw/nagios.cmd
> prw-rw----   1 nagios   nagcmd     47675 Jul 14 09:08
> /usr/local/nagios/var/rw/nagios.cmd
>
> And I've seen than this code is the same than in 3.0.1.
>
> Any idea?
>
> Trying to stop nagios (/etc/init.d/nagios stop) I can't. I just get
>
> Warning - nagios did not exit in a timely manner
>
> And 3 nagios processes are still running. So I had to "kill -9" them
>
> Thanks a lot
>
> Jose Luis
>
>
>
> 2008/7/11 Jose Luis Garcia <garciam at teleline.es>:
>
>
>> Hello,
>>>
>>> My name is Jose Luis and I'm wrinting from Spain.
>>>
>>> I have recently installed Nagios in a Solaris 9 box.
>>>
>>> While I was in version 3.0.1, 1 month or so, everything has been working
>>> smoothly, but now, with version 3.0.3 Nagios has "hanged" twice (once a week
>>> more or less).
>>>
>>> Nagios daemon is running but neither checks are performed nor external
>>> commands processed. So I have to restart it.
>>>
>>> Anyone else have experienced this behaviour?
>>>
>>> Any suggestions? We are thinking of setting up a cron check to detect
>>> this situation (no entries in nagios.log), but perhaps we are experiencing
>>> some kind of bug that should be solved in new releases.
>>>
>>> Thank you very much indded
>>>
>>> Jose Luis
>>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20080801/003ca20a/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list