Kernel Oops Problem...

Scott Bye sbye at unifiedgroup.co.uk
Thu Jul 20 12:26:01 CEST 2006


Hi,
 
I just found it odd that it was only happening after Nagios was
installed and running. The box is already stripped of all non-essential
hardware - but that was all up and running fine prior to the Nagios
process being started.
 
I've tried different kernel versions, different kernel configurations,
different distributions thus different patch sets all with the same
results.
 
I'll probably leave the box up over the weekend without Nagios running
to see if the Oops's stop, and use something to ensure a decent cpu
load.
 
Regards,
 
Scott


>>> Tomas Macek <maca02 at atlas.cz> 20/07/2006 10:51 >>>
I think either it's not a bug in Nagios, because I think, that Nagios
has nothing to do with kernel. 
You write, that you have identical machines, so it could be possible,
that kernel failes on both machines. Did you try to update the kernel?
Did you try to switch off all possible hardware?

On Thu, 20 Jul 2006, Scott Bye wrote:

> Hi all,
>
> I'm not entirely sure this is the fault of Nagios, but I've got a
> couple of identical machines here that are all exhibiting the same
> symptoms. The boxes have been running fine for about a month or so
with
> various background tasks, services and quite a bit of code
> editing/compilation being performed. Then I loaded Nagios onto them,
and
> ever since then I've been getting kernel oops's. I've run a memtest
and
> that came out fine after 100 odd passes, and the operating
temperatures
> seem fine too.
>
> As a test I built another one on a completely different
distribution,
> which also exhibited the same symptoms.
>
> I get a couple of these a day on all the machines, unfortunately
they
> seem to be random and infrequent...
>
> [42956502.420000] Unable to handle kernel NULL pointer dereference
at
> virtual address 00000286
> [42956502.430000]  printing eip:
> [42956502.440000] f6800001
> [42956502.450000] *pde = 00000000
> [42956502.460000] Oops: 0000 [#1]
> [42956502.460000] Modules linked in: ipv6 pl2303 af_packet tsdev
> usbserial 8139cp 8139too mii hsfserial hsfengine hsfbasic2 hsfosspec
> pcspkr serio_raw psmouse ohci_hcd ehci_hcd usbcore evdev
> [42956502.460000] CPU:    0
> [42956502.460000] EIP:    0060:[<f6800001>]    Tainted: P      VLI
> [42956502.460000] EFLAGS: 00010a07   (2.6.15.7-ubuntu1)
> [42956502.460000] EIP is at 0xf6800001
> [42956502.460000] eax: c389c00a   ebx: f6a2dd84   ecx: 00000286  
edx:
> c18b7dc0
> [42956502.460000] esi: f6a2dddc   edi: f71d1200   ebp: 00000000  
esp:
> f68cbf34
> [42956502.460000] ds: 007b   es: 007b   ss: 0068
> [42956502.460000] Process nagios (pid: 1841, threadinfo=f68ca000
> task=f725d030)
> [42956502.460000] Stack: c014b785 f6a2dd84 f6a2dee4 f68cbf68
f6a2dd84
> c014da68 f6a2dd84 f7fa2124
> [42956502.460000]        00000000 00000000 f68cbf64 00000000
000010c9
> c044d7c0 f71d1200 f725d030
> [42956502.460000]        00000001 c0116cb6 f71d1200 f68ca000
c011b5a3
> f71d1200 f725d030 b7ef2174
> [42956502.460000] Call Trace:
> [42956502.460000]  [<c014b785>] remove_vma+0x25/0x60
> [42956502.460000]  [<c014da68>] exit_mmap+0xc8/0xf0
> [42956502.460000]  [<c0116cb6>] mmput+0x26/0x70
> [42956502.460000]  [<c011b5a3>] do_exit+0xd3/0x370
> [42956502.460000]  [<c011b8b4>] do_group_exit+0x34/0x70
> [42956502.460000]  [<c010306b>] sysenter_past_esp+0x54/0x75
> [42956502.460000] Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00
> 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00
> 00 00 c0 <3b> 39 98 00 00 00 02 00 00 4d 3c 00 00 00 00 00 00 00 00
00
> 00
> [42956502.460000]  <1>Fixing recursive fault but reboot is needed!
>
> Regards,
>
> Scott
>

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share
your
opinions on IT & business topics through brief surveys -- and earn
cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net 
https://lists.sourceforge.net/lists/listinfo/nagios-users 
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20060720/0dad03ac/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list