SLES10 x64 RT kernel servers die after NRPE is executed

Sebastian Waitz s.waitz at rtsgroup.net
Mon Nov 1 15:06:25 CET 2010


Hey there,

I have a strange error happening on several different servers having NRPE installed.

The system hangs and need to be rebooted. After the server is back online the logs say:


Oct 19 03:13:11 glddb-rtd2 sudo:      rts : TTY=unknown ; PWD=/ ; USER=root ; COMMAND=/usr/bin/find / -noleaf -name core -type f
Oct 19 03:16:06 glddb-rtd2 kernel: Bad page state in process 'nrpe'
Oct 19 03:16:06 glddb-rtd2 kernel: page:ffff81032a92fd80 flags:0x0000000600000005 mapping:0000000000000000 mapcount:0 count:7
Oct 19 03:16:06 glddb-rtd2 kernel: Trying to fix it up, but a reboot is needed
Oct 19 03:16:06 glddb-rtd2 kernel: Backtrace:
Oct 19 03:16:06 glddb-rtd2 kernel:
Oct 19 03:16:06 glddb-rtd2 kernel: Call Trace:
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff80283680>] bad_page+0x60/0xa0
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff80284a85>] get_page_from_freelist+0x585/0x620
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff8027ec26>] find_get_page+0x86/0x190
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff80284b87>] __alloc_pages+0x67/0x380
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff8028f890>] __handle_mm_fault+0x7c0/0xb90
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff8046bf06>] do_page_fault+0x216/0x8e0
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff80293bd2>] split_vma+0x152/0x170
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff8021b74b>] flush_tlb_mm+0x5b/0x150
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff80295aa4>] sys_mprotect+0x194/0x850
Oct 19 03:16:06 glddb-rtd2 kernel:  [<ffffffff8046a06d>] error_exit+0x0/0x84
Oct 19 03:16:06 glddb-rtd2 kernel:
Oct 19 03:16:06 glddb-rtd2 kernel: general protection fault: 0000 [1] PREEMPT SMP

Oct 19 03:16:06 glddb-rtd2 kernel: last sysfs file: /devices/pci0000:00/0000:00:06.0/0000:05:00.1/irq





Here are some details about the system:
glddb-rtd2:~ # cat /etc/issue
Welcome to SUSE Linux Enterprise Server 10 SP2 (x86_64) - Kernel \r (\l).

glddb-rtd2:~ # uname -a
Linux glddb-rtd2 2.6.22.19-0.14-rt #1 SMP PREEMPT RT 2008-06-04 00:52:17 +0200 x86_64 x86_64 x86_64 GNU/Linux
glddb-rtd2:~ # /usr/local/nagios/bin/nrpe --version
/usr/local/nagios/bin/nrpe: unrecognized option `--version'

NRPE - Nagios Remote Plugin Executor
Copyright (c) 1999-2008 Ethan Galstad (nagios at nagios.org)
Version: 2.12

Last Modified: 03-10-2008





All servers have SLES 10 SP 2 x64 Realtime Kernel in common.

Any ideas if this is an NRPE or kernel problem/bug are appreciated. Or did someone face the similar/same problems?

If some information is missing, feel free to request. I'd be glad to add the info.

Thanks much for your input.




Sebastian Waitz
IT Operations Manager

[cid:rts-logo.png at 02aeae5248894d848dc6f606a972e216]

RTS Realtime Systems (Deutschland) AG, Rembrandtstrasse 13, D-60596 Frankfurt am Main
T: +49.69.61009.0 / F: +49.69.61009.181 / Hotline: +49.69.61009.100

Sitz: Frankfurt am Main - HRB 46523 Amtsgericht Frankfurt/M.
Vorstand: Steffen Gemuenden (Vorsitzender), Mark van Vugt
Aufsichtsratsvorsitzender: Engelbert Gemuenden

www.rtsgroup.net

This email and any attachments are for the exclusive and confidential use of the intended recipient. If you are not the intended recipient, or an employee or agent responsible for delivering this message to the intended recipient, please do not read, distribute or take action in reliance upon this message. If you have received this in error, please notify me immediately by return email and promptly delete this message and its attachments from your computer system.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20101101/921e12c3/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: rts-logo.png
Type: image/png
Size: 13233 bytes
Desc: rts-logo.png
URL: <https://www.monitoring-lists.org/archive/users/attachments/20101101/921e12c3/attachment.png>
-------------- next part --------------
------------------------------------------------------------------------------
Nokia and AT&T present the 2010 Calling All Innovators-North America contest
Create new apps & games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list