check_disk and stale NFS

VALIYEV Ruslan valiyev at unicc.org
Mon Nov 5 16:56:15 CET 2012


Hi all,

I would like to monitor all local and non-local filesystems on a server + check for stale NFS mount points. And according to man, the option I need is the following:

-L, --stat-remote-fs
    Only check local filesystems against thresholds. Yet call stat on remote filesystems
    to test if they are accessible (e.g. to detect Stale NFS Handles)

However, it looks like check_disk is simply hanging when there's a stale NFS.

[root at server ~]# /usr/lib64/nagios/plugins/check_disk -L -t 5
...hang state...


Here's /etc/mtab

[root at server ~]# grep nfs /etc/mtab
sunrpc /var/lib/nfs/rpc_pipefs rpc_pipefs rw 0 0
10.111.11.10:/icc_hds_transfer /icc_hds_transfer nfs rw,bg,addr=10.111.11.10 0 0
10.111.11.10:/ora_disk_backup /ora_disk_backup nfs rw,bg,hard,nointr,rsize=32768,wsize=32768,tcp,nfsvers=3,timeo=600,actimeo=0,addr=10.111.11.10 0 0
10.111.11.10:/icc_fast_refresh /icc_fast_refresh nfs rw,bg,addr=10.111.11.10 0 0

And here's df in hang state

[root at server ~]# df -P
Filesystem         1024-blocks      Used Available Capacity Mounted on
/dev/sda2             11904620   3857608   7432528      35% /
/dev/mapper/Volsda1-LogVol02   7935392   4927660   2598132      66% /home
/dev/mapper/Volsda1-LogVol01   7935392    439296   7086496       6% /opt
/dev/sda5              7936256    151484   7375120       3% /tmp
/dev/sda3              7936288    927140   6599492      13% /var
/dev/sda1               350018     16903    315044       6% /boot
tmpfs                 16474872         0  16474872       0% /dev/shm
/dev/sdb1            287826944 187056292  86149900      69% /u01
/dev/sdc1            287826944 172147168 101059024      64% /oratemp
...hang state...

[root at server ~]# grep nfs /var/log/messages
Nov  5 16:40:05 server kernel: nfs: server 10.111.11.10 not responding, still trying

What am I doing wrong? I just tried the latest nagios-plugins snapshot and the same thing happened.


Best regards,
Ruslan Valiyev
.
..:
Linux System Administrator
Systems Support Unit (OTAS)
Operations and Technology Division
International Computing Centre (ICC)
Email: valiyev at unicc.org
Website: www.unicc.org

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20121105/90d51255/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
LogMeIn Central: Instant, anywhere, Remote PC access and management.
Stay in control, update software, and manage PCs from one command center
Diagnose problems and improve visibility into emerging IT issues
Automate, monitor and manage. Do more in less time with Central
http://p.sf.net/sfu/logmein12331_d2d
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list