Linux NFS development
 help / color / mirror / Atom feed
From: Thanos Chatziathanassiou <tchatzi-nz9JlX+3IF8@public.gmane.org>
To: "J. Bruce Fields" <bfields@fieldses.org>
Cc: linux-nfs@vger.kernel.org
Subject: Re: kernel 2.6 and simulated flock() with posix locks
Date: Fri, 29 Feb 2008 17:20:38 +0200	[thread overview]
Message-ID: <47C822C6.80608@arx.net> (raw)
In-Reply-To: <20080228223249.GD30782@fieldses.org>

[-- Attachment #1: Type: text/plain, Size: 3157 bytes --]

J. Bruce Fields wrote:
> On Mon, Feb 25, 2008 at 06:42:35PM +0200, Thanos Chatziathanassiou wrote:
>   
>> J. Bruce Fields wrote:
>>     
>>> On Mon, Feb 25, 2008 at 03:20:29PM +0200, Thanos Chatziathanassiou wrote:
>>>   
>>>       
>>>> Hi,
>>>>
>>>> I've been trying to replace kernel 2.4 in a web server mounting its Document Root via NFS with kernel 2.6 and faced a rather disturbing problem.
>>>> About 1/2 hour after starting, the server would stop serving requests though it seemed fine.
>>>> Earlier 2.6 kernels exhibited the ``do_vfs_lock: VFS is out of sync with lock manager!'' symptom, later (when this was changed to a dprintk()) just sat there.
>>>> No apparent error apart from apache compaining ``[error] server reached MaxClients setting, consider raising the MaxClients setting'', unable to serve any requests.
>>>>
>>>> This issue does not surface under 2.4, where everything works as expected.
>>>> I came across this  
>>>> (http://blog.notreally.org/articles/2007/12/19/modifying-a-live-linux-kernel/) 
>>>> where apparently they faced the same problem, but their solution 
>>>> (which seemed a little crude) resulted in apache spitting ``There are 
>>>> no available locks'' messages (or roughly this, translated from my 
>>>> regional settings).
>>>>
>>>> Is there any solution to this or a way to get 2.4 behavior under 2.6 ?
>>>>     
>>>>         
>>> I'm a little confused--how do you know that the problem you face is the
>>> same as the one described on the blog above?  Are you re-exporting NFS
>>> via Samba?
>>>
>>> --b.
>>>   
>>>       
>> Indeed I am. But I am willing to convince you ;) What kind of debug info  
>> would I need to collect to find out what really the problem is ?
>>     
>
> Can you give a more detailed explanation of the symptoms?  For example,
> when you say "the server would stop serving requests", are you referring
> to the web server or the nfs server?
sorry if I wasn't clear on this. this particular (stock 2.6.16.60) web 
server stops serving requests.
the nfs server (2.6.12.6 based) as well as other (2.4 based) web servers 
continue humming along just fine.
>   If you think the problem is that
> Apache is hanging on a lock, you should be able to verify that with
> strace or /proc/locks
well, /proc/locks doesn't tell much...
---snip---
www4:~# cat /proc/locks
1: FLOCK  ADVISORY  WRITE 2512 08:07:829070 0 EOF
2: POSIX  ADVISORY  READ  2459 08:07:1284232 0 EOF
3: POSIX  ADVISORY  WRITE 2454 08:07:829066 0 EOF
---snip---
process 2459 is
root      2459  0.0  0.0   1552   500 ?        S    16:07   0:00 ypbind 
(slave)
and 2454 is
root      2454  0.0  0.0   1532   448 ?        S    16:07   0:00 ypbind 
(master)
...I couldn't find 2512 (?) in the process table.

however,
straceing random httpd processes, yields:
---snip---
strace -p 22149
flock(11, LOCK_EX
---snip---

...which is understandably blocking
unfortunately, this child did not ever get to write what it was serving 
at the time to the access and/or error log, but we can (safely ?) assume 
it'd be some mod_perl script that called flock().

let me know if I can grab anything else
>  or a sysrq-T trace.
>
> --b.
>   


[-- Attachment #2: S/MIME Cryptographic Signature --]
[-- Type: application/x-pkcs7-signature, Size: 3229 bytes --]

      reply	other threads:[~2008-02-29 15:21 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-02-25 13:20 kernel 2.6 and simulated flock() with posix locks Thanos Chatziathanassiou
     [not found] ` <47C2C09D.2010203-nz9JlX+3IF8@public.gmane.org>
2008-02-25 16:38   ` J. Bruce Fields
2008-02-25 16:42     ` Thanos Chatziathanassiou
     [not found]       ` <47C2EFFB.40807-nz9JlX+3IF8@public.gmane.org>
2008-02-28 22:32         ` J. Bruce Fields
2008-02-29 15:20           ` Thanos Chatziathanassiou [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=47C822C6.80608@arx.net \
    --to=tchatzi-nz9jlx+3if8@public.gmane.org \
    --cc=bfields@fieldses.org \
    --cc=linux-nfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox