From: Wu Fengguang <fengguang.wu@intel.com>
To: David Miller <davem@davemloft.net>
Cc: "herbert-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org"
<herbert-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"linux-nfs@vger.kernel.org" <linux-nfs@vger.kernel.org>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>
Subject: Re: sk_lock: inconsistent {RECLAIM_FS-ON-W} -> {IN-RECLAIM_FS-W} usage
Date: Fri, 10 Jul 2009 16:00:17 +0800 [thread overview]
Message-ID: <20090710080017.GA24168@localhost> (raw)
In-Reply-To: <20090709.171355.09466097.davem@davemloft.net>
On Fri, Jul 10, 2009 at 08:13:55AM +0800, David Miller wrote:
> From: Wu Fengguang <fengguang.wu@intel.com>
> Date: Thu, 9 Jul 2009 21:17:46 +0800
>
> > @@ -2100,7 +2100,8 @@ void tcp_send_fin(struct sock *sk)
> > } else {
> > /* Socket is locked, keep trying until memory is available. */
> > for (;;) {
> > - skb = alloc_skb_fclone(MAX_TCP_HEADER, GFP_KERNEL);
> > + skb = alloc_skb_fclone(MAX_TCP_HEADER,
> > + sk->sk_allocation);
> > if (skb)
> > break;
> > yield();
>
> I think this specific case needs more thinking.
>
> If the allocation fails, and it's GFP_ATOMIC, we are going to yield()
> (which sleeps) and loop endlessly waiting for the allocation to
> succeed.
The _retried_ GFP_ATOMIC won't be much worse than GFP_KERNEL.
GFP_KERNEL can directly reclaim FS pages; GFP_ATOMIC will wake up
kswapd to do that. So after yield(), GFP_ATOMIC have good opportunity
to succeed if GFP_KERNEL could succeed.
The original GFP_KERNEL does have _a bit_ better chance to succeed,
but there are no guarantee. It could loop endlessly whether it be
GFP_KERNEL or GFP_ATOMIC.
btw, generally speaking, it would be more robust that NFS set
sk_allocation to GFP_NOIO, and let the networking code choose
whether to use plain sk_allocation or (sk_allocation & ~__GFP_WAIT).
The (sk_allocation & ~__GFP_WAIT) cases should be rare, but I guess
the networking code shall do it anyway, because sk_allocation defaults
to GFP_KERNEL. It seems that currently the networking code simply uses
a lot of GFP_ATOMIC, do they really mean "I cannot sleep"?
Thanks,
Fengguang
WARNING: multiple messages have this Message-ID (diff)
From: Wu Fengguang <fengguang.wu@intel.com>
To: David Miller <davem@davemloft.net>
Cc: "herbert@gondor.apana.org.au" <herbert@gondor.apana.org.au>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"linux-nfs@vger.kernel.org" <linux-nfs@vger.kernel.org>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>
Subject: Re: sk_lock: inconsistent {RECLAIM_FS-ON-W} -> {IN-RECLAIM_FS-W} usage
Date: Fri, 10 Jul 2009 16:00:17 +0800 [thread overview]
Message-ID: <20090710080017.GA24168@localhost> (raw)
In-Reply-To: <20090709.171355.09466097.davem@davemloft.net>
On Fri, Jul 10, 2009 at 08:13:55AM +0800, David Miller wrote:
> From: Wu Fengguang <fengguang.wu@intel.com>
> Date: Thu, 9 Jul 2009 21:17:46 +0800
>
> > @@ -2100,7 +2100,8 @@ void tcp_send_fin(struct sock *sk)
> > } else {
> > /* Socket is locked, keep trying until memory is available. */
> > for (;;) {
> > - skb = alloc_skb_fclone(MAX_TCP_HEADER, GFP_KERNEL);
> > + skb = alloc_skb_fclone(MAX_TCP_HEADER,
> > + sk->sk_allocation);
> > if (skb)
> > break;
> > yield();
>
> I think this specific case needs more thinking.
>
> If the allocation fails, and it's GFP_ATOMIC, we are going to yield()
> (which sleeps) and loop endlessly waiting for the allocation to
> succeed.
The _retried_ GFP_ATOMIC won't be much worse than GFP_KERNEL.
GFP_KERNEL can directly reclaim FS pages; GFP_ATOMIC will wake up
kswapd to do that. So after yield(), GFP_ATOMIC have good opportunity
to succeed if GFP_KERNEL could succeed.
The original GFP_KERNEL does have _a bit_ better chance to succeed,
but there are no guarantee. It could loop endlessly whether it be
GFP_KERNEL or GFP_ATOMIC.
btw, generally speaking, it would be more robust that NFS set
sk_allocation to GFP_NOIO, and let the networking code choose
whether to use plain sk_allocation or (sk_allocation & ~__GFP_WAIT).
The (sk_allocation & ~__GFP_WAIT) cases should be rare, but I guess
the networking code shall do it anyway, because sk_allocation defaults
to GFP_KERNEL. It seems that currently the networking code simply uses
a lot of GFP_ATOMIC, do they really mean "I cannot sleep"?
Thanks,
Fengguang
WARNING: multiple messages have this Message-ID (diff)
From: Wu Fengguang <fengguang.wu-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
To: David Miller <davem-fT/PcQaiUtIeIZ0/mPfg9Q@public.gmane.org>
Cc: "herbert-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org"
<herbert-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org>,
"linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
"linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
"netdev-u79uwXL29TY76Z2rM5mHXA@public.gmane.org"
<netdev-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: Re: sk_lock: inconsistent {RECLAIM_FS-ON-W} -> {IN-RECLAIM_FS-W} usage
Date: Fri, 10 Jul 2009 16:00:17 +0800 [thread overview]
Message-ID: <20090710080017.GA24168@localhost> (raw)
In-Reply-To: <20090709.171355.09466097.davem-fT/PcQaiUtIeIZ0/mPfg9Q@public.gmane.org>
On Fri, Jul 10, 2009 at 08:13:55AM +0800, David Miller wrote:
> From: Wu Fengguang <fengguang.wu-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
> Date: Thu, 9 Jul 2009 21:17:46 +0800
>
> > @@ -2100,7 +2100,8 @@ void tcp_send_fin(struct sock *sk)
> > } else {
> > /* Socket is locked, keep trying until memory is available. */
> > for (;;) {
> > - skb = alloc_skb_fclone(MAX_TCP_HEADER, GFP_KERNEL);
> > + skb = alloc_skb_fclone(MAX_TCP_HEADER,
> > + sk->sk_allocation);
> > if (skb)
> > break;
> > yield();
>
> I think this specific case needs more thinking.
>
> If the allocation fails, and it's GFP_ATOMIC, we are going to yield()
> (which sleeps) and loop endlessly waiting for the allocation to
> succeed.
The _retried_ GFP_ATOMIC won't be much worse than GFP_KERNEL.
GFP_KERNEL can directly reclaim FS pages; GFP_ATOMIC will wake up
kswapd to do that. So after yield(), GFP_ATOMIC have good opportunity
to succeed if GFP_KERNEL could succeed.
The original GFP_KERNEL does have _a bit_ better chance to succeed,
but there are no guarantee. It could loop endlessly whether it be
GFP_KERNEL or GFP_ATOMIC.
btw, generally speaking, it would be more robust that NFS set
sk_allocation to GFP_NOIO, and let the networking code choose
whether to use plain sk_allocation or (sk_allocation & ~__GFP_WAIT).
The (sk_allocation & ~__GFP_WAIT) cases should be rare, but I guess
the networking code shall do it anyway, because sk_allocation defaults
to GFP_KERNEL. It seems that currently the networking code simply uses
a lot of GFP_ATOMIC, do they really mean "I cannot sleep"?
Thanks,
Fengguang
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2009-07-10 8:00 UTC|newest]
Thread overview: 44+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-06-08 2:37 sk_lock: inconsistent {RECLAIM_FS-ON-W} -> {IN-RECLAIM_FS-W} usage Wu Fengguang
2009-06-08 2:37 ` Wu Fengguang
2009-06-08 4:55 ` KOSAKI Motohiro
2009-06-08 4:55 ` KOSAKI Motohiro
2009-06-08 4:55 ` KOSAKI Motohiro
2009-06-08 5:00 ` Wu Fengguang
2009-06-08 5:07 ` KOSAKI Motohiro
2009-06-08 5:07 ` KOSAKI Motohiro
2009-06-08 5:07 ` KOSAKI Motohiro
[not found] ` <20090608140529.4376.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-06-08 5:53 ` Wu Fengguang
2009-06-08 5:53 ` Wu Fengguang
2009-06-08 5:53 ` Wu Fengguang
2009-06-08 5:56 ` Wu Fengguang
2009-06-08 5:56 ` Wu Fengguang
2009-06-08 5:56 ` Wu Fengguang
2009-06-08 6:12 ` KOSAKI Motohiro
2009-06-08 6:12 ` KOSAKI Motohiro
2009-06-08 6:12 ` KOSAKI Motohiro
[not found] ` <20090608134428.4373.A69D9226-+CUm20s59erQFUHtdCDX3A@public.gmane.org>
2009-06-09 3:07 ` Wu Fengguang
2009-06-09 3:07 ` Wu Fengguang
2009-06-09 3:07 ` Wu Fengguang
2009-06-09 3:15 ` KOSAKI Motohiro
2009-06-09 3:15 ` KOSAKI Motohiro
2009-06-09 3:15 ` KOSAKI Motohiro
2009-07-06 10:52 ` Herbert Xu
2009-07-06 10:52 ` Herbert Xu
2009-07-06 10:52 ` Herbert Xu
2009-07-09 13:17 ` Wu Fengguang
2009-07-10 0:13 ` David Miller
2009-07-10 0:13 ` David Miller
2009-07-10 0:13 ` David Miller
2009-07-10 0:59 ` Herbert Xu
2009-07-10 0:59 ` Herbert Xu
2009-07-10 0:59 ` Herbert Xu
2009-07-10 8:00 ` Wu Fengguang [this message]
2009-07-10 8:00 ` Wu Fengguang
2009-07-10 8:00 ` Wu Fengguang
2009-07-10 8:02 ` Herbert Xu
[not found] ` <20090710080247.GA2693-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org>
2009-07-14 16:04 ` David Miller
2009-07-14 16:04 ` David Miller
2009-07-14 16:04 ` David Miller
2009-07-15 7:45 ` Wu Fengguang
2009-07-15 7:45 ` Wu Fengguang
2009-07-15 7:45 ` Wu Fengguang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090710080017.GA24168@localhost \
--to=fengguang.wu@intel.com \
--cc=davem@davemloft.net \
--cc=herbert-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-nfs@vger.kernel.org \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.