All of lore.kernel.org
 help / color / mirror / Atom feed
From: Peter Zijlstra <a.p.zijlstra@chello.nl>
To: Indan Zupancic <indan@nul.nu>
Cc: Daniel Phillips <phillips@google.com>,
	netdev@vger.kernel.org, linux-mm@kvack.org,
	linux-kernel@vger.kernel.org
Subject: Re: [RFC][PATCH 2/9] deadlock prevention core
Date: Wed, 09 Aug 2006 16:00:32 +0200	[thread overview]
Message-ID: <1155132032.12225.65.camel@twins> (raw)
In-Reply-To: <39903.81.207.0.53.1155131329.squirrel@81.207.0.53>

On Wed, 2006-08-09 at 15:48 +0200, Indan Zupancic wrote:
> On Wed, August 9, 2006 14:54, Peter Zijlstra said:
> > On Wed, 2006-08-09 at 14:02 +0200, Indan Zupancic wrote:
> >>  That avoids lots of checks and should guarantee that the
> >> accounting is correct, except in the case when the IFF_MEMALLOC flag is
> >> cleared and the counter is set to zero manually. Can't that be avoided and
> >> just let it decrease to zero naturally?
> >
> > That would put the atomic op on the free path unconditionally, I think
> > davem gets nightmares from that.
> 
> I confused SOCK_MEMALLOC with sk_buff::memalloc, sorry. What I meant was
> to unconditionally decrement the reserved usage only when memalloc is true
> on the free path. That way all skbs that increased the reserve also decrease
> it, and the counter should never go below zero.

OK, so far so good, except we loose the notion of getting memory back
from
regular skbs.

> Also as far as I can see it should be possible to replace all atomic
> "if (unlikely(dev_reserve_used(skb->dev)))" checks witha check if
> memalloc is set. That should make davem happy, as there aren't any
> atomic instructions left in hot paths.

dev_reserve_used() uses atomic_read() which isn't actually a LOCK'ed 
instruction, so that should not matter.

> If IFF_MEMALLOC is set new skbs set memalloc and increase the reserve.

Not quite, if IFF_MEMALLOC is set new skbs _could_ get memalloc set. We
only
fall back to alloc_pages() if the regular path fails to alloc. If the
skb
is backed by a page (as opposed to kmem_cache fluff) sk_buff::memalloc
is set.

> When the skb is being destroyed it doesn't matter if IFF_MEMALLOC is set
> or not, only if that skb used reserves and thus only the memalloc flag
> needs to be checked. This means that changing the IFF_MEMALLOC doesn't
> affect in-flight skbs but only newly created ones, and there's no need to
> update in-flight skbs whenever the flag is changed as all should go well.

Your reasoning is sound, except for these two point above...

<snip code>

> It seems that here SOCK_MEMALLOC is only set on the first socket.
> Shouldn't it be set on all sockets instead?

Ouch, where was my brain... Thanks!

Also, I've been thinking (more pain), should I not up the reserve for
each
SOCK_MEMALLOC socket.


WARNING: multiple messages have this Message-ID (diff)
From: Peter Zijlstra <a.p.zijlstra@chello.nl>
To: Indan Zupancic <indan@nul.nu>
Cc: Daniel Phillips <phillips@google.com>,
	netdev@vger.kernel.org, linux-mm@kvack.org,
	linux-kernel@vger.kernel.org
Subject: Re: [RFC][PATCH 2/9] deadlock prevention core
Date: Wed, 09 Aug 2006 16:00:32 +0200	[thread overview]
Message-ID: <1155132032.12225.65.camel@twins> (raw)
In-Reply-To: <39903.81.207.0.53.1155131329.squirrel@81.207.0.53>

On Wed, 2006-08-09 at 15:48 +0200, Indan Zupancic wrote:
> On Wed, August 9, 2006 14:54, Peter Zijlstra said:
> > On Wed, 2006-08-09 at 14:02 +0200, Indan Zupancic wrote:
> >>  That avoids lots of checks and should guarantee that the
> >> accounting is correct, except in the case when the IFF_MEMALLOC flag is
> >> cleared and the counter is set to zero manually. Can't that be avoided and
> >> just let it decrease to zero naturally?
> >
> > That would put the atomic op on the free path unconditionally, I think
> > davem gets nightmares from that.
> 
> I confused SOCK_MEMALLOC with sk_buff::memalloc, sorry. What I meant was
> to unconditionally decrement the reserved usage only when memalloc is true
> on the free path. That way all skbs that increased the reserve also decrease
> it, and the counter should never go below zero.

OK, so far so good, except we loose the notion of getting memory back
from
regular skbs.

> Also as far as I can see it should be possible to replace all atomic
> "if (unlikely(dev_reserve_used(skb->dev)))" checks witha check if
> memalloc is set. That should make davem happy, as there aren't any
> atomic instructions left in hot paths.

dev_reserve_used() uses atomic_read() which isn't actually a LOCK'ed 
instruction, so that should not matter.

> If IFF_MEMALLOC is set new skbs set memalloc and increase the reserve.

Not quite, if IFF_MEMALLOC is set new skbs _could_ get memalloc set. We
only
fall back to alloc_pages() if the regular path fails to alloc. If the
skb
is backed by a page (as opposed to kmem_cache fluff) sk_buff::memalloc
is set.

> When the skb is being destroyed it doesn't matter if IFF_MEMALLOC is set
> or not, only if that skb used reserves and thus only the memalloc flag
> needs to be checked. This means that changing the IFF_MEMALLOC doesn't
> affect in-flight skbs but only newly created ones, and there's no need to
> update in-flight skbs whenever the flag is changed as all should go well.

Your reasoning is sound, except for these two point above...

<snip code>

> It seems that here SOCK_MEMALLOC is only set on the first socket.
> Shouldn't it be set on all sockets instead?

Ouch, where was my brain... Thanks!

Also, I've been thinking (more pain), should I not up the reserve for
each
SOCK_MEMALLOC socket.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2006-08-09 14:04 UTC|newest]

Thread overview: 280+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-08-08 19:33 [RFC][PATCH 0/9] Network receive deadlock prevention for NBD Peter Zijlstra
2006-08-08 19:33 ` Peter Zijlstra
2006-08-08 19:33 ` [RFC][PATCH 1/9] pfn_to_kaddr() for UML Peter Zijlstra
2006-08-08 19:33   ` Peter Zijlstra
2006-08-08 19:33 ` [RFC][PATCH 2/9] deadlock prevention core Peter Zijlstra
2006-08-08 19:33   ` Peter Zijlstra
2006-08-08 20:57   ` Stephen Hemminger
2006-08-08 20:57     ` Stephen Hemminger
2006-08-08 21:05     ` Peter Zijlstra
2006-08-08 21:05       ` Peter Zijlstra
2006-08-09  1:33     ` Daniel Phillips
2006-08-09  1:33       ` Daniel Phillips
2006-08-09  1:38       ` David Miller
2006-08-09  1:38         ` David Miller, Daniel Phillips
2006-08-08 21:17   ` Thomas Graf
2006-08-08 21:17     ` Thomas Graf
2006-08-09  1:34     ` Daniel Phillips
2006-08-09  1:34       ` Daniel Phillips
2006-08-09  1:39       ` David Miller
2006-08-09  1:39         ` David Miller, Daniel Phillips
2006-08-09  5:47         ` Daniel Phillips
2006-08-09  5:47           ` Daniel Phillips
2006-08-09 13:19           ` Thomas Graf
2006-08-09 13:19             ` Thomas Graf
2006-08-09 14:07             ` Peter Zijlstra
2006-08-09 14:07               ` Peter Zijlstra
2006-08-09 16:18               ` Thomas Graf
2006-08-09 16:18                 ` Thomas Graf
2006-08-09 16:19                 ` Peter Zijlstra
2006-08-09 16:19                   ` Peter Zijlstra
2006-08-10  0:01                   ` David Miller
2006-08-10  0:01                     ` David Miller, Peter Zijlstra
2006-08-09 23:58               ` David Miller
2006-08-09 23:58                 ` David Miller, Peter Zijlstra
2006-08-10  6:25                 ` Peter Zijlstra
2006-08-10  6:25                   ` Peter Zijlstra
2006-08-11  4:24                 ` Stephen Hemminger
2006-08-11  4:24                   ` Stephen Hemminger
2006-08-13 21:22                 ` Daniel Phillips
2006-08-13 21:22                   ` Daniel Phillips
2006-08-13 23:49                   ` David Miller
2006-08-13 23:49                     ` David Miller, Daniel Phillips
2006-08-14  1:15                     ` Daniel Phillips
2006-08-14  1:15                       ` Daniel Phillips
2006-08-11  2:37     ` Rik van Riel
2006-08-11  2:37       ` Rik van Riel
2006-08-13 22:05       ` Daniel Phillips
2006-08-13 22:05         ` Daniel Phillips
2006-08-13 23:55         ` David Miller
2006-08-13 23:55           ` David Miller, Daniel Phillips
2006-08-14  1:31           ` Daniel Phillips
2006-08-14  1:31             ` Daniel Phillips
2006-08-14  1:53             ` Andrew Morton
2006-08-14  1:53               ` Andrew Morton
2006-08-14  4:40               ` Peter Zijlstra
2006-08-14  4:40                 ` Peter Zijlstra
2006-08-14  4:58                 ` Andrew Morton
2006-08-14  4:58                   ` Andrew Morton
2006-08-14  5:03                   ` Peter Zijlstra
2006-08-14  5:03                     ` Peter Zijlstra
2006-08-14  5:22                     ` Andrew Morton
2006-08-14  5:22                       ` Andrew Morton
2006-08-14  6:45                       ` Peter Zijlstra
2006-08-14  6:45                         ` Peter Zijlstra
2006-08-14  7:07                         ` Andrew Morton
2006-08-14  7:07                           ` Andrew Morton
2006-08-14  8:15                           ` Peter Zijlstra
2006-08-14  8:15                             ` Peter Zijlstra
2006-08-14  8:25                             ` Evgeniy Polyakov
2006-08-14  8:25                               ` Evgeniy Polyakov
2006-08-14  8:35                               ` Peter Zijlstra
2006-08-14  8:35                                 ` Peter Zijlstra
2006-08-14  8:33                           ` David Miller
2006-08-14  8:33                             ` David Miller, Andrew Morton
2006-08-17  4:27                           ` Daniel Phillips
2006-08-17  4:27                             ` Daniel Phillips
2006-08-14  7:17                         ` Neil Brown
2006-08-14  7:17                           ` Neil Brown
2006-08-14  7:31                           ` Evgeniy Polyakov
2006-08-14  7:31                             ` Evgeniy Polyakov
2006-08-17  3:58                   ` Daniel Phillips
2006-08-17  3:58                     ` Daniel Phillips
2006-08-17  5:57                     ` Andrew Morton
2006-08-17  5:57                       ` Andrew Morton
2006-08-17 23:53                       ` Daniel Phillips
2006-08-17 23:53                         ` Daniel Phillips
2006-08-18  0:24                         ` Rik van Riel
2006-08-18  0:24                           ` Rik van Riel
2006-08-18  0:35                         ` Daniel Phillips
2006-08-18  0:35                           ` Daniel Phillips
2006-08-18  1:14                         ` Neil Brown
2006-08-18  1:14                           ` Neil Brown
2006-08-18  6:05                         ` Andrew Morton
2006-08-18  6:05                           ` Andrew Morton
2006-08-18 21:22                           ` Daniel Phillips
2006-08-18 21:22                             ` Daniel Phillips
2006-08-18 22:34                             ` Andrew Morton
2006-08-18 22:34                               ` Andrew Morton
2006-08-18 23:44                               ` Daniel Phillips
2006-08-18 23:44                                 ` Daniel Phillips
2006-08-19  2:44                                 ` Andrew Morton
2006-08-19  2:44                                   ` Andrew Morton
2006-08-19  4:14                                   ` Network receive stall avoidance (was [PATCH 2/9] deadlock prevention core) Daniel Phillips
2006-08-19  4:14                                     ` Daniel Phillips
2006-08-19  7:28                                     ` Andrew Morton
2006-08-19  7:28                                       ` Andrew Morton
2006-08-19 15:06                                   ` [RFC][PATCH 2/9] deadlock prevention core Rik van Riel
2006-08-19 15:06                                     ` Rik van Riel
2006-08-20  1:33                                     ` Andre Tomt
2006-08-20  1:33                                       ` Andre Tomt
2006-08-19 16:53                                   ` Ray Lee
2006-08-19 16:53                                     ` Ray Lee
2006-08-21 13:27                                   ` Philip R. Auld
2006-08-21 13:27                                     ` Philip R. Auld
2006-08-25 10:47                                     ` Pavel Machek
2006-08-25 10:47                                       ` Pavel Machek
2006-08-21 13:38                                 ` Jens Axboe
2006-08-21 13:38                                   ` Jens Axboe
2006-08-08 22:10   ` David Miller
2006-08-08 22:10     ` David Miller
2006-08-09  1:35     ` Daniel Phillips
2006-08-09  1:35       ` Daniel Phillips
2006-08-09  1:41       ` David Miller
2006-08-09  1:41         ` David Miller, Daniel Phillips
2006-08-09  5:44         ` Daniel Phillips
2006-08-09  5:44           ` Daniel Phillips
2006-08-09  7:00           ` Peter Zijlstra
2006-08-09  7:00             ` Peter Zijlstra
     [not found]   ` <42414.81.207.0.53.1155080443.squirrel@81.207.0.53>
2006-08-09  0:25     ` Daniel Phillips
2006-08-09  0:25       ` Daniel Phillips
2006-08-09 12:02       ` Indan Zupancic
2006-08-09 12:02         ` Indan Zupancic
2006-08-09 12:54         ` Peter Zijlstra
2006-08-09 12:54           ` Peter Zijlstra
2006-08-09 13:48           ` Indan Zupancic
2006-08-09 13:48             ` Indan Zupancic
2006-08-09 14:00             ` Peter Zijlstra [this message]
2006-08-09 14:00               ` Peter Zijlstra
2006-08-09 18:34               ` Indan Zupancic
2006-08-09 18:34                 ` Indan Zupancic
2006-08-09 19:45                 ` Peter Zijlstra
2006-08-09 19:45                   ` Peter Zijlstra
2006-08-09 20:19                   ` Peter Zijlstra
2006-08-09 20:19                     ` Peter Zijlstra
2006-08-10  1:21                   ` Indan Zupancic
2006-08-10  1:21                     ` Indan Zupancic
2006-08-09 16:05   ` -v2 " Peter Zijlstra
2006-08-09 16:05     ` Peter Zijlstra
2006-08-08 19:33 ` [RFC][PATCH 3/9] e1000 driver conversion Peter Zijlstra
2006-08-08 19:33   ` Peter Zijlstra
2006-08-08 20:50   ` Auke Kok
2006-08-08 20:50     ` Auke Kok
2006-08-08 20:59     ` Peter Zijlstra
2006-08-08 20:59       ` Peter Zijlstra
2006-08-08 22:32     ` David Miller
2006-08-08 22:32       ` David Miller, Auke Kok
2006-08-08 22:42       ` Auke Kok
2006-08-08 22:42         ` Auke Kok
2006-08-08 19:34 ` [RFC][PATCH 4/9] e100 " Peter Zijlstra
2006-08-08 19:34   ` Peter Zijlstra
2006-08-08 20:13   ` Auke Kok
2006-08-08 20:13     ` Auke Kok
2006-08-08 20:18     ` Peter Zijlstra
2006-08-08 20:18       ` Peter Zijlstra
2006-08-08 19:34 ` [RFC][PATCH 5/9] r8169 " Peter Zijlstra
2006-08-08 19:34   ` Peter Zijlstra
2006-08-08 19:34 ` [RFC][PATCH 6/9] tg3 " Peter Zijlstra
2006-08-08 19:34   ` Peter Zijlstra
2006-08-08 19:34 ` [RFC][PATCH 7/9] UML eth " Peter Zijlstra
2006-08-08 19:34   ` Peter Zijlstra
2006-08-08 19:34 ` [RFC][PATCH 8/9] 3c59x " Peter Zijlstra
2006-08-08 19:34   ` Peter Zijlstra
2006-08-08 23:07   ` Jeff Garzik
2006-08-08 23:07     ` Jeff Garzik
2006-08-09  5:51     ` Daniel Phillips
2006-08-09  5:51       ` Daniel Phillips
2006-08-09  5:55       ` David Miller
2006-08-09  5:55         ` David Miller, Daniel Phillips
2006-08-09  6:30         ` Jeff Garzik
2006-08-09  6:30           ` Jeff Garzik
2006-08-09  7:03           ` Peter Zijlstra
2006-08-09  7:03             ` Peter Zijlstra
2006-08-09  7:20             ` Jeff Garzik
2006-08-09  7:20               ` Jeff Garzik
2006-08-13 19:38         ` Daniel Phillips
2006-08-13 19:38           ` Daniel Phillips
2006-08-13 19:53           ` Jeff Garzik
2006-08-13 19:53             ` Jeff Garzik
2006-08-08 19:34 ` [RFC][PATCH 9/9] deadlock prevention for NBD Peter Zijlstra
2006-08-08 19:34   ` Peter Zijlstra
2006-08-09  5:46 ` [RFC][PATCH 0/9] Network receive " Evgeniy Polyakov
2006-08-09  5:46   ` Evgeniy Polyakov
2006-08-09  5:52   ` Daniel Phillips
2006-08-09  5:52     ` Daniel Phillips
2006-08-09  5:56     ` David Miller
2006-08-09  5:56       ` David Miller, Daniel Phillips
2006-08-09  5:53   ` David Miller
2006-08-09  5:53     ` David Miller, Evgeniy Polyakov
2006-08-09  5:55     ` Evgeniy Polyakov
2006-08-09  5:55       ` Evgeniy Polyakov
2006-08-09 12:37   ` Peter Zijlstra
2006-08-09 12:37     ` Peter Zijlstra
2006-08-09 13:07     ` Evgeniy Polyakov
2006-08-09 13:07       ` Evgeniy Polyakov
2006-08-09 13:32       ` Peter Zijlstra
2006-08-09 13:32         ` Peter Zijlstra
2006-08-09 19:29         ` Evgeniy Polyakov
2006-08-09 19:29           ` Evgeniy Polyakov
2006-08-09 23:54         ` David Miller
2006-08-09 23:54           ` David Miller, Peter Zijlstra
2006-08-10  6:06           ` Peter Zijlstra
2006-08-10  6:06             ` Peter Zijlstra
2006-08-13 20:16             ` Daniel Phillips
2006-08-13 20:16               ` Daniel Phillips
2006-08-14  5:13               ` Evgeniy Polyakov
2006-08-14  5:13                 ` Evgeniy Polyakov
2006-08-14  6:45                 ` Peter Zijlstra
2006-08-14  6:45                   ` Peter Zijlstra
2006-08-14  6:54                   ` Evgeniy Polyakov
2006-08-14  6:54                     ` Evgeniy Polyakov
2006-08-17  4:49                     ` Daniel Phillips
2006-08-17  4:49                       ` Daniel Phillips
2006-08-17  4:48                 ` Daniel Phillips
2006-08-17  4:48                   ` Daniel Phillips
2006-08-17  5:36                   ` Evgeniy Polyakov
2006-08-17  5:36                     ` Evgeniy Polyakov
2006-08-17 18:01                     ` Daniel Phillips
2006-08-17 18:01                       ` Daniel Phillips
2006-08-17 18:42                       ` Evgeniy Polyakov
2006-08-17 18:42                         ` Evgeniy Polyakov
2006-08-17 19:15                         ` Peter Zijlstra
2006-08-17 19:15                           ` Peter Zijlstra
2006-08-17 19:48                           ` Evgeniy Polyakov
2006-08-17 19:48                             ` Evgeniy Polyakov
2006-08-17 23:24                             ` Daniel Phillips
2006-08-17 23:24                               ` Daniel Phillips
2006-08-18  7:16                               ` Evgeniy Polyakov
2006-08-18  7:16                                 ` Evgeniy Polyakov
2006-08-12  3:42         ` Rik van Riel
2006-08-12  3:42           ` Rik van Riel
2006-08-12  8:47           ` Evgeniy Polyakov
2006-08-12  8:47             ` Evgeniy Polyakov
2006-08-12  9:19             ` Peter Zijlstra
2006-08-12  9:19               ` Peter Zijlstra
2006-08-12  9:37               ` Evgeniy Polyakov
2006-08-12  9:37                 ` Evgeniy Polyakov
2006-08-12 10:18                 ` Peter Zijlstra
2006-08-12 10:18                   ` Peter Zijlstra
2006-08-12 10:42                   ` Evgeniy Polyakov
2006-08-12 10:42                     ` Evgeniy Polyakov
2006-08-12 10:51                     ` Evgeniy Polyakov
2006-08-12 10:51                       ` Evgeniy Polyakov
2006-08-12 11:40                     ` Peter Zijlstra
2006-08-12 11:40                       ` Peter Zijlstra
2006-08-12 11:53                       ` Evgeniy Polyakov
2006-08-12 11:53                         ` Evgeniy Polyakov
2006-08-13  0:46                   ` David Miller
2006-08-13  0:46                     ` David Miller, Peter Zijlstra
2006-08-13  1:11                     ` Rik van Riel
2006-08-13  1:11                       ` Rik van Riel
2006-08-12 14:40                 ` Rik van Riel
2006-08-12 14:40                   ` Rik van Riel
2006-08-12 14:49                   ` Evgeniy Polyakov
2006-08-12 14:49                     ` Evgeniy Polyakov
2006-08-12 14:56                     ` Rik van Riel
2006-08-12 14:56                       ` Rik van Riel
2006-08-12 15:08                       ` Evgeniy Polyakov
2006-08-12 15:08                         ` Evgeniy Polyakov
2006-08-12 15:22                         ` Peter Zijlstra
2006-08-12 15:22                           ` Peter Zijlstra
2006-08-14  0:56                         ` Daniel Phillips
2006-08-14  0:56                           ` Daniel Phillips
2006-08-13  0:46                 ` David Miller
2006-08-13  0:46                   ` David Miller, Evgeniy Polyakov
2006-08-13  9:06                   ` Evgeniy Polyakov
2006-08-13  9:06                     ` Evgeniy Polyakov
2006-08-13  9:52                     ` Evgeniy Polyakov
2006-08-13  9:52                       ` Evgeniy Polyakov
2006-08-15 19:17 ` Pavel Machek
2006-08-15 19:17   ` Pavel Machek

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1155132032.12225.65.camel@twins \
    --to=a.p.zijlstra@chello.nl \
    --cc=indan@nul.nu \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=netdev@vger.kernel.org \
    --cc=phillips@google.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.