netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: James Bottomley <James.Bottomley@HansenPartnership.com>
To: Andi Kleen <ak@muc.de>
Cc: Rik van Riel <riel@redhat.com>,
	Dmitry Yusupov <dmitry_yus@yahoo.com>,
	mpm@selenic.com, andrea@suse.de, michaelc@cs.wisc.edu,
	open-iscsi@googlegroups.com, ksummit-2005-discuss@thunk.org,
	netdev@oss.sgi.com
Subject: Re: [Ksummit-2005-discuss] Summary of 2005 Kernel Summit Proposed Topics
Date: Tue, 29 Mar 2005 09:56:48 -0600	[thread overview]
Message-ID: <1112111808.5510.16.camel@mulgrave> (raw)
In-Reply-To: <20050329152008.GD63268@muc.de>

On Tue, 2005-03-29 at 17:20 +0200, Andi Kleen wrote:
> > Actually, not in 2.6 ... we had the same issue in SCSI using mempools
> > for sglist allocation.  All of the mempool alocation paths now take gfp_
> > flags, so you can specify GFP_ATOMIC for interrupt context.
> 
> Just does not work when you are actually short of memory.
> 
> Just think a second on how a mempool works: In the extreme
> case when it cannot allocate system memory anymore it has
> to wait for someone else to free a memory block into the mempool,
> then pass it on to the next allocator etc. Basically 
> it is a direct bypass pipeline for memory to pass memory
> directly from one high priority user to another. This only
> works with sleeping. Otherwise you could not handle an arbitary
> number of users with a single mempool.
> 
> So to get a reliable mempool you have to sleep on allocation.

But that's not what we use them for.  You are confusing reliability with
forward progress.

In SCSI we use GFP_ATOMIC mempools in order to make forward progress.
All the paths are coded to expect a failure (in which case we requeue).
For forward progress what we need is the knowledge that there are n
resources out there dedicated to us.  When they return they get
reallocated straight to us and we can restart the queue processing
(there's actually a SCSI trigger that does this).

For receive mempools, the situation is much the same; if you have n
reserved buffers, then you have to drop the n+1 th packet.  However, the
resources will free up and go back to your mempool, and eventually you
accept the packet on retransmit.

The killer scenario (and why we require a mempool) is that someone else
gets the memory before you but then becomes blocked on another
allocation, so now you have no more allocations to allow forward
progress.

James


> > The object isn't to make the queues *reliable* it's to ensure the system
> > can make forward progress.  So all we're trying to ensure is that the
> > sockets used to service storage have some probability of being able to
> > send and receive packets during low memory.
> 
> For that it is enough to make the sender reliable. Retransmit
> takes care of the rest.

No ... we cannot get down to the situation where GFP_ATOMIC always
fails.  Now we have no receive capacity at all and the system deadlocks.

> > In your scenario, if we're out of memory and the system needs several
> > ACK's to the swap device for pages to be released to the system, I don't
> > see how we make forward progress since without a reserved resource to
> > allocate from how does the ack make it up the stack to the storage
> > driver layer?
> 
> Typically because the RX ring of the driver has some packets left.
> 
> Also since TCP is very persistent and there is some memory
> activity left you will have at least occasionally a time slot
> where a GFP_ATOMIC allocation can succeed.

That's what I think a mempool is required to guarantee.  Without it,
there are scenarios where GFP_ATOMIC always fails.

James

  reply	other threads:[~2005-03-29 15:56 UTC|newest]

Thread overview: 88+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <4241D106.8050302@cs.wisc.edu>
     [not found] ` <20050324101622S.fujita.tomonori@lab.ntt.co.jp>
     [not found]   ` <1111628393.1548.307.camel@beastie>
     [not found]     ` <20050324113312W.fujita.tomonori@lab.ntt.co.jp>
     [not found]       ` <1111633846.1548.318.camel@beastie>
     [not found]         ` <20050324215922.GT14202@opteron.random>
     [not found]           ` <424346FE.20704@cs.wisc.edu>
     [not found]             ` <20050324233921.GZ14202@opteron.random>
     [not found]               ` <20050325034341.GV32638@waste.org>
     [not found]                 ` <20050327035149.GD4053@g5.random>
2005-03-27  5:48                   ` [Ksummit-2005-discuss] Summary of 2005 Kernel Summit Proposed Topics Matt Mackall
2005-03-27  6:04                     ` Andrea Arcangeli
2005-03-27  6:38                       ` Matt Mackall
2005-03-27 14:50                         ` Andrea Arcangeli
2005-03-27  6:33                     ` Dmitry Yusupov
2005-03-27  6:46                       ` David S. Miller
2005-03-27  7:05                         ` Dmitry Yusupov
2005-03-27  7:57                           ` David S. Miller
2005-03-27  8:18                             ` Dmitry Yusupov
2005-03-27 18:26                               ` Mike Christie
2005-03-27 18:31                                 ` David S. Miller
2005-03-27 19:58                                   ` Matt Mackall
2005-03-27 21:49                                   ` Dmitry Yusupov
2005-03-27 18:47                                 ` Dmitry Yusupov
2005-03-27 21:14                         ` Alex Aizman
     [not found]                         ` <20050327211506.85EDA16022F6@mx1.suse.de>
2005-03-28  0:15                           ` Andrea Arcangeli
2005-03-28  3:54                         ` Rik van Riel
2005-03-28  4:34                           ` David S. Miller
2005-03-28  4:50                             ` Rik van Riel
2005-03-28  6:58                           ` Alex Aizman
2005-03-28 16:12                           ` Andi Kleen
2005-03-28 16:22                             ` Andrea Arcangeli
2005-03-28 16:24                             ` Rik van Riel
2005-03-29 15:11                               ` Andi Kleen
2005-03-29 15:29                                 ` Rik van Riel
2005-03-29 17:03                                 ` Matt Mackall
2005-03-28 16:28                             ` James Bottomley
2005-03-29 15:20                               ` Andi Kleen
2005-03-29 15:56                                 ` James Bottomley [this message]
2005-03-29 17:19                                 ` Dmitry Yusupov
2005-03-29 21:08                                   ` jamal
2005-03-29 22:00                                     ` Rik van Riel
2005-03-29 22:17                                       ` Matt Mackall
2005-03-29 23:30                                         ` jamal
2005-03-29 23:00                                       ` jamal
2005-03-29 23:25                                         ` Matt Mackall
2005-03-30  0:30                                           ` H. Peter Anvin
2005-03-30 15:24                                         ` Andi Kleen
2005-03-29 22:03                                     ` Rick Jones
2005-03-29 23:13                                       ` jamal
2005-03-30  2:28                                         ` Alex Aizman
     [not found]                                         ` <E1DGSwp-0004ZE-00@thunker.thunk.org>
2005-03-30 17:16                                           ` Grant Grundler
2005-03-30 18:46                                         ` Dmitry Yusupov
2005-03-30 15:22                                     ` Andi Kleen
2005-03-30 15:33                                       ` Andrea Arcangeli
2005-03-30 15:38                                         ` Rik van Riel
2005-03-30 15:39                                         ` Andi Kleen
2005-03-30 15:44                                           ` Andrea Arcangeli
2005-03-30 15:50                                             ` Rik van Riel
2005-03-30 16:04                                               ` James Bottomley
2005-03-30 17:48                                                 ` H. Peter Anvin
2005-03-30 16:02                                             ` Andi Kleen
2005-03-30 16:15                                               ` Andrea Arcangeli
2005-03-30 16:55                                                 ` jamal
2005-03-30 18:42                                                   ` Rik van Riel
2005-03-30 19:28                                                 ` Alex Aizman
2005-03-31 11:41                                                 ` Andi Kleen
2005-03-31 12:12                                                   ` Rik van Riel
2005-03-31 18:59                                                     ` Andi Kleen
2005-03-31 19:04                                                       ` Rik van Riel
2005-03-31 15:35                                                   ` Grant Grundler
2005-03-31 19:15                                                   ` Alex Aizman
2005-03-31 19:34                                                   ` Andi Kleen
2005-03-31 19:39                                                     ` Rik van Riel
2005-03-31 11:45                                                 ` Andi Kleen
2005-03-31 11:50                                                 ` Andi Kleen
2005-03-31 17:09                                                   ` Andrea Arcangeli
2005-03-31 22:05                                                     ` Dmitry Yusupov
2005-03-30 17:24                                       ` Matt Mackall
2005-03-30 17:39                                         ` Dmitry Yusupov
2005-03-30 20:10                                           ` Mike Christie
2005-03-30 17:07                                     ` Grant Grundler
2005-03-30  5:12                                   ` H. Peter Anvin
2005-03-28 16:37                             ` Dmitry Yusupov
2005-03-28 19:45                         ` Roland Dreier
2005-03-28 20:32                           ` Topic: Remote DMA network technologies Gerrit Huizenga
2005-03-28 20:36                             ` Roland Dreier
     [not found]                           ` <1112042936.5088.22.camel@beastie>
2005-03-28 22:32                             ` [Ksummit-2005-discuss] Summary of 2005 Kernel Summit Proposed Topics Benjamin LaHaise
2005-03-29  3:19                               ` Linux support for RDMA (was: [Ksummit-2005-discuss] Summary of 2005 Kernel Summit Proposed Topics) Roland Dreier
2005-03-30 16:00                                 ` Benjamin LaHaise
2005-03-31  1:08                                   ` Linux support for RDMA H. Peter Anvin
2005-04-02 18:08                               ` [Ksummit-2005-discuss] Summary of 2005 Kernel Summit Proposed Topics Dmitry Yusupov
2005-04-02 19:13                                 ` Ming Zhang
2005-04-04  6:31                                 ` Grant Grundler
2005-04-04 18:57                                 ` Rick Jones
2005-03-29  3:14                             ` Linux support for RDMA (was: [Ksummit-2005-discuss] Summary of 2005 Kernel Summit Proposed Topics) Roland Dreier
     [not found] <42472259.2866086e.3169.318fSMTPIN_ADDED@mx.googlegroups.com>
2005-03-27 21:18 ` [Ksummit-2005-discuss] Summary of 2005 Kernel Summit Proposed Topics Alex Aizman
2005-03-27 21:53 Alex Aizman

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1112111808.5510.16.camel@mulgrave \
    --to=james.bottomley@hansenpartnership.com \
    --cc=ak@muc.de \
    --cc=andrea@suse.de \
    --cc=dmitry_yus@yahoo.com \
    --cc=ksummit-2005-discuss@thunk.org \
    --cc=michaelc@cs.wisc.edu \
    --cc=mpm@selenic.com \
    --cc=netdev@oss.sgi.com \
    --cc=open-iscsi@googlegroups.com \
    --cc=riel@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).