Re: A bug in read operation for /dev/zero and a proposed fix.

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Andrew Morton <akpm@linux-foundation.org>
To: Salman Qazi <sqazi@google.com>
Cc: linux-kernel@vger.kernel.org,
	Nick Piggin <nickpiggin@yahoo.com.au>,
	Linus Torvalds <torvalds@linux-foundation.org>
Subject: Re: A bug in read operation for /dev/zero and a proposed fix.
Date: Thu, 4 Jun 2009 13:50:50 -0700	[thread overview]
Message-ID: <20090604135050.ceb6bf18.akpm@linux-foundation.org> (raw)
In-Reply-To: <alpine.DEB.1.00.0906041330170.32557@bumblebee1.mtv.corp.google.com>

On Thu, 4 Jun 2009 13:32:55 -0700 (PDT)
Salman Qazi <sqazi@google.com> wrote:

> While running 20 parallel instances of dd as follows:
> 
> #!/bin/bash
> 
> for i in `seq 1 20`; do
>          dd if=/dev/zero of=/export/hda3/dd_$i bs=1073741824 count=1 &
> done
> wait
> 
> on a 16G machine, we noticed that rather than just killing the
> processes, the entire kernel went down.  Stracing dd reveals that it first
> does an mmap2, which makes 1GB worth of zero page mappings.  Then it 
> performs
> a read on those pages from /dev/zero, and finally it performs a write. 
> The
> machine died during the reads.  Looking at the code, it was noticed that
> /dev/zero's read operation had been changed at some point from giving
> zero page mappings to actually zeroing the page.  The zeroing of the
> pages causes physical pages to be allocated to the process.

erk, Nick broke dd(1):

  commit 557ed1fa2620dc119adb86b34c614e152a629a80
  Author: Nick Piggin <npiggin@suse.de>
  Date:   Tue Oct 16 01:24:40 2007 -0700

      remove ZERO_PAGE


This is the first report I've seen of problems arising from that
change.

>  But, when
> the process exhausts all the memory that it can, the kernel cannot kill
> it, as it is still in the kernel mode allocating more memory. 
> Consequently,
> the kernel eventually crashes.
> 
> To fix this, I propose that when a fatal signal is pending during
> /dev/zero read operation, we simply return and let the user process die.
> Here is a patch that does that.
> 
> Signed-off-by: Salman Qazi <sqazi@google.com>
> ---
> diff --git a/drivers/char/mem.c b/drivers/char/mem.c
> index 8f05c38..2ffa36e 100644
> --- a/drivers/char/mem.c
> +++ b/drivers/char/mem.c
> @@ -696,6 +696,11 @@ static ssize_t read_zero(struct file * file, char __user * buf,
>   			break;
>   		buf += chunk;
>   		count -= chunk;
> +		/* The exit code here doesn't actually matter, as userland
> +		 * will never see it.
> +		 */
> +		if (fatal_signal_pending(current))
> +			return -ENOMEM;
>   		cond_resched();
>   	}
>   	return written ? written : -EFAULT;

OK.  I think.

It's presumptuous to return -ENOMEM: we don't _know_ that this signal
came from the oom-killer.  It would be better to return -EINTR here.

next prev parent reply	other threads:[~2009-06-04 20:51 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-06-04 20:32 A bug in read operation for /dev/zero and a proposed fix Salman Qazi
2009-06-04 20:50 ` Andrew Morton [this message]
2009-06-04 20:56   ` Salman Qazi
2009-06-04 21:01   ` Linus Torvalds
2009-06-04 21:05     ` Linus Torvalds

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090604135050.ceb6bf18.akpm@linux-foundation.org \
    --to=akpm@linux-foundation.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=nickpiggin@yahoo.com.au \
    --cc=sqazi@google.com \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.