From: "Serge E. Hallyn" <serue-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
To: sukadev-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org,
Nadia Derbey <Nadia.Derbey-6ktuUTfB/bM@public.gmane.org>
Cc: Linux Containers <containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org>
Subject: Re: [lxc-dev] [BUG][cryo]: underflow in semundo_release() ?
Date: Fri, 13 Jun 2008 09:36:32 -0500 [thread overview]
Message-ID: <20080613143632.GA24858@us.ibm.com> (raw)
In-Reply-To: <20080613002213.GA27654-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
Quoting sukadev-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org (sukadev-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org):
> Serge E. Hallyn [serue-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org] wrote:
> | Quoting sukadev-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org (sukadev-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org):
> | > Serge E. Hallyn [serue-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org] wrote:
> | > | > The last few messages on stdout of the restart are:
> | > | >
> | > | > DEBUG (cr.c::1141) next memseg_t is: start bfe5c000 end bfe71000 prot 3 flag 50 offset 3221139456 fnam [stack]
> | > | > DEBUG (cr.c::1187) Delete segment bfa35000 - bfa4a000
> | > | > DEBUG (cr.c::1189) Restore segment bfe5c000 - bfe71000
> | > |
> | > | The stack segments are not the same. How can that be? Did you turn off
> | > | stack randomization?
> | >
> | > Grr, I did not. After I randomized, it seems to work on -lxc4 as well.
> | >
> | > Rather than warn, can we have cryo fail if stack is not randomized ?
> | > (its almost sure to fail anyway).
> |
> | I guess we may as well, because I guess the error message shows up at
> | the top of a long list of output so you'll never see it.
> |
> | Will change it.
>
> I run into this bug (twice so far) even with randomize_va_space set to 0.
>
> I run the attached pipe2.c program, ckpt and restart. The restart started
> out fine (printed "i is 10") then I hit CTRL-C.
>
> Last commit in my git tree:
>
> commit 84d005031a8a17bdca62dc541c296a3bea74658c
> (which adds 'exit(1)' to Dave's following commit)
> 96bb0ed3351c2e4268dade4416e1acbff7dab152
>
> qemu login: BUG: atomic counter underflow at:
>
> BUG: atomic counter underflow at:
> Pid: 2252, comm: pipe2 Not tainted 2.6.26-rc2-mm1-lxc4 #2
> Pid: 2252, comm: pipe2 Not tainted 2.6.26-rc2-mm1-lxc4 #2
> [<c01d9d7b>] [<c01d9d7b>] semundo_release+0x28/0x4a
> semundo_release+0x28/0x4a
> [<c01514c5>] [<c01514c5>] __fput+0x93/0x13b
> __fput+0x93/0x13b
> [<c0151827>] [<c0151827>] fput+0x2d/0x32
> fput+0x2d/0x32
> [<c014f044>] [<c014f044>] filp_close+0x50/0x5a
> filp_close+0x50/0x5a
> [<c01146e0>] [<c01146e0>] put_files_struct+0x7c/0xbe
> put_files_struct+0x7c/0xbe
> [<c0114759>] [<c0114759>] exit_files+0x37/0x3c
> exit_files+0x37/0x3c
> [<c01156ec>] [<c01156ec>] do_exit+0x1e4/0x589
> do_exit+0x1e4/0x589
> [<c0115aef>] [<c0115aef>] do_group_exit+0x5e/0x86
> do_group_exit+0x5e/0x86
> [<c011d126>] [<c011d126>] get_signal_to_deliver+0x2e0/0x31e
> get_signal_to_deliver+0x2e0/0x31e
> [<c0102215>] [<c0102215>] do_notify_resume+0x91/0x6dd
> do_notify_resume+0x91/0x6dd
> [<c0126d31>] [<c0126d31>] ? ? getnstimeofday+0x37/0xb7
> getnstimeofday+0x37/0xb7
> [<c01f63a8>] [<c01f63a8>] ? ? copy_to_user+0x2a/0x36
> copy_to_user+0x2a/0x36
> [<c012490d>] [<c012490d>] ? ? update_rmtp+0x49/0x5b
> update_rmtp+0x49/0x5b
> [<c0124d0d>] [<c0124d0d>] ? ? hrtimer_nanosleep+0x57/0x95
> hrtimer_nanosleep+0x57/0x95
> [<c012491f>] [<c012491f>] ? ? hrtimer_wakeup+0x0/0x1c
> hrtimer_wakeup+0x0/0x1c
> [<c0124d8d>] [<c0124d8d>] ? ? sys_nanosleep+0x42/0x53
> sys_nanosleep+0x42/0x53
> [<c0102c16>] [<c0102c16>] work_notifysig+0x13/0x19
> work_notifysig+0x13/0x19
> [<c02f0000>] [<c02f0000>] ? ? serial_pci_guess_board+0xb0/0x141
> serial_pci_guess_board+0xb0/0x141
Suka,
yeah I get this with that kernel too (though mine looks very different).
But I don't with a more recent -mm. Certainly seems like either a
exit-vs-signal or exit-vs-semundo bug. So there are three
possibilities:
1. it was a -mm bug which Oleg squashed - in which case it's
fixed.
2. it was a semundo bug which Manfred fixed with his recent
semundo rcu-ification
2. it was a Pierre bug with the semundo-rcu for c/r. In which
case, Nadia is having to rework that set anyway on top of
Manfred's patch. So let's make sure to do this test when
Kathy releases the next -lxc with Nadia's new version of
the semundo-rcu patchset.
thanks,
-serge
parent reply other threads:[~2008-06-13 14:36 UTC|newest]
Thread overview: expand[flat|nested] mbox.gz Atom feed
[parent not found: <20080613002213.GA27654-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080613143632.GA24858@us.ibm.com \
--to=serue-r/jw6+rmf7hqt0dzr+alfa@public.gmane.org \
--cc=Nadia.Derbey-6ktuUTfB/bM@public.gmane.org \
--cc=containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org \
--cc=sukadev-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.