From: Ian Kent <raven@themaw.net>
To: Mike Kazantsev <mk.fraggod@gmail.com>
Cc: linux-btrfs@vger.kernel.org,
Gustavo Sverzut Barbieri <barbieri@profusion.mobi>
Subject: Re: Reproducible kernel (2.6.36) oops with several simultaneus btrfs mounts
Date: Mon, 15 Nov 2010 09:01:45 +0800 [thread overview]
Message-ID: <1289782905.3248.9.camel@localhost> (raw)
In-Reply-To: <20101113201536.29617c0d@sacrilege>
On Sat, 2010-11-13 at 20:15 +0500, Mike Kazantsev wrote:
> Good day.
>
>
> I'm experiencing a kernel oops when systemd tries to fsck and mount
> several btrfs filesystems pretty much simultaneously on boot.
> Oops is highly reproducible for me and causes system to hang, sometimes
> triggering some kind of oops-loop, dumping backtraces into console
> until the power is killed.
>
> I've mentioned systemd (init system, like sysvinit or upstart), because
> I haven't encountered the issue until I've installed it, and then I've
> got it right on the first (successful) systemd boot.
> Also, looks like I'm not alone in this, since the issue was raised on
> systemd-devel mailing list:
> http://thread.gmane.org/gmane.comp.sysutils.systemd.devel/704
> http://article.gmane.org/gmane.comp.sysutils.systemd.devel/721
>
> Since I've used vm (qemu-kvm) replica of physical machine to test
> systemd migration, that's where I've first encountered it.
>
> Symptoms are exactly the same on real hardware, so I doubt it's related
> to my specs, but since vm is nearly identical (rsync'ed from) to the
> real setup, guess it might be related to some particular initrd / lvm /
> whatever setup.
>
> I believe I've seen it first with 2.6.36-rc8, and now wih 2.6.36
> mainline kernel. Haven't tried 2.6.35, because systemd seem to rely on
> newer kernel features.
> Uname -a (I use same kernel for physical machine and vm):
> Linux sacrilege 2.6.36-fg.roam #9 SMP PREEMPT Wed Oct 27 14:22:03 YEKST 2010 i686 GNU/Linux
>
> Keywords: btrfs, systemd, init, boot, fsck, mount, oops, hang, loop, 2.6.36
>
>
>
> Oops message (both links lead to the same data):
> http://fraggod.net/share/systemd_btrfs_oops/oops.txt
> http://paste.pocoo.org/raw/290857/
Yes, this was reported on this list recently against a 2.6.35 based
kernel.
I know what causes it and I'm working on it but I'm not yet sure of the
best way to fix it.
>
>
>
> There's also a kernel/initrd/disk-image combo, which demonstrates the
> issue. It's i686 (32-bit) exherbo linux setup with all fs's on lvm
> volumes.
>
> Multiple btrfs mounts are a bit archaic and unnecessary here, and I'll
> probably get rid of these in a nearby future, but guess that's not the
> reason it shouldn't work or crash like that.
> http://fraggod.net/share/systemd_btrfs_oops/vm-kernel-2.6.36.img
> http://fraggod.net/share/systemd_btrfs_oops/vm-initrd.lzma
> http://fraggod.net/share/systemd_btrfs_oops/vm-disk.qcow2.xz
>
> Also, you can get all these via bittorrent (I may be able to add a few
> extra seeds there, for greater download speeds):
> http://fraggod.net/share/systemd_btrfs_oops/systemd_btrfs_oops_vm.torrent
> http://linuxtracker.org/download.php?id=a9f34f3c871b4d177dc1f8384bd2bb3f261a1297&f=systemd_btrfs_oops_vm.torrent
>
> I've cleaned disk image from most of the unrelated stuff (it was a
> desktop setup, after all), but it's still 250M download (with xz
> compression) and 1.5G uncompressed.
>
> I can reliably reproduce the issue with the following commands:
> qemu-system-x86_64 -kernel vm-kernel-2.6.36.img -initrd vm-initrd.lzma\
> -append 'ro root=/dev/ram0 lvroot=LABEL=root lvetc=LABEL=etc console=ttyS0'\
> -drive file=vm-disk.qcow2,if=virtio -nographic -monitor null -serial pty &
> screen /dev/pty/X
> (to attach to pty device, echoed by qemu)
>
> You can omit -nographic, -serial and -monitor qemu options and
> "console=" cmdline to run qemu with sdl window.
>
> If it doesn't crash and gets to getty login prompt, try killing vm (so
> filesystems won't be cleanly unmounted, although it doesn't seem to be
> the cause for me) and restarting it with the same command.
>
>
> Kernel configuration (I use this config for both vm-guest kernel and
> for the real hardware, which hosts vm):
> http://fraggod.net/share/systemd_btrfs_oops/kconfig.txt
>
>
> I'll probably also be able to attach sequence of actions executed by
> systemd (leading to this crash) a bit later.
> If there's any additional information I can provide or any test I
> should run on the setup, I'd be happy to do so.
>
>
> Thank you for your attention.
>
>
prev parent reply other threads:[~2010-11-15 1:01 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-11-13 15:15 Reproducible kernel (2.6.36) oops with several simultaneus btrfs mounts Mike Kazantsev
2010-11-15 1:01 ` Ian Kent [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1289782905.3248.9.camel@localhost \
--to=raven@themaw.net \
--cc=barbieri@profusion.mobi \
--cc=linux-btrfs@vger.kernel.org \
--cc=mk.fraggod@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).