All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Norbert Scheibner" <scno@gmx.net>
To: linux-btrfs@vger.kernel.org
Subject: panic after remove of device during rebalance
Date: Tue, 02 Feb 2010 12:20:58 +0100	[thread overview]
Message-ID: <20100202112058.193030@gmx.net> (raw)

Hi,
During some btrfs-tests for my own on a btrfs-volume started with 5 devices of different size, some snapshots and subvolumes and a few large files, I removed one device after another (always rebalancing after remove) til I ended up with 3.

I use the latest btrfs-tools snapshot and the 2.6.32 kernel with debian patches for sid.

btrfs-show then said:
Label: none  uuid: ca5e7037-a65c-45d8-b954-f64ab0799964
        Total devices 3 FS bytes used 6.01GB
        devid    5 size 623.25GB used 0.00 path /dev/md15
        devid    3 size 93.13GB used 9.01GB path /dev/md13
        devid    1 size 9.31GB used 9.01GB path /dev/md11

Then I removed number 3.

./btrfs-vol -r /dev/md13 /home/samba/temp/btrfs-tests/
ioctl returns 0
./btrfs-show
Label: none  uuid: ca5e7037-a65c-45d8-b954-f64ab0799964
        Total devices 3 FS bytes used 6.01GB
        devid    3 size 93.13GB used 9.01GB path /dev/sdc4
        devid    5 size 623.25GB used 8.31GB path /dev/md15
        devid    1 size 9.31GB used 8.31GB path /dev/md11

(/dev/sdc4 is the underlying device under /dev/md13, which I removed, I don't know why it still shows up as /dev/sdc4, but that happened before with the other devices I removed, so I didn't bother)

Now I startet to rebalance.

After 30 minutes or so ps ax still said:
17995 pts/3    S+     0:16 ./btrfs-vol -b /home/samba/temp/btrfs-tests/

After an hour ps ax said
17995 pts/3    R+    68:31 ./btrfs-vol -b /home/samba/temp/btrfs-tests/
and btrfs-vol consumes 100% of 1 CPU and can not be killed.

And thats what ./btrfsck /dev/md11 produced
fs tree 256 refs 1 not found
        unresolved ref root 257 dir 256 index 8 namelen 8 name subvol00 error 600
found 6449324032 bytes used err is 1
total csum bytes: 6291456
total tree bytes: 6873088
total fs tree bytes: 36864
btree space waste bytes: 159776
file data blocks allocated: 10737418240
 referenced 10737418240

subvol00 is a subvolume I created and deleted before. The error 600 was there before I started removing devices.

Thats what I found in the logs:

Feb  2 10:40:27 server kernel: [250931.124172] ------------[ cut here ]------------
Feb  2 10:40:27 server kernel: [250931.124239] kernel BUG at fs/btrfs/inode.c:788!
Feb  2 10:40:27 server kernel: [250931.124304] invalid opcode: 0000 [#1] SMP
Feb  2 10:40:27 server kernel: [250931.124371] last sysfs file: /sys/class/hwmon/hwmon0/temp1_input
Feb  2 10:40:27 server kernel: [250931.124440] Modules linked in: btrfs zlib_deflate crc32c libcrc32c autofs4 cpufreq_powersave cpufreq_ondemand cpufreq_stats ipt_REJECT ipt_MASQUERADE xt_TCPMSS xt_mac ipt_REDIRECT xt_DSCP xt_tcpudp xt_state xt_length ipt_LOG xt_limit iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 iptable_mangle iptable_filter ip_tables x_tables nf_conntrack_tftp nf_conntrack_sip nf_conntrack_proto_udplite nf_conntrack_pptp nf_conntrack_proto_gre nf_conntrack_netlink nfnetlink nf_conntrack_irc nf_conntrack_h323 nf_conntrack_ftp nf_conntrack ppp_async crc_ccitt ppp_generic slhc ipv6 nls_utf8 isofs loop powernow_k8 freq_table cpufreq_userspace video backlight ftdi_sio pl2303 asus_atk0110 output wmi usbserial snd_pcm snd_timer snd soundcore snd_page_alloc processor edac_
 core button i2c_nforce2 pcspkr i2c_core evdev ext3 jbd mbcache dm_mirror dm_region_hash dm_log dm_snapshot dm_mod raid1 md_mod ata_generic pata_amd sd_mod amd74xx ahci libata forcedeth firewire_ohci firewire_core crc_itu_t ide_pci_generic ohci_hcd sky2 scsi_mod ehci_hcd ide_core thermal fan thermal_sys hwmon [last unloaded: scsi_wait_scan]
Feb  2 10:40:27 server kernel: [250931.125004]
Feb  2 10:40:27 server kernel: [250931.125004] Pid: 17936, comm: flush-btrfs-6 Not tainted (2.6.32 #1) System Product Name
Feb  2 10:40:27 server kernel: [250931.125004] EIP: 0060:[<f8a9c948>] EFLAGS: 00010286 CPU: 2
Feb  2 10:40:27 server kernel: [250931.125004] EIP is at cow_file_range+0x5f8/0x610 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004] EAX: ffffffe4 EBX: ffffffff ECX: 00008989 EDX: 00000001
Feb  2 10:40:27 server kernel: [250931.125004] ESI: 0000000e EDI: 00001000 EBP: 00000000 ESP: d3c0dc18
Feb  2 10:40:27 server kernel: [250931.125004]  DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
Feb  2 10:40:27 server kernel: [250931.125004] Process flush-btrfs-6 (pid: 17936, ti=d3c0c000 task=c431e070 task.ti=d3c0c000)
Feb  2 10:40:27 server kernel: [250931.125004] Stack:
Feb  2 10:40:27 server kernel: [250931.125004]  02770000 00000000 00001000 00000000 00000000 00000000 85400000 0000000e
Feb  2 10:40:27 server kernel: [250931.125004] <0> ffffffff ffffffff d3c0dc8b 00000001 00000000 c8e6dab0 c243dea0 c8e6dbcc
Feb  2 10:40:27 server kernel: [250931.125004] <0> 00001000 c8e6dab4 ce603800 d8593db4 02770000 00000000 00001000 00000000
Feb  2 10:40:27 server kernel: [250931.125004] Call Trace:
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8a9d696>] ? run_delalloc_range+0x3d6/0x440 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8abbcb8>] ? __extent_writepage+0x938/0xae0 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8ab89f0>] ? end_bio_extent_writepage+0x0/0x200 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8ab7ad0>] ? extent_write_cache_pages+0x170/0x270 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8ab7c28>] ? extent_writepages+0x58/0x80 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8abb380>] ? __extent_writepage+0x0/0xae0 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8ab5510>] ? flush_write_bio+0x0/0x10 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8a9aa00>] ? btrfs_get_extent+0x0/0xbc0 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8a9a88c>] ? btrfs_writepages+0x1c/0x30 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<f8a9a870>] ? btrfs_writepages+0x0/0x30 [btrfs]
Feb  2 10:40:27 server kernel: [250931.125004]  [<c109bdfa>] ? do_writepages+0x1a/0x40
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10e20de>] ? writeback_single_inode+0xbe/0x310
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10e2e40>] ? writeback_inodes_wb+0x380/0x530
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10e30f8>] ? wb_writeback+0x108/0x1c0
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10e330f>] ? wb_do_writeback+0x9f/0x180
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10e343b>] ? bdi_writeback_task+0x4b/0x80
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10aa877>] ? bdi_start_fn+0x67/0xc0
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10aa810>] ? bdi_start_fn+0x0/0xc0
Feb  2 10:40:27 server kernel: [250931.125004]  [<c10504e4>] ? kthread+0x74/0x80
Feb  2 10:40:27 server kernel: [250931.125004]  [<c1050470>] ? kthread+0x0/0x80
Feb  2 10:40:27 server kernel: [250931.125004]  [<c100381f>] ? kernel_thread_helper+0x7/0x18
Feb  2 10:40:27 server kernel: [250931.125004] Code: 00 81 c3 00 10 00 00 83 d6 00 0f ac f3 0c 01 1a 8b 84 24 a8 00 00 00 c7 00 01 00 00 00 e9 46 fe ff ff 0f 0b eb fe 90 8d 74 26 00 <0f> 0b eb fe 8d 74 26 00 31 db 31 f6 e9 bd fb ff ff 0f 0b eb fe
Feb  2 10:40:27 server kernel: [250931.125004] EIP: [<f8a9c948>] cow_file_range+0x5f8/0x610 [btrfs] SS:ESP 0068:d3c0dc18
Feb  2 10:40:27 server kernel: [250931.133712] ---[ end trace 2f81334be95a397c ]---

             reply	other threads:[~2010-02-02 11:20 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-02-02 11:20 Norbert Scheibner [this message]
2010-02-02 14:47 ` panic after remove of device during rebalance Norbert Scheibner
2010-02-02 19:06   ` Goffredo Baroncelli
2010-02-03 14:33   ` Norbert Scheibner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20100202112058.193030@gmx.net \
    --to=scno@gmx.net \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.