kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1

public inbox for linux-btrfs@vger.kernel.org
 help / color / mirror / Atom feed

* kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1
@ 2009-10-07 10:34 Stefan Hajnoczi
  2009-10-07 15:25 ` Chris Mason
  0 siblings, 1 reply; 5+ messages in thread
From: Stefan Hajnoczi @ 2009-10-07 10:34 UTC (permalink / raw)
  To: linux-btrfs

I am testing btrfs on Linux 2.6.31.1 and am repeatedly hitting the
following issue in the btrfs-endio-wri thread:

------------[ cut here ]------------
kernel BUG at fs/btrfs/file.c:528!
invalid opcode: 0000 [#1] PREEMPT SMP
last sysfs file: /sys/block/dm-0/range
CPU 0
Modules linked in: nls_utf8 hfsplus nfs nfsd lockd nfs_acl auth_rpcgss
exportfs deflate ctr cast5 crypto_null ccm serpent blowfish twofish
twofish_common ecb xcbc cbc md5 sha256_generic sha512_generic
des_generic aes_x86_64 aes_generic ipcomp6 ipcomp xfrm_ipcomp ah6 ah4
esp6 esp4 xfrm4_tunnel tunnel4 xfrm4_mode_tunnel xfrm4_mode_transport
xfrm6_mode_transport xfrm6_mode_beet xfrm6_mode_tunnel xfrm6_tunnel
tunnel6 af_key autofs4 hidp l2cap bluetooth rfkill sunrpc tun bridge
stp ipv6 cpufreq_ondemand acpi_cpufreq btrfs zlib_deflate dm_multipath
video output sbs sbshc battery acpi_memhotplug ac lp kvm_intel kvm
snd_hda_codec_analog sg dcdbas snd_hda_intel snd_hda_codec sr_mod
cdrom snd_seq_dummy serio_raw snd_seq_oss snd_seq_midi_event snd_seq
snd_seq_device snd_pcm_oss snd_mixer_oss rtc_cmos rtc_core snd_pcm
parport_pc parport rtc_lib floppy snd_timer button snd soundcore
i2c_i801 snd_page_alloc e1000e i2c_core shpchp dm_snapshot dm_zero
dm_mirror dm_region_hash dm_log dm_mod ahci libata sd_mod scsi_mod
ext3 jbd uhci_hcd ohci_hcd ehci_hcd [last unloaded: microcode]
Pid: 10093, comm: btrfs-endio-wri Not tainted 2.6.31.1 #1 OptiPlex 755
RIP: 0010:[<ffffffffa03552d5>]  [<ffffffffa03552d5>]
btrfs_drop_extents+0x6d5/0x907 [btrfs]
RSP: 0018:ffff88006cf6dc10  EFLAGS: 00010282
RAX: 00000000ffffffef RBX: 0000000223338fff RCX: 00000000000006d5
RDX: 0000000000000001 RSI: 0000000000000000 RDI: 0000000100000001
RBP: 0000000000000000 R08: ffff880069c2ab50 R09: ffff880069c2ab50
R10: 0000000000000004 R11: 0000000000000021 R12: ffff88003d5d6490
R13: ffff880069c2ab50 R14: 000000013231c000 R15: ffff880020e6bc18
FS:  0000000000000000(0000) GS:ffff880001ba6000(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 00000000009f6c60 CR3: 000000006b977000 CR4: 00000000000026f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
Process btrfs-endio-wri (pid: 10093, threadinfo ffff88006cf6c000, task
ffff88006cd267b0)
Stack:
 0000000000001d8d 00000001264ef000 fffffffffffffff6 0000000132320000
<0> 0000000132320000 000000013231c000 ffff880020e6bd98 ffff88006cdae000
<0> ffff8800454ff8e0 0101ffff00000003 0000000223339000 0000000223339000
Call Trace:
 [<ffffffffa034c254>] ? insert_reserved_file_extent+0xa4/0x24c [btrfs]
 [<ffffffffa03620a4>] ? lock_extent+0x38/0x80 [btrfs]
 [<ffffffffa03505bf>] ? btrfs_finish_ordered_io+0x184/0x23d [btrfs]
 [<ffffffffa036422c>] ? end_bio_extent_writepage+0x9b/0x186 [btrfs]
 [<ffffffffa036b6bb>] ? worker_loop+0x5f/0x1ec [btrfs]
 [<ffffffffa036b65c>] ? worker_loop+0x0/0x1ec [btrfs]
 [<ffffffff81056d4a>] ? kthread+0x8b/0x95
 [<ffffffff8100cb5a>] ? child_rip+0xa/0x20
 [<ffffffff81056cbf>] ? kthread+0x0/0x95
 [<ffffffff8100cb50>] ? child_rip+0x0/0x20
Code: 00 00 00 35 00 00 00 41 b9 01 00 00 00 4c 8d 84 24 fc 00 00 00
4c 89 ea 48 8b 74 24 38 48 8b 7c 24 40 e8 a5 4c fe ff 85 c0 74 04 <0f>
0b eb fe 49 8b 6d 00 49 63 75 40 48 89 ef 48 6b f6 19 48 83
RIP  [<ffffffffa03552d5>] btrfs_drop_extents+0x6d5/0x907 [btrfs]
 RSP <ffff88006cf6dc10>
---[ end trace 5dd7bea4ce87888f ]---

Specifics of my setup:
1. The filesystem is used exclusively over NFS.
2. The filesystem contains only a few files, most are large (~9 GB).
3. I use bcp(1) to make copy-on-write copies of the large files.
4. The filesystem is 40 GB and currently 15 GB (37%) are in use.
5. The filesystem is on an LVM volume.

Any idea what is going on here and whether this is already fixed?

Thanks,
Stefan

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1
  2009-10-07 10:34 kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1 Stefan Hajnoczi
@ 2009-10-07 15:25 ` Chris Mason
  2009-10-07 15:35   ` Stefan Hajnoczi
  0 siblings, 1 reply; 5+ messages in thread
From: Chris Mason @ 2009-10-07 15:25 UTC (permalink / raw)
  To: Stefan Hajnoczi; +Cc: linux-btrfs

On Wed, Oct 07, 2009 at 11:34:54AM +0100, Stefan Hajnoczi wrote:
> I am testing btrfs on Linux 2.6.31.1 and am repeatedly hitting the
> following issue in the btrfs-endio-wri thread:
> 
> ------------[ cut here ]------------
> kernel BUG at fs/btrfs/file.c:528!
> invalid opcode: 0000 [#1] PREEMPT SMP
> last sysfs file: /sys/block/dm-0/range
> Pid: 10093, comm: btrfs-endio-wri Not tainted 2.6.31.1 #1 OptiPlex 755
> RIP: 0010:[<ffffffffa03552d5>]  [<ffffffffa03552d5>]
> btrfs_drop_extents+0x6d5/0x907 [btrfs]
> RSP: 0018:ffff88006cf6dc10  EFLAGS: 00010282
> RAX: 00000000ffffffef RBX: 0000000223338fff RCX: 00000000000006d5
> RDX: 0000000000000001 RSI: 0000000000000000 RDI: 0000000100000001
> RBP: 0000000000000000 R08: ffff880069c2ab50 R09: ffff880069c2ab50
> R10: 0000000000000004 R11: 0000000000000021 R12: ffff88003d5d6490
> R13: ffff880069c2ab50 R14: 000000013231c000 R15: ffff880020e6bc18
> FS:  0000000000000000(0000) GS:ffff880001ba6000(0000) knlGS:0000000000000000
> CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> CR2: 00000000009f6c60 CR3: 000000006b977000 CR4: 00000000000026f0
> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
> Process btrfs-endio-wri (pid: 10093, threadinfo ffff88006cf6c000, task
> ffff88006cd267b0)
> Stack:
>  0000000000001d8d 00000001264ef000 fffffffffffffff6 0000000132320000
> <0> 0000000132320000 000000013231c000 ffff880020e6bd98 ffff88006cdae000
> <0> ffff8800454ff8e0 0101ffff00000003 0000000223339000 0000000223339000
> Call Trace:
>  [<ffffffffa034c254>] ? insert_reserved_file_extent+0xa4/0x24c [btrfs]
>  [<ffffffffa03620a4>] ? lock_extent+0x38/0x80 [btrfs]
>  [<ffffffffa03505bf>] ? btrfs_finish_ordered_io+0x184/0x23d [btrfs]
>  [<ffffffffa036422c>] ? end_bio_extent_writepage+0x9b/0x186 [btrfs]
>  [<ffffffffa036b6bb>] ? worker_loop+0x5f/0x1ec [btrfs]
>  [<ffffffffa036b65c>] ? worker_loop+0x0/0x1ec [btrfs]
>  [<ffffffff81056d4a>] ? kthread+0x8b/0x95
>  [<ffffffff8100cb5a>] ? child_rip+0xa/0x20
>  [<ffffffff81056cbf>] ? kthread+0x0/0x95
>  [<ffffffff8100cb50>] ? child_rip+0x0/0x20
> Code: 00 00 00 35 00 00 00 41 b9 01 00 00 00 4c 8d 84 24 fc 00 00 00
> 4c 89 ea 48 8b 74 24 38 48 8b 7c 24 40 e8 a5 4c fe ff 85 c0 74 04 <0f>
> 0b eb fe 49 8b 6d 00 49 63 75 40 48 89 ef 48 6b f6 19 48 83
> RIP  [<ffffffffa03552d5>] btrfs_drop_extents+0x6d5/0x907 [btrfs]
>  RSP <ffff88006cf6dc10>
> ---[ end trace 5dd7bea4ce87888f ]---
> 
> Specifics of my setup:
> 1. The filesystem is used exclusively over NFS.
> 2. The filesystem contains only a few files, most are large (~9 GB).
> 3. I use bcp(1) to make copy-on-write copies of the large files.
> 4. The filesystem is 40 GB and currently 15 GB (37%) are in use.
> 5. The filesystem is on an LVM volume.
> 
> Any idea what is going on here and whether this is already fixed?

This oops means that we're trying to insert an extent that already
exists.  I think it is related to the bug in the file clone ioctl that
Sage recently fixed.  The fix is in the master branch of the
btrfs-unstable tree.

So, I'd say step one is to make a backup of this data.

Are you able to figure out which of the files is being written at the
time of the oops?  If not we can easily add a message to help nail it
down.

Either way, I'd copy the file that is triggering the problem to a new
file and delete the old one.

-chris


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1
  2009-10-07 15:25 ` Chris Mason
@ 2009-10-07 15:35   ` Stefan Hajnoczi
  2009-10-07 15:40     ` Chris Mason
  0 siblings, 1 reply; 5+ messages in thread
From: Stefan Hajnoczi @ 2009-10-07 15:35 UTC (permalink / raw)
  To: Chris Mason, Stefan Hajnoczi, linux-btrfs

On Wed, Oct 7, 2009 at 4:25 PM, Chris Mason <chris.mason@oracle.com> wr=
ote:
> This oops means that we're trying to insert an extent that already
> exists. =A0I think it is related to the bug in the file clone ioctl t=
hat
> Sage recently fixed. =A0The fix is in the master branch of the
> btrfs-unstable tree.

Thanks for explaining this.  How can I track when this hits vanilla lin=
ux-2.6?

> So, I'd say step one is to make a backup of this data.
>
> Are you able to figure out which of the files is being written at the
> time of the oops? =A0If not we can easily add a message to help nail =
it
> down.
>
> Either way, I'd copy the file that is triggering the problem to a new
> file and delete the old one.

The data wasn't critical, for now I have switched back to ext3.  I
really like the COW copy feature so I intend to get back on btrfs when
I have time to rebuild a kernel with the fix in it.

Stefan
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" =
in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1
  2009-10-07 15:35   ` Stefan Hajnoczi
@ 2009-10-07 15:40     ` Chris Mason
  2009-10-07 16:00       ` Stefan Hajnoczi
  0 siblings, 1 reply; 5+ messages in thread
From: Chris Mason @ 2009-10-07 15:40 UTC (permalink / raw)
  To: Stefan Hajnoczi; +Cc: linux-btrfs

On Wed, Oct 07, 2009 at 04:35:37PM +0100, Stefan Hajnoczi wrote:
> On Wed, Oct 7, 2009 at 4:25 PM, Chris Mason <chris.mason@oracle.com> =
wrote:
> > This oops means that we're trying to insert an extent that already
> > exists. =A0I think it is related to the bug in the file clone ioctl=
 that
> > Sage recently fixed. =A0The fix is in the master branch of the
> > btrfs-unstable tree.
>=20
> Thanks for explaining this.  How can I track when this hits vanilla l=
inux-2.6?

It is already included in 2.6.32-rc1.  The master branch of the btrfs
unstable tree is currently based on 2.6.31, so you can just pull it int=
o
a 2.6.31 kernel and you'll get all of the current fixes.

>=20
> > So, I'd say step one is to make a backup of this data.
> >
> > Are you able to figure out which of the files is being written at t=
he
> > time of the oops? =A0If not we can easily add a message to help nai=
l it
> > down.
> >
> > Either way, I'd copy the file that is triggering the problem to a n=
ew
> > file and delete the old one.
>=20
> The data wasn't critical, for now I have switched back to ext3.  I
> really like the COW copy feature so I intend to get back on btrfs whe=
n
> I have time to rebuild a kernel with the fix in it.

Ok great.  Please let us know if you're able to trigger it again.

-chris

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" =
in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1
  2009-10-07 15:40     ` Chris Mason
@ 2009-10-07 16:00       ` Stefan Hajnoczi
  0 siblings, 0 replies; 5+ messages in thread
From: Stefan Hajnoczi @ 2009-10-07 16:00 UTC (permalink / raw)
  To: Chris Mason, Stefan Hajnoczi, linux-btrfs

On Wed, Oct 7, 2009 at 4:40 PM, Chris Mason <chris.mason@oracle.com> wr=
ote:
> Ok great. =A0Please let us know if you're able to trigger it again.

Thanks for your help.  I'll build a new kernel and let you know if it
happens again.

Stefan
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" =
in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2009-10-07 16:00 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-10-07 10:34 kernel BUG at fs/btrfs/file.c:528 on Linux 2.6.31.1 Stefan Hajnoczi
2009-10-07 15:25 ` Chris Mason
2009-10-07 15:35   ` Stefan Hajnoczi
2009-10-07 15:40     ` Chris Mason
2009-10-07 16:00       ` Stefan Hajnoczi

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox