From mboxrd@z Thu Jan 1 00:00:00 1970 From: Matt McCutchen Subject: PROBLEM: "BUG" and hang with reiserfs, ACLs Date: Tue, 08 Jul 2008 15:17:36 -0400 Message-ID: <1215544656.2307.23.camel@localhost> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-l1e+dX9cMl0LYEO+8cJl" Return-path: Sender: reiserfs-devel-owner@vger.kernel.org List-ID: To: reiserfs-devel@vger.kernel.org --=-l1e+dX9cMl0LYEO+8cJl Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable When I run the rsync 3.0.3 test suite on a newly created reiserfs filesystem mounted with ACLs enabled, I get a kernel BUG (stack trace below) and all processes accessing the filesystem hang. This happens often but not on every single run of the test suite. When I reboot, reiserfsck reports no corruption on the filesystem after replaying the journal. The filesystem is in an LVM logical volume where the physical volume is LUKS-encrypted, in case that matters. The problem does not occur if ACLs are disabled, in which case the test suite skips some ACL-related tests. I don't know what the rsync test suite is doing to trigger the problem. Nothing else that I do on my computer does. I tried running the test suite under strace but could not reproduce the problem; possibly it is timing-dependent. =EF=BB=BFI can reproduce the problem with the recent vanilla kernel v2.6.26-rc9-29-gb2798bf but not with 2.6.24. Bisection points to 3227e14c as the commit that introduced the problem. I used Fedora-derived kernel configurations for all tests. To reproduce the problem, run the following on a newly created reiserfs filesystem with ACLs enabled: git clone git://git.samba.org/rsync.git cd rsync && ./configure && make test The stack trace: BUG: unable to handle kernel paging request at f3f40378 IP: [] __list_add+0x6/0x4a *pde =3D 347f2163 *pte =3D 33f40160=20 Oops: 0000 [#1] SMP DEBUG_PAGEALLOC Modules linked in: ipv6 cpufreq_ondemand acpi_cpufreq nls_utf8 fuse ext2 mb= cache loop arc4 ecb iwl3945 snd_hda_intel mac80211 snd_seq_dummy sr_mod snd= _seq_oss dcdbas i2c_i801 joydev snd_seq_midi_event cdrom i2c_core iTCO_wdt = tg3 snd_seq iTCO_vendor_support cfg80211 video snd_seq_device snd_pcm_oss o= utput snd_mixer_oss snd_pcm snd_timer snd_page_alloc snd_hwdep snd soundcor= e wmi battery ac usb_storage pata_acpi ata_generic ata_piix xts gf128mul ae= s_i586 aes_generic dm_crypt crypto_blkcipher dm_snapshot dm_zero dm_mirror = dm_log dm_mod reiserfs uhci_hcd ohci_hcd ehci_hcd Pid: 15276, comm: sh Not tainted (2.6.26-rc9 #1) EIP: 0060:[] EFLAGS: 00010286 CPU: 0 EIP is at __list_add+0x6/0x4a EAX: f3fc0374 EBX: f3fc0374 ECX: f3f40374 EDX: f8c33180 ESI: f4bebc10 EDI: 00008fd8 EBP: f4beba80 ESP: f4beba7c DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Process sh (pid: 15276, ti=3Df4beb000 task=3Df7a017e0 task.ti=3Df4beb000) Stack: 00000006 f4beba88 c05091ff f4bebb40 f88a26d8 00008000 00001000 00000= 000=20 00000000 00000000 00000000 f8c23044 f8c23044 00000286 00000001 f4beb= c84=20 f6fb94f0 f3fc0403 f4bebb44 f4bebb44 f88b8935 00000001 00000011 f53e9= 000=20 Call Trace: [] ? list_add+0xa/0xf [] ? reiserfs_allocate_blocknrs+0x998/0xa90 [reiserfs] [] ? search_for_position_by_key+0x14f/0x2bf [reiserfs] [] ? reiserfs_get_block+0x480/0x1104 [reiserfs] [] ? _spin_unlock+0x1d/0x20 [] ? __block_prepare_write+0x147/0x344 [] ? block_write_begin+0x72/0xca [] ? reiserfs_get_block+0x0/0x1104 [reiserfs] [] ? reiserfs_write_begin+0x115/0x188 [reiserfs] [] ? reiserfs_get_block+0x0/0x1104 [reiserfs] [] ? generic_file_buffered_write+0xd9/0x50b [] ? mnt_drop_write+0x1e/0xc2 [] ? __generic_file_aio_write_nolock+0x3e6/0x41e [] ? mutex_lock_nested+0x269/0x271 [] ? generic_file_aio_write+0x69/0xbd [] ? do_sync_write+0xab/0xe9 [] ? autoremove_wake_function+0x0/0x33 [] ? reiserfs_file_write+0x6e/0x77 [reiserfs] [] ? reiserfs_file_write+0x0/0x77 [reiserfs] [] ? vfs_write+0x8a/0x12e [] ? sys_write+0x3b/0x60 [] ? sysenter_past_esp+0x78/0xd1 =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D Code: 72 c0 e8 ef 2f 16 00 0f 0b 83 c4 0c eb fe 89 58 04 89 03 c7 42 04 00 = 02 20 00 c7 02 00 01 10 00 8b 5d fc c9 c3 55 89 e5 53 89 c3 <8b> 41 04 39 d= 0 74 14 51 50 52 68 04 c0 72 c0 e8 b7 2f 16 00 0f=20 EIP: [] __list_add+0x6/0x4a SS:ESP 0068:f4beba7c ---[ end trace eb0099a3ced003a3 ]--- =EF=BB=BFI have posted all the files mentioned in linux/REPORTING-BUGS, including full dmesg output, at: http://mattmccutchen.net/private/reiserfs-bug/ =EF=BB=BFI previously opened a Fedora bug for this problem at: https://bugzilla.redhat.com/show_bug.cgi?id=3D453699 Keywords: reiserfs, ACLs Regards, Matt --=-l1e+dX9cMl0LYEO+8cJl Content-Type: application/pgp-signature; name=signature.asc Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) iEYEABECAAYFAkhzvU0ACgkQC+xSYN/RlftCBgCguGHJllkc4I0pykAoWG1bgQG6 /ncAn3lGFAdANUtwR2ChYGTiChLfm+oi =Ak/Y -----END PGP SIGNATURE----- --=-l1e+dX9cMl0LYEO+8cJl--