From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: with ECARTIS (v1.0.0; list xfs); Mon, 06 Oct 2008 15:24:26 -0700 (PDT) Received: from cuda.sgi.com (cuda2.sgi.com [192.48.168.29]) by oss.sgi.com (8.12.11.20060308/8.12.11/SuSE Linux 0.7) with ESMTP id m96MONiI028381 for ; Mon, 6 Oct 2008 15:24:24 -0700 Received: from mail1.g-b.net (localhost [127.0.0.1]) by cuda.sgi.com (Spam Firewall) with ESMTP id 03CF74BECFE for ; Mon, 6 Oct 2008 15:26:01 -0700 (PDT) Received: from mail1.g-b.net (mail1.g-b.net [64.191.66.247]) by cuda.sgi.com with ESMTP id KcyzlRuDYNkLqpLU for ; Mon, 06 Oct 2008 15:26:01 -0700 (PDT) Message-ID: <48EA9076.5030800@g-b.net> Date: Mon, 06 Oct 2008 23:25:58 +0100 From: Albert Graham Reply-To: albert.graham@g-b.net MIME-Version: 1.0 Subject: Re: XFS Regression Issue in kernel 2.6.26.3 References: <48E51BF5.3080100@g-b.net> <20081002235211.GI30001@disturbed> In-Reply-To: <20081002235211.GI30001@disturbed> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: xfs-bounce@oss.sgi.com Errors-to: xfs-bounce@oss.sgi.com List-Id: xfs To: Albert Graham , xfs@oss.sgi.com Dave Chinner wrote: > On Thu, Oct 02, 2008 at 08:07:33PM +0100, Albert Graham wrote: >> Hi Guys, >> >> I use Fedora 8 as my MythTV backend server which uses XFS, ISince >> upgrading from kernel-2.6.25 to kernel-2.6.26 I've been getting the >> following errors (see below). > > The freespace btrees are getting out of sync for some reason. > > That is, when we go to allocate an extent, we have to update two > free space btrees. This shutdown: > >> XFS internal error XFS_WANT_CORRUPTED_RETURN at line 280 of file fs/xfs/xfs_alloc.c. Caller 0xf88e0018 > > Indicates the extent being allocated was not found in one of the > two trees. > > This corruption could have happened a long time ago in the past, and > it may simply be coincidental that you've tripped over this at > roughly the same time you upgraded the kernel. > > Can you run xfs_check on the filesystem to determine if a freespace > tree is corrupted (post the output if it is), then run xfs_repair > to rebuild them? > > Cheers, > > Dave. Hi Dave, Thanks for responding, I've just spotted your mail. You are correct on all assumptions. 1. The disk was almost full - 8G free of 1024G 2. I do think this happened sometime ago and the kernel update was coincidental - as previous kernels had same response. 3. xfs_repair did fix it (output was lost as I repaired it before reading your response - but see below) In my case I have a 1TB file/dd image using XFS which is sitting on a 7TB XFS formatted disk. The 1TB file was where there error was occuring, this file was mounted via losetup (/dev/loop7), however when I tried to mount it I got a segmentation fault and the following message /usr/sbin/xfs_check: line 28: 3230 Floating point exceptionxfs_db$DBOPTS -i -p xfs_check -c "check$OPTS" $1 I did search google for it (and google saved it for me, which is how I recalled it) I also got a kernel panic/trace error which was logged via syslog, so that is posted below - hope this helps. I had to use "xfs_repair -L /dev/loop7" to fix it as I was unable to replay the log via mounting, all seems fine now :) I'm not able to show the output of the fixing process, but it was only about 20 lines long, mostly stating: ag1 ag2 some other stuff.. ag3 etc.. Hope this helps - and thanks for your response. Albert. syslog output ============== Oct 6 08:09:43 agraham kernel: BUG: unable to handle kernel NULL pointer dereference at 0000005c Oct 6 08:09:43 agraham kernel: IP: [] :xfs:xfs_alloc_increment+0xd9/0x155 Oct 6 08:09:43 agraham kernel: *pdpt = 0000000036451001 *pde = 000000023d1fc067 Oct 6 08:09:43 agraham kernel: Oops: 0000 [#1] SMP Oct 6 08:09:43 agraham kernel: Modules linked in: sha256_generic aes_i586 aes_generic cbc dm_crypt crypto_blkcipher aoe nvidia(P)(U) tun saa7134_alsa loop nfsd lockd nfs_acl auth_rpcgss exportfs coretemp w83627ehf w83791d hwmon_vid hwmon sunrpc ipv6 bridge ext3 jbd mbcache dm_mirror dm_log dm_multipath dm_mod snd_emu10k1_synth snd_emux_synth sr_mod cdrom snd_seq_virmidi snd_seq_midi_emul ata_generic tuner_simple tuner_types tuner snd_emu10k1 snd_ac97_codec ac97_bus snd_usb_audio snd_usb_lib snd_rawmidi pata_acpi snd_hda_intel snd_seq_dummy snd_seq_oss snd_seq_midi_event saa7134 snd_seq videodev snd_pcm_oss v4l1_compat snd_mixer_oss compat_ioctl32 v4l2_common videobuf_dma_sg videobuf_core snd_pcm ir_kbd_i2c snd_seq_device ir_common snd_timer snd_util_mem tveeprom snd_page_alloc pata_jmicron snd_hwdep firewire_ohci firewire_core pcspkr snd sky2 crc_itu_t iTCO_wdt i2c_i801 iTCO_vendor_support soundcore sg i2c_core ata_piix sata_promise ahci libata 3w_9xxx sd_mod scsi_mod xfs uhci_hcd ohci_hcd ehci_hcd [last unloade Oct 6 08:09:43 agraham kernel: d: nvidia] Oct 6 08:09:43 agraham kernel: Oct 6 08:09:43 agraham kernel: Pid: 15036, comm: mount Tainted: P (2.6.26.3-14.fc8PAE #1) Oct 6 08:09:43 agraham kernel: EIP: 0060:[] EFLAGS: 00010246 CPU: 3 Oct 6 08:09:43 agraham kernel: EIP is at xfs_alloc_increment+0xd9/0x155 [xfs] Oct 6 08:09:43 agraham kernel: EAX: 00000000 EBX: f7d3af58 ECX: ee8ce000 EDX: 00000003 Oct 6 08:09:43 agraham kernel: ESI: f7d3af30 EDI: 00000002 EBP: f6d69b74 ESP: f6d69b58 Oct 6 08:09:43 agraham kernel: DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068 Oct 6 08:09:43 agraham kernel: Process mount (pid: 15036, ti=f6d69000 task=f5063200 task.ti=f6d69000) Oct 6 08:09:43 agraham kernel: Stack: f6d69bf8 00000000 f7d3af95 00000000 000000fe 000000fd 00000000 f6d69c08 Oct 6 08:09:43 agraham kernel: f88e1fb8 2c0018f8 ee8ce800 00000008 00000000 f6d69ba4 f6d69c14 00000000 Oct 6 08:09:43 agraham kernel: f7d3a000 00000000 ee8d3200 ee8ce000 f6d69bc0 f51d8a80 0001f9c1 ee8ce000 Oct 6 08:09:43 agraham kernel: Call Trace: Oct 6 08:09:43 agraham kernel: [] ? xfs_alloc_delrec+0x3b4/0x991 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_alloc_delete+0x23/0x72 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_free_ag_extent+0x36d/0x574 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_free_extent+0xa9/0xc5 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xlog_grant_log_space+0x1ad/0x1f2 [xfs] Oct 6 08:09:43 agraham kernel: [] ? kmem_zone_alloc+0x4a/0x90 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xlog_recover_process_efi+0x185/0x1df [xfs] Oct 6 08:09:43 agraham kernel: [] ? xlog_recover_process_efis+0x42/0x73 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xlog_recover_finish+0x15/0x9c [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_log_mount_finish+0x2c/0x35 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_mountfs+0x46a/0x607 [xfs] Oct 6 08:09:43 agraham kernel: [] ? kmem_zalloc+0x10/0x38 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_mru_cache_create+0xdf/0x10f [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_mount+0x25f/0x2fa [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_fs_fill_super+0xa7/0x1bf [xfs] Oct 6 08:09:43 agraham kernel: [] ? get_sb_bdev+0xca/0x110 Oct 6 08:09:43 agraham kernel: [] ? xfs_fs_get_sb+0x13/0x15 [xfs] Oct 6 08:09:43 agraham kernel: [] ? xfs_fs_fill_super+0x0/0x1bf [xfs] Oct 6 08:09:43 agraham kernel: [] ? vfs_kern_mount+0x81/0xf7 Oct 6 08:09:43 agraham kernel: [] ? do_kern_mount+0x32/0xb9 Oct 6 08:09:43 agraham kernel: [] ? do_new_mount+0x46/0x71 Oct 6 08:09:43 agraham kernel: [] ? do_mount+0x1b0/0x1ce Oct 6 08:09:43 agraham kernel: [] ? __get_free_pages+0x1b/0x21 Oct 6 08:09:43 agraham kernel: [] ? copy_mount_options+0x27/0x10b Oct 6 08:09:43 agraham kernel: [] ? sys_mount+0x64/0x9b Oct 6 08:09:43 agraham kernel: [] ? sysenter_past_esp+0x78/0xb1 Oct 6 08:09:43 agraham kernel: ======================= Oct 6 08:09:43 agraham kernel: Code: 83 c8 02 39 d0 74 0e b9 02 00 00 00 89 fa 89 f0 e8 09 51 01 00 ff 45 ec 47 83 c3 04 0f b6 46 6c 39 c7 7c b4 8b 44 be 24 89 45 f0 <8b> 58 5c eb 5f 8b 46 04 8b 54 be 44 8b 88 98 01 00 00 c1 e2 02 Oct 6 08:09:43 agraham kernel: EIP: [] xfs_alloc_increment+0xd9/0x155 [xfs] SS:ESP 0068:f6d69b58 Oct 6 08:09:43 agraham kernel: ---[ end trace 90b4802fd284c865 ]---