public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
* Re: [xfs] Calltrace in 2.6.27 kernel]
       [not found] <491C9687.6040305@poczta.onet.pl>
@ 2008-11-14  8:49 ` aluno3
  2008-11-14 16:30   ` Christoph Hellwig
  0 siblings, 1 reply; 3+ messages in thread
From: aluno3 @ 2008-11-14  8:49 UTC (permalink / raw)
  To: hch; +Cc: xfs

Hi

I ran addr2line for both vmlinux and for first call trace (2.6.27)
add2line returned:

addr2line -e ./vmlinux c029553d
fs/xfs/xfs_log.c:3528

in source:

        /*
         * If this happens during log recovery, don't worry about
         * locking; the log isn't open for business yet.
         */
        if (!log ||
            log->l_flags & XLOG_ACTIVE_RECOVERY) {
                mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
                XFS_BUF_DONE(mp->m_sb_bp);                            
//3528 line
                return 0;
        }



and for second call trace (2.6.27.5 + patch for XFS from 2.6.28rc)
add2line returned:

addr2line -e ./vmlinux c029571d
fs/xfs/xfs_log.c:3561


in source:

         /*
         * If this happens during log recovery, don't worry about
         * locking; the log isn't open for business yet.
         */
        if (!log ||
            log->l_flags & XLOG_ACTIVE_RECOVERY) {
                mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
                XFS_BUF_DONE(mp->m_sb_bp);                        //3561
line
                return 0;
        }


>
> On Thu, Nov 13, 2008 at 03:22:58PM +0100, aluno3@poczta.onet.pl wrote:
>   
>> Hi
>>
>> I tested kernel 2.6.27 with stress test using fsstress,dd,LVM and
>> snapshots. After a few hours and overflow of snapshot I got call trace:
>>
>> device-mapper: snapshots: Invalidating snapshot: Unable to allocate
>> exception.
>>     
>
> This is a message from device mapper telling your that it got an EIO
> in pending_complete()
>
>   
>> I/O error in filesystem ("dm-49") meta-data dev dm-49 block
>> 0x1d4c3b0       ("xlog_recover_do..(read#2)")
>>  error 5 buf count 8192
>> XFS: log mount/recovery failed: error 5
>> XFS: log mount failed
>>     
>
> And now XFS complains that it got this error passed up, so far so good.
>
>   
>> BUG: unable to handle kernel NULL pointer dereference at 0000002c
>> IP: [<c029553d>] xfs_log_force_umount+0x3d/0x170
>> *pdpt = 00000000345cf001 *pde = 0000000000000000
>>     
>
> But ut should not crash.  Can you run 
>
> 	addr2line -e /path/to/your/kernel xfs_log_force_umount+0x3d/0x170
>
> (you'll need a kernel with debuginfo for that)
>
>
>   

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [xfs] Calltrace in 2.6.27 kernel]
  2008-11-14  8:49 ` [xfs] Calltrace in 2.6.27 kernel] aluno3
@ 2008-11-14 16:30   ` Christoph Hellwig
  2008-11-18 20:58     ` aluno3
  0 siblings, 1 reply; 3+ messages in thread
From: Christoph Hellwig @ 2008-11-14 16:30 UTC (permalink / raw)
  To: aluno3@poczta.onet.pl; +Cc: hch, xfs

On Fri, Nov 14, 2008 at 09:49:47AM +0100, aluno3@poczta.onet.pl wrote:
> Hi
> 
> I ran addr2line for both vmlinux and for first call trace (2.6.27)
> add2line returned:
> 
> addr2line -e ./vmlinux c029553d
> fs/xfs/xfs_log.c:3528
> 
> in source:
> 
>         /*
>          * If this happens during log recovery, don't worry about
>          * locking; the log isn't open for business yet.
>          */
>         if (!log ||
>             log->l_flags & XLOG_ACTIVE_RECOVERY) {
>                 mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
>                 XFS_BUF_DONE(mp->m_sb_bp);                            
> //3528 line

Thanks a lot.  I think I figured out what happens:

 - we fail the buffer read in xlog_recover_do_inode_trans, and thus
   abort the mount.
 - but before that log recovery has started delayed writeback of inode
   buffers
 - during the mount error handling we call xfs_freesb, which NULLs
   mp->m_sb_bp after the inode has been unmounted
 - but the device close tries to flush all delayed buffers after that,
   and the inode buffer triggers the filesystem shutdown case because
   it also had an I/O error.
 - Now we try to mark the superblock buffer done, but it's not there
   anymore.

The easy fix is to just check for a NULL mp->m_sb_bp before marking it
done.  Note that this will only fix the XFS oops, but not the underling
DM issue, for which I'd ask on the device mapper list.


Index: xfs-2.6/fs/xfs/xfs_log.c
===================================================================
--- xfs-2.6.orig/fs/xfs/xfs_log.c	2008-11-14 17:28:55.000000000 +0100
+++ xfs-2.6/fs/xfs/xfs_log.c	2008-11-14 17:29:20.000000000 +0100
@@ -3525,7 +3525,8 @@ xfs_log_force_umount(
 	if (!log ||
 	    log->l_flags & XLOG_ACTIVE_RECOVERY) {
 		mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
-		XFS_BUF_DONE(mp->m_sb_bp);
+		if (mp->m_sb_bp)
+			XFS_BUF_DONE(mp->m_sb_bp);
 		return 0;
 	}
 
@@ -3546,7 +3547,8 @@ xfs_log_force_umount(
 	spin_lock(&log->l_icloglock);
 	spin_lock(&log->l_grant_lock);
 	mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
-	XFS_BUF_DONE(mp->m_sb_bp);
+	if (mp->m_sb_bp)
+		XFS_BUF_DONE(mp->m_sb_bp);
 	/*
 	 * This flag is sort of redundant because of the mount flag, but
 	 * it's good to maintain the separation between the log and the rest

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [xfs] Calltrace in 2.6.27 kernel]
  2008-11-14 16:30   ` Christoph Hellwig
@ 2008-11-18 20:58     ` aluno3
  0 siblings, 0 replies; 3+ messages in thread
From: aluno3 @ 2008-11-18 20:58 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: xfs

Hi

I tested this patch through a few days and I did not get call trace from
kernel.Will this patch be included to official kernel? Thanks and best


Christoph Hellwig wrote:
> On Fri, Nov 14, 2008 at 09:49:47AM +0100, aluno3@poczta.onet.pl wrote:
>   
>> Hi
>>
>> I ran addr2line for both vmlinux and for first call trace (2.6.27)
>> add2line returned:
>>
>> addr2line -e ./vmlinux c029553d
>> fs/xfs/xfs_log.c:3528
>>
>> in source:
>>
>>         /*
>>          * If this happens during log recovery, don't worry about
>>          * locking; the log isn't open for business yet.
>>          */
>>         if (!log ||
>>             log->l_flags & XLOG_ACTIVE_RECOVERY) {
>>                 mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
>>                 XFS_BUF_DONE(mp->m_sb_bp);                            
>> //3528 line
>>     
>
> Thanks a lot.  I think I figured out what happens:
>
>  - we fail the buffer read in xlog_recover_do_inode_trans, and thus
>    abort the mount.
>  - but before that log recovery has started delayed writeback of inode
>    buffers
>  - during the mount error handling we call xfs_freesb, which NULLs
>    mp->m_sb_bp after the inode has been unmounted
>  - but the device close tries to flush all delayed buffers after that,
>    and the inode buffer triggers the filesystem shutdown case because
>    it also had an I/O error.
>  - Now we try to mark the superblock buffer done, but it's not there
>    anymore.
>
> The easy fix is to just check for a NULL mp->m_sb_bp before marking it
> done.  Note that this will only fix the XFS oops, but not the underling
> DM issue, for which I'd ask on the device mapper list.
>
>
> Index: xfs-2.6/fs/xfs/xfs_log.c
> ===================================================================
> --- xfs-2.6.orig/fs/xfs/xfs_log.c	2008-11-14 17:28:55.000000000 +0100
> +++ xfs-2.6/fs/xfs/xfs_log.c	2008-11-14 17:29:20.000000000 +0100
> @@ -3525,7 +3525,8 @@ xfs_log_force_umount(
>  	if (!log ||
>  	    log->l_flags & XLOG_ACTIVE_RECOVERY) {
>  		mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
> -		XFS_BUF_DONE(mp->m_sb_bp);
> +		if (mp->m_sb_bp)
> +			XFS_BUF_DONE(mp->m_sb_bp);
>  		return 0;
>  	}
>  
> @@ -3546,7 +3547,8 @@ xfs_log_force_umount(
>  	spin_lock(&log->l_icloglock);
>  	spin_lock(&log->l_grant_lock);
>  	mp->m_flags |= XFS_MOUNT_FS_SHUTDOWN;
> -	XFS_BUF_DONE(mp->m_sb_bp);
> +	if (mp->m_sb_bp)
> +		XFS_BUF_DONE(mp->m_sb_bp);
>  	/*
>  	 * This flag is sort of redundant because of the mount flag, but
>  	 * it's good to maintain the separation between the log and the rest
>
>   

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2008-11-18 21:01 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <491C9687.6040305@poczta.onet.pl>
2008-11-14  8:49 ` [xfs] Calltrace in 2.6.27 kernel] aluno3
2008-11-14 16:30   ` Christoph Hellwig
2008-11-18 20:58     ` aluno3

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox