All of lore.kernel.org
 help / color / mirror / Atom feed
From: Vincent ETIENNE <ve@vetienne.net>
To: Vincent ETIENNE <vetienne@aprogsys.com>,
	linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
	Alexander Viro <viro@zeniv.linux.org.uk>,
	ocfs2-devel@oss.oracle.com
Subject: [Ocfs2-devel] kernel BUG at fs/buffer.c:2886! Linux 3.5.0
Date: Mon, 30 Jul 2012 18:30:29 -0000	[thread overview]
Message-ID: <5016D2C0.6090708@vetienne.net> (raw)
In-Reply-To: <20120730075333.GC4025@dhcp-172-17-9-228.mtv.corp.google.com>




On 30/07/2012 09:53, Joel Becker wrote:
> On Mon, Jul 30, 2012 at 09:45:14AM +0200, Vincent ETIENNE wrote:
>> Le 30/07/2012 08:30, Joel Becker a ?crit :
>>> On Sat, Jul 28, 2012 at 12:18:30AM +0200, Vincent ETIENNE wrote:
>>>> Hello
>>>>
>>>> Get this on first write made ( by deliver sending mail to inform of the
>>>> restart of services  )
>>>> Home partition (the one receiving the mail) is based on ocfs2 created
>>>> from drbd block device in primary/primary mode
>>>> These drbd devices are based on lvm.
>>>>
>>>> system is running linux-3.5.0, identical symptom with linux 3.3 and 3.2
>>>> but working with linux 3.0 kernel
>>>>
>>>> reproduced on two machines ( so different hardware involved on this one
>>>> software md raid on SATA, on second one areca hardware raid card )
>>>> but the 2 machines are the one sharing this partition ( so share the
>>>> same data )
>>> 	Hmm.  Any chance you can bisect this further?
>> Will try to. Will take a few days as the server is in production ( but
>> used as backup so...)
>>
>>>> Jul 27 23:41:41 jupiter2 kernel: [  351.169213] ------------[ cut here
>>>> ]------------
>>>> Jul 27 23:41:41 jupiter2 kernel: [  351.169261] kernel BUG at
>>>> fs/buffer.c:2886!
>>> 	This is:
>>>
>>> 	BUG_ON(!buffer_mapped(bh));
>>>
>>> in submit_bh().
>>>
>>> system_call_fastpath+0x16/0x1b
>>> 	This stack trace is from 3.5, because of the location of the
>>> BUG.  The call path in the trace suggests the code added by Al's ea022d,
>>> but you say it breaks in 3.2 and 3.3 as well.  Can you give me a trace
>>> from 3.2?
>> For a 3.2 kernel i get this stack trace. Different trace form 3.5 but
>> exactly at the same moment. and for the same reasons.
>> Seems to be less immmediate than with 3.5 but more a subjective
>> imrpession than something based on fact. ( it takes a few seconds after
>> deliver is started to have the bug )
> Totally different stack trace.  Not in symlink code, but instead in
> fallocate.  Weird.  I wonder if you are hitting two things.  Bisection
> will definitely help.

Yes could be, that would explain the 2 stack trace ( and the different
timing observed )
Bisection is in progress. The fallocate bug is certainly already
corrected ( info sent by
sunil.mushran at gmail.com but unavailable on the list for the moment  ?)

------

The fallocate() oops is probably the same that is fixed by this patch.
https://oss.oracle.com/git/?p=smushran/linux-2.6.git;a=commit;h=a2118b301104a24381b414bc93371d666fe8d43a


Is in the list of patches that are ready to be pushed.
https://oss.oracle.com/git/?p=smushran/linux-2.6.git;a=shortlog;h=mw-3.4-mar15

----

But not sure it will correct all i observed. So i will continue to
bisect to confirm/infirm.
( But i seems to have lost network on my server after a reboot and so no
more access before tomorrow , I have certainly forget to do make
modules_install before installing new kernel ... Being stupid is not
very helpful... ) . I hope to finish the bisection tomorrow or wednesday.
 
Thanks a lot for the support.
> Joel
>
>

WARNING: multiple messages have this Message-ID (diff)
From: Vincent ETIENNE <ve@vetienne.net>
To: Vincent ETIENNE <vetienne@aprogsys.com>,
	linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
	Alexander Viro <viro@zeniv.linux.org.uk>,
	ocfs2-devel@oss.oracle.com
Subject: Re: kernel BUG at fs/buffer.c:2886! Linux 3.5.0
Date: Mon, 30 Jul 2012 20:30:24 +0200	[thread overview]
Message-ID: <5016D2C0.6090708@vetienne.net> (raw)
In-Reply-To: <20120730075333.GC4025@dhcp-172-17-9-228.mtv.corp.google.com>




On 30/07/2012 09:53, Joel Becker wrote:
> On Mon, Jul 30, 2012 at 09:45:14AM +0200, Vincent ETIENNE wrote:
>> Le 30/07/2012 08:30, Joel Becker a écrit :
>>> On Sat, Jul 28, 2012 at 12:18:30AM +0200, Vincent ETIENNE wrote:
>>>> Hello
>>>>
>>>> Get this on first write made ( by deliver sending mail to inform of the
>>>> restart of services  )
>>>> Home partition (the one receiving the mail) is based on ocfs2 created
>>>> from drbd block device in primary/primary mode
>>>> These drbd devices are based on lvm.
>>>>
>>>> system is running linux-3.5.0, identical symptom with linux 3.3 and 3.2
>>>> but working with linux 3.0 kernel
>>>>
>>>> reproduced on two machines ( so different hardware involved on this one
>>>> software md raid on SATA, on second one areca hardware raid card )
>>>> but the 2 machines are the one sharing this partition ( so share the
>>>> same data )
>>> 	Hmm.  Any chance you can bisect this further?
>> Will try to. Will take a few days as the server is in production ( but
>> used as backup so...)
>>
>>>> Jul 27 23:41:41 jupiter2 kernel: [  351.169213] ------------[ cut here
>>>> ]------------
>>>> Jul 27 23:41:41 jupiter2 kernel: [  351.169261] kernel BUG at
>>>> fs/buffer.c:2886!
>>> 	This is:
>>>
>>> 	BUG_ON(!buffer_mapped(bh));
>>>
>>> in submit_bh().
>>>
>>> system_call_fastpath+0x16/0x1b
>>> 	This stack trace is from 3.5, because of the location of the
>>> BUG.  The call path in the trace suggests the code added by Al's ea022d,
>>> but you say it breaks in 3.2 and 3.3 as well.  Can you give me a trace
>>> from 3.2?
>> For a 3.2 kernel i get this stack trace. Different trace form 3.5 but
>> exactly at the same moment. and for the same reasons.
>> Seems to be less immmediate than with 3.5 but more a subjective
>> imrpession than something based on fact. ( it takes a few seconds after
>> deliver is started to have the bug )
> Totally different stack trace.  Not in symlink code, but instead in
> fallocate.  Weird.  I wonder if you are hitting two things.  Bisection
> will definitely help.

Yes could be, that would explain the 2 stack trace ( and the different
timing observed )
Bisection is in progress. The fallocate bug is certainly already
corrected ( info sent by
sunil.mushran@gmail.com but unavailable on the list for the moment  ?)

------

The fallocate() oops is probably the same that is fixed by this patch.
https://oss.oracle.com/git/?p=smushran/linux-2.6.git;a=commit;h=a2118b301104a24381b414bc93371d666fe8d43a


Is in the list of patches that are ready to be pushed.
https://oss.oracle.com/git/?p=smushran/linux-2.6.git;a=shortlog;h=mw-3.4-mar15

----

But not sure it will correct all i observed. So i will continue to
bisect to confirm/infirm.
( But i seems to have lost network on my server after a reboot and so no
more access before tomorrow , I have certainly forget to do make
modules_install before installing new kernel ... Being stupid is not
very helpful... ) . I hope to finish the bisection tomorrow or wednesday.
 
Thanks a lot for the support.
> Joel
>
>

  parent reply	other threads:[~2012-07-30 18:30 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-07-27 22:18 kernel BUG at fs/buffer.c:2886! Linux 3.5.0 Vincent ETIENNE
2012-07-27 22:18 ` [Ocfs2-devel] " Vincent ETIENNE
2012-07-30  6:30 ` Joel Becker
2012-07-30  6:30   ` Joel Becker
2012-07-30  7:45   ` Vincent ETIENNE
2012-07-30  7:45     ` [Ocfs2-devel] " Vincent ETIENNE
2012-07-30  7:45     ` Vincent ETIENNE
2012-07-30  7:53     ` [Ocfs2-devel] " Joel Becker
2012-07-30  7:53       ` Joel Becker
2012-07-30 15:59       ` [Ocfs2-devel] " Sunil Mushran
2012-07-30 15:59         ` Sunil Mushran
2012-07-30 18:30       ` Vincent ETIENNE [this message]
2012-07-30 18:30         ` [Ocfs2-devel] " Vincent ETIENNE
2012-08-01 16:51         ` Vincent ETIENNE
2012-08-01 16:51           ` Vincent ETIENNE
2012-08-01 20:43         ` [Ocfs2-devel] " Vincent ETIENNE
2012-08-01 20:43           ` Vincent ETIENNE
2012-08-01 20:46           ` [Ocfs2-devel] " Vincent ETIENNE
2012-08-01 20:46             ` Vincent ETIENNE
2012-08-02  7:21             ` [Ocfs2-devel] " Vincent ETIENNE
2012-08-02  7:21               ` Vincent ETIENNE
2012-08-02 19:28               ` [Ocfs2-devel] " Vincent ETIENNE
2012-08-02 19:28                 ` Vincent ETIENNE
2012-08-02 21:08                 ` [Ocfs2-devel] " Sunil Mushran
2012-08-02 21:08                   ` Sunil Mushran
2012-08-03  7:22                   ` [Ocfs2-devel] " Vincent ETIENEN
2012-08-03  7:22                     ` Vincent ETIENEN
2012-08-03 16:16                     ` [Ocfs2-devel] " Sunil Mushran

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5016D2C0.6090708@vetienne.net \
    --to=ve@vetienne.net \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=ocfs2-devel@oss.oracle.com \
    --cc=vetienne@aprogsys.com \
    --cc=viro@zeniv.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.