From: Ben Hutchings <ben@decadent.org.uk>
To: Jameson Graef Rollins <jrollins@finestructure.net>
Cc: 624343@bugs.debian.org, NeilBrown <neilb@suse.de>,
linux-raid@vger.kernel.org
Subject: Re: Bug#624343: linux-image-2.6.38-2-amd64: frequent message "bio too big device md0 (248 > 240)" in kern.log
Date: Fri, 29 Apr 2011 05:39:40 +0100 [thread overview]
Message-ID: <1304051980.3105.46.camel@localhost> (raw)
In-Reply-To: <20110427161901.27049.31001.reportbug@servo.factory.finestructure.net>
[-- Attachment #1: Type: text/plain, Size: 2973 bytes --]
On Wed, 2011-04-27 at 09:19 -0700, Jameson Graef Rollins wrote:
> Package: linux-2.6
> Version: 2.6.38-3
> Severity: normal
>
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA256
>
> As you can see from the kern.log snippet below, I am seeing frequent
> messages reporting "bio too big device md0 (248 > 240)".
>
> I run what I imagine is a fairly unusual disk setup on my laptop,
> consisting of:
>
> ssd -> raid1 -> dm-crypt -> lvm -> ext4
>
> I use the raid1 as a backup. The raid1 operates normally in degraded
> mode. For backups I then hot-add a usb hdd, let the raid1 sync, and
> then fail/remove the external hdd.
Well, this is not expected to work. Possibly the hot-addition of a disk
with different bio restrictions should be rejected. But I'm not sure,
because it is safe to do that if there is no mounted filesystem or
stacking device on top of the RAID.
I would recommend using filesystem-level backup (e.g. dirvish or
backuppc). Aside from this bug, if the SSD fails during a RAID resync
you will be left with an inconsistent and therefore useless 'backup'.
> I started noticing these messages after my last sync. I have not
> rebooted since.
>
> I found a bug report on the launchpad that describes an almost
> identical situation:
>
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/320638
>
> The reporter seemed to be concerned that their may be data loss
> happening. I have not yet noticed any, but of course I'm terrified
> that it's happening and I just haven't found it yet. Unfortunately
> the bug was closed with a "Won't Fix" without any resolution.
>
> Is this a kernel bug, or is there something I can do to remedy the
> situation? I haven't tried to reboot yet to see if the messages stop.
> I'm obviously most worried about data loss. Please advise!
The block layer correctly returns an error after logging this message.
If it's due to a read operation, the error should be propagated up to
the application that tried to read. If it's due to a write operation, I
would expect the error to result in the RAID becoming desynchronised.
In some cases it might be propagated to the application that tried to
write.
If the error is somehow discarded then there *is* a kernel bug with the
risk of data loss.
> I am starting to suspect that these messages are in face associated with
> data loss on my system. I have witnessed these messages occur during
> write operations to the disk, and I have also started to see some
> strange behavior on my system. dhclient started acting weird after
> these messages appeared (not holding on to leases) and I started to
> notice database exceptions in my mail client.
>
> Interestingly, the messages seem to have gone away after reboot. I will
> watch closely to see if they return after my next raid1 sync.
Ben.
--
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.
[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 828 bytes --]
next parent reply other threads:[~2011-04-29 4:39 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20110427161901.27049.31001.reportbug@servo.factory.finestructure.net>
2011-04-29 4:39 ` Ben Hutchings [this message]
2011-05-01 22:06 ` Bug#624343: linux-image-2.6.38-2-amd64: frequent message "bio too big device md0 (248 > 240)" in kern.log Jameson Graef Rollins
2011-05-02 0:00 ` Ben Hutchings
2011-05-02 0:22 ` NeilBrown
2011-05-02 2:47 ` Guy Watkins
2011-05-02 5:07 ` Daniel Kahn Gillmor
2011-05-02 9:08 ` David Brown
2011-05-02 10:00 ` NeilBrown
2011-05-02 10:32 ` David Brown
2011-05-02 14:56 ` David Brown
2011-05-02 0:42 ` Daniel Kahn Gillmor
2011-05-02 1:04 ` Ben Hutchings
2011-05-02 1:17 ` Jameson Graef Rollins
2011-05-02 9:05 ` David Brown
2011-05-02 9:11 ` David Brown
2011-05-02 16:38 ` Jameson Graef Rollins
2011-05-02 18:54 ` David Brown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1304051980.3105.46.camel@localhost \
--to=ben@decadent.org.uk \
--cc=624343@bugs.debian.org \
--cc=jrollins@finestructure.net \
--cc=linux-raid@vger.kernel.org \
--cc=neilb@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).