All of lore.kernel.org
 help / color / mirror / Atom feed
From: Henry de Valence <hdevalence@hdevalence.ca>
To: linux-btrfs <linux-btrfs@vger.kernel.org>
Subject: Itermittent data corruption and dmesg spam
Date: Tue, 22 Oct 2013 23:58:33 -0400	[thread overview]
Message-ID: <6844836.rMEZUtNbVg@noether> (raw)

[-- Attachment #1: Type: text/plain, Size: 1797 bytes --]

Hi all,

Two questions:

First, I have a ton of lines in dmesg like

[  123.664465] incomplete page read in btrfs with offset 2048 and length 2048
[  123.835761] incomplete page read in btrfs with offset 512 and length 3584

What does this mean? I tried searching on Google but all I got was the commit 
that added the code that prints these messages. Should I be worried?

Second, I’m having some intermittent data corruption issues, and I’m not 
really sure how to pin down the cause. Sometimes, I’ll get errors trying to 
read a file due to a failed checksum, but when I run btrfs scrub, it reports 
that everything is OK. For instance, this time I booted, I get a line in dmesg 
saying

btrfs: bdev /dev/bcache0 errs: wr 0, rd 0, flush 0, corrupt 16, gen 0

but when I run btrfs scrub I get:

scrub status for 56118d27-c9a8-483c-afaa-e429d59884e9
     scrub started at Tue Oct 22 22:46:17 2013 and finished after 2802 seconds
     total bytes scrubbed: 426.03GB with 0 errors

My setup is a btrfs partition on a bcache device, which has a new-ish hard 
drive as the backing store and a partition on an older SSD as the cache. The 
bcache documentation suggests that sequential reads bypass the cache device. 
Is it possible that I have some bad blocks on my SSD, which cause the errors 
and data corruption, but the data corruption doesn’t show up with btrfs scrub 
because the disk accesses in the scrub are bypassing the cache?

Does anyone know how I could test this theory, or otherwise try to determine 
the source of the problems?

For what it’s worth, I ran smartctl on both my hard drive and my SSD, and it 
didn’t detect anything.

My btrfs version is Btrfs v0.20-rc1-358-g194aa4a on Linux 3.11.3 (Arch).

Thanks,
Henry de Valence

[-- Attachment #2: This is a digitally signed message part. --]
[-- Type: application/pgp-signature, Size: 836 bytes --]

             reply	other threads:[~2013-10-23  3:58 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-10-23  3:58 Henry de Valence [this message]
2013-10-23 13:09 ` Itermittent data corruption and dmesg spam Duncan
2013-10-23 15:39 ` Chris Murphy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=6844836.rMEZUtNbVg@noether \
    --to=hdevalence@hdevalence.ca \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.