public inbox for linux-btrfs@vger.kernel.org
 help / color / mirror / Atom feed
From: Jens Axboe <jens.axboe@oracle.com>
To: Daniel J Blueman <daniel.blueman@gmail.com>
Cc: Markus Trippelsdorf <markus@trippelsdorf.de>,
	linux-btrfs@vger.kernel.org
Subject: Re: btrfs csum failed on git .pack file
Date: Wed, 9 Sep 2009 10:26:36 +0200	[thread overview]
Message-ID: <20090909082636.GP18599@kernel.dk> (raw)
In-Reply-To: <6278d2220909090118o6cdadb38ic38f2991d4fd1e10@mail.gmail.com>

On Wed, Sep 09 2009, Daniel J Blueman wrote:
> On Wed, Sep 9, 2009 at 8:01 AM, Jens Axboe<jens.axboe@oracle.com> wrote:
> > On Wed, Sep 09 2009, Markus Trippelsdorf wrote:
> >> On Tue, Sep 08, 2009 at 10:32:14PM +0200, Jens Axboe wrote:
> >> > On Tue, Sep 08 2009, Markus Trippelsdorf wrote:
> >> > > On Tue, Sep 08, 2009 at 10:00:42PM +0200, Jens Axboe wrote:
> >> > > > On Mon, Sep 07 2009, Markus Trippelsdorf wrote:
> >> > > > > Just got this error today in my dmesg:
> >> > > > > btrfs csum failed ino 1483065 off 158482432 csum 4283543305 private 43905798
> >> > > > >
> >> > > > > linux % find . -inum 1483065
> >> > > > > ./.git/objects/pack/pack-f9251bcc6a8afe3c92193e14d1d742f2f0182ce5.pack
> >> > > > >
> >> > > > > It's the main pack file from my git linux kernel tree:
> >> > > > >
> >> > > >
> >> > > > Hmm, I ran into something very similar. Care to check what the corrupted
> >> > > > block of data looks like (and how big it is)?
> >> > >
> >> > > I've already deleted the file in question unfortunately.
> >> > > On IRC Chris decided that either bad RAM or a harddrive error was the
> >> > > most likely reason for this chechsum mismatch.
> >> >
> >> > Darn, that's too bad. The corruption issue I had was also in a git pack
> >> > file. It was fine one day, bad the next. Turned out to be 16kb of 0xff
> >> > in the file, and I blamed it on the (cheap) SSD drive that hosted the
> >> > local git repo. It's still the most likely explanation given the nature
> >> > of the problem, however it would have been really interesting to see
> >> > what corruption you had.
> >>
> >> If by cheap SSD drive you mean an Indilinx Barefoot based one, we might
> >> be using the same hardware (30GB Vertex in my case).
> >
> > Spooky, yes indeed that's the very same drive I'm using. Also see my
> > postings on this very issue here, top two entries:
> >
> > http://axboe.livejournal.com/
> >
> > So that pretty much looks like it reaffirms some of my suspicions. Is
> > the drive in a laptop that you suspend and resume?
> 
> If you're on firmware < 1.30, the changlog includes some fixes which
> may be relevant, eg if "block 0" is relative, or you're
> suspending/resuming:
> 
> - Race condition occurred during soft reset handler
> - If read fail occurs during reading stamp information, firmware
> corrupted block 0.
> - Power off recovery had bug in certain circumstances
> 
> http://www.ocztechnologyforum.com/forum/showthread.php?t=57516

The issue is pretty much moot at this point, since OCZ support were not
really interested in providing any sort of real technical support to
find out what really caused this issue. My main worry was reliability of
these cheaper SSD drives, and that worry is still not resolved. If you
read the blog entries, I do comment on the apparently scary basic bugs
taht are still being fixed on the Indilinx controllers. I do expect some
basic level of data integrity from a consumer product and at least some
interest in resolving weird corruption issues if things go wrong. Since
OCZ cannot provide anything like that, I have a hard time recommending
these drives for anything but very casual use. Fast, cheap, reliable.
Pick any two.

My drive was running 1.10 at the time of the problem.

-- 
Jens Axboe


  reply	other threads:[~2009-09-09  8:26 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-09-07 20:35 btrfs csum failed on git .pack file Markus Trippelsdorf
2009-09-08 20:00 ` Jens Axboe
2009-09-08 20:22   ` Markus Trippelsdorf
2009-09-08 20:32     ` Jens Axboe
2009-09-08 20:55       ` Tomasz Torcz
2009-09-09  6:55       ` Markus Trippelsdorf
2009-09-09  7:01         ` Jens Axboe
2009-09-09  7:23           ` Markus Trippelsdorf
2009-09-09  7:29             ` Jens Axboe
2009-09-09  8:18           ` Daniel J Blueman
2009-09-09  8:26             ` Jens Axboe [this message]
2009-09-09  8:37               ` Daniel J Blueman
2009-09-09 11:19                 ` Chris Mason
2009-09-09 21:01           ` Oliver Mattos
2009-09-10 10:49             ` Bryan Østergaard
2009-09-08 21:53     ` Tracy Reed
2009-09-09  7:28       ` Gregory Maxwell
2009-09-17  5:05   ` Markus Trippelsdorf
2009-09-17  6:44     ` Jens Axboe
2009-09-17  9:04       ` Markus Trippelsdorf
2009-09-17  9:05         ` Jens Axboe
2009-09-17 12:15           ` Markus Trippelsdorf
2009-09-17 13:58             ` Markus Trippelsdorf
2009-09-17 17:00             ` Zach Brown
2009-09-17 17:10               ` Markus Trippelsdorf
2009-09-17 17:50                 ` Tomasz Torcz

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090909082636.GP18599@kernel.dk \
    --to=jens.axboe@oracle.com \
    --cc=daniel.blueman@gmail.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=markus@trippelsdorf.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox