git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [OT] git fsck uncovers hardware/kernel problem?
@ 2010-01-20  2:54 John Koleszar
  0 siblings, 0 replies; only message in thread
From: John Koleszar @ 2010-01-20  2:54 UTC (permalink / raw)
  To: git

Hi all,

I don't think that this is a git issue, but I've only reproduced it so 
far using git and I know there are people here that can point me in the 
right direction, so please forgive the off-topic post.

I was repairing a repo today that I thought was corrupt, and when I 
finished, it fsck'd ok but then ran into another corruption error when 
gc'ing it. I realized these errors were intermittent. I ran into this 
situation once before (same machine) and was time-pressed, so I rebooted 
and the problem went away. This time I have a little free time to 
investigate.

It looks like somehow my buffer cache is getting inconsistent with 
what's actually on the disk. I'm looking for input as to whether this is 
a kernel problem, bad memory, dieing hard disk, etc. This machine has 
been stable, and I've had no unexplained crashes or other noticeable 
data corruption. No errors from the kernel, no SMART errors. Only 
notable thing about the repos I was working on when this happened is 
that they were large -- exceeding the size of physical RAM. Lots of 
binary objects. The one I'm looking at now is about 3.8GB packed.

Please see below -- any pointers would be appreciated, including to a 
more appropriate list.

$ git fsck --full
error: inflate: data stream error (incorrect data check)
error: corrupt loose object '060a8f0255a4a0b76197056e959bb0ce37b90dae'
fatal: object 060a8f0255a4a0b76197056e959bb0ce37b90dae is corrupted

$ echo 3 | sudo tee /proc/sys/vm/drop_caches
Password:
3

$ git fsck --fullerror: inflate: data stream error (incorrect data check)
error: corrupt loose object '0905cc800a08a24c8d83132c4d346d8b47403a73'
fatal: object 0905cc800a08a24c8d83132c4d346d8b47403a73 is corrupted
# Note this is a different object

$ git fsck --fullerror: inflate: data stream error (incorrect data check)
error: corrupt loose object '0905cc800a08a24c8d83132c4d346d8b47403a73'
fatal: object 0905cc800a08a24c8d83132c4d346d8b47403a73 is corrupted
# but running again gives a consistent error until dropping caches again

$ echo 3 | sudo tee /proc/sys/vm/drop_caches
3

$ git fsck --full
error: inflate: data stream error (incorrect data check)
error: corrupt loose object '3e9a88bfea886dd4bd4af69870e33f115bc80985'
fatal: object 3e9a88bfea886dd4bd4af69870e33f115bc80985 is corrupted
# fsck progresses much farther this time.

$ sha1sum objects/3e/9a88bfea886dd4bd4af69870e33f115bc80985
acffc0e434297e5115b2491101d0a0260a20a1aa  
objects/3e/9a88bfea886dd4bd4af69870e33f115bc80985

$ echo 3 | sudo tee /proc/sys/vm/drop_caches
3

$ sha1sum objects/3e/9a88bfea886dd4bd4af69870e33f115bc80985
83a1291ecb9e2fd9ded509aefba2a3c474d259c2  
objects/3e/9a88bfea886dd4bd4af69870e33f115bc80985
# sha1sum is different, so this issue isn't limited to git

$ git cat-file blob 3e9a88bfea886dd4bd4af69870e33f115bc80985>/dev/null
# git's integrity checks pass on this file now.

$ uname -a
Linux cp-jk-linux 2.6.30-gentoo-r5 #3 SMP Wed Nov 4 12:53:38 EST 2009 
x86_64 Intel(R) Core(TM)2 CPU 6600 @ 2.40GHz GenuineIntel GNU/Linux

---
Thanks,
John

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2010-01-20  3:10 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-01-20  2:54 [OT] git fsck uncovers hardware/kernel problem? John Koleszar

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).