From mboxrd@z Thu Jan 1 00:00:00 1970 From: Francisco Javier Cabello Subject: Re: data corruption with 2.4.25 and datalogging patches Date: Fri, 14 Jul 2006 14:03:30 +0200 Message-ID: <200607141403.33613.fjcabello@visual-tools.com> References: <200607120816.11292.fjcabello@visual-tools.com> <200607141026.03556.fjcabello@visual-tools.com> <1152877739.6407.37.camel@tribesman.namesys.com> Mime-Version: 1.0 Content-Type: multipart/signed; boundary="nextPart5364953.cj0hsElVIh"; protocol="application/pgp-signature"; micalg=pgp-sha1 Content-Transfer-Encoding: 7bit Return-path: list-help: list-unsubscribe: list-post: Errors-To: flx@namesys.com In-Reply-To: <1152877739.6407.37.camel@tribesman.namesys.com> List-Id: To: "Vladimir V. Saveliev" Cc: reiserfs-list@namesys.com --nextPart5364953.cj0hsElVIh Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Yes. I have a sef of system with the same main board, memory,=20 microprocessor... They are identical. The difference is the conditions wher= e=20 they are working. Perhaps the cpu load average is difference, the amount of= =20 data they are writting, the number of power failure... I am going to send you the output of reiserfsck of some the systems. Regards, Paco On Friday, 14 de July de 2006 13:48, Vladimir V. Saveliev wrote: > Hello > > On Fri, 2006-07-14 at 10:25 +0200, Francisco Javier Cabello wrote: > > Hello, > > I am almost sure that unclean shutdowns happen in those systems. We have > > tried to reproduce removing power each 5 minutes and the filesystem > > wasn't suffering corruption. Perhaps it's related, but I don't know. > > > > I have talked about 'Datalogging patches' because it's the only thing > > different from our system. > > sorry, I am confused. Am I correct that you have set of systems and they > all run similar load on the same kernel and only ~10% of them encounter > reiserfs corruptions? Do they have identical hardware? > > > I have searched a lot and few people have > > corruption with reiserfs standalone... so, it may be datalogging patche= s. > > > > what do you need from reiserfsck? I guess the output of 'reiserfsck > > --check device' > > yes. There is -l option to redirect output to log file. > > > of perhaps you need the output of reiserfsck --rebuild tree. > > > > > > Regards, > > > > Paco > > > > On Thursday, 13 de July de 2006 16:34, Vladimir V. Saveliev wrote: > > > Hello > > > > > > On Wed, 2006-07-12 at 08:16 +0200, Francisco Javier Cabello wrote: > > > > Hello, > > > > My company develops video recorder system. Basically we work with > > > > linux boxes running kernel 2.4.25. The system captures analogue > > > > video, and after processing and compressing, digital video is stor= ed > > > > to hard disk. We are recording continuously (24x7). > > > > > > > > We have realized that more or less a 10% of our systems are sufferi= ng > > > > data corruption in the reiserfs partition. > > > > > > Did unclean shutdowns take place on those systems? > > > If you let us see what does reiserfsck report in those cases that cou= ld > > > help to understand what is is happening. > > > > > > > Sometimes it's possible to fix it > > > > running 'reiserfsck --rebuild-tree' but not always. > > > > More information: > > > > -Kernel 2.4.25 + v4l2 patches > > > > -Reiserfsprogs 3.6.19 > > > > -Datalogging patches. > > > > (http://mirror.mcs.anl.gov/suse-people/mason/patches/data-logging/2= =2E4 > > > >.25/ ) > > > > > > > > I have checked datalogging patches from Reiserfs website and they > > > > seem equal to suse ones. > > > > > > > > I don't have any idea of what it's happening. The disk bandwidth is > > > > not so high (300-500kb/sec). The disk is always full at 90% (we have > > > > a process deleting old video). > > > > > > > > I have been thinking about removing Dataloggin patches but I would > > > > like to have serious reason. It's not easy to check that the problem > > > > is solved because we are not able to reproduce the error in our > > > > headquarter. > > > > > > > > Regards, > > > > > > > > Paco =2D-=20 One of my most productive days was throwing away 1000 lines of code (Ken=20 Thompson) =2D---------------- PGP fingerprint: AF69 62B4 97EB F5BB 2C60 B802 568A E122 BBBE 5820 PGP Key available at http://pgp.mit.edu =2D---------------- --nextPart5364953.cj0hsElVIh Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.4 (GNU/Linux) iD8DBQBEt4gVVorhIru+WCARAliZAJwK7e7WzxrD2f0h0yTvt/Rp9qFSmACfeBz8 /xzj2Ofj+9F2iaTJA5clpKs= =FAiM -----END PGP SIGNATURE----- --nextPart5364953.cj0hsElVIh--