From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from relay.sgi.com (relay2.corp.sgi.com [137.38.102.29]) by oss.sgi.com (Postfix) with ESMTP id 8F6757F59 for ; Sun, 18 Aug 2013 22:55:27 -0500 (CDT) Received: from cuda.sgi.com (cuda1.sgi.com [192.48.157.11]) by relay2.corp.sgi.com (Postfix) with ESMTP id 4A23A304032 for ; Sun, 18 Aug 2013 20:55:24 -0700 (PDT) Received: from benjamin.baylink.com (rrcs-24-129-180-187.se.biz.rr.com [24.129.180.187]) by cuda.sgi.com with ESMTP id hzWYTtZwnHPYqOI2 for ; Sun, 18 Aug 2013 20:55:22 -0700 (PDT) Received: from localhost (localhost.localdomain [127.0.0.1]) by benjamin.baylink.com (Postfix) with ESMTP id 3FAA21F003F5 for ; Sun, 18 Aug 2013 23:55:22 -0400 (EDT) Received: from benjamin.baylink.com ([127.0.0.1]) by localhost (benjamin.baylink.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id zoWmBprgEIRl for ; Sun, 18 Aug 2013 23:55:20 -0400 (EDT) Received: from 145-169-1-30.pools.spcsdns.net (66-87-123-145.pools.spcsdns.net [66.87.123.145]) by benjamin.baylink.com (Postfix) with ESMTPSA id 5FE2E1F003E7 for ; Sun, 18 Aug 2013 23:55:20 -0400 (EDT) References: <11558272.4016.1376861936621.JavaMail.root@benjamin.baylink.com> <5211455B.3000409@hardwarefreak.com> In-Reply-To: <5211455B.3000409@hardwarefreak.com> MIME-Version: 1.0 Subject: Re: XFS recovery resumes... From: Jay Ashworth Date: Sun, 18 Aug 2013 23:55:18 -0400 Message-ID: <6f8bb504-c786-4bc0-9327-ee58a40d99e2@email.android.com> List-Id: XFS Filesystem from SGI List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: multipart/mixed; boundary="===============9015677013621210074==" Errors-To: xfs-bounces@oss.sgi.com Sender: xfs-bounces@oss.sgi.com To: xfs@oss.sgi.com --===============9015677013621210074== Content-Type: multipart/alternative; boundary="----G8X6HRAC0KEWMQ1WIFD7AIY4DDY0U2" ------G8X6HRAC0KEWMQ1WIFD7AIY4DDY0U2 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Still the same outage from 2 weeks ago, Stan; my script had nothing to do with breaking the FSs. Was a zorched power supply, almost certainly. And in fact, after 32 years adminning *nix boxes for a living, yes, I do expect that if any userland program can /corrupt/ FS internals without twiddling with /dev/sdX, either the FS is broken or the hardware is. In this case I'm quite certain it /was/ the hardware, and 85-90% confident it's fixed now. Cheers, -jra -- Sent from my Android phone with K-9 Mail. Please excuse my brevity. Stan Hoeppner wrote: On 8/18/2013 4:38 PM, Jay Ashworth wrote: > I'm trying to dedupe the two large XFS filesystems on which I have DVR > recordings, so that I can walk around amongst the available HDDs and create > new filesystems under everything. > > Every time I rm a file, the filesystem blows up, and the driver shuts it > down. > > Some background: > > At the moment, I have 2 devices, /dev/sdd1 mounted on /appl/media4, and > /dev/sda1 mounted on /appl/media5, and a large script, created by hand- > hacking the output of a perl dupe finder script. > > The large script was mangled so that it would remove anything that was a > dupe from media4, unless the file was an unlabeled lost+found on media5, > and had a name on media4. In that case, I removed the file on media5, and > then moved it from media4 to media5. > > After the hand-hacking on the script, I sorted it to do all the rm's first, > and then all the mv's, to make sure free space when up before it went down. > > And, of course, when I ran the script, it caused the XFS driver to cough and > die, leading to error 5s and gnashing of teeth. If this script is the catalyst of your XFS problems, it seems logical that you would include said script in your trouble report, yet you did not. It's a bit foolish to assume you can't break a Linux subsystem with a poorly written program and/or in combination with a platform that isn't up to the task being asked of it. As Joe mentioned having too little RAM could be part of this problem. -- Stan ------G8X6HRAC0KEWMQ1WIFD7AIY4DDY0U2 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit Still the same outage from 2 weeks ago, Stan; my script had nothing to do with breaking the FSs. Was a zorched power supply, almost certainly.

And in fact, after 32 years adminning *nix boxes for a living, yes, I do expect that if any userland program can /corrupt/ FS internals without twiddling with /dev/sdX, either the FS is broken or the hardware is.

In this case I'm quite certain it /was/ the hardware, and 85-90% confident it's fixed now.

Cheers,
-jra
--
Sent from my Android phone with K-9 Mail. Please excuse my brevity.

Stan Hoeppner <stan@hardwarefreak.com> wrote:
On 8/18/2013 4:38 PM, Jay Ashworth wrote:
> I'm trying to dedupe the two large XFS filesystems on which I have DVR
> recordings, so that I can walk around amongst the available HDDs and create
> new filesystems under everything.
>
> Every time I rm a file, the filesystem blows up, and the driver shuts it
> down.
>
> Some background:
>
> At the moment, I have 2 devices, /dev/sdd1 mounted on /appl/media4, and
> /dev/sda1 mounted on /appl/media5, and a large script, created by hand-
> hacking the output of a perl dupe finder script.
>
> The large script was mangled so that it would remove anything that was a
> dupe from media4, unless the file was an unlabeled lost+found on media5,
> and had a name on media4. In that case, I removed the file on media5, and
> ; then moved it from media4 to media5.
>
> After the hand-hacking on the script, I sorted it to do all the rm's first,
> and then all the mv's, to make sure free space when up before it went down.
>
> And, of course, when I ran the script, it caused the XFS driver to cough and
> die, leading to error 5s and gnashing of teeth.

If this script is the catalyst of your XFS problems, it seems logical
that you would include said script in your trouble report, yet you did
not. It's a bit foolish to assume you can't break a Linux subsystem
with a poorly written program and/or in combination with a platform that
isn't up to the task being asked of it. As Joe mentioned having too
little RAM could be part of this problem.

--
Stan


------G8X6HRAC0KEWMQ1WIFD7AIY4DDY0U2-- --===============9015677013621210074== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ xfs mailing list xfs@oss.sgi.com http://oss.sgi.com/mailman/listinfo/xfs --===============9015677013621210074==--