From: Wakko Warner <wakko@animx.eu.org>
To: Guennadi Liakhovetski <g.liakhovetski@gmx.de>
Cc: linux-kernel@vger.kernel.org
Subject: Re: Strange lockup with 2.6.0
Date: Fri, 9 Jan 2004 11:45:19 -0500 [thread overview]
Message-ID: <20040109114519.A7074@animx.eu.org> (raw)
In-Reply-To: <Pine.LNX.4.44.0401091653340.19686-100000@poirot.grange>; from Guennadi Liakhovetski on Fri, Jan 09, 2004 at 05:02:02PM +0100
> > > > I usually do a backup of each filesystem simply using tar. I attempted to
> > > > backup a machine I had that's running 2.6.0 and it hard locked.
> > >
> > > Are sysrq-keys enabled? If so, could you catch the tar backtrace during
> > > the lock-up (ALT-SysRq-t)? What was the latest kernel-version that worked?
> >
> > Yes, but the machine hard locks. sysrq does not work. I have a small
>
> __THAT__ hard...:-)
Yup. That hard.
> > utility I wrote that will set the state of the parport (I used this to tell
> > if it locks up) using outb to the port (This does not effect it in anyway,
> > it will lockup w/o it running)
>
> You mean it just toggles a bit periodically?
I have a set of LEDs attached to the parport (12) and this program writes to
it in a way that makes it bounce the 'on' led every .25 seconds I'll send
you the program if you're interested.
> > > Can you just try to write some data over NFS? Would it lock if you write 1
> >
> > I am constantly accessing NFS with this machine. Read and write. It was
>
> How much data at one go (max)?
Dunno. I've never given it that much thought. I have the completed
backup on the jaz. I can attempt to dump it to the server to see if that
makes a difference.
> > only when I backed it up with tar. In the event it doesn't lock, tar
> > crashes w/o error/warning (over NFS).
>
> So, it locks not always?
Most of the time, yes it does. I'd say 90% of the time it hard locks. If
it doesn't and I attempt it again it always hard locks (except one time I
did it). I've done the tests numberous times.
> > > byte or 1K or 1M? Does it lock immediately as you start the backup or
> >
> > It locks up usually at one point, but not always.
>
> Since you could backup to Jazz, looks like your filesystem is ok, NFS also
> works in principle...
Before one test, I did: cp /dev/sda /dev/null
to see if it has any problems with the disk. It was fine.
> > > after some time (you could start some process in the background
> > > periodically printing some info on the terminal, like vmstat, cat
> > > /proc/interrupts, free, tcpdump on both ends to a file...) Can you try NFS
> >
> > I can do this I think. It's fun when running with init being bash. It will
> > take some time to do since I can't scroll backwards.
>
> You could also attach a serial console and direct the output there (then
> you also can scroll).
I thought about this. Hopefully compiling in serial doesn't add another
variable to this. I currently have serial compiled as a module.
> > > over TCP? Are other machines, where backup works, also running 2.6,
> >
> > I can try TCP, but I'm not sure about the server accepting TCP (was there a
> > compile time option for NFSD to use TCP?) These 2 machines are the only
>
> Yes.
I did not compile the server with TCP support.
> > ones I have on 2.6.
> >
> > > 10/100mbps?
> >
> > 100 FD always.
>
> Why I am interested in your experiences is that I also have a problem
> transferring large (several M) files over NFS when the server is 2.6 and
> both ends have 100 FD. (You can see my posts this week about 2.6 NFS.) And
> in my case it TCP fixed it. But I never had hard-locks, just cp hanged in
> D, and tcpdump showed timed out reassembly on the receiving side. But I
> was reading from the server.
That's interesting. I hope it doesn't matter if the server is a diskless
machine. Interesting you mention the server being 2.6. The NFS I did above
was to a different (also diskless) server. The 2.6 one I threw a hard disk
on so I could do backups of all my machines (and w/o shutting another down).
Out of the 5 machines on this network, only 2 have usable IDE ports (one has
none, one's a laptop, one is full of cdroms which is the machine that's
hanging on me)
On a side note, I have a 2.4.x (x>=20) using knfsd and nohide on directories.
A 2.4.x client can see those contents, a 2.6.x client can't w/o mounting each
individually.
--
Lab tests show that use of micro$oft causes cancer in lab animals
next prev parent reply other threads:[~2004-01-09 16:32 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-01-09 14:39 Strange lockup with 2.6.0 Wakko Warner
2004-01-09 15:18 ` Guennadi Liakhovetski
2004-01-09 15:49 ` Wakko Warner
2004-01-09 16:02 ` Guennadi Liakhovetski
2004-01-09 16:45 ` Wakko Warner [this message]
2004-01-10 1:06 ` Wakko Warner
2004-01-10 3:17 ` Wakko Warner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20040109114519.A7074@animx.eu.org \
--to=wakko@animx.eu.org \
--cc=g.liakhovetski@gmx.de \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox