All of lore.kernel.org
 help / color / mirror / Atom feed
From: Tejun Heo <htejun@gmail.com>
To: Robert Hancock <hancockr@shaw.ca>
Cc: Mark Lord <liml@rtr.ca>, Jeff Garzik <jeff@garzik.org>,
	IDE/ATA development list <linux-ide@vger.kernel.org>,
	Allen Martin <AMartin@nvidia.com>, Peer Chen <pchen@nvidia.com>,
	Kuan Luo <kluo@nvidia.com>
Subject: Re: disabling sata_nv ADMA for 2.6.24
Date: Tue, 08 Jan 2008 12:08:02 +0900	[thread overview]
Message-ID: <4782E912.1050204@gmail.com> (raw)
In-Reply-To: <4782E78F.9050205@shaw.ca>

Robert Hancock wrote:
> Tejun Heo wrote:
>> Tejun Heo wrote:
>>> Robert Hancock wrote:
>>>>> Okay, just succeeded on the current #upstream-fixes, attaching the
>>>>> log.
>>>>>  The machine is a brick after the crash.
>>>> I assume the cable got reconnected at 325 seconds? It looks like that
>>>> was during error handling for the previous unplug?
>>> I don't remember too well (the console was more than two meters away and
>>> I was just keeping disconnecting and reconnecting.  I noticed the
>>> machine was frozen after I came back to console, so...
>>>
>>>> [  314.987885] ata3: timeout waiting for ADMA IDLE, stat=0x400
>>>> [  314.993556] ata3: timeout waiting for ADMA LEGACY, stat=0x400
>>>> [  315.009915] ata3.00: exception Emask 0x10 SAct 0x1 SErr 0x1910000
>>>> action 0xa frozen
>>>> [  315.017708] ata3.00: ADMA status 0x00000402: , hot unplug
>>>> [  315.017714] ata3: SError: { PHYRdyChg Dispar LinkSeq TrStaTrns }
>>>> [  315.029239] ata3.00: cmd 60/01:00:92:d7:12/00:00:05:00:00/40 tag 0
>>>> ncq 512 in
>>>> [  315.029240]          res 40/00:04:92:d7:12/00:04:92:d7:12/40 Emask
>>>> 0x10 (ATA bus error)
>>>> [  315.029243] ata3.00: status: { DRDY }
>>>> [  315.048236] ata3: hard resetting link
>>>> [  315.774982] ata3: SATA link down (SStatus 0 SControl 300)
>>>> [  315.780498] ata3: failed to recover some devices, retrying in 5 secs
>>>> [  320.788427] ata3: hard resetting link
>>>> [  325.242220] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
>>>>
>>>> Not sure if the port would be frozen at this point or not?
>>>>
>>>> It would be useful to add some printks to narrow down at what point the
>>>> lockup happens. If it's a loop, interrupt storm or something then we
>>>> can
>>>> likely fix it, but if the controller's just locking up then we may be
>>>> out of luck..
>>> I think it's machine hard lock up.  NMI watchdog doesn't get triggered.
> 
> Is NMI watchdog actually working on this machine?
> 
> [   34.466899] testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears
> to be stuck (0->0)!
> [   34.555056] WARNING: CPU#1: NMI appears to be stuck (0->0)!

Oops, missed that.  I'll see whether there's IRQ storm going on.

>> Ah.. another thing.  Sometimes when I swap two drives, sata_nv fails to
>> detect the new drive.  If I pull out the plug and replug it, it then
>> recognizes the new drive.
> 
> No output in that case, I assume?

It seems what happens is sata_nv EH loses hotplug events during
hardreset is going on.  This is a bit tricky.  I'm not sure whether it's
sata_nv's fault or other drivers are working out of dumb luck.  I'll
reproduce the problem and post the log when I get some time.

Thanks.

-- 
tejun

  reply	other threads:[~2008-01-08  3:08 UTC|newest]

Thread overview: 41+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-01-07  9:25 disabling sata_nv ADMA for 2.6.24 Tejun Heo
2008-01-07 15:15 ` Mark Lord
2008-01-07 15:35   ` [PATCH #upstream-fixes] sata_nv: disable ADMA mode by default Tejun Heo
2008-01-10  5:58     ` Jeff Garzik
2008-01-10  6:29       ` Tejun Heo
2008-01-07 23:35   ` disabling sata_nv ADMA for 2.6.24 Robert Hancock
2008-01-07 23:56     ` Tejun Heo
2008-01-08  0:12       ` Robert Hancock
2008-01-08  1:01         ` Tejun Heo
2008-01-08  1:16           ` Tejun Heo
2008-01-08  2:29             ` Robert Hancock
2008-01-08  2:53               ` Tejun Heo
2008-01-08  2:55                 ` Tejun Heo
2008-01-08  3:01                   ` Robert Hancock
2008-01-08  3:08                     ` Tejun Heo [this message]
2008-01-08  9:58                       ` Tejun Heo
2008-01-08 14:40                         ` Robert Hancock
2008-01-09  1:58                           ` Tejun Heo
2008-01-09  2:00                             ` Tejun Heo
2008-01-09  3:50                               ` Robert Hancock
2008-01-09  5:09                                 ` Tejun Heo
2008-01-10  0:33                                   ` Robert Hancock
2008-01-10  6:59                                     ` Tejun Heo
2008-01-11  7:54                                     ` fixed a bug of adma in rhel4u5 with HDS7250SASUN500G Kuan Luo
2008-01-11 14:29                                       ` Robert Hancock
2008-01-11 21:57                                         ` David Milburn
2008-01-12  1:07                                       ` Robert Hancock
2008-01-14  3:08                                         ` Kuan Luo
2008-01-14  5:20                                           ` Robert Hancock
2008-01-14  6:23                                             ` Kuan Luo
2008-01-23  9:32                                             ` sata_nv and 2.6.24 (was Re: fixed a bug of adma in rhel4u5 with HDS7250SASUN500G.) Jeff Garzik
2008-01-23 14:44                                               ` Robert Hancock
2008-01-24  1:42                                                 ` Jeff Garzik
2008-01-24  1:53                                                   ` Robert Hancock
2008-01-24  0:43                                           ` fixed a bug of adma in rhel4u5 with HDS7250SASUN500G Robert Hancock
2008-01-24  3:20                                             ` Kuan Luo
2008-01-28 23:50                                               ` Robert Hancock
2008-01-29  2:48                                                 ` Kuan Luo
2008-01-29  2:48                                                   ` Kuan Luo
2008-01-29  4:59                                                 ` Kuan Luo
2008-01-29  4:59                                                   ` Kuan Luo

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4782E912.1050204@gmail.com \
    --to=htejun@gmail.com \
    --cc=AMartin@nvidia.com \
    --cc=hancockr@shaw.ca \
    --cc=jeff@garzik.org \
    --cc=kluo@nvidia.com \
    --cc=liml@rtr.ca \
    --cc=linux-ide@vger.kernel.org \
    --cc=pchen@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.