linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Tejun Heo <htejun@gmail.com>
To: Kasper Sandberg <lkml@metanurb.dk>
Cc: Gene Heskett <gene.heskett@gmail.com>,
	Mikael Pettersson <mikpe@it.uu.se>,
	Peter Zijlstra <a.p.zijlstra@chello.nl>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Linux ide Mailing list <linux-ide@vger.kernel.org>
Subject: Re: Problem with ata layer in 2.6.24
Date: Sat, 02 Feb 2008 16:13:36 +0900	[thread overview]
Message-ID: <47A41820.2070907@gmail.com> (raw)
In-Reply-To: <1201582875.12795.15.camel@localhost>

Kasper Sandberg wrote:
> to put some timeline perspective into this.
> i believe it was in 2005 i assembled the system, and when i realized it
> was faulty, on old ide driver, i stopped using it - that miht have been
> in beginning of 2006. then for almost a year i werent using it, hoping
> to somehow fix it, but in january 2007 i think it was, atleast in the
> very beginning of 2007, i hit upon the idea of trying libata, and ever
> since the system has been running 24/7 - doing these errors around 2
> times a day.
> 
> i have multiple times reported my problems to lkml, but nothing has
> happened, i also tried to aproeach jgarzik direcly, but he was not
> interested.
> 
> i really hope this can be solved now, its a huge problem
> 
> my fileserver has an asus k8v motherboard, with via chipset (k8t880 i
> think it is, or something like it). currently using the promise
> controller again(strangely enough all the timeouts seems to happen here,
> and when the ITE was on, there, not the onboard one), in conjunction
> with the onboard via.

Timeouts are nasty to debug.  It can be caused by whole range of
different problems including transmission errors, bad power, faulty
drive, mishandled media error, IRQ misrouting, dumb hardware bug.  It's
basically 'uh... I told the controller to do something but it never
called me back'.

If you see timeouts on multiple devices connected to different
controllers, the chance is that you have problem somewhere else.  The
most likely culprit is bad power.  Please...

* Post the result of 'lspci -nn' and kernel log including full boot log
and error messages.

* Try to isolate the problem.  ie. Does removing several number of
drives fix the problem?  If the problem is localized to certain device,
what happens if you move it?  Does the problem follow the drive or stay
with the port?  If the failing drives are SATA, it's a good idea to
power some of the failing drives with a separate PSU and see whether
anything is different.

By trying to isolate the hardware problem, more can be learned about the
error condition and even when the problem actually isn't hardware
problem, it gives us much deeper insight of the problem and clues
regarding where to look.

Thanks.

-- 
tejun

  reply	other threads:[~2008-02-02  7:13 UTC|newest]

Thread overview: 74+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <200801272122.21823.gene.heskett@gmail.com>
     [not found] ` <18333.36746.819935.641383@harpo.it.uu.se>
     [not found]   ` <1201521783.6149.33.camel@lappy>
2008-01-28 12:54     ` Problem with ata layer in 2.6.24 Gene Heskett
2008-01-28 13:19       ` Gene Heskett
2008-01-28 13:57       ` Mikael Pettersson
2008-01-28 16:35         ` Gene Heskett
2008-01-28 16:50           ` Calvin Walton
2008-01-28 17:20             ` Zan Lynx
2008-01-28 17:30               ` Gene Heskett
2008-01-28 17:44                 ` Gene Heskett
2008-01-28 17:59                 ` Daniel Barkalow
2008-01-28 18:23                   ` Richard Heck
2008-01-28 20:01                     ` Daniel Barkalow
2008-01-29  0:05                       ` Gene Heskett
2008-01-29  0:34                         ` Daniel Barkalow
2008-01-29  1:31                           ` Gene Heskett
2008-01-29  1:51                             ` Daniel Barkalow
2008-01-29  4:48                             ` Michal Jaegermann
2008-01-29 12:12                     ` Alan Cox
2008-01-29 14:30                       ` Gene Heskett
2008-01-29 14:51                       ` Gene Heskett
2008-01-29 15:47                         ` Alan Cox
2008-01-29 16:32                           ` Gene Heskett
2008-01-29 16:48                             ` Mikael Pettersson
2008-01-29 17:04                               ` Gene Heskett
2008-01-29 17:38                                 ` Daniel Barkalow
2008-01-29 17:44                                   ` Alan Cox
2008-01-29 18:12                                     ` Daniel Barkalow
2008-01-29 17:59                                   ` Gene Heskett
2008-01-29 18:54                                 ` Alan Cox
2008-01-29 22:41                                   ` Gene Heskett
2008-01-29 22:48                                     ` Alan Cox
2008-01-30  0:19                                     ` rgheck
2008-01-30  0:19                                     ` Mark Lord
2008-01-29 17:06                       ` rgheck
2008-01-29 17:12                         ` Alan Cox
2008-01-29 17:24                           ` rgheck
2008-01-29 17:40                             ` Alan Cox
2008-01-29 18:11                         ` Mark Lord
2008-01-29 18:28                           ` rgheck
2008-01-29 18:32                             ` Mark Lord
2008-01-29 18:14                       ` Daniel Barkalow
2008-01-29 18:46                         ` Alan Cox
2008-01-29 19:14                           ` Daniel Barkalow
2008-01-29 19:34                             ` Alan Cox
2008-01-28 16:56           ` Gene Heskett
2008-01-28 18:20             ` Mark Lord
2008-01-28 18:59               ` Gene Heskett
2008-01-28 20:43                 ` Mark Lord
2008-01-29  0:06                   ` Gene Heskett
2008-01-29  3:16                     ` Mark Lord
2008-01-29  4:07                       ` Gene Heskett
2008-01-28 17:06           ` Dave Neuer
2008-01-29  4:23           ` Kasper Sandberg
2008-01-29  4:49             ` Gene Heskett
2008-01-29  5:01               ` Kasper Sandberg
2008-02-02  7:13                 ` Tejun Heo [this message]
2008-01-28 14:44       ` Richard Heck
2008-01-28 17:01         ` Gene Heskett
2008-01-28 18:38       ` Mark Lord
2008-01-28 20:01         ` Alan Cox
2008-01-28 20:29           ` Mark Lord
     [not found] ` <479E24F7.4090502@rtr.ca>
2008-01-28 19:01   ` Mark Lord
     [not found]   ` <200801281404.12937.gene.heskett@gmail.com>
     [not found]     ` <479E399C.1030409@rtr.ca>
2008-01-28 20:32       ` Mark Lord
2008-01-28 19:08 ` Jeff Garzik
2008-01-28 19:13   ` Gene Heskett
2008-01-29  6:41     ` Florian Attenberger
2008-01-29 15:04       ` Gene Heskett
2008-01-29 16:12         ` Mark Lord
2008-01-29 16:36           ` Gene Heskett
2008-01-29 18:09             ` Mark Lord
2008-01-29 16:50           ` rgheck
2008-01-29 16:58         ` Jeff Garzik
2008-01-29 17:12           ` Gene Heskett
2008-01-29 17:32             ` Jeff Garzik
2008-01-29 17:53               ` Gene Heskett

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=47A41820.2070907@gmail.com \
    --to=htejun@gmail.com \
    --cc=a.p.zijlstra@chello.nl \
    --cc=gene.heskett@gmail.com \
    --cc=linux-ide@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lkml@metanurb.dk \
    --cc=mikpe@it.uu.se \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).