All of lore.kernel.org
 help / color / mirror / Atom feed
From: Adrian Bunk <bunk@stusta.de>
To: Haakon Riiser <haakon.riiser@fys.uio.no>, jgarzik@pobox.com
Cc: linux-kernel@vger.kernel.org, linux-ide@vger.kernel.org
Subject: SATA: Panic in SCSI subsystem
Date: Wed, 4 Jan 2006 23:26:13 +0100	[thread overview]
Message-ID: <20060104222613.GW3831@stusta.de> (raw)
In-Reply-To: <20060102133507.GA1664@fox.lan>

Jeff, do you have any idea what might cause Haakon's problem?

TIA
Adrian

On Mon, Jan 02, 2006 at 02:35:07PM +0100, Haakon Riiser wrote:
> First of all, a little information on my current system:
> 
>   Distro: Fedora Core 3 (fully updated)
>   Kernel: 2.6.12-1.1381_FC3 (the bug also occurs on the latest 2.6.14 kernel
>           from Fedora Core 4)
>   Motherboard: Asus K8V SE Deluxe (Socket 754, VIA K8T800)
>   CPU: Sempron 3000+
>   Memory: 2 x TwinMOS 256 MB DDR400 (CL2.5)
>   PATA disk: Maxtor 6Y160P0
>   SATA disks: 4 x 250 GB Seagate Barracuda 7200.8 SATA NCQ
>   SATA controller: Promise SATA-II 150 TX4 PCI
> 
> The system boots from the PATA disk, which among other things
> contains the OS.  The four SATA disks are set up in a software
> RAID5 array using mdadm, and this is where I store all my important
> data.
> 
> Until recently, this file server contained a 650 MHz Pentium III
> CPU and an Asus P3B-F motherboard.  This configuration worked
> flawlessly.
> 
> After upgrading the motherboard/CPU/memory on my file server to
> the configuration listed on top, I started getting kernel panics
> in the SCSI subsystem.  Once it happens, the machine freezes
> completely, so I only have the logs that appear on screen.
> The following is copied down by hand, so I left out the full
> memory adresses preceeding each line in the stack trace:
> 
>   ata7: called with no error (51)!
>   Kernel panic - not syncing: drivers/scsi/scsi.c:297: spin_is_locked
>    on uninitialized spinlock deadc014. (Not tainted)
> 
>   panic+0x42/0x1c0
>   scsi_put_command+0x19c/0x2bf [scsi_mod]
>   scsi_next_command+0xc/0x14 [scsi_mod]
>   scsi_end_request+0xed/0x200 [scsi_mod]
>   mempool_free+0x67/0x1aa
>   scsi_io_completion+0x151/0x4d4 [scsi_mod]
>   sd_rw_intr+0x156/0x30f [sd_mod]
>   scsi_finish_command+0x8e/0xc1 [scsi_mod]
>   net_rx_action+0xbb/0x2bc
>   scsi_softirq+0x9d/0xcd [scsi_mod]
>   __do_softirq+0x3e/0x8a
>   do_softirq+0x39/0x40
>   =======================================
>   do_IRQ+0x53/0x85
>   common_interrupt+0x1a/0x20
>   acpi_processor_idle+0x01/0x27f
>   acpi_processor_idle+0x101/0x27f
>   cpu_idle+0x3c/0x51
>   start_kernel+0x175/0x1b1
>   unknown_bootoption+0x0/0x1b0
> 
> The above log isn't the same every time, but it always includes
> SCSI related stuff like scsi_finish_command.  I upgraded around
> a month ago, and until yesterday, it only crashed three or four
> times.  Now, after the last crash, the time between crashes has
> been reduced from about a week to about five minutes -- the system
> is completely unusable!  I noticed that every time I boot the
> system, the RAID array starts the resyncing process.  It always
> crashes before finishing, forcing it to start from scratch on
> every boot.  I'm guessing that the stress from the resync operation
> is what's causing it to crash within five minutes.  I don't know
> what prompted it to start resyncing in the first place, though.
> (Btw, how dangerous is a crash in the resync process?  Is there
> a high risk of data corruption?)
> 
> Until I know more, I have downgraded to my old CPU/motherboard.
> It's still resyncing, so I don't know if the problem has gone away,
> but it's been up for almost 100 minutes, and I still haven't seen a
> single "ataN: called with no error (51)!" that usually preceeds the
> kernel panic, so I think it's still working with my old hardware.
> 
> One last thing:  The new motherboard actually has four SATA
> connectors (two that connects to an onboard Promise TX2 controller,
> and two that connects to an onboard VIA controller), so I tried
> taking out my Promise TX4 card, and connecting the disks to the
> onboard connectors.  This didn't make any difference -- the system
> panics in exactly the same way as before.
> 
> Any help is greatly appreciated!
> 
> -- 
>  Haakon

      reply	other threads:[~2006-01-04 22:26 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-01-02 13:35 Panic in SCSI subsystem Haakon Riiser
2006-01-04 22:26 ` Adrian Bunk [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20060104222613.GW3831@stusta.de \
    --to=bunk@stusta.de \
    --cc=haakon.riiser@fys.uio.no \
    --cc=jgarzik@pobox.com \
    --cc=linux-ide@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.