From mboxrd@z Thu Jan 1 00:00:00 1970 From: Onis Subject: sata_mv dropping disks Date: Fri, 19 May 2006 00:31:31 +0300 Message-ID: <20060518213131.GA10777@virasto.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Received: from virasto.com ([212.213.212.50]:9862 "EHLO virasto.com") by vger.kernel.org with ESMTP id S1751399AbWERVbd (ORCPT ); Thu, 18 May 2006 17:31:33 -0400 Content-Disposition: inline Sender: linux-ide-owner@vger.kernel.org List-Id: linux-ide@vger.kernel.org To: linux-ide@vger.kernel.org Hello Got warnings while rebuilding md raid5 array. Controller is 88SX5081 with 8xMaxtor 300GB 7V300F0. I've ran badblock -w on all disks, smartctl doesn't report errors. ---- BUG: warning at drivers/scsi/sata_mv.c:1884/mv_channel_reset() Call Trace: {mv_channel_reset+238} {mv_stop_and_reset+55} {mv_interrupt+631} {handle_IRQ_event+44} {__do_IRQ+176} {do_IRQ+66} {ret_from_intr+0} {get_request_wait+35} {xor_sse_5+191} {compute_block+221} {generic_make_request+495} {handle_stripe+7840} {raid5d+349} {prepare_to_wait+24} {keventd_create_kthread+0} {md_thread+300} {autoremove_wake_function+0} {autoremove_wake_function+0} {md_thread+0} {kthread+217} {child_rip+8} {keventd_create_kthread+0} {kthread+0} {child_rip+0} BUG: warning at drivers/scsi/sata_mv.c:1904/__msleep() Call Trace: {__mv_phy_reset+241} {mv_channel_reset+250} {mv_interrupt+631} {handle_IRQ_event+44} {__do_IRQ+176} {do_IRQ+66} {ret_from_intr+0} {get_request_wait+35} {xor_sse_5+191} {compute_block+221} {generic_make_request+495} {handle_stripe+7840} {raid5d+349} {prepare_to_wait+24} {keventd_create_kthread+0} {md_thread+300} {autoremove_wake_function+0} {autoremove_wake_function+0} {md_thread+0} {kthread+217} {child_rip+8} {keventd_create_kthread+0} {kthread+0} {child_rip+0} ata4: translated ATA stat/err 0x50/01 to SCSI SK/ASC/ASCQ 0x3/13/00 ata4: status=0x50 { DriveReady SeekComplete } ata4: error=0x01 { AddrMarkNotFound } sata_mv: PCI ERROR; PCI IRQ cause=0x28000020 What does "PCI IRQ cause=0x28000020" mean? Few minutes after that rebuild stopped: ---- sd 6:0:0:0: SCSI error: return code = 0x40000 end_request: I/O error, dev sdg, sector 403739536 sd 6:0:0:0: SCSI error: return code = 0x40000 end_request: I/O error, dev sdg, sector 403739544 sd 6:0:0:0: SCSI error: return code = 0x40000 end_request: I/O error, dev sdg, sector 403739552 md: md0: sync done. # cat /proc/mdstat Personalities : [raid6] [raid5] [raid4] md0 : active raid5 sda[0] sdg[8](F) sdh[7] sdf[5] sde[4] sdd[3] sdc[2] sdb[1] 2051400960 blocks level 5, 128k chunk, algorithm 2 [8/7] [UUUUUU_U] # hdparm -I /dev/sdg /dev/sdg: HDIO_DRIVE_CMD(identify) failed: Input/output error Also I'm getting a lots of these on all ports on boot. smartctl also triggers these: ---- ata3: translated ATA stat/err 0xd0/00 to SCSI SK/ASC/ASCQ 0xb/47/00 ata3: status=0xd0 { Busy } ata1: translated ATA stat/err 0xd0/00 to SCSI SK/ASC/ASCQ 0xb/47/00 ata1: status=0xd0 { Busy } ... System ------ * Tyan Thunder S2882 Dual Opteron 240 * Marvell Technology Group Ltd. MV88SX5081 8-port SATA I PCI-X Controller * 8 x Maxtor Maxline III 300GB SATA2 * Debian Sarge AMD64 * kernel 2.6.17-rc4-mm1 - Onis