From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@bugzilla.kernel.org Subject: [Bug 13594] New: SMART responses for SATA disks on SAS get interpreted as errors Date: Sun, 21 Jun 2009 17:26:29 GMT Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: QUOTED-PRINTABLE Return-path: Received: from demeter.kernel.org ([140.211.167.39]:36341 "EHLO demeter.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751266AbZFUR00 convert rfc822-to-8bit (ORCPT ); Sun, 21 Jun 2009 13:26:26 -0400 Received: from demeter.kernel.org (localhost.localdomain [127.0.0.1]) by demeter.kernel.org (8.14.2/8.14.2) with ESMTP id n5LHQThU028527 for ; Sun, 21 Jun 2009 17:26:29 GMT Sender: linux-scsi-owner@vger.kernel.org List-Id: linux-scsi@vger.kernel.org To: linux-scsi@vger.kernel.org http://bugzilla.kernel.org/show_bug.cgi?id=3D13594 Summary: SMART responses for SATA disks on SAS get interpret= ed as errors Product: IO/Storage Version: 2.5 Kernel Version: 2.6.30-rc6 Platform: All OS/Version: Linux Tree: Mainline Status: NEW Severity: normal Priority: P1 Component: SCSI AssignedTo: linux-scsi@vger.kernel.org ReportedBy: sgunderson@bigfoot.com Regression: No Hi, I just bought a LSI SAS3081E-R which I use against a Supermicro backpla= ne to drive ten Seagate SATA disks (7200.11, 750GB and 1.5GB). I'm using the standard Linux Fusion MPT device driver (CONFIG_FUSION_SAS) under Linux 2.6.30-rc6. Everything seems to work pretty well, with one exception: W= hen I use SMART against the drives (say, smartctl -a /dev/sda) the kernel com= plains with: [ 811.091916] sd 0:0:0:0: [sda] Sense Key : Recovered Error [current= ] [descriptor] [ 811.099807] Descriptor sense data with sense descriptors (in hex): [ 811.106175] 72 01 00 1d 00 00 00 0e 09 0c 00 00 00 00 00 0= 0 [ 811.113262] 00 4f 00 c2 00 50 [ 811.117379] sd 0:0:0:0: [sda] Add. Sense: ATA pass through informa= tion available I've tried upgrading to the newest firmware (1.28.02.00, 05-MAY-2009), = but all that changed is that the hex dump was added to the error message. Whenever this happens, it appears like all the disks =E2=80=9Chiccup=E2= =80=9D and the kernel loses contact with the controller for a small while. If too many of the= se happen at once, eventually disks start falling off RAIDs, and the entir= e machine goes down. It looks to me as if these messages should simply no= t be treated as errors by the kernel -- smartctl explicitly asks for a respo= nse even if the command doesn't fail (by setting CK_COND), so the response proba= bly shouldn't be taken as an error. --=20 Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=3Demail ------- You are receiving this mail because: ------- You are the assignee for the bug.-- To unsubscribe from this list: send the line "unsubscribe linux-scsi" i= n the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html