From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tejun Heo Subject: Re: Serious bug in sata_sil module in 2.6.19.2? Date: Tue, 06 Feb 2007 23:44:50 +0900 Message-ID: <45C89462.2090904@gmail.com> References: <45C1BFE0.4080004@effenberger.org> <45C1FBFF.3050706@gmail.com> <45C27A4D.8070802@effenberger.org> <45C82071.5080808@gmail.com> <45C85C3B.4060303@effenberger.org> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-15 Content-Transfer-Encoding: 7bit Return-path: Received: from wx-out-0506.google.com ([66.249.82.224]:15715 "EHLO wx-out-0506.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751998AbXBFOo5 (ORCPT ); Tue, 6 Feb 2007 09:44:57 -0500 Received: by wx-out-0506.google.com with SMTP id h31so1896072wxd for ; Tue, 06 Feb 2007 06:44:56 -0800 (PST) In-Reply-To: <45C85C3B.4060303@effenberger.org> Sender: linux-ide-owner@vger.kernel.org List-Id: linux-ide@vger.kernel.org To: Florian Effenberger Cc: jgarzik@pobox.com, linux-ide@vger.kernel.org Florian Effenberger wrote: > Hi there, > >> That definitely looks like libata error messages but can't tell anything >> other than that from it. It could be cause of system hang and the weird >> screen or just another symptom of another problem. >> >> Is it possible for you to connect a serial console or configure >> netconsole (Documentation/networking/netconsole.txt) such that the >> messages are preserved after such hang occurs? Also, please turn on >> PRINTK_TIME (Kernel Hacking -> Show timing information on printks) so >> that we can tell what happens when. To make the info more useful, you >> can log into the machine from another machine and run something like >> "while true; do sleep 1; date; done" on it such that you can tel exactly >> when the machine went down. > > thanks a lot for your feedback. It seems we solved the problem: it was > the power supplying unit! We change some stuff (memory, cleaned the > machine and so on), but after changing the power supplying unit, > everything worked fine. So I guess the new kernel just had a little bit > more power consumption than the previous one. > > Thanks a lot for your kind help, and sorry for the false alert! :-) Yeap, when power quality degrades, the first thing that breaks is SATA, so that explains the error message. There have been several SATA bug reports which turned out to be PSU problems. Good to have another obvious data point on that. :-) -- tejun