* [1/6] 2.6.21-rc2: known regressions [not found] <Pine.LNX.4.64.0702272105220.12485@woody.linux-foundation.org> @ 2007-03-05 1:50 ` Adrian Bunk 2007-03-05 2:26 ` Andrew Morton ` (3 more replies) 2007-03-05 1:50 ` [2/6] " Adrian Bunk 1 sibling, 4 replies; 45+ messages in thread From: Adrian Bunk @ 2007-03-05 1:50 UTC (permalink / raw) To: Linus Torvalds, Andrew Morton Cc: Ayaz Abdulla, maxk, Jeff Garzik, Marcel Holtmann, linux-pm, Linux Kernel Mailing List, Sid Boyce, netdev, bluez-devel, Pavel Machek, Matt Mackall, Mark Lord, Johannes Berg, Albert Hopkins This email lists some known regressions in 2.6.21-rc2 compared to 2.6.20 that are not yet fixed in Linus' tree. If you find your name in the Cc header, you are either submitter of one of the bugs, maintainer of an affectected subsystem or driver, a patch of you caused a breakage or I'm considering you in any other way possibly involved with one or more of these issues. Due to the huge amount of recipients, please trim the Cc when answering. Subject : kwin dies silently References : http://lkml.org/lkml/2007/2/28/112 Submitter : Sid Boyce <g3vbv@blueyonder.co.uk> Status : unknown Subject : resume: slab error in verify_redzone_free(): cache `size-512': memory outside object was overwritten References : http://lkml.org/lkml/2007/2/24/41 Submitter : Pavel Machek <pavel@ucw.cz> Handled-By : Marcel Holtmann <marcel@holtmann.org> Status : unknown Subject : bluetooth hardlocks References : http://lkml.org/lkml/2007/3/2/85 Submitter : Pavel Machek <pavel@ucw.cz> Status : unknown Subject : Bluetooth RFComm locks up the machine (device_move() related) References : http://lkml.org/lkml/2007/3/4/64 Submitter : Mark Lord <lkml@rtr.ca> Caused-By : Marcel Holtmann <marcel@holtmann.org> commit c1a3313698895d8ad4760f98642007bf236af2e8 Status : unknown Subject : kref refcounting breakage References : http://lkml.org/lkml/2007/3/2/67 Submitter : Andrew Morton <akpm@linux-foundation.org> Handled-By : Greg KH <greg@kroah.com> Status : unknown Subject : wireless breakage (ipw2200, iwconfig, NetworkManager) References : http://lkml.org/lkml/2007/3/4/135 Submitter : Matt Mackall <mpm@selenic.com> Caused-By : Greg Kroah-Hartman <gregkh@suse.de> (?) commit 43cb76d91ee85f579a69d42bc8efc08bac560278 (?) Handled-By : Johannes Berg <johannes@sipsolutions.net> Status : unknown Subject : forcedeth: skb_over_panic References : http://bugzilla.kernel.org/show_bug.cgi?id=8058 Submitter : Albert Hopkins <kernel@marduk.letterboxes.org> Status : unknown ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-05 1:50 ` [1/6] 2.6.21-rc2: known regressions Adrian Bunk @ 2007-03-05 2:26 ` Andrew Morton 2007-03-05 3:35 ` Greg KH ` (2 subsequent siblings) 3 siblings, 0 replies; 45+ messages in thread From: Andrew Morton @ 2007-03-05 2:26 UTC (permalink / raw) To: Adrian Bunk Cc: Ayaz Abdulla, Pavel Machek, Garzik, Jeff, Marcel Holtmann, Matt Mackall, linux-pm, List, Sid Boyce, netdev, bluez-devel, maxk, Linux, Mark Lord, Johannes Berg, Linus Torvalds, Albert Hopkins On Mon, 5 Mar 2007 02:50:31 +0100 Adrian Bunk <bunk@stusta.de> wrote: > This email lists some known regressions in 2.6.21-rc2 compared to 2.6.20 > that are not yet fixed in Linus' tree. We seem to have broken an unusually large amount of stuff this time. partial post-mortem: - The ACPICA merge landed in -mm super-late: basically it was in mainline a week afterwards and saw only a single -mm release. Part of the reason for this short period in -mm was that ACPICA had its paws all over x86_64 code and conflicted badly with significant changes in the x86_64 tree. That happens sometimes. But when it does, the mess lands in my lap rather than in the laps of the perpetrators. Lesson: keep the code well-factored so that different subsystems don't soil each others' kennels. - The hrtimers/dynticks stuff is simply hard: timekeeping, low-level x86, even APICs. These are areas in which things break a lot, so churning it was inevitably going to cause problems. Lesson: none, I think. Low-level x86 support is just hard, and changing it breaks things. So that accounts for _some_ of the damage, but I wonder if there's more to it than that. ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-05 1:50 ` [1/6] 2.6.21-rc2: known regressions Adrian Bunk 2007-03-05 2:26 ` Andrew Morton @ 2007-03-05 3:35 ` Greg KH 2007-03-06 0:55 ` Johannes Berg 2007-03-05 4:01 ` Mark Lord 2007-03-07 11:06 ` Jeff Garzik 3 siblings, 1 reply; 45+ messages in thread From: Greg KH @ 2007-03-05 3:35 UTC (permalink / raw) To: Adrian Bunk Cc: Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Sid Boyce, Pavel Machek, Marcel Holtmann, linux-pm, maxk, bluez-devel, Mark Lord, Matt Mackall, Johannes Berg, Albert Hopkins, Ayaz Abdulla, Jeff Garzik, netdev On Mon, Mar 05, 2007 at 02:50:31AM +0100, Adrian Bunk wrote: > Subject : kref refcounting breakage > References : http://lkml.org/lkml/2007/3/2/67 > Submitter : Andrew Morton <akpm@linux-foundation.org> > Handled-By : Greg KH <greg@kroah.com> > Status : unknown I'm working on tracking this down still... > Subject : wireless breakage (ipw2200, iwconfig, NetworkManager) > References : http://lkml.org/lkml/2007/3/4/135 > Submitter : Matt Mackall <mpm@selenic.com> > Caused-By : Greg Kroah-Hartman <gregkh@suse.de> (?) > commit 43cb76d91ee85f579a69d42bc8efc08bac560278 (?) > Handled-By : Johannes Berg <johannes@sipsolutions.net> > Status : unknown I really think this is a CONFIG_SYSFS_DEPRECATED issue (not being set), but want to get Matt confirm either way before saying this is a real issue or not. thanks, greg k-h ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-05 3:35 ` Greg KH @ 2007-03-06 0:55 ` Johannes Berg 0 siblings, 0 replies; 45+ messages in thread From: Johannes Berg @ 2007-03-06 0:55 UTC (permalink / raw) To: Greg KH Cc: Ayaz Abdulla, Pavel Machek, Jeff Garzik, netdev, Marcel Holtmann, linux-pm, Linux Kernel Mailing List, Adrian Bunk, bluez-devel, maxk, Matt Mackall, Mark Lord, Andrew Morton, Sid Boyce, Linus Torvalds, Albert Hopkins [-- Attachment #1.1: Type: text/plain, Size: 826 bytes --] On Sun, 2007-03-04 at 19:35 -0800, Greg KH wrote: > > Subject : wireless breakage (ipw2200, iwconfig, NetworkManager) > > References : http://lkml.org/lkml/2007/3/4/135 > > Submitter : Matt Mackall <mpm@selenic.com> > > Caused-By : Greg Kroah-Hartman <gregkh@suse.de> (?) > > commit 43cb76d91ee85f579a69d42bc8efc08bac560278 (?) > > Handled-By : Johannes Berg <johannes@sipsolutions.net> > > Status : unknown > > I really think this is a CONFIG_SYSFS_DEPRECATED issue (not being set), > but want to get Matt confirm either way before saying this is a real > issue or not. I just tested on my quad powermac that also runs a stock Debian/unstable and changing that option makes all the difference. Nevermind that it defaults to yes and one needs to turn it off explicitly... johannes [-- Attachment #1.2: This is a digitally signed message part --] [-- Type: application/pgp-signature, Size: 190 bytes --] [-- Attachment #2: Type: text/plain, Size: 0 bytes --] ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-05 1:50 ` [1/6] 2.6.21-rc2: known regressions Adrian Bunk 2007-03-05 2:26 ` Andrew Morton 2007-03-05 3:35 ` Greg KH @ 2007-03-05 4:01 ` Mark Lord 2007-03-05 4:34 ` Greg KH 2007-03-07 11:06 ` Jeff Garzik 3 siblings, 1 reply; 45+ messages in thread From: Mark Lord @ 2007-03-05 4:01 UTC (permalink / raw) To: Adrian Bunk Cc: Ayaz Abdulla, Pavel Machek, Jeff Garzik, Marcel Holtmann, linux-pm, Linux Kernel Mailing List, Sid Boyce, netdev, bluez-devel, maxk, Matt Mackall, Andrew Morton, Linus Torvalds, Johannes Berg, Albert Hopkins Adrian Bunk wrote: > > Subject : Bluetooth RFComm locks up the machine (device_move() related) > References : http://lkml.org/lkml/2007/3/4/64 > Submitter : Mark Lord <lkml@rtr.ca> > Caused-By : Marcel Holtmann <marcel@holtmann.org> > commit c1a3313698895d8ad4760f98642007bf236af2e8 > Status : unknown A 2-line patch exists for fs/sysfs/dir.c to address this. Waiting on Greg to apply it or substitute something prettier. ;) Cheers ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-05 4:01 ` Mark Lord @ 2007-03-05 4:34 ` Greg KH 2007-03-05 12:42 ` Marcel Holtmann 0 siblings, 1 reply; 45+ messages in thread From: Greg KH @ 2007-03-05 4:34 UTC (permalink / raw) To: Mark Lord Cc: Adrian Bunk, Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Sid Boyce, Pavel Machek, Marcel Holtmann, linux-pm, maxk, bluez-devel, Matt Mackall, Johannes Berg, Albert Hopkins, Ayaz Abdulla, Jeff Garzik, netdev On Sun, Mar 04, 2007 at 11:01:33PM -0500, Mark Lord wrote: > Adrian Bunk wrote: > > > >Subject : Bluetooth RFComm locks up the machine (device_move() related) > >References : http://lkml.org/lkml/2007/3/4/64 > >Submitter : Mark Lord <lkml@rtr.ca> > >Caused-By : Marcel Holtmann <marcel@holtmann.org> > > commit c1a3313698895d8ad4760f98642007bf236af2e8 > >Status : unknown > > A 2-line patch exists for fs/sysfs/dir.c to address this. > Waiting on Greg to apply it or substitute something prettier. ;) I want to see if Marcel agrees with it, as he did the original patch in that area. thanks, greg k-h ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-05 4:34 ` Greg KH @ 2007-03-05 12:42 ` Marcel Holtmann 0 siblings, 0 replies; 45+ messages in thread From: Marcel Holtmann @ 2007-03-05 12:42 UTC (permalink / raw) To: Greg KH Cc: Ayaz Abdulla, Pavel Machek, Jeff Garzik, netdev, linux-pm, Linux Kernel Mailing List, Adrian Bunk, bluez-devel, maxk, Matt Mackall, Mark Lord, Andrew Morton, Sid Boyce, Linus Torvalds, Johannes Berg, Albert Hopkins Hi Greg, > > >Subject : Bluetooth RFComm locks up the machine (device_move() related) > > >References : http://lkml.org/lkml/2007/3/4/64 > > >Submitter : Mark Lord <lkml@rtr.ca> > > >Caused-By : Marcel Holtmann <marcel@holtmann.org> > > > commit c1a3313698895d8ad4760f98642007bf236af2e8 > > >Status : unknown > > > > A 2-line patch exists for fs/sysfs/dir.c to address this. > > Waiting on Greg to apply it or substitute something prettier. ;) > > I want to see if Marcel agrees with it, as he did the original patch in > that area. I am not deep enough in the sysfs code to tell you if Mark's change it correct or not. It looks however fully reasonable to me. From the higher level perspective of the device_move() usage the RFCOMM code looks correct and has been tested before I submitted it for inclusion. Regards Marcel ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-05 1:50 ` [1/6] 2.6.21-rc2: known regressions Adrian Bunk ` (2 preceding siblings ...) 2007-03-05 4:01 ` Mark Lord @ 2007-03-07 11:06 ` Jeff Garzik 2007-03-07 22:17 ` Albert Hopkins 3 siblings, 1 reply; 45+ messages in thread From: Jeff Garzik @ 2007-03-07 11:06 UTC (permalink / raw) To: Adrian Bunk, Albert Hopkins, Ayaz Abdulla Cc: Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Sid Boyce, Pavel Machek, Marcel Holtmann, linux-pm, maxk, bluez-devel, Mark Lord, Greg KH, Matt Mackall, Johannes Berg, netdev Adrian Bunk wrote: > Subject : forcedeth: skb_over_panic > References : http://bugzilla.kernel.org/show_bug.cgi?id=8058 > Submitter : Albert Hopkins <kernel@marduk.letterboxes.org> > Status : unknown I think this is fixed by the recent forcedeth 3-patch patchset? Can Albert or others confirm? Jeff ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [1/6] 2.6.21-rc2: known regressions 2007-03-07 11:06 ` Jeff Garzik @ 2007-03-07 22:17 ` Albert Hopkins 0 siblings, 0 replies; 45+ messages in thread From: Albert Hopkins @ 2007-03-07 22:17 UTC (permalink / raw) To: Jeff Garzik Cc: Adrian Bunk, Ayaz Abdulla, Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Sid Boyce, Pavel Machek, Marcel Holtmann, linux-pm, maxk, bluez-devel, Mark Lord, Greg KH, Matt Mackall, Johannes Berg, netdev On Wed, 2007-03-07 at 06:06 -0500, Jeff Garzik wrote: > Adrian Bunk wrote: > > Subject : forcedeth: skb_over_panic > > References : http://bugzilla.kernel.org/show_bug.cgi?id=8058 > > Submitter : Albert Hopkins <kernel@marduk.letterboxes.org> > > Status : unknown > > > I think this is fixed by the recent forcedeth 3-patch patchset? Can > Albert or others confirm? > > Jeff > I'll reflect this in th bug report, but I recently tried the forcedeth driver in 2.6.21-rc3 and still get an oops. -- Albert W. Hopkins ^ permalink raw reply [flat|nested] 45+ messages in thread
* [2/6] 2.6.21-rc2: known regressions [not found] <Pine.LNX.4.64.0702272105220.12485@woody.linux-foundation.org> 2007-03-05 1:50 ` [1/6] 2.6.21-rc2: known regressions Adrian Bunk @ 2007-03-05 1:50 ` Adrian Bunk 2007-03-07 11:09 ` Jeff Garzik 2007-03-08 12:31 ` Michael S. Tsirkin 1 sibling, 2 replies; 45+ messages in thread From: Adrian Bunk @ 2007-03-05 1:50 UTC (permalink / raw) To: Linus Torvalds, Andrew Morton Cc: Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, Michael S. Tsirkin, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, greg, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input This email lists some known regressions in 2.6.21-rc2 compared to 2.6.20 that are not yet fixed in Linus' tree. If you find your name in the Cc header, you are either submitter of one of the bugs, maintainer of an affectected subsystem or driver, a patch of you caused a breakage or I'm considering you in any other way possibly involved with one or more of these issues. Due to the huge amount of recipients, please trim the Cc when answering. Subject : ThinkPad doesn't resume from suspend to RAM References : http://lkml.org/lkml/2007/2/27/80 http://lkml.org/lkml/2007/2/28/348 Submitter : Jens Axboe <jens.axboe@oracle.com> Jeff Chua <jeff.chua.linux@gmail.com> Status : unknown Subject : beeps get longer after suspend References : http://lkml.org/lkml/2007/2/26/276 Submitter : Pavel Machek <pavel@ucw.cz> Status : unknown Subject : ThinkPad T60: no screen after suspend to RAM References : http://lkml.org/lkml/2007/2/22/391 Submitter : Michael S. Tsirkin <mst@mellanox.co.il> Status : unknown Subject : ThinkPad Z60m: usb mouse stops working after suspend to ram References : http://lkml.org/lkml/2007/2/21/413 http://lkml.org/lkml/2007/2/28/172 Submitter : Arkadiusz Miskiewicz <arekm@maven.pl> Caused-By : Konstantin Karasyov <konstantin.a.karasyov@intel.com> commit 0a6139027f3986162233adc17285151e78b39cac Handled-By : Konstantin Karasyov <konstantin.a.karasyov@intel.com> Status : problem is being debugged Subject : AE_NOT_FOUND ACPI messages References : http://bugzilla.kernel.org/show_bug.cgi?id=8066 http://lkml.org/lkml/2007/2/27/263 Submitter : Thomas Meyer <thomas.mey@web.de> Meelis Roos <mroos@linux.ee> Handled-By : Alexey Starikovskiy <alexey.y.starikovskiy@linux.intel.com> Patch : http://bugzilla.kernel.org/show_bug.cgi?id=8066 Status : patch available Subject : Asus M6Ne notebook: ACPI: First battery is not detected References : http://bugzilla.kernel.org/show_bug.cgi?id=8080 Submitter : Janosch Machowinski <jmachowinski@gmx.de> Status : unknown Subject : AT keyboard only works with pci=noacpi References : http://lkml.org/lkml/2007/3/3/68 Submitter : Ash Milsted <thatistosayiseenem@gawab.com> Status : unknown ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-05 1:50 ` [2/6] " Adrian Bunk @ 2007-03-07 11:09 ` Jeff Garzik 2007-03-07 16:10 ` Linus Torvalds 2007-03-08 12:03 ` Ash Milsted 2007-03-08 12:31 ` Michael S. Tsirkin 1 sibling, 2 replies; 45+ messages in thread From: Jeff Garzik @ 2007-03-07 11:09 UTC (permalink / raw) To: Adrian Bunk Cc: Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, Michael S. Tsirkin, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, greg, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input Adrian Bunk wrote: > Subject : AT keyboard only works with pci=noacpi > References : http://lkml.org/lkml/2007/3/3/68 > Submitter : Ash Milsted <thatistosayiseenem@gawab.com> > Status : unknown sounds like a BIOS bug, even though it appears to be a regression? ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-07 11:09 ` Jeff Garzik @ 2007-03-07 16:10 ` Linus Torvalds 2007-03-08 12:03 ` Ash Milsted 1 sibling, 0 replies; 45+ messages in thread From: Linus Torvalds @ 2007-03-07 16:10 UTC (permalink / raw) To: Jeff Garzik, Ash Milsted Cc: Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, Michael S. Tsirkin, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, dmitry.torokhov, linux-input On Wed, 7 Mar 2007, Jeff Garzik wrote: > > Adrian Bunk wrote: > > Subject : AT keyboard only works with pci=noacpi > > References : http://lkml.org/lkml/2007/3/3/68 > > Submitter : Ash Milsted <thatistosayiseenem@gawab.com> > > Status : unknown > > sounds like a BIOS bug, even though it appears to be a regression? So? BIOS bugs are not bugs we can fix, they are things we have to work around. They are like hardware bugs in a network chip: a "driver" that doesn't work around a BIOS bug is simply a *buggy* driver, exactly the same way a network driver has to work around errata in the hardware. So it doesn't really matter if something is a BIOS bug or not. It's not reasonable to expect people to upgrade their BIOS'es - even if such an upgrade were to exist (which is fairly rare in itself). If it used to work, that just makes it (a) doubly important to try to fix it, since regressions are BAD BAD BAD and (b) a fair amount *easier* to fix, since we can hopefully get an idea of what broke it. Ash, can you try to use "git bisect" to figure where it started? But perhaps just try -rc3 first to see if it's been fixed? The working setups (whether with irqfixup or with pci=noacpi seem to both have a nice input: AT Translated Set 2 keyboard as /class/input/inputX but the nonworking one does not. But even the nonworking one actually *found* the controller: PNP: PS/2 Controller [PNP0303:PS2K] at 0x60,0x64 irq 1 PNP: PS/2 controller doesn't have AUX irq; using default 12 serio: i8042 KBD port at 0x60,0x64 irq 1 serio: i8042 AUX port at 0x60,0x64 irq 12 mice: PS/2 mouse device common for all mice which probably means that the "atkbd_probe()" function fails (probably because it never gets a reply, which in turn is probably because the interrupt routing is broken). If you use "atkbd.reset=1" on the kernel command line, you would probably get a message like "keyboard reset failed".. Now, the most likely cause is obviously just the ACPI changes that mess up irq routing somehow, but it would be important to figure out *what* makes it happen, which is why "git bisect" would be wonderful to try. So Ash, if you get the git tree, just start with git bisect start git bisect good v2.6.20 git bisect bad v2.6.21-rc1 and off you go.. git isn't really that hard to use any more. Linus ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-07 11:09 ` Jeff Garzik 2007-03-07 16:10 ` Linus Torvalds @ 2007-03-08 12:03 ` Ash Milsted 1 sibling, 0 replies; 45+ messages in thread From: Ash Milsted @ 2007-03-08 12:03 UTC (permalink / raw) To: Jeff Garzik Cc: Adrian Bunk, Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, Michael S. Tsirkin, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, greg, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, dmitry.torokhov, linux-input On Wed, 07 Mar 2007 06:09:06 -0500 Jeff Garzik <jeff@garzik.org> wrote: > Adrian Bunk wrote: > > Subject : AT keyboard only works with pci=noacpi > > References : http://lkml.org/lkml/2007/3/3/68 > > Submitter : Ash Milsted <thatistosayiseenem@gawab.com> > > Status : unknown > > > sounds like a BIOS bug, even though it appears to be a regression? > Resolved here: http://marc.theaimsgroup.com/?l=linux-kernel&m=117332735404395&w=2 Was a bug in the i8042 driver it seems. Ash ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-05 1:50 ` [2/6] " Adrian Bunk 2007-03-07 11:09 ` Jeff Garzik @ 2007-03-08 12:31 ` Michael S. Tsirkin 2007-03-08 15:11 ` Jeff Chua 2007-03-08 18:01 ` Linus Torvalds 1 sibling, 2 replies; 45+ messages in thread From: Michael S. Tsirkin @ 2007-03-08 12:31 UTC (permalink / raw) To: Adrian Bunk Cc: Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, greg, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input [-- Attachment #1: Type: text/plain, Size: 996 bytes --] > Quoting Adrian Bunk <bunk@stusta.de>: > Subject: [2/6] 2.6.21-rc2: known regressions > > Subject : ThinkPad T60: no screen after suspend to RAM > References : http://lkml.org/lkml/2007/2/22/391 > Submitter : Michael S. Tsirkin <mst@mellanox.co.il> > Status : unknown Here's the status with -rc3: better, but still does not work as well as 2.6.20. 1. Switching to console VT, and running s2ram, I am able to resume to console (I have not tested with echo mem > /sys/power/state yet, both work OK with 2.6.20) This seems to take about as long as with 2.6.20. During this time, I see some messages on console (see attached log) 2. First disk access after resume takes a couple of minutes (seemed instant with 2.6.20) during this time no new messages show on console 3. When I switch to X (CTRL-ALT-F7), X hangs after drawing a couple of windows after waiting for some 10 min, I rebooted. no new messages showed up in /var/log/messages log attached -- MST [-- Attachment #2: newlog --] [-- Type: text/plain, Size: 20275 bytes --] Mar 8 08:47:20 localhost kernel: Inspecting /boot/System.map-2.6.21-rc3-work Mar 8 08:47:20 localhost kernel: Loaded 31265 symbols from /boot/System.map-2.6.21-rc3-work. Mar 8 08:47:20 localhost kernel: Symbols match kernel version 2.6.21. Mar 8 08:47:20 localhost kernel: No module symbols loaded - kernel modules not enabled. Mar 8 08:47:20 localhost kernel: 1 Mar 8 08:47:20 localhost kernel: [ 15.317823] ata2: SATA link down (SStatus 0 SControl 0) Mar 8 08:47:20 localhost kernel: [ 15.317897] scsi2 : ahci Mar 8 08:47:20 localhost kernel: [ 15.379836] ata3: SATA link down (SStatus 0 SControl 0) Mar 8 08:47:20 localhost kernel: [ 15.379909] scsi3 : ahci Mar 8 08:47:20 localhost kernel: [ 15.441854] ata4: SATA link down (SStatus 0 SControl 0) Mar 8 08:47:20 localhost kernel: [ 15.441999] scsi 0:0:0:0: Direct-Access ATA HTS541080G9SA00 MB4I PQ: 0 ANSI: 5 Mar 8 08:47:20 localhost kernel: [ 15.442216] SCSI device sda: 156301488 512-byte hdwr sectors (80026 MB) Mar 8 08:47:20 localhost kernel: [ 15.442292] sda: Write Protect is off Mar 8 08:47:20 localhost kernel: [ 15.443138] SCSI device sda: write cache: enabled, read cache: enabled, doesn't support DPO or FUA Mar 8 08:47:20 localhost kernel: [ 15.443256] SCSI device sda: 156301488 512-byte hdwr sectors (80026 MB) Mar 8 08:47:20 localhost kernel: [ 15.443330] sda: Write Protect is off Mar 8 08:47:20 localhost kernel: [ 15.443406] SCSI device sda: write cache: enabled, read cache: enabled, doesn't support DPO or FUA Mar 8 08:47:20 localhost kernel: [ 15.443490] sda: sda1 sda2 sda3 sda4 < sda5 sda6 > Mar 8 08:47:20 localhost kernel: [ 15.845145] sd 0:0:0:0: Attached scsi disk sda Mar 8 08:47:20 localhost kernel: [ 15.845290] sd 0:0:0:0: Attached scsi generic sg0 type 0 Mar 8 08:47:20 localhost kernel: [ 15.845797] ACPI: PCI Interrupt 0000:00:1d.7[D] -> GSI 19 (level, low) -> IRQ 21 Mar 8 08:47:20 localhost kernel: [ 15.845946] ehci_hcd 0000:00:1d.7: EHCI Host Controller Mar 8 08:47:20 localhost kernel: [ 15.846106] ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 1 Mar 8 08:47:20 localhost kernel: [ 15.846229] ehci_hcd 0000:00:1d.7: debug port 1 Mar 8 08:47:20 localhost kernel: [ 15.846315] ehci_hcd 0000:00:1d.7: irq 21, io mem 0xee444000 Mar 8 08:47:20 localhost kernel: [ 15.850269] ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004 Mar 8 08:47:20 localhost kernel: [ 15.850508] usb usb1: configuration #1 chosen from 1 choice Mar 8 08:47:20 localhost kernel: [ 15.850630] hub 1-0:1.0: USB hub found Mar 8 08:47:20 localhost kernel: [ 15.850697] hub 1-0:1.0: 8 ports detected Mar 8 08:47:20 localhost kernel: [ 15.862005] USB Universal Host Controller Interface driver v3.0 Mar 8 08:47:20 localhost kernel: [ 15.862135] ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:47:20 localhost kernel: [ 15.862268] uhci_hcd 0000:00:1d.0: UHCI Host Controller Mar 8 08:47:20 localhost kernel: [ 15.862381] uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 2 Mar 8 08:47:20 localhost kernel: [ 15.862484] uhci_hcd 0000:00:1d.0: irq 20, io base 0x00001820 Mar 8 08:47:20 localhost kernel: [ 15.862702] usb usb2: configuration #1 chosen from 1 choice Mar 8 08:47:20 localhost kernel: [ 15.862821] hub 2-0:1.0: USB hub found Mar 8 08:47:20 localhost kernel: [ 15.862887] hub 2-0:1.0: 2 ports detected Mar 8 08:47:20 localhost kernel: [ 15.892543] ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 17 (level, low) -> IRQ 22 Mar 8 08:47:20 localhost kernel: [ 15.892676] uhci_hcd 0000:00:1d.1: UHCI Host Controller Mar 8 08:47:20 localhost kernel: [ 15.892789] uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 3 Mar 8 08:47:20 localhost kernel: [ 15.892894] uhci_hcd 0000:00:1d.1: irq 22, io base 0x00001840 Mar 8 08:47:20 localhost kernel: [ 15.893113] usb usb3: configuration #1 chosen from 1 choice Mar 8 08:47:20 localhost kernel: [ 15.893229] hub 3-0:1.0: USB hub found Mar 8 08:47:20 localhost kernel: [ 15.893296] hub 3-0:1.0: 2 ports detected Mar 8 08:47:20 localhost kernel: [ 15.933182] ACPI: PCI Interrupt 0000:00:1d.2[C] -> GSI 18 (level, low) -> IRQ 23 Mar 8 08:47:20 localhost kernel: [ 15.933314] uhci_hcd 0000:00:1d.2: UHCI Host Controller Mar 8 08:47:20 localhost kernel: [ 15.933427] uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 4 Mar 8 08:47:20 localhost kernel: [ 15.933577] uhci_hcd 0000:00:1d.2: irq 23, io base 0x00001860 Mar 8 08:47:20 localhost kernel: [ 15.933797] usb usb4: configuration #1 chosen from 1 choice Mar 8 08:47:20 localhost kernel: [ 15.933915] hub 4-0:1.0: USB hub found Mar 8 08:47:20 localhost kernel: [ 15.933981] hub 4-0:1.0: 2 ports detected Mar 8 08:47:20 localhost kernel: [ 15.976494] ACPI: PCI Interrupt 0000:00:1d.3[D] -> GSI 19 (level, low) -> IRQ 21 Mar 8 08:47:20 localhost kernel: [ 15.976627] uhci_hcd 0000:00:1d.3: UHCI Host Controller Mar 8 08:47:20 localhost kernel: [ 15.976740] uhci_hcd 0000:00:1d.3: new USB bus registered, assigned bus number 5 Mar 8 08:47:20 localhost kernel: [ 15.976843] uhci_hcd 0000:00:1d.3: irq 21, io base 0x00001880 Mar 8 08:47:20 localhost kernel: [ 15.977060] usb usb5: configuration #1 chosen from 1 choice Mar 8 08:47:20 localhost kernel: [ 15.977196] hub 5-0:1.0: USB hub found Mar 8 08:47:20 localhost kernel: [ 15.977263] hub 5-0:1.0: 2 ports detected Mar 8 08:47:20 localhost kernel: [ 16.040824] Initializing USB Mass Storage driver... Mar 8 08:47:20 localhost kernel: [ 16.157808] usb 5-2: new full speed USB device using uhci_hcd and address 2 Mar 8 08:47:20 localhost kernel: [ 16.221329] usb 5-2: configuration #1 chosen from 1 choice Mar 8 08:47:20 localhost kernel: [ 16.224625] usbcore: registered new interface driver usb-storage Mar 8 08:47:20 localhost kernel: [ 16.224692] USB Mass Storage support registered. Mar 8 08:47:20 localhost kernel: [ 16.224886] PNP: PS/2 Controller [PNP0303:KBD,PNP0f13:MOU] at 0x60,0x64 irq 1,12 Mar 8 08:47:20 localhost kernel: [ 16.233142] serio: i8042 KBD port at 0x60,0x64 irq 1 Mar 8 08:47:20 localhost kernel: [ 16.233211] serio: i8042 AUX port at 0x60,0x64 irq 12 Mar 8 08:47:20 localhost kernel: [ 16.233366] mice: PS/2 mouse device common for all mice Mar 8 08:47:20 localhost kernel: [ 16.233716] Advanced Linux Sound Architecture Driver Version 1.0.14rc3 (Tue Mar 06 13:10:00 2007 UTC). Mar 8 08:47:20 localhost kernel: [ 16.234221] ACPI: PCI Interrupt 0000:00:1b.0[B] -> GSI 17 (level, low) -> IRQ 22 Mar 8 08:47:20 localhost kernel: [ 16.236937] input: AT Translated Set 2 keyboard as /class/input/input3 Mar 8 08:47:20 localhost kernel: [ 16.444361] ALSA device list: Mar 8 08:47:20 localhost kernel: [ 16.444428] #0: HDA Intel at 0xee240000 irq 22 Mar 8 08:47:20 localhost kernel: [ 16.444525] nf_conntrack version 0.5.0 (7168 buckets, 57344 max) Mar 8 08:47:20 localhost kernel: [ 16.444736] ip_tables: (C) 2000-2006 Netfilter Core Team Mar 8 08:47:20 localhost kernel: [ 16.444809] TCP cubic registered Mar 8 08:47:20 localhost kernel: [ 16.444887] NET: Registered protocol family 1 Mar 8 08:47:20 localhost kernel: [ 16.444953] NET: Registered protocol family 17 Mar 8 08:47:20 localhost kernel: [ 16.445019] ieee80211: 802.11 data/management/control stack, git-1.1.13 Mar 8 08:47:20 localhost kernel: [ 16.445087] ieee80211: Copyright (C) 2004-2005 Intel Corporation <jketreno@linux.intel.com> Mar 8 08:47:20 localhost kernel: [ 16.445185] Starting balanced_irq Mar 8 08:47:20 localhost kernel: [ 16.445254] Using IPI No-Shortcut mode Mar 8 08:47:20 localhost kernel: [ 16.472663] kjournald starting. Commit interval 5 seconds Mar 8 08:47:20 localhost kernel: [ 16.472735] EXT3-fs: mounted filesystem with ordered data mode. Mar 8 08:47:20 localhost kernel: [ 16.472864] VFS: Mounted root (ext3 filesystem) readonly. Mar 8 08:47:20 localhost kernel: [ 16.473029] Freeing unused kernel memory: 184k freed Mar 8 08:47:20 localhost kernel: [ 16.473122] Write protecting the kernel read-only data: 902k Mar 8 08:47:20 localhost kernel: [ 24.658491] Intel(R) PRO/1000 Network Driver - version 7.3.20-k2 Mar 8 08:47:20 localhost kernel: [ 24.658576] Copyright (c) 1999-2006 Intel Corporation. Mar 8 08:47:20 localhost kernel: [ 24.658761] ACPI: PCI Interrupt 0000:02:00.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:47:20 localhost kernel: [ 24.733801] e1000: 0000:02:00.0: e1000_probe: (PCI Express:2.5Gb/s:Width x1) 00:16:41:54:6c:47 Mar 8 08:47:20 localhost kernel: [ 24.815441] e1000: eth0: e1000_probe: Intel(R) PRO/1000 Network Connection Mar 8 08:47:20 localhost kernel: [ 25.282292] Adding 1951824k swap on /dev/sda6. Priority:-1 extents:1 across:1951824k Mar 8 08:47:20 localhost kernel: [ 25.343370] EXT3 FS on sda3, internal journal Mar 8 08:47:20 localhost kernel: [ 25.697608] Synaptics Touchpad, model: 1, fw: 6.2, id: 0x81a0b1, caps: 0xa04793/0x300000 Mar 8 08:47:20 localhost kernel: [ 25.697710] serio: Synaptics pass-through port at isa0060/serio1/input0 Mar 8 08:47:20 localhost kernel: [ 25.742946] input: SynPS/2 Synaptics TouchPad as /class/input/input4 Mar 8 08:47:20 localhost kernel: [ 30.419349] NTFS volume version 3.1. Mar 8 08:47:20 localhost kernel: [ 31.156204] IBM TrackPoint firmware: 0x0e, buttons: 3/3 Mar 8 08:47:20 localhost kernel: [ 31.426442] input: TPPS/2 IBM TrackPoint as /class/input/input5 Mar 8 08:47:22 localhost hpiod: 0.9.7 accepting connections at 50899... Mar 8 08:47:26 localhost kernel: [ 39.389386] [drm] Initialized i915 1.6.0 20060119 on minor 0 Mar 8 08:48:07 localhost kernel: <1] pcieport-driver 0000:00:1c.3: LATE suspend Mar 8 08:48:07 localhost kernel: [ 2.183443] Intel machine check architecture supported. Mar 8 08:48:07 localhost kernel: [ 2.183451] Intel machine check reporting enabled on CPU#0. Mar 8 08:48:07 localhost kernel: [ 2.183982] Enabling non-boot CPUs ... Mar 8 08:48:07 localhost kernel: [ 2.321035] SMP alternatives: switching to SMP code Mar 8 08:48:07 localhost kernel: [ 2.321241] Booting processor 1/1 eip 3000 Mar 8 08:48:07 localhost kernel: [ 2.332199] Initializing CPU#1 Mar 8 08:48:07 localhost kernel: [ 3.143727] Calibrating delay using timer specific routine.. 20089.15 BogoMIPS (lpj=100445793) Mar 8 08:48:07 localhost kernel: [ 3.143746] monitor/mwait feature present. Mar 8 08:48:07 localhost kernel: [ 3.143751] CPU: L1 I cache: 32K, L1 D cache: 32K Mar 8 08:48:07 localhost kernel: [ 3.143755] CPU: L2 cache: 2048K Mar 8 08:48:07 localhost kernel: [ 3.143759] CPU: Physical Processor ID: 0 Mar 8 08:48:07 localhost kernel: [ 3.143761] CPU: Processor Core ID: 1 Mar 8 08:48:07 localhost kernel: [ 3.143774] Intel machine check architecture supported. Mar 8 08:48:07 localhost kernel: [ 3.143783] Intel machine check reporting enabled on CPU#1. Mar 8 08:48:07 localhost kernel: [ 3.144104] CPU1: Intel Genuine Intel(R) CPU T2400 @ 1.83GHz stepping 08 Mar 8 08:48:07 localhost kernel: [ 3.144892] speedstep-centrino with X86_SPEEDSTEP_CENTRINO_ACPI config is deprecated. Mar 8 08:48:07 localhost kernel: [ 3.144896] Use X86_ACPI_CPUFREQ (acpi-cpufreq) instead. Mar 8 08:48:07 localhost kernel: [ 3.144936] CPU1 is up Mar 8 08:48:07 localhost kernel: [ 3.440273] ACPI: PCI Interrupt 0000:00:02.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:07 localhost kernel: [ 3.527272] ACPI: PCI Interrupt 0000:00:1b.0[B] -> GSI 17 (level, low) -> IRQ 22 Mar 8 08:48:07 localhost kernel: [ 3.912917] ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:07 localhost kernel: [ 3.912966] usb usb2: root hub lost power or was reset Mar 8 08:48:07 localhost kernel: [ 3.912988] ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 17 (level, low) -> IRQ 22 Mar 8 08:48:07 localhost kernel: [ 3.913036] usb usb3: root hub lost power or was reset Mar 8 08:48:07 localhost kernel: [ 3.913055] ACPI: PCI Interrupt 0000:00:1d.2[C] -> GSI 18 (level, low) -> IRQ 23 Mar 8 08:48:07 localhost kernel: [ 3.913102] usb usb4: root hub lost power or was reset Mar 8 08:48:07 localhost kernel: [ 3.913120] ACPI: PCI Interrupt 0000:00:1d.3[D] -> GSI 19 (level, low) -> IRQ 21 Mar 8 08:48:07 localhost kernel: [ 3.913168] usb usb5: root hub lost power or was reset Mar 8 08:48:07 localhost kernel: [ 3.914015] ACPI: PCI Interrupt 0000:00:1d.7[D] -> GSI 19 (level, low) -> IRQ 21 Mar 8 08:48:07 localhost kernel: [ 3.914205] ACPI: PCI Interrupt 0000:00:1f.1[C] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:07 localhost kernel: [ 3.914407] ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:07 localhost kernel: [ 3.922880] ACPI: PCI Interrupt 0000:02:00.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:07 localhost kernel: [ 3.928614] ACPI: PCI Interrupt 0000:15:00.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:07 localhost kernel: [ 3.928638] pnp: Device 00:08 does not support activation. Mar 8 08:48:07 localhost kernel: [ 3.928641] pnp: Device 00:09 does not support activation. Mar 8 08:48:07 localhost kernel: [ 4.209165] ata2: SATA link down (SStatus 0 SControl 0) Mar 8 08:48:07 localhost kernel: [ 4.209179] ata3: SATA link down (SStatus 0 SControl 0) Mar 8 08:48:07 localhost kernel: [ 4.209191] ata4: SATA link down (SStatus 0 SControl 0) Mar 8 08:48:07 localhost kernel: [ 4.849894] atkbd.c: Unknown key released (translated set 2, code 0xe0 on isa0060/serio0). Mar 8 08:48:07 localhost kernel: [ 4.849896] atkbd.c: Use 'setkeycodes e060 <keycode>' to make it known. Mar 8 08:48:07 localhost kernel: [ 4.850297] atkbd.c: Unknown key pressed (translated set 2, code 0x63 on isa0060/serio0). Mar 8 08:48:07 localhost kernel: [ 4.850300] atkbd.c: Use 'setkeycodes 63 <keycode>' to make it known. Mar 8 08:48:07 localhost kernel: [ 5.826771] Restarting tasks ... <6>usb 5-2: USB disconnect, address 2 Mar 8 08:48:07 localhost kernel: [ 5.839206] done. Mar 8 08:48:07 localhost kernel: [ 6.702776] ata1: waiting for device to spin up (7 secs) Mar 8 08:48:07 localhost kernel: [ 6.775788] usb 5-2: new full speed USB device using uhci_hcd and address 3 Mar 8 08:48:07 localhost kernel: [ 6.866806] usb 5-2: configuration #1 chosen from 1 choice Mar 8 08:48:07 localhost kernel: [ 11.334773] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300) Mar 8 08:48:07 localhost kernel: [ 11.673734] ata1.00: configured for UDMA/100 Mar 8 08:48:07 localhost kernel: [ 11.696311] SCSI device sda: 156301488 512-byte hdwr sectors (80026 MB) Mar 8 08:48:07 localhost kernel: [ 11.697691] sda: Write Protect is off Mar 8 08:48:07 localhost kernel: [ 11.708467] SCSI device sda: write cache: enabled, read cache: enabled, doesn't support DPO or FUA Mar 8 08:48:16 localhost kernel: <>[ 19.841599] button button_power:00: suspend Mar 8 08:48:16 localhost kernel: [ 19.841607] Disabling non-boot CPUs ... Mar 8 08:48:16 localhost kernel: [ 19.847811] Breaking affinity for irq 12 Mar 8 08:48:16 localhost kernel: [ 20.444974] CPU 1 is now offline Mar 8 08:48:16 localhost kernel: [ 20.444980] SMP alternatives: switching to UP code Mar 8 08:48:16 localhost kernel: [ 20.445823] CPU1 is down Mar 8 08:48:16 localhost kernel: [ 2.423903] Intel machine check architecture supported. Mar 8 08:48:16 localhost kernel: [ 2.423911] Intel machine check reporting enabled on CPU#0. Mar 8 08:48:16 localhost kernel: [ 2.424444] Enabling non-boot CPUs ... Mar 8 08:48:16 localhost kernel: [ 2.534642] SMP alternatives: switching to SMP code Mar 8 08:48:16 localhost kernel: [ 2.534847] Booting processor 1/1 eip 3000 Mar 8 08:48:16 localhost kernel: [ 2.545805] Initializing CPU#1 Mar 8 08:48:16 localhost kernel: [ 3.357337] Calibrating delay using timer specific routine.. 20089.13 BogoMIPS (lpj=100445659) Mar 8 08:48:16 localhost kernel: [ 3.357357] monitor/mwait feature present. Mar 8 08:48:16 localhost kernel: [ 3.357362] CPU: L1 I cache: 32K, L1 D cache: 32K Mar 8 08:48:16 localhost kernel: [ 3.357365] CPU: L2 cache: 2048K Mar 8 08:48:16 localhost kernel: [ 3.357369] CPU: Physical Processor ID: 0 Mar 8 08:48:16 localhost kernel: [ 3.357372] CPU: Processor Core ID: 1 Mar 8 08:48:16 localhost kernel: [ 3.357384] Intel machine check architecture supported. Mar 8 08:48:16 localhost kernel: [ 3.357394] Intel machine check reporting enabled on CPU#1. Mar 8 08:48:16 localhost kernel: [ 3.357711] CPU1: Intel Genuine Intel(R) CPU T2400 @ 1.83GHz stepping 08 Mar 8 08:48:16 localhost kernel: [ 3.358503] speedstep-centrino with X86_SPEEDSTEP_CENTRINO_ACPI config is deprecated. Mar 8 08:48:16 localhost kernel: [ 3.358507] Use X86_ACPI_CPUFREQ (acpi-cpufreq) instead. Mar 8 08:48:16 localhost kernel: [ 3.358548] CPU1 is up Mar 8 08:48:16 localhost kernel: [ 3.653795] ACPI: PCI Interrupt 0000:00:02.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:16 localhost kernel: [ 3.740900] ACPI: PCI Interrupt 0000:00:1b.0[B] -> GSI 17 (level, low) -> IRQ 22 Mar 8 08:48:16 localhost kernel: [ 4.126569] ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:16 localhost kernel: [ 4.126619] usb usb2: root hub lost power or was reset Mar 8 08:48:16 localhost kernel: [ 4.126639] ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 17 (level, low) -> IRQ 22 Mar 8 08:48:16 localhost kernel: [ 4.126686] usb usb3: root hub lost power or was reset Mar 8 08:48:16 localhost kernel: [ 4.126705] ACPI: PCI Interrupt 0000:00:1d.2[C] -> GSI 18 (level, low) -> IRQ 23 Mar 8 08:48:16 localhost kernel: [ 4.126752] usb usb4: root hub lost power or was reset Mar 8 08:48:16 localhost kernel: [ 4.126772] ACPI: PCI Interrupt 0000:00:1d.3[D] -> GSI 19 (level, low) -> IRQ 21 Mar 8 08:48:16 localhost kernel: [ 4.126819] usb usb5: root hub lost power or was reset Mar 8 08:48:16 localhost kernel: [ 4.127659] ACPI: PCI Interrupt 0000:00:1d.7[D] -> GSI 19 (level, low) -> IRQ 21 Mar 8 08:48:16 localhost kernel: [ 4.127848] ACPI: PCI Interrupt 0000:00:1f.1[C] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:16 localhost kernel: [ 4.128049] ACPI: PCI Interrupt 0000:00:1f.2[B] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:16 localhost kernel: [ 4.136510] ACPI: PCI Interrupt 0000:02:00.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:16 localhost kernel: [ 4.142247] ACPI: PCI Interrupt 0000:15:00.0[A] -> GSI 16 (level, low) -> IRQ 20 Mar 8 08:48:16 localhost kernel: [ 4.142271] pnp: Device 00:08 does not support activation. Mar 8 08:48:16 localhost kernel: [ 4.142274] pnp: Device 00:09 does not support activation. Mar 8 08:48:16 localhost kernel: [ 4.421975] ata2: SATA link down (SStatus 0 SControl 0) Mar 8 08:48:16 localhost kernel: [ 4.421989] ata3: SATA link down (SStatus 0 SControl 0) Mar 8 08:48:16 localhost kernel: [ 4.422001] ata4: SATA link down (SStatus 0 SControl 0) Mar 8 08:48:16 localhost kernel: [ 5.971047] Restarting tasks ... done. Mar 8 08:48:16 localhost kernel: [ 6.012513] usb 5-2: USB disconnect, address 3 Mar 8 08:48:16 localhost kernel: [ 6.891677] ata1: waiting for device to spin up (7 secs) Mar 8 08:48:16 localhost kernel: [ 6.958587] usb 5-2: new full speed USB device using uhci_hcd and address 4 Mar 8 08:48:16 localhost kernel: [ 7.049506] usb 5-2: configuration #1 chosen from 1 choice Mar 8 08:48:16 localhost kernel: [ 11.547844] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300) Mar 8 08:48:16 localhost kernel: [ 11.885459] ata1.00: configured for UDMA/100 Mar 8 08:48:16 localhost kernel: [ 11.913910] SCSI device sda: 156301488 512-byte hdwr sectors (80026 MB) Mar 8 08:48:16 localhost kernel: [ 11.914575] sda: Write Protect is off Mar 8 08:48:16 localhost kernel: [ 11.914868] SCSI device sda: write cache: enabled, read cache: enabled, doesn't support DPO or FUA Mar 8 08:48:16 localhost shutdown[4092]: shutting down for system halt ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 12:31 ` Michael S. Tsirkin @ 2007-03-08 15:11 ` Jeff Chua 2007-03-08 18:01 ` Linus Torvalds 1 sibling, 0 replies; 45+ messages in thread From: Jeff Chua @ 2007-03-08 15:11 UTC (permalink / raw) To: Michael S. Tsirkin Cc: Adrian Bunk, Linus Torvalds, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, greg, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input On 3/8/07, Michael S. Tsirkin <mst@mellanox.co.il> wrote: > > Quoting Adrian Bunk <bunk@stusta.de>: > > Subject: [2/6] 2.6.21-rc2: known regressions > > > > Subject : ThinkPad T60: no screen after suspend to RAM > > References : http://lkml.org/lkml/2007/2/22/391 > > Submitter : Michael S. Tsirkin <mst@mellanox.co.il> > > Status : unknown > > Here's the status with -rc3: better, but still does not work as well as 2.6.20. > 3. When I switch to X (CTRL-ALT-F7), X hangs after drawing a couple of windows > after waiting for some 10 min, I rebooted. > no new messages showed up in /var/log/messages In my case, I can "suspend" and "resume", but seems the clock died after resume. "date" always returns the same time! I had to disable CONFIG_NO_HZ and CONFIG_SYSFS_DEPRECATED in order to suspend. Try to execute "date" and see if it changes after resume. Thanks, Jeff. ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 12:31 ` Michael S. Tsirkin 2007-03-08 15:11 ` Jeff Chua @ 2007-03-08 18:01 ` Linus Torvalds 2007-03-08 19:06 ` Ingo Molnar ` (3 more replies) 1 sibling, 4 replies; 45+ messages in thread From: Linus Torvalds @ 2007-03-08 18:01 UTC (permalink / raw) To: Michael S. Tsirkin Cc: Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman, Ingo Molnar [ Eric, Ingo, can you double-check the timer initialization after resume? We appear to have several reports of date not advancing, and while this could be some SATA issue, it could easily be a timer tick issue too ] On Thu, 8 Mar 2007, Michael S. Tsirkin wrote: > > Here's the status with -rc3: better, but still does not work as well as 2.6.20. Ok. I think we mostly solved the irq-related stuff, but you might want to check whether you have CONFIG_NOHZ on or off and whether that makes a difference. > 2. First disk access after resume takes a couple of minutes > (seemed instant with 2.6.20) during this time no new messages show on console Yeah, there is some problem with SATA resume. It would be beautiful if the people who actually see this could narrow it down with bisection. "It works for me" is clearly the case for many people, but not all. But before blaming SATA, check if you have NO_HZ enabled and whether disabling that makes it work ok. If timeouts don't work right (or are *extremely* slow) things that should be instant won't be. > 3. When I switch to X (CTRL-ALT-F7), X hangs after drawing a couple of windows > after waiting for some 10 min, I rebooted. > no new messages showed up in /var/log/messages I think this is likely just more of the disk being buggered, but it could again be related to NO_HZ (people report time not advancing, and that would make any X timeout taking forever, and you'd see exactly your behaviour). Linus ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 18:01 ` Linus Torvalds @ 2007-03-08 19:06 ` Ingo Molnar 2007-03-08 19:10 ` Ingo Molnar 2007-03-08 19:47 ` Michael S. Tsirkin 2007-03-08 19:25 ` Ingo Molnar ` (2 subsequent siblings) 3 siblings, 2 replies; 45+ messages in thread From: Ingo Molnar @ 2007-03-08 19:06 UTC (permalink / raw) To: Linus Torvalds Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Linus Torvalds <torvalds@linux-foundation.org> wrote: > > 3. When I switch to X (CTRL-ALT-F7), X hangs after drawing a couple of windows > > after waiting for some 10 min, I rebooted. no new messages showed > > up in /var/log/messages > > I think this is likely just more of the disk being buggered, but it > could again be related to NO_HZ (people report time not advancing, and > that would make any X timeout taking forever, and you'd see exactly > your behaviour). Michael - does your 'date' output advance after resume? If not then i'd say it's a NO_HZ related problem. If yes then i'd guess it's the SATA problem. Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 19:06 ` Ingo Molnar @ 2007-03-08 19:10 ` Ingo Molnar 2007-03-08 19:47 ` Michael S. Tsirkin 1 sibling, 0 replies; 45+ messages in thread From: Ingo Molnar @ 2007-03-08 19:10 UTC (permalink / raw) To: Linus Torvalds Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Ingo Molnar <mingo@elte.hu> wrote: > Michael - does your 'date' output advance after resume? If not then > i'd say it's a NO_HZ related problem. [...] in that case please do this on such a 'frozen date' system: echo q > /proc/sysrq-trigger and then send us the hw-timers info. Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 19:06 ` Ingo Molnar 2007-03-08 19:10 ` Ingo Molnar @ 2007-03-08 19:47 ` Michael S. Tsirkin 2007-03-08 20:10 ` Ingo Molnar 1 sibling, 1 reply; 45+ messages in thread From: Michael S. Tsirkin @ 2007-03-08 19:47 UTC (permalink / raw) To: Ingo Molnar Cc: Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, Jens Axboe, linux-input, Alexey Starikovskiy, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, linux-pm, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Andrew Morton, Linus Torvalds > Quoting Ingo Molnar <mingo@elte.hu>: > Subject: Re: [2/6] 2.6.21-rc2: known regressions > > > * Linus Torvalds <torvalds@linux-foundation.org> wrote: > > > > 3. When I switch to X (CTRL-ALT-F7), X hangs after drawing a couple of windows > > > after waiting for some 10 min, I rebooted. no new messages showed > > > up in /var/log/messages > > > > I think this is likely just more of the disk being buggered, but it > > could again be related to NO_HZ (people report time not advancing, and > > that would make any X timeout taking forever, and you'd see exactly > > your behaviour). > > Michael - does your 'date' output advance after resume? If not then i'd > say it's a NO_HZ related problem. If yes then i'd guess it's the SATA > problem. I'll test, but I have NO_HZ off for now. -- MST ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 19:47 ` Michael S. Tsirkin @ 2007-03-08 20:10 ` Ingo Molnar 0 siblings, 0 replies; 45+ messages in thread From: Ingo Molnar @ 2007-03-08 20:10 UTC (permalink / raw) To: Michael S. Tsirkin Cc: Linus Torvalds, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Michael S. Tsirkin <mst@mellanox.co.il> wrote: > > Michael - does your 'date' output advance after resume? If not then > > i'd say it's a NO_HZ related problem. If yes then i'd guess it's the > > SATA problem. > > I'll test, but I have NO_HZ off for now. there can still be effects of it (the regression we fixed in -rc3 was such an effect too), so please test it. Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 18:01 ` Linus Torvalds 2007-03-08 19:06 ` Ingo Molnar @ 2007-03-08 19:25 ` Ingo Molnar 2007-03-08 23:07 ` Ingo Molnar 2007-03-08 19:46 ` [2/6] 2.6.21-rc2: known regressions Michael S. Tsirkin 2007-03-08 19:57 ` Michael S. Tsirkin 3 siblings, 1 reply; 45+ messages in thread From: Ingo Molnar @ 2007-03-08 19:25 UTC (permalink / raw) To: Linus Torvalds Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Linus Torvalds <torvalds@linux-foundation.org> wrote: > > 2. First disk access after resume takes a couple of minutes > > (seemed instant with 2.6.20) during this time no new messages > > show on console > > Yeah, there is some problem with SATA resume. It would be beautiful if > the people who actually see this could narrow it down with bisection. > "It works for me" is clearly the case for many people, but not all. Thomas found a new twist to this today: applying the patch below (which turns on ATA_DEBUG) made the SATA problem go away on his laptop. Michael, could you try this patch, does it change the behavior of your laptop in any way? Ingo --- include/linux/libata.h | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) Index: linux/include/linux/libata.h =================================================================== --- linux.orig/include/linux/libata.h +++ linux/include/linux/libata.h @@ -51,7 +51,7 @@ * compile-time options: to be removed as soon as all the drivers are * converted to the new debugging mechanism */ -#undef ATA_DEBUG /* debugging output */ +#define ATA_DEBUG 1 /* debugging output */ #undef ATA_VERBOSE_DEBUG /* yet more debugging output */ #undef ATA_IRQ_TRAP /* define to ack screaming irqs */ #undef ATA_NDEBUG /* define to disable quick runtime checks */ ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 19:25 ` Ingo Molnar @ 2007-03-08 23:07 ` Ingo Molnar 2007-03-08 23:12 ` Ingo Molnar 2007-03-08 23:49 ` Linus Torvalds 0 siblings, 2 replies; 45+ messages in thread From: Ingo Molnar @ 2007-03-08 23:07 UTC (permalink / raw) To: Linus Torvalds Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Ingo Molnar <mingo@elte.hu> wrote: > Thomas found a new twist to this today: applying the patch below > (which turns on ATA_DEBUG) made the SATA problem go away on his > laptop. Michael, could you try this patch, does it change the behavior > of your laptop in any way? Here's another suspend/resume artifact: one of my boxes wouldnt resume, it hangs at: [ 1.456633] pci 0000:00:18.2: resuming [ 1.456641] pci 0000:00:18.3: resuming [ 1.456648] 8139too 0000:05:07.0: resuming [ 1.456667] radeonfb 0000:01:00.0: resuming the box is pingable but otherwise no console and it's hung as well. [I got the log above via netconsole] disabling the following radeonfb options in the .config made resume work again: CONFIG_FB_RADEON=y CONFIG_FB_RADEON_I2C=y CONFIG_FB_RADEON_BACKLIGHT=y Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 23:07 ` Ingo Molnar @ 2007-03-08 23:12 ` Ingo Molnar 2007-03-08 23:28 ` Ingo Molnar 2007-03-08 23:49 ` Linus Torvalds 1 sibling, 1 reply; 45+ messages in thread From: Ingo Molnar @ 2007-03-08 23:12 UTC (permalink / raw) To: Linus Torvalds Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Ingo Molnar <mingo@elte.hu> wrote: > Here's another suspend/resume artifact: one of my boxes wouldnt > resume, it hangs at: > > [ 1.456633] pci 0000:00:18.2: resuming > [ 1.456641] pci 0000:00:18.3: resuming > [ 1.456648] 8139too 0000:05:07.0: resuming > [ 1.456667] radeonfb 0000:01:00.0: resuming > > the box is pingable but otherwise no console and it's hung as well. [I > got the log above via netconsole] unfortunately there's no way to get any backtrace out of the system at this point - the timer interrupts are not active yet here, so i cannot use the softlockup_tick patch to sample the lockup. Since it's pingable i'll try to hack a dump_stack() into icmp_reply() ;-) Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 23:12 ` Ingo Molnar @ 2007-03-08 23:28 ` Ingo Molnar 0 siblings, 0 replies; 45+ messages in thread From: Ingo Molnar @ 2007-03-08 23:28 UTC (permalink / raw) To: Linus Torvalds Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Ingo Molnar <mingo@elte.hu> wrote: > > Here's another suspend/resume artifact: one of my boxes wouldnt > > resume, it hangs at: > > > > [ 1.456633] pci 0000:00:18.2: resuming > > [ 1.456641] pci 0000:00:18.3: resuming > > [ 1.456648] 8139too 0000:05:07.0: resuming > > [ 1.456667] radeonfb 0000:01:00.0: resuming > > > > the box is pingable but otherwise no console and it's hung as well. [I > > got the log above via netconsole] > > unfortunately there's no way to get any backtrace out of the system at > this point - the timer interrupts are not active yet here, so i cannot > use the softlockup_tick patch to sample the lockup. Since it's > pingable i'll try to hack a dump_stack() into icmp_reply() ;-) didnt succeed at that - no netconsole output during the hang. But the RADEON_FB .config options definitely make the difference. The video card in question is: 01:00.0 VGA compatible controller: ATI Technologies Inc RV370 5B60 [Radeon X300 (PCIE)] 01:00.1 Display controller: ATI Technologies Inc RV370 [Radeon X300SE] Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 23:07 ` Ingo Molnar 2007-03-08 23:12 ` Ingo Molnar @ 2007-03-08 23:49 ` Linus Torvalds 2007-03-09 10:56 ` Ingo Molnar ` (2 more replies) 1 sibling, 3 replies; 45+ messages in thread From: Linus Torvalds @ 2007-03-08 23:49 UTC (permalink / raw) To: Ingo Molnar Cc: Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, Jens Axboe, linux-input, Alexey Starikovskiy, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, linux-pm, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Michael S. Tsirkin, Andrew Morton On Fri, 9 Mar 2007, Ingo Molnar wrote: > > disabling the following radeonfb options in the .config made resume work > again: In general, don't even *try* to use radeonfb for suspend/resume. I don't think it has ever worked, except on some very rare laptops (largely PPC Macs) where people had enough information to set up the PLL's. I don't think the other framebuffer drivers are much better. You're better off using the VGA console, and lettign X re-initialize the graphics device. That generally at least has a reasonably good chance of working. Re-initializing graphics modes really is very hard. You can try with the BIOS video hack (I forget the kernel command line to turn it on), but we really do end up depending on X doing it better. Some day we may have modesetting support in the kernel for some graphics hw, right now it's pretty damn spotty. Linus ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 23:49 ` Linus Torvalds @ 2007-03-09 10:56 ` Ingo Molnar 2007-03-09 18:00 ` Linus Torvalds 2007-03-09 11:19 ` Pavel Machek 2007-03-09 17:48 ` Johannes Stezenbach 2 siblings, 1 reply; 45+ messages in thread From: Ingo Molnar @ 2007-03-09 10:56 UTC (permalink / raw) To: Linus Torvalds Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman * Linus Torvalds <torvalds@linux-foundation.org> wrote: > > disabling the following radeonfb options in the .config made resume > > work again: > > In general, don't even *try* to use radeonfb for suspend/resume. > > I don't think it has ever worked, except on some very rare laptops > (largely PPC Macs) where people had enough information to set up the > PLL's. > > I don't think the other framebuffer drivers are much better. > > You're better off using the VGA console, and lettign X re-initialize > the graphics device. That generally at least has a reasonably good > chance of working. > > Re-initializing graphics modes really is very hard. You can try with > the BIOS video hack (I forget the kernel command line to turn it on), > but we really do end up depending on X doing it better. it's the s3_sleep boot option and /proc/sys/kernel/acpi_video_mode, but that didnt make a difference. > Some day we may have modesetting support in the kernel for some > graphics hw, right now it's pretty damn spotty. having no video is what i'd have expected - but getting a /hang/ is not what i'd have expected. Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-09 10:56 ` Ingo Molnar @ 2007-03-09 18:00 ` Linus Torvalds 0 siblings, 0 replies; 45+ messages in thread From: Linus Torvalds @ 2007-03-09 18:00 UTC (permalink / raw) To: Ingo Molnar Cc: Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman On Fri, 9 Mar 2007, Ingo Molnar wrote: > > Some day we may have modesetting support in the kernel for some > > graphics hw, right now it's pretty damn spotty. > > having no video is what i'd have expected - but getting a /hang/ is not > what i'd have expected. I debugged a case exactly like this (with the TRACE_RESUME() thing) back last November on my laptop. The problem is that radeonfb actually *tries* to re-initialize the graphics chips, but because it doesn't know all the details, it does it wrong, and hangs the whole chip. I forget exactly where it happened, and it might well be different for you. But this is exactly what TRACE_RESUME can be used to pinpoint if you want to try to debug it. Linus ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 23:49 ` Linus Torvalds 2007-03-09 10:56 ` Ingo Molnar @ 2007-03-09 11:19 ` Pavel Machek 2007-03-18 16:07 ` Ingo Molnar 2007-03-09 17:48 ` Johannes Stezenbach 2 siblings, 1 reply; 45+ messages in thread From: Pavel Machek @ 2007-03-09 11:19 UTC (permalink / raw) To: Linus Torvalds Cc: Ingo Molnar, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman Hi! > > disabling the following radeonfb options in the .config made resume work > > again: > > In general, don't even *try* to use radeonfb for suspend/resume. > > I don't think it has ever worked, except on some very rare laptops > (largely PPC Macs) where people had enough information to set up the > PLL's. It worked ok on thinkpad x32. BIOS did the setup in resume case (with acpi_sleep=..., anyway), and radeonfb could pick the card up from there. > I don't think the other framebuffer drivers are much better. > > You're better off using the VGA console, and lettign X re-initialize the > graphics device. That generally at least has a reasonably good chance of > working. suspend.sf.net, s2ram there has a long list of tricks. If you invent new one, please add it there. > Re-initializing graphics modes really is very hard. You can try with the > BIOS video hack (I forget the kernel command line to turn it on), but we > really do end up depending on X doing it better. ...or you can try vbetool; it is similar hack to acpi_sleep=... , but it works for more people. > Some day we may have modesetting support in the kernel for some graphics > hw, right now it's pretty damn spotty. Yep, that's the way to go. Pavel -- (english) http://www.livejournal.com/~pavelmachek (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-09 11:19 ` Pavel Machek @ 2007-03-18 16:07 ` Ingo Molnar 2007-03-18 16:40 ` [linux-pm] " Jim Gettys 0 siblings, 1 reply; 45+ messages in thread From: Ingo Molnar @ 2007-03-18 16:07 UTC (permalink / raw) To: Pavel Machek Cc: Linus Torvalds, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, "Eric W. Biederman" <ebie> * Pavel Machek <pavel@ucw.cz> wrote: > > Some day we may have modesetting support in the kernel for some > > graphics hw, right now it's pretty damn spotty. > > Yep, that's the way to go. hey, i wildly supported this approach ever since 1996, when GGI came up :-/ Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [linux-pm] [2/6] 2.6.21-rc2: known regressions 2007-03-18 16:07 ` Ingo Molnar @ 2007-03-18 16:40 ` Jim Gettys 2007-03-19 20:33 ` Bill Davidsen 0 siblings, 1 reply; 45+ messages in thread From: Jim Gettys @ 2007-03-18 16:40 UTC (permalink / raw) To: Ingo Molnar Cc: Pavel Machek, Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, linux-input, Alexey Starikovskiy, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Michael S. Tsirkin, Andrew Morton, Linus Torvalds On Sun, 2007-03-18 at 17:07 +0100, Ingo Molnar wrote: > * Pavel Machek <pavel@ucw.cz> wrote: > > > > Some day we may have modesetting support in the kernel for some > > > graphics hw, right now it's pretty damn spotty. > > > > Yep, that's the way to go. > > hey, i wildly supported this approach ever since 1996, when GGI came up > :-/ > So wildly you wrote tons of code.... ;-). More seriously, at the time, XFree86 would have spat in your face for any such thing. Thankfully, times are changing. Also more seriously, a somewhat hybrid approach is in order for mode setting: simple mode setting isn't much code and is required for sane behavior on crash (it is nice to get oopses onto a screen); but the full blown mode setting/configuration problem is so large that on some hardware, it is likely left best left to a helper process (not the X server). Also key to get sane behavior out of the scheduler is to get the X server to yield (sleep in the kernel) rather than busy waiting when the GPU is busy; a standardized interface for this for both fbdev and dri is in order. Right now, X is a misbehaving compute bound process rather than the properly interactive process it can/should/will be, releasing the CPU whenever the hardware is busy. Needless to say, this wastes cycles and hurts interactivity with just about any scheduler you can devise. It isn't as if this is hard; on UNIX systems we did it in 1984 or thereabouts. Of course, in 1996, XFree86 would have ignored any such interfaces, in its insane quest for operating system independent user space drivers requiring no standard kernel interfaces.... (it is the second part of this where the true insanity lay). - Jim -- Jim Gettys One Laptop Per Child ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [linux-pm] [2/6] 2.6.21-rc2: known regressions 2007-03-18 16:40 ` [linux-pm] " Jim Gettys @ 2007-03-19 20:33 ` Bill Davidsen 2007-03-19 22:08 ` Jim Gettys 0 siblings, 1 reply; 45+ messages in thread From: Bill Davidsen @ 2007-03-19 20:33 UTC (permalink / raw) To: jg Cc: Ingo Molnar, Pavel Machek, Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, linux-input, Alexey Starikovskiy, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Michael S. Tsirkin, Andrew Morton, Linus Torvalds Jim Gettys wrote: > On Sun, 2007-03-18 at 17:07 +0100, Ingo Molnar wrote: >> * Pavel Machek <pavel@ucw.cz> wrote: >> >>>> Some day we may have modesetting support in the kernel for some >>>> graphics hw, right now it's pretty damn spotty. >>> Yep, that's the way to go. >> hey, i wildly supported this approach ever since 1996, when GGI came up >> :-/ >> > > So wildly you wrote tons of code.... ;-). > > More seriously, at the time, XFree86 would have spat in your face for > any such thing. Thankfully, times are changing. > > Also more seriously, a somewhat hybrid approach is in order for mode > setting: simple mode setting isn't much code and is required for sane > behavior on crash (it is nice to get oopses onto a screen); but the full > blown mode setting/configuration problem is so large that on some > hardware, it is likely left best left to a helper process (not the X > server). > > Also key to get sane behavior out of the scheduler is to get the X > server to yield (sleep in the kernel) rather than busy waiting when the > GPU is busy; a standardized interface for this for both fbdev and dri is > in order. Right now, X is a misbehaving compute bound process rather > than the properly interactive process it can/should/will be, releasing > the CPU whenever the hardware is busy. Needless to say, this wastes > cycles and hurts interactivity with just about any scheduler you can > devise. It isn't as if this is hard; on UNIX systems we did it in 1984 > or thereabouts. What you say sounds good, assuming that the cost of a sleep is less than the cost of the busy wait. But this may be hardware, the waits may be very small and frequent, and if it's hitting a small hardware window like retrace, delays in response will cause the time period to be missed completely. This probably less critical with very smart cards, many of us don't run them. > > Of course, in 1996, XFree86 would have ignored any such interfaces, in > its insane quest for operating system independent user space drivers > requiring no standard kernel interfaces.... (it is the second part of > this where the true insanity lay). > - Jim > -- Bill Davidsen <davidsen@tmr.com> "We have more to fear from the bungling of the incompetent than from the machinations of the wicked." - from Slashdot ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [linux-pm] [2/6] 2.6.21-rc2: known regressions 2007-03-19 20:33 ` Bill Davidsen @ 2007-03-19 22:08 ` Jim Gettys 2007-03-20 14:44 ` Bill Davidsen 0 siblings, 1 reply; 45+ messages in thread From: Jim Gettys @ 2007-03-19 22:08 UTC (permalink / raw) To: Bill Davidsen Cc: Ingo Molnar, Pavel Machek, Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, linux-input, Alexey Starikovskiy, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Michael S. Tsirkin, Andrew Morton, Linus Torvalds On Mon, 2007-03-19 at 16:33 -0400, Bill Davidsen wrote: > > What you say sounds good, assuming that the cost of a sleep is less than > the cost of the busy wait. But this may be hardware, the waits may be > very small and frequent, and if it's hitting a small hardware window > like retrace, delays in response will cause the time period to be missed > completely. This probably less critical with very smart cards, many of > us don't run them. > > Actually, various strategies involving short busy waiting, or looking at DMA address registers before sleeping were commonplace. But a syscall/sleep/wakeup is/was pretty fast. If you have an operation blitting the screen (e.g. scrolling), it takes a bit of time for the GPU to execute the command. I see this right now on OLPC, where a wonderful music application needs to scroll (most of) the screen left), periodically, and we're losing samples sometimes at those operation. Remember also, that being nice to everyone else by sleeping, there are more cycles to go around, and the scheduler can nicely boost the X server's priority as it will for "interactive" processes that are being cooperative. - Jim -- Jim Gettys One Laptop Per Child ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [linux-pm] [2/6] 2.6.21-rc2: known regressions 2007-03-19 22:08 ` Jim Gettys @ 2007-03-20 14:44 ` Bill Davidsen 0 siblings, 0 replies; 45+ messages in thread From: Bill Davidsen @ 2007-03-20 14:44 UTC (permalink / raw) To: jg Cc: Ingo Molnar, Pavel Machek, Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, linux-input, Alexey Starikovskiy, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Michael S. Tsirkin, Andrew Morton, Linus Torvalds Jim Gettys wrote: > On Mon, 2007-03-19 at 16:33 -0400, Bill Davidsen wrote: > > >> What you say sounds good, assuming that the cost of a sleep is less than >> the cost of the busy wait. But this may be hardware, the waits may be >> very small and frequent, and if it's hitting a small hardware window >> like retrace, delays in response will cause the time period to be missed >> completely. This probably less critical with very smart cards, many of >> us don't run them. >> > > Actually, various strategies involving short busy waiting, or looking at > DMA address registers before sleeping were commonplace. But a > syscall/sleep/wakeup is/was pretty fast. If you have an operation > blitting the screen (e.g. scrolling), it takes a bit of time for the GPU > to execute the command. I see this right now on OLPC, where a wonderful > music application needs to scroll (most of) the screen left), > periodically, and we're losing samples sometimes at those operation. > None of that conflicts with what I said, but what works on an LCD may not be appropriate for a CRT. With even moderate 1024x768@70 timing the horizontal retrace happens ~50k/sec, and that's not an appropriate syscall rate. I'm just pointing out that some things a video interface does with simple hardware involve lots of very small windows. Don't read that as "don't do it," just "be careful HOW you do it." > Remember also, that being nice to everyone else by sleeping, there are > more cycles to go around, and the scheduler can nicely boost the X > server's priority as it will for "interactive" processes that are being > cooperative. I'm going to cautiously guess that the problem might be not "how much" but "how soon." That is, latency might be more important than giving the server a lot of CPU. -- bill davidsen <davidsen@tmr.com> CTO TMR Associates, Inc Doing interesting things with small computers since 1979 ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 23:49 ` Linus Torvalds 2007-03-09 10:56 ` Ingo Molnar 2007-03-09 11:19 ` Pavel Machek @ 2007-03-09 17:48 ` Johannes Stezenbach 2007-03-09 23:35 ` Pavel Machek 2 siblings, 1 reply; 45+ messages in thread From: Johannes Stezenbach @ 2007-03-09 17:48 UTC (permalink / raw) To: Linus Torvalds Cc: Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, linux-input, Alexey Starikovskiy, Andrew Morton, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Michael S. Tsirkin, Ingo Molnar On Thu, Mar 08, 2007, Linus Torvalds wrote: > On Fri, 9 Mar 2007, Ingo Molnar wrote: > > > > disabling the following radeonfb options in the .config made resume work > > again: > > In general, don't even *try* to use radeonfb for suspend/resume. > > I don't think it has ever worked, except on some very rare laptops > (largely PPC Macs) where people had enough information to set up the > PLL's. > > I don't think the other framebuffer drivers are much better. > > You're better off using the VGA console, and lettign X re-initialize the > graphics device. That generally at least has a reasonably good chance of > working. > > Re-initializing graphics modes really is very hard. You can try with the > BIOS video hack (I forget the kernel command line to turn it on), but we > really do end up depending on X doing it better. acpi_sleep=s3_bios has always worked for me on my ThinkPad T42p. Even if one doesn't use the fb console at all, radeonfb apparently is still required on some ThinkPad models to work around BIOS bugs: http://www.thinkwiki.org/wiki/Problem_with_high_power_drain_in_ACPI_sleep#Radeon_GPU_not_powered_off Johannes ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-09 17:48 ` Johannes Stezenbach @ 2007-03-09 23:35 ` Pavel Machek 2007-03-10 9:01 ` Ingo Molnar 0 siblings, 1 reply; 45+ messages in thread From: Pavel Machek @ 2007-03-09 23:35 UTC (permalink / raw) To: Johannes Stezenbach Cc: Linus Torvalds, Ingo Molnar, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input Hi! > > You're better off using the VGA console, and lettign X re-initialize the > > graphics device. That generally at least has a reasonably good chance of > > working. > > > > Re-initializing graphics modes really is very hard. You can try with the > > BIOS video hack (I forget the kernel command line to turn it on), but we > > really do end up depending on X doing it better. > > acpi_sleep=s3_bios has always worked for me on my ThinkPad T42p. > > Even if one doesn't use the fb console at all, radeonfb apparently > is still required on some ThinkPad models to work around BIOS bugs: > > http://www.thinkwiki.org/wiki/Problem_with_high_power_drain_in_ACPI_sleep#Radeon_GPU_not_powered_off s2ram should be able to work around this, it has parts from radeontool. (suspend.sf.net). -- (english) http://www.livejournal.com/~pavelmachek (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-09 23:35 ` Pavel Machek @ 2007-03-10 9:01 ` Ingo Molnar 2007-03-10 11:43 ` Stefan Seyfried 2007-03-10 22:04 ` s2ram (was Re: [2/6] 2.6.21-rc2: known regressions) Pavel Machek 0 siblings, 2 replies; 45+ messages in thread From: Ingo Molnar @ 2007-03-10 9:01 UTC (permalink / raw) To: Pavel Machek Cc: Johannes Stezenbach, Linus Torvalds, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input * Pavel Machek <pavel@suse.cz> wrote: > > Even if one doesn't use the fb console at all, radeonfb apparently > > is still required on some ThinkPad models to work around BIOS bugs: > > > > http://www.thinkwiki.org/wiki/Problem_with_high_power_drain_in_ACPI_sleep#Radeon_GPU_not_powered_off > > > s2ram should be able to work around this, it has parts from > radeontool. (suspend.sf.net). i'm wondering, do you have any idea how Windows handles the suspend/resume quirks problem area? Do they "curse BIOS vendors and maintain a large DB of DMI-driven exceptions", or do they perhaps have some fundamentally better approach than us? If it's the former, then we might as well try to bring more automatism (and more of your database) into the kernel itself. btw., the s2ram database seems quite a bit spotty: $ ./s2ram -n Machine is unknown. This machine can be identified by: sys_vendor = "System manufacturer" sys_product = "System Product Name" sys_version = "System Version" bios_version = "ASUS A8N-E ACPI BIOS Revision 1008" $ ./s2ram -n Machine is unknown. This machine can be identified by: sys_vendor = "Hewlett-Packard " sys_product = "compaq nx9030 (PG630ET#ABD) " sys_version = "Rev 1 " bios_version = "F.15 " See http://en.opensuse.org/S2ram for details. even at the link above i didnt find any clear algorithm about how to extend the quirks-list and the white-list - while i expect that most people experience what i did: that s2ram doesnt know their boxes. (otherwise they would not visit that URL at all i suspect) Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-10 9:01 ` Ingo Molnar @ 2007-03-10 11:43 ` Stefan Seyfried 2007-03-10 13:53 ` Johannes Stezenbach 2007-03-10 15:18 ` Ingo Molnar 2007-03-10 22:04 ` s2ram (was Re: [2/6] 2.6.21-rc2: known regressions) Pavel Machek 1 sibling, 2 replies; 45+ messages in thread From: Stefan Seyfried @ 2007-03-10 11:43 UTC (permalink / raw) To: Ingo Molnar Cc: Pavel Machek, Johannes Stezenbach, Linus Torvalds, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov On Sat, Mar 10, 2007 at 10:01:21AM +0100, Ingo Molnar wrote: > > * Pavel Machek <pavel@suse.cz> wrote: > > s2ram should be able to work around this, it has parts from > > radeontool. (suspend.sf.net). > > i'm wondering, do you have any idea how Windows handles the > suspend/resume quirks problem area? Do they "curse BIOS vendors and > maintain a large DB of DMI-driven exceptions", or do they perhaps have > some fundamentally better approach than us? If it's the former, then we > might as well try to bring more automatism (and more of your database) > into the kernel itself. I think that in windows, you simply install the "HP nx$FOOBAR intel graphics driver" on a nx$FOOBAR machine, and the "ASUS $FOOBAR ATI graphics driver" on an ASUS $FOOBAR machine etc. Those drivers are mostly stock intel/ATI/ whoever drivers, but with the little bit of extra knowledge on how to wake up the graphics chip. And there is no console mode in Windows. If you restrict yourself to "it works from X", many machines do not need workarounds. > btw., the s2ram database seems quite a bit spotty: > > $ ./s2ram -n > Machine is unknown. > This machine can be identified by: > sys_vendor = "System manufacturer" > sys_product = "System Product Name" > sys_version = "System Version" > bios_version = "ASUS A8N-E ACPI BIOS Revision 1008" > > $ ./s2ram -n > Machine is unknown. > This machine can be identified by: > sys_vendor = "Hewlett-Packard " > sys_product = "compaq nx9030 (PG630ET#ABD) " > sys_version = "Rev 1 " > bios_version = "F.15 " > See http://en.opensuse.org/S2ram for details. > > even at the link above i didnt find any clear algorithm about how to > extend the quirks-list and the white-list - while i expect that most > people experience what i did: that s2ram doesnt know their boxes. > (otherwise they would not visit that URL at all i suspect) Ok. To be honest, you are the first reporter that seems to have read the documentation above, but not understood what to do. So i'll put it into even simpler steps: - get the current version of s2ram - boot your system with init=/bin/bash, mount /proc and /sys - try the s2ram options in the combinations listed on the above page - as soon as you find an option that works, please retry it from a full booted system (including X) - if it still works, please report the options used and the output of "s2ram -n" to the suspend-devel mailinglist. i might consider reworking the documentation if there are more reports about problems with the procedure. -- Stefan Seyfried "Any ideas, John?" "Well, surrounding them's out." ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-10 11:43 ` Stefan Seyfried @ 2007-03-10 13:53 ` Johannes Stezenbach 2007-03-10 15:18 ` Ingo Molnar 1 sibling, 0 replies; 45+ messages in thread From: Johannes Stezenbach @ 2007-03-10 13:53 UTC (permalink / raw) To: Stefan Seyfried Cc: Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, Alexey Starikovskiy, Andrew Morton, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Thomas Meyer, Michael S. Tsirkin, Ingo Molnar, Linus Torvalds On Sat, Mar 10, 2007 at 12:43:01PM +0100, Stefan Seyfried wrote: > On Sat, Mar 10, 2007 at 10:01:21AM +0100, Ingo Molnar wrote: > > > > i'm wondering, do you have any idea how Windows handles the > > suspend/resume quirks problem area? Do they "curse BIOS vendors and > > maintain a large DB of DMI-driven exceptions", or do they perhaps have > > some fundamentally better approach than us? If it's the former, then we > > might as well try to bring more automatism (and more of your database) > > into the kernel itself. > > I think that in windows, you simply install the "HP nx$FOOBAR intel graphics > driver" on a nx$FOOBAR machine, and the "ASUS $FOOBAR ATI graphics driver" > on an ASUS $FOOBAR machine etc. Those drivers are mostly stock intel/ATI/ > whoever drivers, but with the little bit of extra knowledge on how to wake > up the graphics chip. That's also what the thinkwiki page suggests. It says: "Affected Operating Systems: * Linux, all flavours. * Windows, for some models as well (only when using non-IBM drivers). * FreeBSD (on the A22M)" Johannes ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-10 11:43 ` Stefan Seyfried 2007-03-10 13:53 ` Johannes Stezenbach @ 2007-03-10 15:18 ` Ingo Molnar 2007-03-10 22:08 ` Pavel Machek 1 sibling, 1 reply; 45+ messages in thread From: Ingo Molnar @ 2007-03-10 15:18 UTC (permalink / raw) To: Stefan Seyfried Cc: Ash Milsted, Johannes Stezenbach, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, Alexey Starikovskiy, dmitry.torokhov, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Thomas Meyer, Michael S. Tsirkin, Andrew Morton, Linus Torvalds * Stefan Seyfried <seife@suse.de> wrote: > > btw., the s2ram database seems quite a bit spotty: > > > > $ ./s2ram -n > > Machine is unknown. > > This machine can be identified by: > > sys_vendor = "System manufacturer" > > sys_product = "System Product Name" > > sys_version = "System Version" > > bios_version = "ASUS A8N-E ACPI BIOS Revision 1008" > > > > $ ./s2ram -n > > Machine is unknown. > > This machine can be identified by: > > sys_vendor = "Hewlett-Packard " > > sys_product = "compaq nx9030 (PG630ET#ABD) " > > sys_version = "Rev 1 " > > bios_version = "F.15 " > > See http://en.opensuse.org/S2ram for details. > > > > even at the link above i didnt find any clear algorithm about how to > > extend the quirks-list and the white-list - while i expect that most > > people experience what i did: that s2ram doesnt know their boxes. > > (otherwise they would not visit that URL at all i suspect) > > Ok. To be honest, you are the first reporter that seems to have read > the documentation above, but not understood what to do. thanks for the compliment ;-) _I_ very much know what to do (i mailed the right person after all ;), but i dont really count and on the 6 systems i tried s2ram said on 5 that it's 'unknown', i assumed it could possibly be due to the visible lack of clear instructions on that webpage ;-) > i might consider reworking the documentation if there are more reports > about problems with the procedure. Probably tweaking the webpage doesnt help because people dont get there - as the results plainly show it. Maybe some more automation would be useful too, a tool that detects failed resume and tries all those options that makes sense on that box or something? It's not like that people dont _try_ suspend/resume on Linux, it's just that they find it doesnt work and there is no clear mortal-usable (end-user-) way of fixing it and submitting the result of that fixing. Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-10 15:18 ` Ingo Molnar @ 2007-03-10 22:08 ` Pavel Machek 2007-03-11 8:20 ` Ingo Molnar 0 siblings, 1 reply; 45+ messages in thread From: Pavel Machek @ 2007-03-10 22:08 UTC (permalink / raw) To: Ingo Molnar Cc: Stefan Seyfried, Johannes Stezenbach, Linus Torvalds, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov Hi! > > Ok. To be honest, you are the first reporter that seems to have read > > the documentation above, but not understood what to do. > > thanks for the compliment ;-) _I_ very much know what to do (i mailed > the right person after all ;), but i dont really count and on the 6 (Can we get the whitelist entries?) > > i might consider reworking the documentation if there are more reports > > about problems with the procedure. > > Probably tweaking the webpage doesnt help because people dont get there > - as the results plainly show it. Maybe some more automation would be > useful too, a tool that detects failed resume and tries all those > options that makes sense on that box or something? It's not like > that Unfortunately, these tend to crash the box when you pass wrong options, and I do not see easy way to test "can user see whats on display" automatically. Feel free to send patches, but I do not think it is easy/possible. Pavel -- (english) http://www.livejournal.com/~pavelmachek (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-10 22:08 ` Pavel Machek @ 2007-03-11 8:20 ` Ingo Molnar 2007-03-12 6:34 ` Stefan Seyfried 0 siblings, 1 reply; 45+ messages in thread From: Ingo Molnar @ 2007-03-11 8:20 UTC (permalink / raw) To: Pavel Machek Cc: Ash Milsted, Johannes Stezenbach, Arkadiusz Miskiewicz, linux-pm, Jens Axboe, Alexey Starikovskiy, dmitry.torokhov, Stefan Seyfried, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Thomas Meyer, Michael S. Tsirkin, Andrew Morton, Linus Torvalds * Pavel Machek <pavel@suse.cz> wrote: > > Probably tweaking the webpage doesnt help because people dont get > > there - as the results plainly show it. Maybe some more automation > > would be useful too, a tool that detects failed resume and tries all > > those options that makes sense on that box or something? It's not > > like that > > Unfortunately, these tend to crash the box when you pass wrong > options, and I do not see easy way to test "can user see whats on > display" automatically. you could perhaps try what X's modesetting utility does: display a dialog box that times out if it does not get clicked on, and reboot if it did not get clicked on. Likewise, detect upon the next bootup that a suspend-test was in progress (and didnt get back via normal resume), via some temporary file. That way both the 'did not resume and i had to power-cycle' and the 'resume did not restore my X' problems can be handled. Finally, when the correct options have been established (worse-case with a small number of reboots and "yes, indeed the resume did not work fine" clicks done upon bootup by the user), automatically fill in a webform in firefox and ask the user to do a single click to submit that form. techniques like that have more chance i think to get Linux suspend/resume anywhere near to working. The current 'rely on the developer' technique apparently does not work. Ingo ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-11 8:20 ` Ingo Molnar @ 2007-03-12 6:34 ` Stefan Seyfried 0 siblings, 0 replies; 45+ messages in thread From: Stefan Seyfried @ 2007-03-12 6:34 UTC (permalink / raw) To: Ingo Molnar Cc: Pavel Machek, Johannes Stezenbach, Linus Torvalds, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov On Sun, Mar 11, 2007 at 09:20:02AM +0100, Ingo Molnar wrote: > > * Pavel Machek <pavel@suse.cz> wrote: > > Unfortunately, these tend to crash the box when you pass wrong > > options, and I do not see easy way to test "can user see whats on > > display" automatically. > > you could perhaps try what X's modesetting utility does: display a > dialog box that times out if it does not get clicked on, and reboot if > it did not get clicked on. Likewise, detect upon the next bootup that a > suspend-test was in progress (and didnt get back via normal resume), via > some temporary file. That way both the 'did not resume and i had to > power-cycle' and the 'resume did not restore my X' problems can be > handled. > > Finally, when the correct options have been established (worse-case with > a small number of reboots and "yes, indeed the resume did not work fine" > clicks done upon bootup by the user), automatically fill in a webform in > firefox and ask the user to do a single click to submit that form. There is ongoing work to make something like this happen, but it will take some time (very probably the whitelist will end up in a HAL fdi-file which gives the additional advantage that interested vendors can supply a linux-"driver" for their notebooks which makes suspend "just work" (the "driver" simply being a .fdi-file that lists the workaround for this particular machine) > techniques like that have more chance i think to get Linux > suspend/resume anywhere near to working. The current 'rely on the > developer' technique apparently does not work. It is not too bad actually, i seem to be getting lots of whitelist entries from people that are first time shell users, and i usually walk them through the process in just one or two mails. But yes, it does not scale well ;-) I was already thinking about some more sophisticated "wildcard" matching, something like "if it is a HP and it has an ATI gfx chip, and we do not have an exact match, then do VBE_POST|VBE_MODE" or similar, which would probably get many machines working fine (it turns out that almost all new machines from a vendor with similar hardware need similar workarounds, but for example the hp's with ATI need different quirks than the hp's with intel. Only thinkpads seem to almost universally work with s3_bios,s3_mode, with some x86_64 models being the exception). Something like that should be pretty easy once the whitelist is kept in HAL. -- Stefan Seyfried "Any ideas, John?" "Well, surrounding them's out." ^ permalink raw reply [flat|nested] 45+ messages in thread
* s2ram (was Re: [2/6] 2.6.21-rc2: known regressions) 2007-03-10 9:01 ` Ingo Molnar 2007-03-10 11:43 ` Stefan Seyfried @ 2007-03-10 22:04 ` Pavel Machek 1 sibling, 0 replies; 45+ messages in thread From: Pavel Machek @ 2007-03-10 22:04 UTC (permalink / raw) To: Ingo Molnar Cc: Johannes Stezenbach, Linus Torvalds, Michael S. Tsirkin, Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input Hi! > > > Even if one doesn't use the fb console at all, radeonfb apparently > > > is still required on some ThinkPad models to work around BIOS bugs: > > > > > > http://www.thinkwiki.org/wiki/Problem_with_high_power_drain_in_ACPI_sleep#Radeon_GPU_not_powered_off > > > > > > s2ram should be able to work around this, it has parts from > > radeontool. (suspend.sf.net). > > i'm wondering, do you have any idea how Windows handles the > suspend/resume quirks problem area? Do they "curse BIOS vendors and Windows actually have (kernel) graphics drivers that know how to resume the video. If you boot save mode, they go w/o graphics drivers, and have similar problems to us. > btw., the s2ram database seems quite a bit spotty: > > $ ./s2ram -n > Machine is unknown. > This machine can be identified by: > sys_vendor = "System manufacturer" > sys_product = "System Product Name" > sys_version = "System Version" > bios_version = "ASUS A8N-E ACPI BIOS Revision 1008" > > $ ./s2ram -n > Machine is unknown. > This machine can be identified by: > sys_vendor = "Hewlett-Packard " > sys_product = "compaq nx9030 (PG630ET#ABD) " > sys_version = "Rev 1 " > bios_version = "F.15 " > See http://en.opensuse.org/S2ram for details. Desktops are the problem; but that nx9030 should be reasonably easy to add. > even at the link above i didnt find any clear algorithm about how to > extend the quirks-list and the white-list - while i expect that most > people experience what i did: that s2ram doesnt know their boxes. > (otherwise they would not visit that URL at all i suspect) Did you get options for s2ram that work on your systems? Mail them to seife@suse.de, and he'll extend the whitelist ;-). In the meantime, just use -f <whatever> Pavel -- (english) http://www.livejournal.com/~pavelmachek (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 18:01 ` Linus Torvalds 2007-03-08 19:06 ` Ingo Molnar 2007-03-08 19:25 ` Ingo Molnar @ 2007-03-08 19:46 ` Michael S. Tsirkin 2007-03-08 19:57 ` Michael S. Tsirkin 3 siblings, 0 replies; 45+ messages in thread From: Michael S. Tsirkin @ 2007-03-08 19:46 UTC (permalink / raw) To: Linus Torvalds Cc: Adrian Bunk, Andrew Morton, Linux Kernel Mailing List, Jens Axboe, Jeff Chua, pavel, linux-pm, lenb, linux-acpi, luming.yu, Arkadiusz Miskiewicz, Konstantin Karasyov, Greg KH, linux-usb-devel, Thomas Meyer, Meelis Roos, Alexey Starikovskiy, Janosch Machowinski, vladimir.p.lebedev, Ash Milsted, dmitry.torokhov, linux-input, Eric W. Biederman, Ingo Molnar > Quoting Linus Torvalds <torvalds@linux-foundation.org>: > > > > Here's the status with -rc3: better, but still does not work as well as 2.6.20. > > Ok. I think we mostly solved the irq-related stuff, but you might want to > check whether you have CONFIG_NOHZ on or off and whether that makes a > difference. This testing was done without NO_HZ. -- MST ^ permalink raw reply [flat|nested] 45+ messages in thread
* Re: [2/6] 2.6.21-rc2: known regressions 2007-03-08 18:01 ` Linus Torvalds ` (2 preceding siblings ...) 2007-03-08 19:46 ` [2/6] 2.6.21-rc2: known regressions Michael S. Tsirkin @ 2007-03-08 19:57 ` Michael S. Tsirkin 3 siblings, 0 replies; 45+ messages in thread From: Michael S. Tsirkin @ 2007-03-08 19:57 UTC (permalink / raw) To: Linus Torvalds Cc: Ash Milsted, dmitry.torokhov, Arkadiusz Miskiewicz, Jens Axboe, linux-input, Alexey Starikovskiy, Ingo Molnar, linux-usb-devel, Jeff Chua, Meelis Roos, Janosch Machowinski, linux-pm, Linux Kernel Mailing List, Adrian Bunk, linux-acpi, Eric W. Biederman, Thomas Meyer, Andrew Morton > > 2. First disk access after resume takes a couple of minutes > > (seemed instant with 2.6.20) during this time no new messages show on console > > Yeah, there is some problem with SATA resume. It would be beautiful if the > people who actually see this could narrow it down with bisection. "It > works for me" is clearly the case for many people, but not all. Problem is, there seem to be multiple problems some of which got fixed between rc2 and rc3. With rc2 I didn't get as far as getting to console. -- MST ^ permalink raw reply [flat|nested] 45+ messages in thread
end of thread, other threads:[~2007-03-20 14:44 UTC | newest]
Thread overview: 45+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <Pine.LNX.4.64.0702272105220.12485@woody.linux-foundation.org>
2007-03-05 1:50 ` [1/6] 2.6.21-rc2: known regressions Adrian Bunk
2007-03-05 2:26 ` Andrew Morton
2007-03-05 3:35 ` Greg KH
2007-03-06 0:55 ` Johannes Berg
2007-03-05 4:01 ` Mark Lord
2007-03-05 4:34 ` Greg KH
2007-03-05 12:42 ` Marcel Holtmann
2007-03-07 11:06 ` Jeff Garzik
2007-03-07 22:17 ` Albert Hopkins
2007-03-05 1:50 ` [2/6] " Adrian Bunk
2007-03-07 11:09 ` Jeff Garzik
2007-03-07 16:10 ` Linus Torvalds
2007-03-08 12:03 ` Ash Milsted
2007-03-08 12:31 ` Michael S. Tsirkin
2007-03-08 15:11 ` Jeff Chua
2007-03-08 18:01 ` Linus Torvalds
2007-03-08 19:06 ` Ingo Molnar
2007-03-08 19:10 ` Ingo Molnar
2007-03-08 19:47 ` Michael S. Tsirkin
2007-03-08 20:10 ` Ingo Molnar
2007-03-08 19:25 ` Ingo Molnar
2007-03-08 23:07 ` Ingo Molnar
2007-03-08 23:12 ` Ingo Molnar
2007-03-08 23:28 ` Ingo Molnar
2007-03-08 23:49 ` Linus Torvalds
2007-03-09 10:56 ` Ingo Molnar
2007-03-09 18:00 ` Linus Torvalds
2007-03-09 11:19 ` Pavel Machek
2007-03-18 16:07 ` Ingo Molnar
2007-03-18 16:40 ` [linux-pm] " Jim Gettys
2007-03-19 20:33 ` Bill Davidsen
2007-03-19 22:08 ` Jim Gettys
2007-03-20 14:44 ` Bill Davidsen
2007-03-09 17:48 ` Johannes Stezenbach
2007-03-09 23:35 ` Pavel Machek
2007-03-10 9:01 ` Ingo Molnar
2007-03-10 11:43 ` Stefan Seyfried
2007-03-10 13:53 ` Johannes Stezenbach
2007-03-10 15:18 ` Ingo Molnar
2007-03-10 22:08 ` Pavel Machek
2007-03-11 8:20 ` Ingo Molnar
2007-03-12 6:34 ` Stefan Seyfried
2007-03-10 22:04 ` s2ram (was Re: [2/6] 2.6.21-rc2: known regressions) Pavel Machek
2007-03-08 19:46 ` [2/6] 2.6.21-rc2: known regressions Michael S. Tsirkin
2007-03-08 19:57 ` Michael S. Tsirkin
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox