* 2.6.39-rc3-git7: Reported regressions from 2.6.38 @ 2011-04-17 12:52 Rafael J. Wysocki 2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki ` (10 more replies) 0 siblings, 11 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:52 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Maciej Rutecki, Florian Mickler, Andrew Morton, Linus Torvalds, Kernel Testers List, Network Development, Linux ACPI, Linux PM List, Linux SCSI List, Linux Wireless List, DRI This message contains a list of some regressions from 2.6.38, for which there are no fixes in the mainline known to the tracking team. If any of them have been fixed already, please let us know. If you know of any other unresolved regressions from 2.6.38, please let us know either and we'll add them to the list. Also, please let us know if any of the entries below are invalid. Each entry from the list will be sent additionally in an automatic reply to this message with CCs to the people involved in reporting and handling the issue. Listed regressions statistics: Date Total Pending Unresolved ---------------------------------------- 2011-04-17 17 11 10 Unresolved regressions ---------------------- Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33342 Subject : [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive. Submitter : Shaun Ruffell <sruffell@digium.com> Date : 2011-04-08 20:15 (10 days old) Message-ID : <20110408201513.GA3040@digium.com> References : http://marc.info/?l=linux-kernel&m=130229371907209&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33272 Subject : drm related hard-hang Submitter : Peter Teoh <htmldeveloper@gmail.com> Date : 2011-04-14 01:29 (4 days old) Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33242 Subject : Lockdep splat in autofs with 2.6.39-rc2 Submitter : Nick Bowler <nbowler@elliptictech.com> Date : 2011-04-07 19:44 (11 days old) Message-ID : <20110407194403.GA29404@elliptictech.com> References : http://marc.info/?l=linux-kernel&m=130220545614682&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33142 Subject : 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Submitter : Keith Packard <keithp@keithp.com> Date : 2011-04-06 7:44 (12 days old) Message-ID : <yun1v1fj024.fsf@aiko.keithp.com> References : http://marc.info/?l=linux-kernel&m=130207593728273&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33102 Subject : File's copied from client->linux server only copy 1st 64K data;rest is lost Submitter : Linda Walsh <lkml@tlinx.org> Date : 2011-04-11 22:12 (7 days old) Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33092 Subject : [regression] 2.6.39-rc1 - Beagleboard usbnet broken Submitter : Mark Jackson <mpfj-list@mimc.co.uk> Date : 2011-04-04 9:22 (14 days old) First-Bad-Commit: http://git.kernel.org/linus/087809fce28f50098d9c3ef1a6865c722f23afd2 Message-ID : <4D998DC9.3040109@mimc.co.uk> References : http://marc.info/?l=linux-kernel&m=130191386508831&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 Subject : Kernel locks up a few minutes after boot Submitter : Bart Van Assche <bart.vanassche@gmail.com> Date : 2011-04-10 19:55 (8 days old) Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902 Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Submitter : Alex Romosan <romosan@sycorax.lbl.gov> Date : 2011-04-03 19:41 (15 days old) Message-ID : <87k4fbnmw8.fsf@sycorax.lbl.gov> References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32892 Subject : 2.6.39-rc1 data corruption with rtorrent Submitter : Jindrich Makovicka <makovick@gmail.com> Date : 2011-04-02 20:21 (16 days old) Message-ID : <20110402222118.3b5c2fa8@holly> References : http://marc.info/?l=linux-kernel&m=130177570309226&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32262 Subject : 2.6.38-git15+ IDE hangs boot Submitter : Pete Clements <clem@clem.clem-digital.net> Date : 2011-03-25 15:38 (24 days old) Message-ID : <201103251538.p2PFc11i001674@clem.clem-digital.net> References : http://marc.info/?l=linux-kernel&m=130106749313695&w=2 Regressions with patches ------------------------ Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252 Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Submitter : Shaun Ruffell <sruffell@digium.com> Date : 2011-04-06 22:30 (12 days old) First-Bad-Commit: http://git.kernel.org/linus/242214f9c1eeaae40eca11e3b4d37bfce960a7cd Message-ID : <20110406223036.GA15721@digium.com> References : http://marc.info/?l=linux-kernel&m=130212907032580&w=2 Handled-By : Don Zickus <dzickus@redhat.com> Patch : http://cache.gmane.org//gmane/linux/kernel/1125621-001.bin For details, please visit the bug entries and follow the links given in references. As you can see, there is a Bugzilla entry for each of the listed regressions. There also is a Bugzilla entry used for tracking the regressions from 2.6.38, unresolved as well as resolved, at: http://bugzilla.kernel.org/show_bug.cgi?id=32012 Please let the tracking team know if there are any Bugzilla entries that should be added to the list in there. Thanks! ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32262] 2.6.38-git15+ IDE hangs boot 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki @ 2011-04-17 12:52 ` Rafael J. Wysocki 2011-04-17 13:16 ` Pete Clements 2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki ` (9 subsequent siblings) 10 siblings, 1 reply; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:52 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Pete Clements This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32262 Subject : 2.6.38-git15+ IDE hangs boot Submitter : Pete Clements <clem-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org> Date : 2011-03-25 15:38 (24 days old) Message-ID : <201103251538.p2PFc11i001674-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130106749313695&w=2 ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32262] 2.6.38-git15+ IDE hangs boot 2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki @ 2011-04-17 13:16 ` Pete Clements [not found] ` <201104171316.p3HDGI8Y018237-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Pete Clements @ 2011-04-17 13:16 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Pete Clements, Linux Kernel Mailing List > > The following bug entry is on the current list of known regressions > from 2.6.38. Please verify if it still should be listed and let the tracking team > know (either way). > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=3D32262 > Subject : 2.6.38-git15+ IDE hangs boot > Submitter : Pete Clements <clem@clem.clem-digital.net> > Date : 2011-03-25 15:38 (24 days old) > Message-ID : <201103251538.p2PFc11i001674@clem.clem-digital.net> > References : http://marc.info/?l=3Dlinux-kernel&m=3D130106749313695&w=3D2 > > I no longer experience the problem. Don't recall when the fix was integrated (post git19?). (Currently at 39-rc3-git6.) -- Pete Clements ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <201104171316.p3HDGI8Y018237-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>]
* Re: [Bug #32262] 2.6.38-git15+ IDE hangs boot [not found] ` <201104171316.p3HDGI8Y018237-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org> @ 2011-04-17 13:28 ` Rafael J. Wysocki 0 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 13:28 UTC (permalink / raw) To: Pete Clements Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Linux Kernel Mailing List On Sunday, April 17, 2011, Pete Clements wrote: > > > > The following bug entry is on the current list of known regressions > > from 2.6.38. Please verify if it still should be listed and let the tracking team > > know (either way). > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=3D32262 > > Subject : 2.6.38-git15+ IDE hangs boot > > Submitter : Pete Clements <clem-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org> > > Date : 2011-03-25 15:38 (24 days old) > > Message-ID : <201103251538.p2PFc11i001674-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org> > > References : http://marc.info/?l=3Dlinux-kernel&m=3D130106749313695&w=3D2 > > > > > > I no longer experience the problem. Don't recall when the fix was integrated > (post git19?). (Currently at 39-rc3-git6.) Thanks, closing. Rafael ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki 2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-18 15:34 ` Alex Romosan 2011-04-17 12:57 ` [Bug #32892] 2.6.39-rc1 data corruption with rtorrent Rafael J. Wysocki ` (8 subsequent siblings) 10 siblings, 1 reply; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Alex Romosan This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902 Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Submitter : Alex Romosan <romosan-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org> Date : 2011-04-03 19:41 (15 days old) Message-ID : <87k4fbnmw8.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2 ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki @ 2011-04-18 15:34 ` Alex Romosan [not found] ` <87ipub7evl.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Alex Romosan @ 2011-04-18 15:34 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler "Rafael J. Wysocki" <rjw@sisk.pl> writes: > This message has been generated automatically as a part of a summary report > of recent regressions. > > The following bug entry is on the current list of known regressions > from 2.6.38. Please verify if it still should be listed and let the > tracking team > know (either way). > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902 > Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 > Submitter : Alex Romosan <romosan@sycorax.lbl.gov> > Date : 2011-04-03 19:41 (15 days old) > Message-ID : <87k4fbnmw8.fsf@sycorax.lbl.gov> > References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2 > my laptop works again with 2.6.39-rc3 (except the boot process hangs at waiting for /dev to be populated.... if i hit Ctrl-C then the booting proceeds normally. this doesn't happen with 2.6.38) so probably this bug can be closed. --alex-- -- | I believe the moment is at hand when, by a paranoiac and active | | advance of the mind, it will be possible (simultaneously with | | automatism and other passive states) to systematize confusion | | and thus to help to discredit completely the world of reality. | ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <87ipub7evl.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>]
* Re: [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 [not found] ` <87ipub7evl.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org> @ 2011-04-18 21:22 ` Rafael J. Wysocki 0 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-18 21:22 UTC (permalink / raw) To: Alex Romosan Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler On Monday, April 18, 2011, Alex Romosan wrote: > "Rafael J. Wysocki" <rjw-KKrjLPT3xs0@public.gmane.org> writes: > > > This message has been generated automatically as a part of a summary report > > of recent regressions. > > > > The following bug entry is on the current list of known regressions > > from 2.6.38. Please verify if it still should be listed and let the > > tracking team > > know (either way). > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902 > > Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 > > Submitter : Alex Romosan <romosan-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org> > > Date : 2011-04-03 19:41 (15 days old) > > Message-ID : <87k4fbnmw8.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org> > > References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2 > > > > my laptop works again with 2.6.39-rc3 (except the boot process hangs at > waiting for /dev to be populated.... if i hit Ctrl-C then the booting > proceeds normally. this doesn't happen with 2.6.38) so probably this bug > can be closed. Thanks, closing. Rafael ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32892] 2.6.39-rc1 data corruption with rtorrent 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki 2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken Rafael J. Wysocki ` (7 subsequent siblings) 10 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Jindrich Makovicka This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32892 Subject : 2.6.39-rc1 data corruption with rtorrent Submitter : Jindrich Makovicka <makovick@gmail.com> Date : 2011-04-02 20:21 (16 days old) Message-ID : <20110402222118.3b5c2fa8@holly> References : http://marc.info/?l=linux-kernel&m=130177570309226&w=2 ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (2 preceding siblings ...) 2011-04-17 12:57 ` [Bug #32892] 2.6.39-rc1 data corruption with rtorrent Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki ` (6 subsequent siblings) 10 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Dan Williams, Jamie Iles, Jamie Iles, Mark Jackson This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33092 Subject : [regression] 2.6.39-rc1 - Beagleboard usbnet broken Submitter : Mark Jackson <mpfj-list-kZtEnBLzDKq1Qrn1Bg8BZw@public.gmane.org> Date : 2011-04-04 9:22 (14 days old) First-Bad-Commit: http://git.kernel.org/linus/087809fce28f50098d9c3ef1a6865c722f23afd2 Message-ID : <4D998DC9.3040109-kZtEnBLzDKq1Qrn1Bg8BZw@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130191386508831&w=2 ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32982] Kernel locks up a few minutes after boot 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (3 preceding siblings ...) 2011-04-17 12:57 ` [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-17 17:03 ` Linus Torvalds 2011-04-17 12:57 ` [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2 Rafael J. Wysocki ` (5 subsequent siblings) 10 siblings, 1 reply; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Bart Van Assche, Linus Torvalds This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 Subject : Kernel locks up a few minutes after boot Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-10 19:55 (8 days old) ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot 2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki @ 2011-04-17 17:03 ` Linus Torvalds [not found] ` <BANLkTimeaHwBiZoz=isCM=qtBxoxaBqDUw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Linus Torvalds @ 2011-04-17 17:03 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Bart Van Assche, Neil Brown, Jens Axboe Is this machine running a RAID5 setup or something like that? There is a known interaction with the new block layer plugging code and MD. The "hung task" report in that bugzilla looks very much like that issue. And you do have "root=/dev/md0", so clearly there's some md thing going on. And bisecting might not work all that well for it, because I suspect it ends up being very much a matter of IO patterns how it triggers. Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens? Linus On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote: > This message has been generated automatically as a part of a summary report > of recent regressions. > > The following bug entry is on the current list of known regressions > from 2.6.38. Please verify if it still should be listed and let the tracking team > know (either way). > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 > Subject : Kernel locks up a few minutes after boot > Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> > Date : 2011-04-10 19:55 (8 days old) > > > ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTimeaHwBiZoz=isCM=qtBxoxaBqDUw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTimeaHwBiZoz=isCM=qtBxoxaBqDUw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-04-17 18:37 ` Bart Van Assche [not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Bart Van Assche @ 2011-04-17 18:37 UTC (permalink / raw) To: Linus Torvalds Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown, Jens Axboe On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds <torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote: > On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote: > > This message has been generated automatically as a part of a summary report > > of recent regressions. > > > > The following bug entry is on the current list of known regressions > > from 2.6.38. Please verify if it still should be listed and let the tracking team > > know (either way). > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 > > Subject : Kernel locks up a few minutes after boot > > Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> > > Date : 2011-04-10 19:55 (8 days old) > > Is this machine running a RAID5 setup or something like that? > > There is a known interaction with the new block layer plugging code > and MD. The "hung task" report in that bugzilla looks very much like > that issue. And you do have "root=/dev/md0", so clearly there's some > md thing going on. > > And bisecting might not work all that well for it, because I suspect > it ends up being very much a matter of IO patterns how it triggers. > > Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens? (converted top-posting into bottom-posting) Hello Linus, On the system on which bug #32982 has been triggered md0, md1 and md2 have been configured as two-disk RAID1 (mirroring). I've done my best to trigger enough I/O in order to obtain reliable bisect results. A difficulty I encountered during bisecting though was that I encountered unbootable kernels (all skipped revisions). Bart. ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-04-17 21:07 ` NeilBrown [not found] ` <20110418070711.12dd5cc1-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org> 2011-04-18 11:44 ` Jens Axboe 1 sibling, 1 reply; 41+ messages in thread From: NeilBrown @ 2011-04-17 21:07 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Jens Axboe On Sun, 17 Apr 2011 20:37:39 +0200 Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote: > On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds > <torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote: > > On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote: > > > This message has been generated automatically as a part of a summary report > > > of recent regressions. > > > > > > The following bug entry is on the current list of known regressions > > > from 2.6.38. Please verify if it still should be listed and let the tracking team > > > know (either way). > > > > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 > > > Subject : Kernel locks up a few minutes after boot > > > Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> > > > Date : 2011-04-10 19:55 (8 days old) > > > > Is this machine running a RAID5 setup or something like that? > > > > There is a known interaction with the new block layer plugging code > > and MD. The "hung task" report in that bugzilla looks very much like > > that issue. And you do have "root=/dev/md0", so clearly there's some > > md thing going on. > > > > And bisecting might not work all that well for it, because I suspect > > it ends up being very much a matter of IO patterns how it triggers. > > > > Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens? > > (converted top-posting into bottom-posting) > > Hello Linus, > > On the system on which bug #32982 has been triggered md0, md1 and md2 > have been configured as two-disk RAID1 (mirroring). If any of those have write-intent bitmaps then I definitely know what the problem is and I'll be posting patches later today (probably not much later). If not .. then I'm less sure but it would certainly be worth testing after applying the promised fixes. NeilBrown > > I've done my best to trigger enough I/O in order to obtain reliable > bisect results. A difficulty I encountered during bisecting though was > that I encountered unbootable kernels (all skipped revisions). > > Bart. ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <20110418070711.12dd5cc1-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <20110418070711.12dd5cc1-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org> @ 2011-04-17 22:20 ` NeilBrown 0 siblings, 0 replies; 41+ messages in thread From: NeilBrown @ 2011-04-17 22:20 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Jens Axboe On Mon, 18 Apr 2011 07:07:11 +1000 NeilBrown <neilb-l3A5Bk7waGM@public.gmane.org> wrote: > On Sun, 17 Apr 2011 20:37:39 +0200 Bart Van Assche <bvanassche-y/PYEvSCHaw@public.gmane.orgg> wrote: > > > On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds > > <torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote: > > > On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote: > > > > This message has been generated automatically as a part of a summary report > > > > of recent regressions. > > > > > > > > The following bug entry is on the current list of known regressions > > > > from 2.6.38. Please verify if it still should be listed and let the tracking team > > > > know (either way). > > > > > > > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 > > > > Subject : Kernel locks up a few minutes after boot > > > > Submitter : Bart Van Assche <bart.vanassche@gmail.com> > > > > Date : 2011-04-10 19:55 (8 days old) > > > > > > Is this machine running a RAID5 setup or something like that? > > > > > > There is a known interaction with the new block layer plugging code > > > and MD. The "hung task" report in that bugzilla looks very much like > > > that issue. And you do have "root=/dev/md0", so clearly there's some > > > md thing going on. > > > > > > And bisecting might not work all that well for it, because I suspect > > > it ends up being very much a matter of IO patterns how it triggers. > > > > > > Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens? > > > > (converted top-posting into bottom-posting) > > > > Hello Linus, > > > > On the system on which bug #32982 has been triggered md0, md1 and md2 > > have been configured as two-disk RAID1 (mirroring). > > If any of those have write-intent bitmaps then I definitely know what the > problem is and I'll be posting patches later today (probably not much later). > Actually it won't be today. The new block device plugging is still unusable for MD - so I won't be able to fix this until that gets sorted out. NeilBrown ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-17 21:07 ` NeilBrown @ 2011-04-18 11:44 ` Jens Axboe [not found] ` <4DAC2429.5000105-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 1 sibling, 1 reply; 41+ messages in thread From: Jens Axboe @ 2011-04-18 11:44 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On 2011-04-17 20:37, Bart Van Assche wrote: > On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds > <torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote: >> On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote: >>> This message has been generated automatically as a part of a summary report >>> of recent regressions. >>> >>> The following bug entry is on the current list of known regressions >>> from 2.6.38. Please verify if it still should be listed and let the tracking team >>> know (either way). >>> >>> >>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 >>> Subject : Kernel locks up a few minutes after boot >>> Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> >>> Date : 2011-04-10 19:55 (8 days old) >> >> Is this machine running a RAID5 setup or something like that? >> >> There is a known interaction with the new block layer plugging code >> and MD. The "hung task" report in that bugzilla looks very much like >> that issue. And you do have "root=/dev/md0", so clearly there's some >> md thing going on. >> >> And bisecting might not work all that well for it, because I suspect >> it ends up being very much a matter of IO patterns how it triggers. >> >> Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens? > > (converted top-posting into bottom-posting) > > Hello Linus, > > On the system on which bug #32982 has been triggered md0, md1 and md2 > have been configured as two-disk RAID1 (mirroring). > > I've done my best to trigger enough I/O in order to obtain reliable > bisect results. A difficulty I encountered during bisecting though was > that I encountered unbootable kernels (all skipped revisions). Bart, can you try and pull: git://git.kernel.dk/linux-2.6-block.git for-linus into Linus' tree and see if that works? This has, among other things, Neils fixes for MD. -- Jens Axboe ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <4DAC2429.5000105-5c4llco8/ftWk0Htik3J/w@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <4DAC2429.5000105-5c4llco8/ftWk0Htik3J/w@public.gmane.org> @ 2011-04-18 18:21 ` Bart Van Assche [not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Bart Van Assche @ 2011-04-18 18:21 UTC (permalink / raw) To: Jens Axboe Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: > Bart, can you try and pull: > > git://git.kernel.dk/linux-2.6-block.git for-linus > > into Linus' tree and see if that works? This has, among other things, > Neils fixes for MD. md seems to work stable with the resulting tree, but it looks there is a performance regression in the block layer not related to the md issue. If I run a small block IOPS test on a block device created by ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). Bart. ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-04-18 18:28 ` Jens Axboe [not found] ` <4DAC82E6.3020809-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 2011-04-19 3:32 ` David Dillow 1 sibling, 1 reply; 41+ messages in thread From: Jens Axboe @ 2011-04-18 18:28 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On 2011-04-18 20:21, Bart Van Assche wrote: > On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: >> Bart, can you try and pull: >> >> git://git.kernel.dk/linux-2.6-block.git for-linus >> >> into Linus' tree and see if that works? This has, among other things, >> Neils fixes for MD. > > md seems to work stable with the resulting tree, but it looks there is OK, that's the most important bit. > a performance regression in the block layer not related to the md > issue. If I run a small block IOPS test on a block device created by > ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 > (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). That's not good. What's the test case? -- Jens Axboe ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <4DAC82E6.3020809-5c4llco8/ftWk0Htik3J/w@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <4DAC82E6.3020809-5c4llco8/ftWk0Htik3J/w@public.gmane.org> @ 2011-04-18 18:32 ` Bart Van Assche [not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Bart Van Assche @ 2011-04-18 18:32 UTC (permalink / raw) To: Jens Axboe Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: > On 2011-04-18 20:21, Bart Van Assche wrote: >> a performance regression in the block layer not related to the md >> issue. If I run a small block IOPS test on a block device created by >> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 >> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). > > That's not good. What's the test case? Nothing more than a fio IOPS test: fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1 Bart. ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-04-18 18:38 ` Jens Axboe 2011-04-19 9:09 ` Jens Axboe 1 sibling, 0 replies; 41+ messages in thread From: Jens Axboe @ 2011-04-18 18:38 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On 2011-04-18 20:32, Bart Van Assche wrote: > On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: >> On 2011-04-18 20:21, Bart Van Assche wrote: >>> a performance regression in the block layer not related to the md >>> issue. If I run a small block IOPS test on a block device created by >>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 >>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). >> >> That's not good. What's the test case? > > Nothing more than a fio IOPS test: > > fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread > --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G > --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1 Interesting, I'll have to check if we regressed with all these recent changes. Comparing your .38 to .39-rc3+, are you using more/less CPU, more/less sys%, etc? A quick perf record -fg / perf report -g for both kernels would be nice to see. -- Jens Axboe ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-18 18:38 ` Jens Axboe @ 2011-04-19 9:09 ` Jens Axboe [not found] ` <4DAD5156.2050300-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 1 sibling, 1 reply; 41+ messages in thread From: Jens Axboe @ 2011-04-19 9:09 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On 2011-04-18 20:32, Bart Van Assche wrote: > On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: >> On 2011-04-18 20:21, Bart Van Assche wrote: >>> a performance regression in the block layer not related to the md >>> issue. If I run a small block IOPS test on a block device created by >>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 >>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). >> >> That's not good. What's the test case? > > Nothing more than a fio IOPS test: > > fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread > --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G > --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1 Bart, can you try the below: diff --git a/block/blk-core.c b/block/blk-core.c index 5fa3dd2..9b41da1 100644 --- a/block/blk-core.c +++ b/block/blk-core.c @@ -307,11 +307,7 @@ void __blk_run_queue(struct request_queue *q) * Only recurse once to avoid overrunning the stack, let the unplug * handling reinvoke the handler shortly if we already got there. */ - if (!queue_flag_test_and_set(QUEUE_FLAG_REENTER, q)) { - q->request_fn(q); - queue_flag_clear(QUEUE_FLAG_REENTER, q); - } else - queue_delayed_work(kblockd_workqueue, &q->delay_work, 0); + q->request_fn(q); } EXPORT_SYMBOL(__blk_run_queue); -- Jens Axboe ^ permalink raw reply related [flat|nested] 41+ messages in thread
[parent not found: <4DAD5156.2050300-5c4llco8/ftWk0Htik3J/w@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <4DAD5156.2050300-5c4llco8/ftWk0Htik3J/w@public.gmane.org> @ 2011-04-19 11:16 ` Jens Axboe 2011-04-19 16:13 ` Bart Van Assche [not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 0 siblings, 2 replies; 41+ messages in thread From: Jens Axboe @ 2011-04-19 11:16 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On 2011-04-19 11:09, Jens Axboe wrote: > On 2011-04-18 20:32, Bart Van Assche wrote: >> On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: >>> On 2011-04-18 20:21, Bart Van Assche wrote: >>>> a performance regression in the block layer not related to the md >>>> issue. If I run a small block IOPS test on a block device created by >>>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 >>>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). >>> >>> That's not good. What's the test case? >> >> Nothing more than a fio IOPS test: >> >> fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread >> --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G >> --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1 > > Bart, can you try the below: Here's a more complete variant. James, lets get rid of this REENTER crap. It's completely bogus and triggers falsely for a variety of reasons. The below will work, but there may be room for improvement on the SCSI side. diff --git a/block/blk-core.c b/block/blk-core.c index 5fa3dd2..4e49665 100644 --- a/block/blk-core.c +++ b/block/blk-core.c @@ -303,15 +303,7 @@ void __blk_run_queue(struct request_queue *q) if (unlikely(blk_queue_stopped(q))) return; - /* - * Only recurse once to avoid overrunning the stack, let the unplug - * handling reinvoke the handler shortly if we already got there. - */ - if (!queue_flag_test_and_set(QUEUE_FLAG_REENTER, q)) { - q->request_fn(q); - queue_flag_clear(QUEUE_FLAG_REENTER, q); - } else - queue_delayed_work(kblockd_workqueue, &q->delay_work, 0); + q->request_fn(q); } EXPORT_SYMBOL(__blk_run_queue); @@ -328,6 +320,7 @@ void blk_run_queue_async(struct request_queue *q) if (likely(!blk_queue_stopped(q))) queue_delayed_work(kblockd_workqueue, &q->delay_work, 0); } +EXPORT_SYMBOL(blk_run_queue_async); /** * blk_run_queue - run a single device queue diff --git a/block/blk.h b/block/blk.h index c9df8fc..6126346 100644 --- a/block/blk.h +++ b/block/blk.h @@ -22,7 +22,6 @@ void blk_rq_timed_out_timer(unsigned long data); void blk_delete_timer(struct request *); void blk_add_timer(struct request *); void __generic_unplug_device(struct request_queue *); -void blk_run_queue_async(struct request_queue *q); /* * Internal atomic flags for request handling diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c index ab55c2f..e9901b8 100644 --- a/drivers/scsi/scsi_lib.c +++ b/drivers/scsi/scsi_lib.c @@ -411,8 +411,6 @@ static void scsi_run_queue(struct request_queue *q) list_splice_init(&shost->starved_list, &starved_list); while (!list_empty(&starved_list)) { - int flagset; - /* * As long as shost is accepting commands and we have * starved queues, call blk_run_queue. scsi_request_fn @@ -435,20 +433,7 @@ static void scsi_run_queue(struct request_queue *q) continue; } - spin_unlock(shost->host_lock); - - spin_lock(sdev->request_queue->queue_lock); - flagset = test_bit(QUEUE_FLAG_REENTER, &q->queue_flags) && - !test_bit(QUEUE_FLAG_REENTER, - &sdev->request_queue->queue_flags); - if (flagset) - queue_flag_set(QUEUE_FLAG_REENTER, sdev->request_queue); - __blk_run_queue(sdev->request_queue); - if (flagset) - queue_flag_clear(QUEUE_FLAG_REENTER, sdev->request_queue); - spin_unlock(sdev->request_queue->queue_lock); - - spin_lock(shost->host_lock); + blk_run_queue_async(sdev->request_queue); } /* put any unprocessed entries back */ list_splice(&starved_list, &shost->starved_list); diff --git a/drivers/scsi/scsi_transport_fc.c b/drivers/scsi/scsi_transport_fc.c index 28c3350..815069d 100644 --- a/drivers/scsi/scsi_transport_fc.c +++ b/drivers/scsi/scsi_transport_fc.c @@ -3816,28 +3816,17 @@ fail_host_msg: static void fc_bsg_goose_queue(struct fc_rport *rport) { - int flagset; - unsigned long flags; - if (!rport->rqst_q) return; + /* + * This get/put dance makes no sense + */ get_device(&rport->dev); - - spin_lock_irqsave(rport->rqst_q->queue_lock, flags); - flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) && - !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags); - if (flagset) - queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q); - __blk_run_queue(rport->rqst_q); - if (flagset) - queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q); - spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags); - + blk_run_queue_async(rport->rqst_q); put_device(&rport->dev); } - /** * fc_bsg_rport_dispatch - process rport bsg requests and dispatch to LLDD * @q: rport request queue diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h index cbbfd98..2ad95fa 100644 --- a/include/linux/blkdev.h +++ b/include/linux/blkdev.h @@ -388,20 +388,19 @@ struct request_queue #define QUEUE_FLAG_SYNCFULL 3 /* read queue has been filled */ #define QUEUE_FLAG_ASYNCFULL 4 /* write queue has been filled */ #define QUEUE_FLAG_DEAD 5 /* queue being torn down */ -#define QUEUE_FLAG_REENTER 6 /* Re-entrancy avoidance */ -#define QUEUE_FLAG_ELVSWITCH 7 /* don't use elevator, just do FIFO */ -#define QUEUE_FLAG_BIDI 8 /* queue supports bidi requests */ -#define QUEUE_FLAG_NOMERGES 9 /* disable merge attempts */ -#define QUEUE_FLAG_SAME_COMP 10 /* force complete on same CPU */ -#define QUEUE_FLAG_FAIL_IO 11 /* fake timeout */ -#define QUEUE_FLAG_STACKABLE 12 /* supports request stacking */ -#define QUEUE_FLAG_NONROT 13 /* non-rotational device (SSD) */ +#define QUEUE_FLAG_ELVSWITCH 6 /* don't use elevator, just do FIFO */ +#define QUEUE_FLAG_BIDI 7 /* queue supports bidi requests */ +#define QUEUE_FLAG_NOMERGES 8 /* disable merge attempts */ +#define QUEUE_FLAG_SAME_COMP 9 /* force complete on same CPU */ +#define QUEUE_FLAG_FAIL_IO 10 /* fake timeout */ +#define QUEUE_FLAG_STACKABLE 11 /* supports request stacking */ +#define QUEUE_FLAG_NONROT 12 /* non-rotational device (SSD) */ #define QUEUE_FLAG_VIRT QUEUE_FLAG_NONROT /* paravirt device */ -#define QUEUE_FLAG_IO_STAT 15 /* do IO stats */ -#define QUEUE_FLAG_DISCARD 16 /* supports DISCARD */ -#define QUEUE_FLAG_NOXMERGES 17 /* No extended merges */ -#define QUEUE_FLAG_ADD_RANDOM 18 /* Contributes to random pool */ -#define QUEUE_FLAG_SECDISCARD 19 /* supports SECDISCARD */ +#define QUEUE_FLAG_IO_STAT 13 /* do IO stats */ +#define QUEUE_FLAG_DISCARD 14 /* supports DISCARD */ +#define QUEUE_FLAG_NOXMERGES 15 /* No extended merges */ +#define QUEUE_FLAG_ADD_RANDOM 16 /* Contributes to random pool */ +#define QUEUE_FLAG_SECDISCARD 17 /* supports SECDISCARD */ #define QUEUE_FLAG_DEFAULT ((1 << QUEUE_FLAG_IO_STAT) | \ (1 << QUEUE_FLAG_STACKABLE) | \ @@ -699,6 +698,7 @@ extern void blk_sync_queue(struct request_queue *q); extern void __blk_stop_queue(struct request_queue *q); extern void __blk_run_queue(struct request_queue *q); extern void blk_run_queue(struct request_queue *); +extern void blk_run_queue_async(struct request_queue *q); extern int blk_rq_map_user(struct request_queue *, struct request *, struct rq_map_data *, void __user *, unsigned long, gfp_t); -- Jens Axboe ^ permalink raw reply related [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot 2011-04-19 11:16 ` Jens Axboe @ 2011-04-19 16:13 ` Bart Van Assche [not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> [not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 1 sibling, 1 reply; 41+ messages in thread From: Bart Van Assche @ 2011-04-19 16:13 UTC (permalink / raw) To: Jens Axboe Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown, David Dillow On Tue, Apr 19, 2011 at 1:16 PM, Jens Axboe <jaxboe@fusionio.com> wrote: > On 2011-04-19 11:09, Jens Axboe wrote: > > On 2011-04-18 20:32, Bart Van Assche wrote: > >> On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe@fusionio.com> wrote: > >>> On 2011-04-18 20:21, Bart Van Assche wrote: > >>>> a performance regression in the block layer not related to the md > >>>> issue. If I run a small block IOPS test on a block device created by > >>>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 > >>>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). > >>> > >>> That's not good. What's the test case? > >> > >> Nothing more than a fio IOPS test: > >> > >> fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread > >> --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G > >> --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1 > > > > Bart, can you try the below: > > Here's a more complete variant. James, lets get rid of this REENTER > crap. It's completely bogus and triggers falsely for a variety of > reasons. The below will work, but there may be room for improvement on > the SCSI side. > > diff --git a/block/blk-core.c b/block/blk-core.c > index 5fa3dd2..4e49665 100644 > --- a/block/blk-core.c > +++ b/block/blk-core.c > @@ -303,15 +303,7 @@ void __blk_run_queue(struct request_queue *q) > if (unlikely(blk_queue_stopped(q))) > return; > > - /* > - * Only recurse once to avoid overrunning the stack, let the unplug > - * handling reinvoke the handler shortly if we already got there. > - */ > - if (!queue_flag_test_and_set(QUEUE_FLAG_REENTER, q)) { > - q->request_fn(q); > - queue_flag_clear(QUEUE_FLAG_REENTER, q); > - } else > - queue_delayed_work(kblockd_workqueue, &q->delay_work, 0); > + q->request_fn(q); > } > EXPORT_SYMBOL(__blk_run_queue); > > @@ -328,6 +320,7 @@ void blk_run_queue_async(struct request_queue *q) > if (likely(!blk_queue_stopped(q))) > queue_delayed_work(kblockd_workqueue, &q->delay_work, 0); > } > +EXPORT_SYMBOL(blk_run_queue_async); > > /** > * blk_run_queue - run a single device queue > diff --git a/block/blk.h b/block/blk.h > index c9df8fc..6126346 100644 > --- a/block/blk.h > +++ b/block/blk.h > @@ -22,7 +22,6 @@ void blk_rq_timed_out_timer(unsigned long data); > void blk_delete_timer(struct request *); > void blk_add_timer(struct request *); > void __generic_unplug_device(struct request_queue *); > -void blk_run_queue_async(struct request_queue *q); > > /* > * Internal atomic flags for request handling > diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c > index ab55c2f..e9901b8 100644 > --- a/drivers/scsi/scsi_lib.c > +++ b/drivers/scsi/scsi_lib.c > @@ -411,8 +411,6 @@ static void scsi_run_queue(struct request_queue *q) > list_splice_init(&shost->starved_list, &starved_list); > > while (!list_empty(&starved_list)) { > - int flagset; > - > /* > * As long as shost is accepting commands and we have > * starved queues, call blk_run_queue. scsi_request_fn > @@ -435,20 +433,7 @@ static void scsi_run_queue(struct request_queue *q) > continue; > } > > - spin_unlock(shost->host_lock); > - > - spin_lock(sdev->request_queue->queue_lock); > - flagset = test_bit(QUEUE_FLAG_REENTER, &q->queue_flags) && > - !test_bit(QUEUE_FLAG_REENTER, > - &sdev->request_queue->queue_flags); > - if (flagset) > - queue_flag_set(QUEUE_FLAG_REENTER, sdev->request_queue); > - __blk_run_queue(sdev->request_queue); > - if (flagset) > - queue_flag_clear(QUEUE_FLAG_REENTER, sdev->request_queue); > - spin_unlock(sdev->request_queue->queue_lock); > - > - spin_lock(shost->host_lock); > + blk_run_queue_async(sdev->request_queue); > } > /* put any unprocessed entries back */ > list_splice(&starved_list, &shost->starved_list); > diff --git a/drivers/scsi/scsi_transport_fc.c b/drivers/scsi/scsi_transport_fc.c > index 28c3350..815069d 100644 > --- a/drivers/scsi/scsi_transport_fc.c > +++ b/drivers/scsi/scsi_transport_fc.c > @@ -3816,28 +3816,17 @@ fail_host_msg: > static void > fc_bsg_goose_queue(struct fc_rport *rport) > { > - int flagset; > - unsigned long flags; > - > if (!rport->rqst_q) > return; > > + /* > + * This get/put dance makes no sense > + */ > get_device(&rport->dev); > - > - spin_lock_irqsave(rport->rqst_q->queue_lock, flags); > - flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) && > - !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags); > - if (flagset) > - queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q); > - __blk_run_queue(rport->rqst_q); > - if (flagset) > - queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q); > - spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags); > - > + blk_run_queue_async(rport->rqst_q); > put_device(&rport->dev); > } > > - > /** > * fc_bsg_rport_dispatch - process rport bsg requests and dispatch to LLDD > * @q: rport request queue > diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h > index cbbfd98..2ad95fa 100644 > --- a/include/linux/blkdev.h > +++ b/include/linux/blkdev.h > @@ -388,20 +388,19 @@ struct request_queue > #define QUEUE_FLAG_SYNCFULL 3 /* read queue has been filled */ > #define QUEUE_FLAG_ASYNCFULL 4 /* write queue has been filled */ > #define QUEUE_FLAG_DEAD 5 /* queue being torn down */ > -#define QUEUE_FLAG_REENTER 6 /* Re-entrancy avoidance */ > -#define QUEUE_FLAG_ELVSWITCH 7 /* don't use elevator, just do FIFO */ > -#define QUEUE_FLAG_BIDI 8 /* queue supports bidi requests */ > -#define QUEUE_FLAG_NOMERGES 9 /* disable merge attempts */ > -#define QUEUE_FLAG_SAME_COMP 10 /* force complete on same CPU */ > -#define QUEUE_FLAG_FAIL_IO 11 /* fake timeout */ > -#define QUEUE_FLAG_STACKABLE 12 /* supports request stacking */ > -#define QUEUE_FLAG_NONROT 13 /* non-rotational device (SSD) */ > +#define QUEUE_FLAG_ELVSWITCH 6 /* don't use elevator, just do FIFO */ > +#define QUEUE_FLAG_BIDI 7 /* queue supports bidi requests */ > +#define QUEUE_FLAG_NOMERGES 8 /* disable merge attempts */ > +#define QUEUE_FLAG_SAME_COMP 9 /* force complete on same CPU */ > +#define QUEUE_FLAG_FAIL_IO 10 /* fake timeout */ > +#define QUEUE_FLAG_STACKABLE 11 /* supports request stacking */ > +#define QUEUE_FLAG_NONROT 12 /* non-rotational device (SSD) */ > #define QUEUE_FLAG_VIRT QUEUE_FLAG_NONROT /* paravirt device */ > -#define QUEUE_FLAG_IO_STAT 15 /* do IO stats */ > -#define QUEUE_FLAG_DISCARD 16 /* supports DISCARD */ > -#define QUEUE_FLAG_NOXMERGES 17 /* No extended merges */ > -#define QUEUE_FLAG_ADD_RANDOM 18 /* Contributes to random pool */ > -#define QUEUE_FLAG_SECDISCARD 19 /* supports SECDISCARD */ > +#define QUEUE_FLAG_IO_STAT 13 /* do IO stats */ > +#define QUEUE_FLAG_DISCARD 14 /* supports DISCARD */ > +#define QUEUE_FLAG_NOXMERGES 15 /* No extended merges */ > +#define QUEUE_FLAG_ADD_RANDOM 16 /* Contributes to random pool */ > +#define QUEUE_FLAG_SECDISCARD 17 /* supports SECDISCARD */ > > #define QUEUE_FLAG_DEFAULT ((1 << QUEUE_FLAG_IO_STAT) | \ > (1 << QUEUE_FLAG_STACKABLE) | \ > @@ -699,6 +698,7 @@ extern void blk_sync_queue(struct request_queue *q); > extern void __blk_stop_queue(struct request_queue *q); > extern void __blk_run_queue(struct request_queue *q); > extern void blk_run_queue(struct request_queue *); > +extern void blk_run_queue_async(struct request_queue *q); > extern int blk_rq_map_user(struct request_queue *, struct request *, > struct rq_map_data *, void __user *, unsigned long, > gfp_t); Hello Jens, The same test with an initiator running 2.6.39-rc4 + git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch yields about 155.000 IOPS on my test setup, or the same performance as with 2.6.38.3. I'm running the above patch through an I/O stress test now. Bart. ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-04-19 16:32 ` Linus Torvalds [not found] ` <BANLkTikzRE7sA8_NNgVUpOqR_zGtw+YUvQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-19 17:43 ` Jens Axboe 1 sibling, 1 reply; 41+ messages in thread From: Linus Torvalds @ 2011-04-19 16:32 UTC (permalink / raw) To: Bart Van Assche Cc: Jens Axboe, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown, David Dillow On Tue, Apr 19, 2011 at 9:13 AM, Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote: > > The same test with an initiator running 2.6.39-rc4 + > git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch > yields about 155.000 IOPS on my test setup, or the same performance as > with 2.6.38.3. I'm running the above patch through an I/O stress test > now. Goodie. So not only does that patch get back the 11%, it removes the crazy QUEUE_FLAG_REENTER flag that was broken to begin with. AND it removes a number of complicated lines. Halleluja. Linus ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTikzRE7sA8_NNgVUpOqR_zGtw+YUvQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTikzRE7sA8_NNgVUpOqR_zGtw+YUvQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-04-19 17:43 ` Jens Axboe 0 siblings, 0 replies; 41+ messages in thread From: Jens Axboe @ 2011-04-19 17:43 UTC (permalink / raw) To: Linus Torvalds Cc: Bart Van Assche, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown, David Dillow On 2011-04-19 18:32, Linus Torvalds wrote: > On Tue, Apr 19, 2011 at 9:13 AM, Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote: >> >> The same test with an initiator running 2.6.39-rc4 + >> git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch >> yields about 155.000 IOPS on my test setup, or the same performance as >> with 2.6.38.3. I'm running the above patch through an I/O stress test >> now. > > Goodie. So not only does that patch get back the 11%, it removes the > crazy QUEUE_FLAG_REENTER flag that was broken to begin with. AND it > removes a number of complicated lines. > > Halleluja. Indeed, coming your way soonish. -- Jens Axboe ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-19 16:32 ` Linus Torvalds @ 2011-04-19 17:43 ` Jens Axboe 1 sibling, 0 replies; 41+ messages in thread From: Jens Axboe @ 2011-04-19 17:43 UTC (permalink / raw) To: Bart Van Assche Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown, David Dillow On 2011-04-19 18:13, Bart Van Assche wrote: > The same test with an initiator running 2.6.39-rc4 + > git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch > yields about 155.000 IOPS on my test setup, or the same performance as > with 2.6.38.3. I'm running the above patch through an I/O stress test > now. OK, so parity, that's good. With the above patch, I can take a single device from ~400K IOPS on 2.6.38 to ~440K IOPS on 2.6.39-rc4+patches. -- Jens Axboe ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org> @ 2011-04-19 16:48 ` Christoph Hellwig [not found] ` <20110419164815.GA30616-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Christoph Hellwig @ 2011-04-19 16:48 UTC (permalink / raw) To: Jens Axboe Cc: Bart Van Assche, Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown > + blk_run_queue_async(sdev->request_queue); This doesn't even have to be async except when scsi drivers call cmd->scsi_done directly. It seems like if this always went through the softirq (or kblockd) we could still run it in context for the others. > + /* > + * This get/put dance makes no sense > + */ > get_device(&rport->dev); > - > - spin_lock_irqsave(rport->rqst_q->queue_lock, flags); > - flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) && > - !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags); > - if (flagset) > - queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q); > - __blk_run_queue(rport->rqst_q); > - if (flagset) > - queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q); > - spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags); > - > + blk_run_queue_async(rport->rqst_q); And the QUEUE_FLAG_REENTER mess here never made sense either as it tested for a bit beeing set and not set at the same time. So this one actually should be able to be replaced by a plain blk_run_queue. ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <20110419164815.GA30616-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <20110419164815.GA30616-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org> @ 2011-04-19 17:06 ` Jens Axboe 0 siblings, 0 replies; 41+ messages in thread From: Jens Axboe @ 2011-04-19 17:06 UTC (permalink / raw) To: Christoph Hellwig Cc: Bart Van Assche, Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On 2011-04-19 18:48, Christoph Hellwig wrote: >> + blk_run_queue_async(sdev->request_queue); > > This doesn't even have to be async except when scsi drivers call > cmd->scsi_done directly. It seems like if this always went through the > softirq (or kblockd) we could still run it in context for the others. Exactly. I'll pass an 'optimize' patch past James. >> + /* >> + * This get/put dance makes no sense >> + */ >> get_device(&rport->dev); >> - >> - spin_lock_irqsave(rport->rqst_q->queue_lock, flags); >> - flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) && >> - !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags); >> - if (flagset) >> - queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q); >> - __blk_run_queue(rport->rqst_q); >> - if (flagset) >> - queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q); >> - spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags); >> - >> + blk_run_queue_async(rport->rqst_q); > > And the QUEUE_FLAG_REENTER mess here never made sense either as it > tested for a bit beeing set and not set at the same time. So this one > actually should be able to be replaced by a plain blk_run_queue. Yep, it's completely broken as-is. -- Jens Axboe ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-18 18:28 ` Jens Axboe @ 2011-04-19 3:32 ` David Dillow [not found] ` <1303183934.2585.12.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org> 1 sibling, 1 reply; 41+ messages in thread From: David Dillow @ 2011-04-19 3:32 UTC (permalink / raw) To: Bart Van Assche Cc: Jens Axboe, Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On Mon, 2011-04-18 at 20:21 +0200, Bart Van Assche wrote: > On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: > > Bart, can you try and pull: > > > > git://git.kernel.dk/linux-2.6-block.git for-linus > > > > into Linus' tree and see if that works? This has, among other things, > > Neils fixes for MD. > > md seems to work stable with the resulting tree, but it looks there is > a performance regression in the block layer not related to the md > issue. If I run a small block IOPS test on a block device created by > ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 > (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). The mapping code for ib_srp changed in 2.6.39-rc1, but it showed improved IOPS for a similar setup in my testing so I'd be surprised if it is the culprit. Still, it wouldn't hurt to check. Do you have time to try the new ib_srp code with 2.6.38.3 to eliminate it from the equation? Thanks, Dave ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <1303183934.2585.12.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <1303183934.2585.12.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org> @ 2011-04-19 16:39 ` Bart Van Assche 0 siblings, 0 replies; 41+ messages in thread From: Bart Van Assche @ 2011-04-19 16:39 UTC (permalink / raw) To: David Dillow Cc: Jens Axboe, Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown On Tue, Apr 19, 2011 at 5:32 AM, David Dillow <dave-i1Mk8JYDVaaSihdK6806/g@public.gmane.org> wrote: > > On Mon, 2011-04-18 at 20:21 +0200, Bart Van Assche wrote: > > On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote: > > > Bart, can you try and pull: > > > > > > git://git.kernel.dk/linux-2.6-block.git for-linus > > > > > > into Linus' tree and see if that works? This has, among other things, > > > Neils fixes for MD. > > > > md seems to work stable with the resulting tree, but it looks there is > > a performance regression in the block layer not related to the md > > issue. If I run a small block IOPS test on a block device created by > > ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3 > > (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+). > > The mapping code for ib_srp changed in 2.6.39-rc1, but it showed > improved IOPS for a similar setup in my testing so I'd be surprised if > it is the culprit. Still, it wouldn't hurt to check. Do you have time to > try the new ib_srp code with 2.6.38.3 to eliminate it from the equation? Hello Dave, I just ran a test with the most important 2.6.39-specific ib_srp commits reverted but that didn't yield a measurable performance difference for this specific test: $ git show --format=format:%s 7f9e5c48c1078507747434d4c182ab10925bf98a be8b981453a4904399cb090c1660618e250092d8 c07d424d6118d528ef71b22b7424bfc359c307a5 8f26c9ff9cd0317ad867bce972f69e0c6c2cbe3c 961e0be89a5120a1409ebc525cca6f603615a8a8 8c4037b501acd2ec3abc7925e66af8af40a2da9d | grep '^IB' IB: Increase DMA max_segment_size on Mellanox hardware IB/srp: try to use larger FMR sizes to cover our mappings IB/srp: add support for indirect tables that don't fit in SRP_CMD IB/srp: rework mapping engine to use multiple FMR entries IB/srp: move IB CM setup completion into its own function IB/srp: always avoid non-zero offsets into an FMR Bart. ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (4 preceding siblings ...) 2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost Rafael J. Wysocki ` (4 subsequent siblings) 10 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Nick Bowler This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33242 Subject : Lockdep splat in autofs with 2.6.39-rc2 Submitter : Nick Bowler <nbowler-7BP4RkwGw0uXmMXjJBpWqg@public.gmane.org> Date : 2011-04-07 19:44 (11 days old) Message-ID : <20110407194403.GA29404-7BP4RkwGw0uXmMXjJBpWqg@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130220545614682&w=2 ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (5 preceding siblings ...) 2011-04-17 12:57 ` [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2 Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Rafael J. Wysocki ` (3 subsequent siblings) 10 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Greg Kroah-Hartman, H. Peter Anvin, Konrad Rzeszutek Wilk, Linda Walsh, Stefano Stabellini This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33102 Subject : File's copied from client->linux server only copy 1st 64K data;rest is lost Submitter : Linda Walsh <lkml@tlinx.org> Date : 2011-04-11 22:12 (7 days old) ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (6 preceding siblings ...) 2011-04-17 12:57 ` [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33272] drm related hard-hang Rafael J. Wysocki ` (2 subsequent siblings) 10 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Keith Packard This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33142 Subject : 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Submitter : Keith Packard <keithp@keithp.com> Date : 2011-04-06 7:44 (12 days old) Message-ID : <yun1v1fj024.fsf@aiko.keithp.com> References : http://marc.info/?l=linux-kernel&m=130207593728273&w=2 ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33272] drm related hard-hang 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (7 preceding siblings ...) 2011-04-17 12:57 ` [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki [not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ@mail.gmail.com> 2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive Rafael J. Wysocki 10 siblings, 1 reply; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Peter Teoh This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33272 Subject : drm related hard-hang Submitter : Peter Teoh <htmldeveloper-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-14 01:29 (4 days old) ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ@mail.gmail.com>]
[parent not found: <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #33272] drm related hard-hang [not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-04-17 18:22 ` Rafael J. Wysocki 0 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 18:22 UTC (permalink / raw) To: Peter Teoh; +Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler On Sunday, April 17, 2011, Peter Teoh wrote: > oh yes, as shown in the bug report, the version at linus-git tree is not > working as of 14 Apr 2011, Thanks for the update. Rafael > On Sun, Apr 17, 2011 at 8:57 PM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote: > > > This message has been generated automatically as a part of a summary report > > of recent regressions. > > > > The following bug entry is on the current list of known regressions > > from 2.6.38. Please verify if it still should be listed and let the > > tracking team > > know (either way). > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33272 > > Subject : drm related hard-hang > > Submitter : Peter Teoh <htmldeveloper-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> > > Date : 2011-04-14 01:29 (4 days old) > > > > > > > > > ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (8 preceding siblings ...) 2011-04-17 12:57 ` [Bug #33272] drm related hard-hang Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 2011-04-17 13:05 ` Cyrill Gorcunov 2011-04-17 12:57 ` [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive Rafael J. Wysocki 10 siblings, 1 reply; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Cyrill Gorcunov, Don Zickus, Ingo Molnar, Shaun Ruffell This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252 Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org> Date : 2011-04-06 22:30 (12 days old) First-Bad-Commit: http://git.kernel.org/linus/242214f9c1eeaae40eca11e3b4d37bfce960a7cd Message-ID : <20110406223036.GA15721-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130212907032580&w=2 Handled-By : Don Zickus <dzickus-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> Patch : http://cache.gmane.org//gmane/linux/kernel/1125621-001.bin ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs 2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki @ 2011-04-17 13:05 ` Cyrill Gorcunov [not found] ` <4DAAE5AA.1090302-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> 0 siblings, 1 reply; 41+ messages in thread From: Cyrill Gorcunov @ 2011-04-17 13:05 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Don Zickus, Ingo Molnar, Shaun Ruffell On 04/17/2011 04:57 PM, Rafael J. Wysocki wrote: > This message has been generated automatically as a part of a summary report > of recent regressions. > > The following bug entry is on the current list of known regressions > from 2.6.38. Please verify if it still should be listed and let the tracking team > know (either way). > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252 > Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs > Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org> > Date : 2011-04-06 22:30 (12 days old) > First-Bad-Commit: http://git.kernel.org/linus/242214f9c1eeaae40eca11e3b4d37bfce960a7cd > Message-ID : <20110406223036.GA15721-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org> > References : http://marc.info/?l=linux-kernel&m=130212907032580&w=2 > Handled-By : Don Zickus <dzickus-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> > Patch : http://cache.gmane.org//gmane/linux/kernel/1125621-001.bin > > We're working on it, patch is almost done. I guess it'll be published next week. -- Cyrill ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <4DAAE5AA.1090302-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>]
* Re: [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs [not found] ` <4DAAE5AA.1090302-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> @ 2011-04-17 13:31 ` Rafael J. Wysocki 0 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 13:31 UTC (permalink / raw) To: Cyrill Gorcunov Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Don Zickus, Ingo Molnar, Shaun Ruffell On Sunday, April 17, 2011, Cyrill Gorcunov wrote: > On 04/17/2011 04:57 PM, Rafael J. Wysocki wrote: > > This message has been generated automatically as a part of a summary report > > of recent regressions. > > > > The following bug entry is on the current list of known regressions > > from 2.6.38. Please verify if it still should be listed and let the tracking team > > know (either way). > > > > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252 > > Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs > > Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org> > > Date : 2011-04-06 22:30 (12 days old) > > > > > > We're working on it, patch is almost done. I guess it'll be published next week. Great, thanks! Please let me know when the patch makes it to the Linus' tree. Rafael ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive. 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki ` (9 preceding siblings ...) 2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki @ 2011-04-17 12:57 ` Rafael J. Wysocki 10 siblings, 0 replies; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Shaun Ruffell This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33342 Subject : [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive. Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org> Date : 2011-04-08 20:15 (10 days old) Message-ID : <20110408201513.GA3040-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130229371907209&w=2 ^ permalink raw reply [flat|nested] 41+ messages in thread
* 2.6.39-rc5-git4: Reported regressions from 2.6.38 @ 2011-04-30 19:42 Rafael J. Wysocki 2011-04-30 19:42 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki 0 siblings, 1 reply; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-30 19:42 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Maciej Rutecki, Florian Mickler, Andrew Morton, Linus Torvalds, Kernel Testers List, Network Development, Linux ACPI, Linux PM List, Linux SCSI List, Linux Wireless List, DRI [NOTE: I have one request for whoever works on fixing the listed regressions. While your work is highly appreciated anyway, if you have a patch fixing a listed regression or you know of a commit fixing a listed regression, please drop a notice into the corresponding Bugzilla entry. This will help us a lot.] This message contains a list of some regressions from 2.6.38, for which there are no fixes in the mainline known to the tracking team. If any of them have been fixed already, please let us know. If you know of any other unresolved regressions from 2.6.38, please let us know either and we'll add them to the list. Also, please let us know if any of the entries below are invalid. Each entry from the list will be sent additionally in an automatic reply to this message with CCs to the people involved in reporting and handling the issue. Listed regressions statistics: Date Total Pending Unresolved ---------------------------------------- 2011-04-30 38 17 16 2011-04-17 17 11 10 Unresolved regressions ---------------------- Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=34012 Subject : 2.6.39-rc4+: oom-killer busy killing tasks Submitter : Christian Kujau <lists-AanptEQQ3TL9uQeqpI+JUg@public.gmane.org> Date : 2011-04-22 1:57 (9 days old) Message-ID : <alpine.DEB.2.01.1104211841510.18728-yMVncJ7Z0T9hhup4ARC/jw@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130343744622331&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=34002 Subject : [REGRESSION] [2.6.39-rc3] Wrong resolution in framebuffer and X Window Submitter : Maciej Rutecki <maciej.rutecki-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-17 16:04 (14 days old) Message-ID : <201104171804.04664.maciej.rutecki-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> References : http://marc.info/?l=linux-fbdev&m=130305625114863&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33842 Subject : NULL pointer dereference in ip_fragment Submitter : Tomas Carnecky <tom-ix/z9KG6G0Jl57MIdRCFDg@public.gmane.org> Date : 2011-04-23 07:51 (8 days old) Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33812 Subject : radeon with modeset option causes oops as secondary adapter (vgaswitcheroo) Submitter : Johannes Engel <jcnengel-gM/Ye1E23mwN+BqQ9rBEUg@public.gmane.org> Date : 2011-04-21 23:14 (10 days old) Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33792 Subject : lockdep trace when unplugging usb audio (.39rc4) Submitter : Dave Jones <davej-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> Date : 2011-04-19 18:07 (12 days old) Message-ID : <20110419180745.GA438-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130323648920431&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33492 Subject : [bug] 2.6.39-rc3 fails to boot on Alpha Submitter : Mikael Pettersson <mikpe-1zs4UD6AkMk@public.gmane.org> Date : 2011-04-17 15:13 (14 days old) Message-ID : <19883.912.266127.538215-tgku4HJDRZih8lFjZTKsyTAV6s6igYVG@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130305321212360&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33482 Subject : [Regression] Bluetooth pairing does not work anymore in 2.6.39-rc3 (works in 2.6.38.3) Submitter : Gottfried Haider <gottfried.haider-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-16 16:38 (15 days old) Message-ID : <BANLkTi=LvyZ+7BHfVL849pztfvsYaVM4SQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130297197128328&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33432 Subject : WARNING at libata-core.c:5015 in 2.6.39-rc3-wl+, then lockup. Submitter : Ben Greear <greearb-my8/4N5VtI7c+919tysfdA@public.gmane.org> Date : 2011-04-13 16:29 (18 days old) Message-ID : <4DA5CF81.4070908-my8/4N5VtI7c+919tysfdA@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130271220809443&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33422 Subject : oops in radeon_ddc_get_modes on first boot of rc3 Submitter : Steve French <smfrench-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-12 17:54 (19 days old) Message-ID : <BANLkTikTVuV9eH4GiLDs0DewGJXo_3daeg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130263086322702&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33402 Subject : [regression] 2.6.39-rc[1-3] fail to boot on G5 PowerMac Submitter : Mikael Pettersson <mikpe-1zs4UD6AkMk@public.gmane.org> Date : 2011-04-12 17:30 (19 days old) Message-ID : <19876.35918.322649.256455-tgku4HJDRZih8lFjZTKsyTAV6s6igYVG@public.gmane.org> References : http://marc.info/?l=linuxppc-embedded&m=130263128523237&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33302 Subject : ACPI reboot broken on HP Compaq nx7300. Submitter : Alexey Zaytsev <alexey.zaytsev-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-15 14:44 (16 days old) First-Bad-Commit: http://git.kernel.org/linus/3d35ac346e981162eeba391e496faceed4753e7b Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33272 Subject : drm related hard-hang Submitter : Peter Teoh <htmldeveloper-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-14 01:29 (17 days old) Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33242 Subject : Lockdep splat in autofs with 2.6.39-rc2 Submitter : Nick Bowler <nbowler-7BP4RkwGw0uXmMXjJBpWqg@public.gmane.org> Date : 2011-04-07 19:44 (24 days old) Message-ID : <20110407194403.GA29404-7BP4RkwGw0uXmMXjJBpWqg@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130220545614682&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33142 Subject : 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Submitter : Keith Packard <keithp-aN4HjG94KOLQT0dZR+AlfA@public.gmane.org> Date : 2011-04-06 7:44 (25 days old) Message-ID : <yun1v1fj024.fsf-XEHByDMaNQOz9DMzp4kqnw@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130207593728273&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33092 Subject : [regression] 2.6.39-rc1 - Beagleboard usbnet broken Submitter : Mark Jackson <mpfj-list-kZtEnBLzDKq1Qrn1Bg8BZw@public.gmane.org> Date : 2011-04-04 9:22 (27 days old) First-Bad-Commit: http://git.kernel.org/linus/087809fce28f50098d9c3ef1a6865c722f23afd2 Message-ID : <4D998DC9.3040109-kZtEnBLzDKq1Qrn1Bg8BZw@public.gmane.org> References : http://marc.info/?l=linux-kernel&m=130191386508831&w=2 Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 Subject : Kernel locks up a few minutes after boot Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-10 19:55 (21 days old) Regressions with patches ------------------------ Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33802 Subject : list_del corruption in sd driver since 2.6.39-rc4 Submitter : Christian Casteyde <casteyde.christian-GANU6spQydw@public.gmane.org> Date : 2011-04-21 21:10 (10 days old) Handled-By : James Bottomley <James.Bottomley-l3A5Bk7waGM@public.gmane.org> Patch : http://marc.info/?l=linux-kernel&m=130271409412095 For details, please visit the bug entries and follow the links given in references. As you can see, there is a Bugzilla entry for each of the listed regressions. There also is a Bugzilla entry used for tracking the regressions from 2.6.38, unresolved as well as resolved, at: http://bugzilla.kernel.org/show_bug.cgi?id=32012 Please let the tracking team know if there are any Bugzilla entries that should be added to the list in there. Thanks! ^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32982] Kernel locks up a few minutes after boot 2011-04-30 19:42 2.6.39-rc5-git4: Reported regressions from 2.6.38 Rafael J. Wysocki @ 2011-04-30 19:42 ` Rafael J. Wysocki 2011-04-30 19:51 ` Linus Torvalds 0 siblings, 1 reply; 41+ messages in thread From: Rafael J. Wysocki @ 2011-04-30 19:42 UTC (permalink / raw) To: Linux Kernel Mailing List Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Bart Van Assche, Linus Torvalds This message has been generated automatically as a part of a summary report of recent regressions. The following bug entry is on the current list of known regressions from 2.6.38. Please verify if it still should be listed and let the tracking team know (either way). Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 Subject : Kernel locks up a few minutes after boot Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Date : 2011-04-10 19:55 (21 days old) ^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot 2011-04-30 19:42 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki @ 2011-04-30 19:51 ` Linus Torvalds [not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA@mail.gmail.com> 0 siblings, 1 reply; 41+ messages in thread From: Linus Torvalds @ 2011-04-30 19:51 UTC (permalink / raw) To: Rafael J. Wysocki Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler, Bart Van Assche I think we had all assumed that this was the MD problem that should have been fixed in rc4 (the symptoms matched), but I don't think we got any confirmation from Bart on that. Bart? Does the problem still persist in current -git? Linus On Sat, Apr 30, 2011 at 12:42 PM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote: > This message has been generated automatically as a part of a summary report > of recent regressions. > > The following bug entry is on the current list of known regressions > from 2.6.38. Please verify if it still should be listed and let the tracking team > know (either way). > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982 > Subject : Kernel locks up a few minutes after boot > Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> > Date : 2011-04-10 19:55 (21 days old) > > > ^ permalink raw reply [flat|nested] 41+ messages in thread
[parent not found: <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA@mail.gmail.com>]
[parent not found: <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: [Bug #32982] Kernel locks up a few minutes after boot [not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-05-01 17:01 ` Linus Torvalds 0 siblings, 0 replies; 41+ messages in thread From: Linus Torvalds @ 2011-05-01 17:01 UTC (permalink / raw) To: Bart Van Assche Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki, Florian Mickler On Sun, May 1, 2011 at 2:55 AM, Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote: > > There is something else and completely unrelated that is puzzling me though: > on two almost identical systems one always recognizes all internal PCIe > cards but the other system not. This is something that seldom happened with > 2.6.34 but happens frequently with 2.6.38 and 2.6.39-rcx. What I see is that > during boot either both InfiniBand PCIe cards are recognized or that one > specific card is not recognized and even doesn't show up in the lspci > output. A BIOS upgrade didn't help. Any idea where I should start looking to > find the cause of this issue ? So it has happened sporadically before, but happens much more commonly now? That very much implies some timing issue in PCI probing. It could be, for example, that the card has a very slow reset sequence, and doesn't respond to PCI config cycles until it has internally booted fully. If so, a faster boot by the kernel might just cause the Linux PCI enumeration to be done before the card is ready. (That's a really unlikely scenario - I'm not seriously suggesting that the card would be quite <i>that</i> stupid and slow. But there might be similar issues at a much lower level, ie if the Linux pcie port driver might be resetting the port and then trying to read the card too quickly afterwards, and you'd want some added delay there). Have you tried it "pcie_ports=compat" (or "native") makes any difference? But you should probably contact Jesse Barnes and the linux-pci mailing list and see if anybody has any smarter ideas. Linus ^ permalink raw reply [flat|nested] 41+ messages in thread
end of thread, other threads:[~2011-05-01 17:01 UTC | newest] Thread overview: 41+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki 2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki 2011-04-17 13:16 ` Pete Clements [not found] ` <201104171316.p3HDGI8Y018237-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org> 2011-04-17 13:28 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki 2011-04-18 15:34 ` Alex Romosan [not found] ` <87ipub7evl.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org> 2011-04-18 21:22 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #32892] 2.6.39-rc1 data corruption with rtorrent Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki 2011-04-17 17:03 ` Linus Torvalds [not found] ` <BANLkTimeaHwBiZoz=isCM=qtBxoxaBqDUw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-17 18:37 ` Bart Van Assche [not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-17 21:07 ` NeilBrown [not found] ` <20110418070711.12dd5cc1-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org> 2011-04-17 22:20 ` NeilBrown 2011-04-18 11:44 ` Jens Axboe [not found] ` <4DAC2429.5000105-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 2011-04-18 18:21 ` Bart Van Assche [not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-18 18:28 ` Jens Axboe [not found] ` <4DAC82E6.3020809-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 2011-04-18 18:32 ` Bart Van Assche [not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-18 18:38 ` Jens Axboe 2011-04-19 9:09 ` Jens Axboe [not found] ` <4DAD5156.2050300-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 2011-04-19 11:16 ` Jens Axboe 2011-04-19 16:13 ` Bart Van Assche [not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-19 16:32 ` Linus Torvalds [not found] ` <BANLkTikzRE7sA8_NNgVUpOqR_zGtw+YUvQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-19 17:43 ` Jens Axboe 2011-04-19 17:43 ` Jens Axboe [not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org> 2011-04-19 16:48 ` Christoph Hellwig [not found] ` <20110419164815.GA30616-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org> 2011-04-19 17:06 ` Jens Axboe 2011-04-19 3:32 ` David Dillow [not found] ` <1303183934.2585.12.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org> 2011-04-19 16:39 ` Bart Van Assche 2011-04-17 12:57 ` [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2 Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33272] drm related hard-hang Rafael J. Wysocki [not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ@mail.gmail.com> [not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-04-17 18:22 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki 2011-04-17 13:05 ` Cyrill Gorcunov [not found] ` <4DAAE5AA.1090302-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> 2011-04-17 13:31 ` Rafael J. Wysocki 2011-04-17 12:57 ` [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive Rafael J. Wysocki -- strict thread matches above, loose matches on Subject: below -- 2011-04-30 19:42 2.6.39-rc5-git4: Reported regressions from 2.6.38 Rafael J. Wysocki 2011-04-30 19:42 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki 2011-04-30 19:51 ` Linus Torvalds [not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA@mail.gmail.com> [not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2011-05-01 17:01 ` Linus Torvalds
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).