* 2.6.39-rc3-git7: Reported regressions from 2.6.38
@ 2011-04-17 12:52 Rafael J. Wysocki
2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki
` (10 more replies)
0 siblings, 11 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:52 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Maciej Rutecki, Florian Mickler, Andrew Morton, Linus Torvalds,
Kernel Testers List, Network Development, Linux ACPI,
Linux PM List, Linux SCSI List, Linux Wireless List, DRI
This message contains a list of some regressions from 2.6.38,
for which there are no fixes in the mainline known to the tracking team.
If any of them have been fixed already, please let us know.
If you know of any other unresolved regressions from 2.6.38, please let us
know either and we'll add them to the list. Also, please let us know
if any of the entries below are invalid.
Each entry from the list will be sent additionally in an automatic reply
to this message with CCs to the people involved in reporting and handling
the issue.
Listed regressions statistics:
Date Total Pending Unresolved
----------------------------------------
2011-04-17 17 11 10
Unresolved regressions
----------------------
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33342
Subject : [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive.
Submitter : Shaun Ruffell <sruffell@digium.com>
Date : 2011-04-08 20:15 (10 days old)
Message-ID : <20110408201513.GA3040@digium.com>
References : http://marc.info/?l=linux-kernel&m=130229371907209&w=2
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33272
Subject : drm related hard-hang
Submitter : Peter Teoh <htmldeveloper@gmail.com>
Date : 2011-04-14 01:29 (4 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33242
Subject : Lockdep splat in autofs with 2.6.39-rc2
Submitter : Nick Bowler <nbowler@elliptictech.com>
Date : 2011-04-07 19:44 (11 days old)
Message-ID : <20110407194403.GA29404@elliptictech.com>
References : http://marc.info/?l=linux-kernel&m=130220545614682&w=2
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33142
Subject : 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716
Submitter : Keith Packard <keithp@keithp.com>
Date : 2011-04-06 7:44 (12 days old)
Message-ID : <yun1v1fj024.fsf@aiko.keithp.com>
References : http://marc.info/?l=linux-kernel&m=130207593728273&w=2
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33102
Subject : File's copied from client->linux server only copy 1st 64K data;rest is lost
Submitter : Linda Walsh <lkml@tlinx.org>
Date : 2011-04-11 22:12 (7 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33092
Subject : [regression] 2.6.39-rc1 - Beagleboard usbnet broken
Submitter : Mark Jackson <mpfj-list@mimc.co.uk>
Date : 2011-04-04 9:22 (14 days old)
First-Bad-Commit: http://git.kernel.org/linus/087809fce28f50098d9c3ef1a6865c722f23afd2
Message-ID : <4D998DC9.3040109@mimc.co.uk>
References : http://marc.info/?l=linux-kernel&m=130191386508831&w=2
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
Subject : Kernel locks up a few minutes after boot
Submitter : Bart Van Assche <bart.vanassche@gmail.com>
Date : 2011-04-10 19:55 (8 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902
Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64
Submitter : Alex Romosan <romosan@sycorax.lbl.gov>
Date : 2011-04-03 19:41 (15 days old)
Message-ID : <87k4fbnmw8.fsf@sycorax.lbl.gov>
References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32892
Subject : 2.6.39-rc1 data corruption with rtorrent
Submitter : Jindrich Makovicka <makovick@gmail.com>
Date : 2011-04-02 20:21 (16 days old)
Message-ID : <20110402222118.3b5c2fa8@holly>
References : http://marc.info/?l=linux-kernel&m=130177570309226&w=2
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32262
Subject : 2.6.38-git15+ IDE hangs boot
Submitter : Pete Clements <clem@clem.clem-digital.net>
Date : 2011-03-25 15:38 (24 days old)
Message-ID : <201103251538.p2PFc11i001674@clem.clem-digital.net>
References : http://marc.info/?l=linux-kernel&m=130106749313695&w=2
Regressions with patches
------------------------
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252
Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs
Submitter : Shaun Ruffell <sruffell@digium.com>
Date : 2011-04-06 22:30 (12 days old)
First-Bad-Commit: http://git.kernel.org/linus/242214f9c1eeaae40eca11e3b4d37bfce960a7cd
Message-ID : <20110406223036.GA15721@digium.com>
References : http://marc.info/?l=linux-kernel&m=130212907032580&w=2
Handled-By : Don Zickus <dzickus@redhat.com>
Patch : http://cache.gmane.org//gmane/linux/kernel/1125621-001.bin
For details, please visit the bug entries and follow the links given in
references.
As you can see, there is a Bugzilla entry for each of the listed regressions.
There also is a Bugzilla entry used for tracking the regressions from 2.6.38,
unresolved as well as resolved, at:
http://bugzilla.kernel.org/show_bug.cgi?id=32012
Please let the tracking team know if there are any Bugzilla entries that
should be added to the list in there.
Thanks!
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32262] 2.6.38-git15+ IDE hangs boot
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
@ 2011-04-17 12:52 ` Rafael J. Wysocki
2011-04-17 13:16 ` Pete Clements
2011-04-17 12:57 ` [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken Rafael J. Wysocki
` (9 subsequent siblings)
10 siblings, 1 reply; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:52 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Pete Clements
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32262
Subject : 2.6.38-git15+ IDE hangs boot
Submitter : Pete Clements <clem-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>
Date : 2011-03-25 15:38 (24 days old)
Message-ID : <201103251538.p2PFc11i001674-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>
References : http://marc.info/?l=linux-kernel&m=130106749313695&w=2
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32982] Kernel locks up a few minutes after boot
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-17 17:03 ` Linus Torvalds
2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki
` (7 subsequent siblings)
10 siblings, 1 reply; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Bart Van Assche, Linus Torvalds
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
Subject : Kernel locks up a few minutes after boot
Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2011-04-10 19:55 (8 days old)
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (2 preceding siblings ...)
2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-18 15:34 ` Alex Romosan
2011-04-17 12:57 ` [Bug #32892] 2.6.39-rc1 data corruption with rtorrent Rafael J. Wysocki
` (6 subsequent siblings)
10 siblings, 1 reply; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Alex Romosan
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902
Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64
Submitter : Alex Romosan <romosan-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>
Date : 2011-04-03 19:41 (15 days old)
Message-ID : <87k4fbnmw8.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>
References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki
` (8 subsequent siblings)
10 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Dan Williams, Jamie Iles, Jamie Iles, Mark Jackson
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33092
Subject : [regression] 2.6.39-rc1 - Beagleboard usbnet broken
Submitter : Mark Jackson <mpfj-list-kZtEnBLzDKq1Qrn1Bg8BZw@public.gmane.org>
Date : 2011-04-04 9:22 (14 days old)
First-Bad-Commit: http://git.kernel.org/linus/087809fce28f50098d9c3ef1a6865c722f23afd2
Message-ID : <4D998DC9.3040109-kZtEnBLzDKq1Qrn1Bg8BZw@public.gmane.org>
References : http://marc.info/?l=linux-kernel&m=130191386508831&w=2
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32892] 2.6.39-rc1 data corruption with rtorrent
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (3 preceding siblings ...)
2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Rafael J. Wysocki
` (5 subsequent siblings)
10 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Jindrich Makovicka
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32892
Subject : 2.6.39-rc1 data corruption with rtorrent
Submitter : Jindrich Makovicka <makovick@gmail.com>
Date : 2011-04-02 20:21 (16 days old)
Message-ID : <20110402222118.3b5c2fa8@holly>
References : http://marc.info/?l=linux-kernel&m=130177570309226&w=2
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33272] drm related hard-hang
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (5 preceding siblings ...)
2011-04-17 12:57 ` [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
[not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ@mail.gmail.com>
2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki
` (3 subsequent siblings)
10 siblings, 1 reply; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Peter Teoh
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33272
Subject : drm related hard-hang
Submitter : Peter Teoh <htmldeveloper-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2011-04-14 01:29 (4 days old)
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (7 preceding siblings ...)
2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2 Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive Rafael J. Wysocki
10 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Greg Kroah-Hartman, H. Peter Anvin, Konrad Rzeszutek Wilk,
Linda Walsh, Stefano Stabellini
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33102
Subject : File's copied from client->linux server only copy 1st 64K data;rest is lost
Submitter : Linda Walsh <lkml@tlinx.org>
Date : 2011-04-11 22:12 (7 days old)
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (4 preceding siblings ...)
2011-04-17 12:57 ` [Bug #32892] 2.6.39-rc1 data corruption with rtorrent Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33272] drm related hard-hang Rafael J. Wysocki
` (4 subsequent siblings)
10 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Keith Packard
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33142
Subject : 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716
Submitter : Keith Packard <keithp@keithp.com>
Date : 2011-04-06 7:44 (12 days old)
Message-ID : <yun1v1fj024.fsf@aiko.keithp.com>
References : http://marc.info/?l=linux-kernel&m=130207593728273&w=2
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (6 preceding siblings ...)
2011-04-17 12:57 ` [Bug #33272] drm related hard-hang Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-17 13:05 ` Cyrill Gorcunov
2011-04-17 12:57 ` [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost Rafael J. Wysocki
` (2 subsequent siblings)
10 siblings, 1 reply; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Cyrill Gorcunov, Don Zickus, Ingo Molnar, Shaun Ruffell
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252
Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs
Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org>
Date : 2011-04-06 22:30 (12 days old)
First-Bad-Commit: http://git.kernel.org/linus/242214f9c1eeaae40eca11e3b4d37bfce960a7cd
Message-ID : <20110406223036.GA15721-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org>
References : http://marc.info/?l=linux-kernel&m=130212907032580&w=2
Handled-By : Don Zickus <dzickus-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
Patch : http://cache.gmane.org//gmane/linux/kernel/1125621-001.bin
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (8 preceding siblings ...)
2011-04-17 12:57 ` [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive Rafael J. Wysocki
10 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler, Nick Bowler
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33242
Subject : Lockdep splat in autofs with 2.6.39-rc2
Submitter : Nick Bowler <nbowler-7BP4RkwGw0uXmMXjJBpWqg@public.gmane.org>
Date : 2011-04-07 19:44 (11 days old)
Message-ID : <20110407194403.GA29404-7BP4RkwGw0uXmMXjJBpWqg@public.gmane.org>
References : http://marc.info/?l=linux-kernel&m=130220545614682&w=2
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive.
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
` (9 preceding siblings ...)
2011-04-17 12:57 ` [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2 Rafael J. Wysocki
@ 2011-04-17 12:57 ` Rafael J. Wysocki
10 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 12:57 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Shaun Ruffell
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33342
Subject : [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive.
Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org>
Date : 2011-04-08 20:15 (10 days old)
Message-ID : <20110408201513.GA3040-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org>
References : http://marc.info/?l=linux-kernel&m=130229371907209&w=2
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs
2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki
@ 2011-04-17 13:05 ` Cyrill Gorcunov
[not found] ` <4DAAE5AA.1090302-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Cyrill Gorcunov @ 2011-04-17 13:05 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Don Zickus, Ingo Molnar, Shaun Ruffell
On 04/17/2011 04:57 PM, Rafael J. Wysocki wrote:
> This message has been generated automatically as a part of a summary report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.38. Please verify if it still should be listed and let the tracking team
> know (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252
> Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs
> Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org>
> Date : 2011-04-06 22:30 (12 days old)
> First-Bad-Commit: http://git.kernel.org/linus/242214f9c1eeaae40eca11e3b4d37bfce960a7cd
> Message-ID : <20110406223036.GA15721-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org>
> References : http://marc.info/?l=linux-kernel&m=130212907032580&w=2
> Handled-By : Don Zickus <dzickus-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
> Patch : http://cache.gmane.org//gmane/linux/kernel/1125621-001.bin
>
>
We're working on it, patch is almost done. I guess it'll be published next week.
--
Cyrill
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32262] 2.6.38-git15+ IDE hangs boot
2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki
@ 2011-04-17 13:16 ` Pete Clements
[not found] ` <201104171316.p3HDGI8Y018237-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Pete Clements @ 2011-04-17 13:16 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Pete Clements, Linux Kernel Mailing List
>
> The following bug entry is on the current list of known regressions
> from 2.6.38. Please verify if it still should be listed and let the tracking team
> know (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=3D32262
> Subject : 2.6.38-git15+ IDE hangs boot
> Submitter : Pete Clements <clem@clem.clem-digital.net>
> Date : 2011-03-25 15:38 (24 days old)
> Message-ID : <201103251538.p2PFc11i001674@clem.clem-digital.net>
> References : http://marc.info/?l=3Dlinux-kernel&m=3D130106749313695&w=3D2
>
>
I no longer experience the problem. Don't recall when the fix was integrated
(post git19?). (Currently at 39-rc3-git6.)
--
Pete Clements
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32262] 2.6.38-git15+ IDE hangs boot
[not found] ` <201104171316.p3HDGI8Y018237-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>
@ 2011-04-17 13:28 ` Rafael J. Wysocki
0 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 13:28 UTC (permalink / raw)
To: Pete Clements
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Linux Kernel Mailing List
On Sunday, April 17, 2011, Pete Clements wrote:
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.38. Please verify if it still should be listed and let the tracking team
> > know (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=3D32262
> > Subject : 2.6.38-git15+ IDE hangs boot
> > Submitter : Pete Clements <clem-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>
> > Date : 2011-03-25 15:38 (24 days old)
> > Message-ID : <201103251538.p2PFc11i001674-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>
> > References : http://marc.info/?l=3Dlinux-kernel&m=3D130106749313695&w=3D2
> >
> >
>
> I no longer experience the problem. Don't recall when the fix was integrated
> (post git19?). (Currently at 39-rc3-git6.)
Thanks, closing.
Rafael
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs
[not found] ` <4DAAE5AA.1090302-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2011-04-17 13:31 ` Rafael J. Wysocki
0 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 13:31 UTC (permalink / raw)
To: Cyrill Gorcunov
Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Don Zickus, Ingo Molnar, Shaun Ruffell
On Sunday, April 17, 2011, Cyrill Gorcunov wrote:
> On 04/17/2011 04:57 PM, Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a summary report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.38. Please verify if it still should be listed and let the tracking team
> > know (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33252
> > Subject : [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs
> > Submitter : Shaun Ruffell <sruffell-mYLhy7Md7uHQT0dZR+AlfA@public.gmane.org>
> > Date : 2011-04-06 22:30 (12 days old)
> >
> >
>
> We're working on it, patch is almost done. I guess it'll be published next week.
Great, thanks! Please let me know when the patch makes it to the Linus' tree.
Rafael
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki
@ 2011-04-17 17:03 ` Linus Torvalds
[not found] ` <BANLkTimeaHwBiZoz=isCM=qtBxoxaBqDUw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Linus Torvalds @ 2011-04-17 17:03 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Bart Van Assche, Neil Brown, Jens Axboe
Is this machine running a RAID5 setup or something like that?
There is a known interaction with the new block layer plugging code
and MD. The "hung task" report in that bugzilla looks very much like
that issue. And you do have "root=/dev/md0", so clearly there's some
md thing going on.
And bisecting might not work all that well for it, because I suspect
it ends up being very much a matter of IO patterns how it triggers.
Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens?
Linus
On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
> This message has been generated automatically as a part of a summary report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.38. Please verify if it still should be listed and let the tracking team
> know (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
> Subject : Kernel locks up a few minutes after boot
> Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> Date : 2011-04-10 19:55 (8 days old)
>
>
>
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #33272] drm related hard-hang
[not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-04-17 18:22 ` Rafael J. Wysocki
0 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-17 18:22 UTC (permalink / raw)
To: Peter Teoh; +Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler
On Sunday, April 17, 2011, Peter Teoh wrote:
> oh yes, as shown in the bug report, the version at linus-git tree is not
> working as of 14 Apr 2011,
Thanks for the update.
Rafael
> On Sun, Apr 17, 2011 at 8:57 PM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
>
> > This message has been generated automatically as a part of a summary report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.38. Please verify if it still should be listed and let the
> > tracking team
> > know (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=33272
> > Subject : drm related hard-hang
> > Submitter : Peter Teoh <htmldeveloper-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > Date : 2011-04-14 01:29 (4 days old)
> >
> >
> >
>
>
>
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTimeaHwBiZoz=isCM=qtBxoxaBqDUw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-04-17 18:37 ` Bart Van Assche
[not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Bart Van Assche @ 2011-04-17 18:37 UTC (permalink / raw)
To: Linus Torvalds
Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
Maciej Rutecki, Florian Mickler, Neil Brown, Jens Axboe
On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds
<torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote:
> On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
> > This message has been generated automatically as a part of a summary report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.38. Please verify if it still should be listed and let the tracking team
> > know (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
> > Subject : Kernel locks up a few minutes after boot
> > Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > Date : 2011-04-10 19:55 (8 days old)
>
> Is this machine running a RAID5 setup or something like that?
>
> There is a known interaction with the new block layer plugging code
> and MD. The "hung task" report in that bugzilla looks very much like
> that issue. And you do have "root=/dev/md0", so clearly there's some
> md thing going on.
>
> And bisecting might not work all that well for it, because I suspect
> it ends up being very much a matter of IO patterns how it triggers.
>
> Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens?
(converted top-posting into bottom-posting)
Hello Linus,
On the system on which bug #32982 has been triggered md0, md1 and md2
have been configured as two-disk RAID1 (mirroring).
I've done my best to trigger enough I/O in order to obtain reliable
bisect results. A difficulty I encountered during bisecting though was
that I encountered unbootable kernels (all skipped revisions).
Bart.
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-04-17 21:07 ` NeilBrown
[not found] ` <20110418070711.12dd5cc1-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org>
2011-04-18 11:44 ` Jens Axboe
1 sibling, 1 reply; 41+ messages in thread
From: NeilBrown @ 2011-04-17 21:07 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Jens Axboe
On Sun, 17 Apr 2011 20:37:39 +0200 Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote:
> On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds
> <torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote:
> > On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
> > > This message has been generated automatically as a part of a summary report
> > > of recent regressions.
> > >
> > > The following bug entry is on the current list of known regressions
> > > from 2.6.38. Please verify if it still should be listed and let the tracking team
> > > know (either way).
> > >
> > >
> > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
> > > Subject : Kernel locks up a few minutes after boot
> > > Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > > Date : 2011-04-10 19:55 (8 days old)
> >
> > Is this machine running a RAID5 setup or something like that?
> >
> > There is a known interaction with the new block layer plugging code
> > and MD. The "hung task" report in that bugzilla looks very much like
> > that issue. And you do have "root=/dev/md0", so clearly there's some
> > md thing going on.
> >
> > And bisecting might not work all that well for it, because I suspect
> > it ends up being very much a matter of IO patterns how it triggers.
> >
> > Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens?
>
> (converted top-posting into bottom-posting)
>
> Hello Linus,
>
> On the system on which bug #32982 has been triggered md0, md1 and md2
> have been configured as two-disk RAID1 (mirroring).
If any of those have write-intent bitmaps then I definitely know what the
problem is and I'll be posting patches later today (probably not much later).
If not .. then I'm less sure but it would certainly be worth testing after
applying the promised fixes.
NeilBrown
>
> I've done my best to trigger enough I/O in order to obtain reliable
> bisect results. A difficulty I encountered during bisecting though was
> that I encountered unbootable kernels (all skipped revisions).
>
> Bart.
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <20110418070711.12dd5cc1-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org>
@ 2011-04-17 22:20 ` NeilBrown
0 siblings, 0 replies; 41+ messages in thread
From: NeilBrown @ 2011-04-17 22:20 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Jens Axboe
On Mon, 18 Apr 2011 07:07:11 +1000 NeilBrown <neilb-l3A5Bk7waGM@public.gmane.org> wrote:
> On Sun, 17 Apr 2011 20:37:39 +0200 Bart Van Assche <bvanassche-y/PYEvSCHaw@public.gmane.orgg> wrote:
>
> > On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds
> > <torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote:
> > > On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
> > > > This message has been generated automatically as a part of a summary report
> > > > of recent regressions.
> > > >
> > > > The following bug entry is on the current list of known regressions
> > > > from 2.6.38. Please verify if it still should be listed and let the tracking team
> > > > know (either way).
> > > >
> > > >
> > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
> > > > Subject : Kernel locks up a few minutes after boot
> > > > Submitter : Bart Van Assche <bart.vanassche@gmail.com>
> > > > Date : 2011-04-10 19:55 (8 days old)
> > >
> > > Is this machine running a RAID5 setup or something like that?
> > >
> > > There is a known interaction with the new block layer plugging code
> > > and MD. The "hung task" report in that bugzilla looks very much like
> > > that issue. And you do have "root=/dev/md0", so clearly there's some
> > > md thing going on.
> > >
> > > And bisecting might not work all that well for it, because I suspect
> > > it ends up being very much a matter of IO patterns how it triggers.
> > >
> > > Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens?
> >
> > (converted top-posting into bottom-posting)
> >
> > Hello Linus,
> >
> > On the system on which bug #32982 has been triggered md0, md1 and md2
> > have been configured as two-disk RAID1 (mirroring).
>
> If any of those have write-intent bitmaps then I definitely know what the
> problem is and I'll be posting patches later today (probably not much later).
>
Actually it won't be today. The new block device plugging is still unusable
for MD - so I won't be able to fix this until that gets sorted out.
NeilBrown
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-17 21:07 ` NeilBrown
@ 2011-04-18 11:44 ` Jens Axboe
[not found] ` <4DAC2429.5000105-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
1 sibling, 1 reply; 41+ messages in thread
From: Jens Axboe @ 2011-04-18 11:44 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown
On 2011-04-17 20:37, Bart Van Assche wrote:
> On Sun, Apr 17, 2011 at 7:03 PM, Linus Torvalds
> <torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org> wrote:
>> On Sun, Apr 17, 2011 at 5:57 AM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
>>> This message has been generated automatically as a part of a summary report
>>> of recent regressions.
>>>
>>> The following bug entry is on the current list of known regressions
>>> from 2.6.38. Please verify if it still should be listed and let the tracking team
>>> know (either way).
>>>
>>>
>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
>>> Subject : Kernel locks up a few minutes after boot
>>> Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
>>> Date : 2011-04-10 19:55 (8 days old)
>>
>> Is this machine running a RAID5 setup or something like that?
>>
>> There is a known interaction with the new block layer plugging code
>> and MD. The "hung task" report in that bugzilla looks very much like
>> that issue. And you do have "root=/dev/md0", so clearly there's some
>> md thing going on.
>>
>> And bisecting might not work all that well for it, because I suspect
>> it ends up being very much a matter of IO patterns how it triggers.
>>
>> Neil supposedly has a patch for it, but I haven't seen it yet. Neil, Jens?
>
> (converted top-posting into bottom-posting)
>
> Hello Linus,
>
> On the system on which bug #32982 has been triggered md0, md1 and md2
> have been configured as two-disk RAID1 (mirroring).
>
> I've done my best to trigger enough I/O in order to obtain reliable
> bisect results. A difficulty I encountered during bisecting though was
> that I encountered unbootable kernels (all skipped revisions).
Bart, can you try and pull:
git://git.kernel.dk/linux-2.6-block.git for-linus
into Linus' tree and see if that works? This has, among other things,
Neils fixes for MD.
--
Jens Axboe
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64
2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki
@ 2011-04-18 15:34 ` Alex Romosan
[not found] ` <87ipub7evl.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Alex Romosan @ 2011-04-18 15:34 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler
"Rafael J. Wysocki" <rjw@sisk.pl> writes:
> This message has been generated automatically as a part of a summary report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.38. Please verify if it still should be listed and let the
> tracking team
> know (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902
> Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64
> Submitter : Alex Romosan <romosan@sycorax.lbl.gov>
> Date : 2011-04-03 19:41 (15 days old)
> Message-ID : <87k4fbnmw8.fsf@sycorax.lbl.gov>
> References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2
>
my laptop works again with 2.6.39-rc3 (except the boot process hangs at
waiting for /dev to be populated.... if i hit Ctrl-C then the booting
proceeds normally. this doesn't happen with 2.6.38) so probably this bug
can be closed.
--alex--
--
| I believe the moment is at hand when, by a paranoiac and active |
| advance of the mind, it will be possible (simultaneously with |
| automatism and other passive states) to systematize confusion |
| and thus to help to discredit completely the world of reality. |
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <4DAC2429.5000105-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
@ 2011-04-18 18:21 ` Bart Van Assche
[not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Bart Van Assche @ 2011-04-18 18:21 UTC (permalink / raw)
To: Jens Axboe
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown
On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
> Bart, can you try and pull:
>
> git://git.kernel.dk/linux-2.6-block.git for-linus
>
> into Linus' tree and see if that works? This has, among other things,
> Neils fixes for MD.
md seems to work stable with the resulting tree, but it looks there is
a performance regression in the block layer not related to the md
issue. If I run a small block IOPS test on a block device created by
ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
(155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
Bart.
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-04-18 18:28 ` Jens Axboe
[not found] ` <4DAC82E6.3020809-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
2011-04-19 3:32 ` David Dillow
1 sibling, 1 reply; 41+ messages in thread
From: Jens Axboe @ 2011-04-18 18:28 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown
On 2011-04-18 20:21, Bart Van Assche wrote:
> On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
>> Bart, can you try and pull:
>>
>> git://git.kernel.dk/linux-2.6-block.git for-linus
>>
>> into Linus' tree and see if that works? This has, among other things,
>> Neils fixes for MD.
>
> md seems to work stable with the resulting tree, but it looks there is
OK, that's the most important bit.
> a performance regression in the block layer not related to the md
> issue. If I run a small block IOPS test on a block device created by
> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
That's not good. What's the test case?
--
Jens Axboe
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <4DAC82E6.3020809-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
@ 2011-04-18 18:32 ` Bart Van Assche
[not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Bart Van Assche @ 2011-04-18 18:32 UTC (permalink / raw)
To: Jens Axboe
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown
On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
> On 2011-04-18 20:21, Bart Van Assche wrote:
>> a performance regression in the block layer not related to the md
>> issue. If I run a small block IOPS test on a block device created by
>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
>
> That's not good. What's the test case?
Nothing more than a fio IOPS test:
fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread
--iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G
--gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1
Bart.
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-04-18 18:38 ` Jens Axboe
2011-04-19 9:09 ` Jens Axboe
1 sibling, 0 replies; 41+ messages in thread
From: Jens Axboe @ 2011-04-18 18:38 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown
On 2011-04-18 20:32, Bart Van Assche wrote:
> On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
>> On 2011-04-18 20:21, Bart Van Assche wrote:
>>> a performance regression in the block layer not related to the md
>>> issue. If I run a small block IOPS test on a block device created by
>>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
>>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
>>
>> That's not good. What's the test case?
>
> Nothing more than a fio IOPS test:
>
> fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread
> --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G
> --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1
Interesting, I'll have to check if we regressed with all these recent
changes. Comparing your .38 to .39-rc3+, are you using more/less CPU,
more/less sys%, etc?
A quick perf record -fg / perf report -g for both kernels would be nice
to see.
--
Jens Axboe
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64
[not found] ` <87ipub7evl.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>
@ 2011-04-18 21:22 ` Rafael J. Wysocki
0 siblings, 0 replies; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-18 21:22 UTC (permalink / raw)
To: Alex Romosan
Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler
On Monday, April 18, 2011, Alex Romosan wrote:
> "Rafael J. Wysocki" <rjw-KKrjLPT3xs0@public.gmane.org> writes:
>
> > This message has been generated automatically as a part of a summary report
> > of recent regressions.
> >
> > The following bug entry is on the current list of known regressions
> > from 2.6.38. Please verify if it still should be listed and let the
> > tracking team
> > know (either way).
> >
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32902
> > Subject : 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64
> > Submitter : Alex Romosan <romosan-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>
> > Date : 2011-04-03 19:41 (15 days old)
> > Message-ID : <87k4fbnmw8.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>
> > References : http://marc.info/?l=linux-kernel&m=130186054431678&w=2
> >
>
> my laptop works again with 2.6.39-rc3 (except the boot process hangs at
> waiting for /dev to be populated.... if i hit Ctrl-C then the booting
> proceeds normally. this doesn't happen with 2.6.38) so probably this bug
> can be closed.
Thanks, closing.
Rafael
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-18 18:28 ` Jens Axboe
@ 2011-04-19 3:32 ` David Dillow
[not found] ` <1303183934.2585.12.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org>
1 sibling, 1 reply; 41+ messages in thread
From: David Dillow @ 2011-04-19 3:32 UTC (permalink / raw)
To: Bart Van Assche
Cc: Jens Axboe, Linus Torvalds, Rafael J. Wysocki,
Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Neil Brown
On Mon, 2011-04-18 at 20:21 +0200, Bart Van Assche wrote:
> On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
> > Bart, can you try and pull:
> >
> > git://git.kernel.dk/linux-2.6-block.git for-linus
> >
> > into Linus' tree and see if that works? This has, among other things,
> > Neils fixes for MD.
>
> md seems to work stable with the resulting tree, but it looks there is
> a performance regression in the block layer not related to the md
> issue. If I run a small block IOPS test on a block device created by
> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
The mapping code for ib_srp changed in 2.6.39-rc1, but it showed
improved IOPS for a similar setup in my testing so I'd be surprised if
it is the culprit. Still, it wouldn't hurt to check. Do you have time to
try the new ib_srp code with 2.6.38.3 to eliminate it from the equation?
Thanks,
Dave
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-18 18:38 ` Jens Axboe
@ 2011-04-19 9:09 ` Jens Axboe
[not found] ` <4DAD5156.2050300-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
1 sibling, 1 reply; 41+ messages in thread
From: Jens Axboe @ 2011-04-19 9:09 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown
On 2011-04-18 20:32, Bart Van Assche wrote:
> On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
>> On 2011-04-18 20:21, Bart Van Assche wrote:
>>> a performance regression in the block layer not related to the md
>>> issue. If I run a small block IOPS test on a block device created by
>>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
>>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
>>
>> That's not good. What's the test case?
>
> Nothing more than a fio IOPS test:
>
> fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread
> --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G
> --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1
Bart, can you try the below:
diff --git a/block/blk-core.c b/block/blk-core.c
index 5fa3dd2..9b41da1 100644
--- a/block/blk-core.c
+++ b/block/blk-core.c
@@ -307,11 +307,7 @@ void __blk_run_queue(struct request_queue *q)
* Only recurse once to avoid overrunning the stack, let the unplug
* handling reinvoke the handler shortly if we already got there.
*/
- if (!queue_flag_test_and_set(QUEUE_FLAG_REENTER, q)) {
- q->request_fn(q);
- queue_flag_clear(QUEUE_FLAG_REENTER, q);
- } else
- queue_delayed_work(kblockd_workqueue, &q->delay_work, 0);
+ q->request_fn(q);
}
EXPORT_SYMBOL(__blk_run_queue);
--
Jens Axboe
^ permalink raw reply related [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <4DAD5156.2050300-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
@ 2011-04-19 11:16 ` Jens Axboe
2011-04-19 16:13 ` Bart Van Assche
[not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
0 siblings, 2 replies; 41+ messages in thread
From: Jens Axboe @ 2011-04-19 11:16 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown
On 2011-04-19 11:09, Jens Axboe wrote:
> On 2011-04-18 20:32, Bart Van Assche wrote:
>> On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
>>> On 2011-04-18 20:21, Bart Van Assche wrote:
>>>> a performance regression in the block layer not related to the md
>>>> issue. If I run a small block IOPS test on a block device created by
>>>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
>>>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
>>>
>>> That's not good. What's the test case?
>>
>> Nothing more than a fio IOPS test:
>>
>> fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread
>> --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G
>> --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1
>
> Bart, can you try the below:
Here's a more complete variant. James, lets get rid of this REENTER
crap. It's completely bogus and triggers falsely for a variety of
reasons. The below will work, but there may be room for improvement on
the SCSI side.
diff --git a/block/blk-core.c b/block/blk-core.c
index 5fa3dd2..4e49665 100644
--- a/block/blk-core.c
+++ b/block/blk-core.c
@@ -303,15 +303,7 @@ void __blk_run_queue(struct request_queue *q)
if (unlikely(blk_queue_stopped(q)))
return;
- /*
- * Only recurse once to avoid overrunning the stack, let the unplug
- * handling reinvoke the handler shortly if we already got there.
- */
- if (!queue_flag_test_and_set(QUEUE_FLAG_REENTER, q)) {
- q->request_fn(q);
- queue_flag_clear(QUEUE_FLAG_REENTER, q);
- } else
- queue_delayed_work(kblockd_workqueue, &q->delay_work, 0);
+ q->request_fn(q);
}
EXPORT_SYMBOL(__blk_run_queue);
@@ -328,6 +320,7 @@ void blk_run_queue_async(struct request_queue *q)
if (likely(!blk_queue_stopped(q)))
queue_delayed_work(kblockd_workqueue, &q->delay_work, 0);
}
+EXPORT_SYMBOL(blk_run_queue_async);
/**
* blk_run_queue - run a single device queue
diff --git a/block/blk.h b/block/blk.h
index c9df8fc..6126346 100644
--- a/block/blk.h
+++ b/block/blk.h
@@ -22,7 +22,6 @@ void blk_rq_timed_out_timer(unsigned long data);
void blk_delete_timer(struct request *);
void blk_add_timer(struct request *);
void __generic_unplug_device(struct request_queue *);
-void blk_run_queue_async(struct request_queue *q);
/*
* Internal atomic flags for request handling
diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
index ab55c2f..e9901b8 100644
--- a/drivers/scsi/scsi_lib.c
+++ b/drivers/scsi/scsi_lib.c
@@ -411,8 +411,6 @@ static void scsi_run_queue(struct request_queue *q)
list_splice_init(&shost->starved_list, &starved_list);
while (!list_empty(&starved_list)) {
- int flagset;
-
/*
* As long as shost is accepting commands and we have
* starved queues, call blk_run_queue. scsi_request_fn
@@ -435,20 +433,7 @@ static void scsi_run_queue(struct request_queue *q)
continue;
}
- spin_unlock(shost->host_lock);
-
- spin_lock(sdev->request_queue->queue_lock);
- flagset = test_bit(QUEUE_FLAG_REENTER, &q->queue_flags) &&
- !test_bit(QUEUE_FLAG_REENTER,
- &sdev->request_queue->queue_flags);
- if (flagset)
- queue_flag_set(QUEUE_FLAG_REENTER, sdev->request_queue);
- __blk_run_queue(sdev->request_queue);
- if (flagset)
- queue_flag_clear(QUEUE_FLAG_REENTER, sdev->request_queue);
- spin_unlock(sdev->request_queue->queue_lock);
-
- spin_lock(shost->host_lock);
+ blk_run_queue_async(sdev->request_queue);
}
/* put any unprocessed entries back */
list_splice(&starved_list, &shost->starved_list);
diff --git a/drivers/scsi/scsi_transport_fc.c b/drivers/scsi/scsi_transport_fc.c
index 28c3350..815069d 100644
--- a/drivers/scsi/scsi_transport_fc.c
+++ b/drivers/scsi/scsi_transport_fc.c
@@ -3816,28 +3816,17 @@ fail_host_msg:
static void
fc_bsg_goose_queue(struct fc_rport *rport)
{
- int flagset;
- unsigned long flags;
-
if (!rport->rqst_q)
return;
+ /*
+ * This get/put dance makes no sense
+ */
get_device(&rport->dev);
-
- spin_lock_irqsave(rport->rqst_q->queue_lock, flags);
- flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) &&
- !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags);
- if (flagset)
- queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q);
- __blk_run_queue(rport->rqst_q);
- if (flagset)
- queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q);
- spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags);
-
+ blk_run_queue_async(rport->rqst_q);
put_device(&rport->dev);
}
-
/**
* fc_bsg_rport_dispatch - process rport bsg requests and dispatch to LLDD
* @q: rport request queue
diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h
index cbbfd98..2ad95fa 100644
--- a/include/linux/blkdev.h
+++ b/include/linux/blkdev.h
@@ -388,20 +388,19 @@ struct request_queue
#define QUEUE_FLAG_SYNCFULL 3 /* read queue has been filled */
#define QUEUE_FLAG_ASYNCFULL 4 /* write queue has been filled */
#define QUEUE_FLAG_DEAD 5 /* queue being torn down */
-#define QUEUE_FLAG_REENTER 6 /* Re-entrancy avoidance */
-#define QUEUE_FLAG_ELVSWITCH 7 /* don't use elevator, just do FIFO */
-#define QUEUE_FLAG_BIDI 8 /* queue supports bidi requests */
-#define QUEUE_FLAG_NOMERGES 9 /* disable merge attempts */
-#define QUEUE_FLAG_SAME_COMP 10 /* force complete on same CPU */
-#define QUEUE_FLAG_FAIL_IO 11 /* fake timeout */
-#define QUEUE_FLAG_STACKABLE 12 /* supports request stacking */
-#define QUEUE_FLAG_NONROT 13 /* non-rotational device (SSD) */
+#define QUEUE_FLAG_ELVSWITCH 6 /* don't use elevator, just do FIFO */
+#define QUEUE_FLAG_BIDI 7 /* queue supports bidi requests */
+#define QUEUE_FLAG_NOMERGES 8 /* disable merge attempts */
+#define QUEUE_FLAG_SAME_COMP 9 /* force complete on same CPU */
+#define QUEUE_FLAG_FAIL_IO 10 /* fake timeout */
+#define QUEUE_FLAG_STACKABLE 11 /* supports request stacking */
+#define QUEUE_FLAG_NONROT 12 /* non-rotational device (SSD) */
#define QUEUE_FLAG_VIRT QUEUE_FLAG_NONROT /* paravirt device */
-#define QUEUE_FLAG_IO_STAT 15 /* do IO stats */
-#define QUEUE_FLAG_DISCARD 16 /* supports DISCARD */
-#define QUEUE_FLAG_NOXMERGES 17 /* No extended merges */
-#define QUEUE_FLAG_ADD_RANDOM 18 /* Contributes to random pool */
-#define QUEUE_FLAG_SECDISCARD 19 /* supports SECDISCARD */
+#define QUEUE_FLAG_IO_STAT 13 /* do IO stats */
+#define QUEUE_FLAG_DISCARD 14 /* supports DISCARD */
+#define QUEUE_FLAG_NOXMERGES 15 /* No extended merges */
+#define QUEUE_FLAG_ADD_RANDOM 16 /* Contributes to random pool */
+#define QUEUE_FLAG_SECDISCARD 17 /* supports SECDISCARD */
#define QUEUE_FLAG_DEFAULT ((1 << QUEUE_FLAG_IO_STAT) | \
(1 << QUEUE_FLAG_STACKABLE) | \
@@ -699,6 +698,7 @@ extern void blk_sync_queue(struct request_queue *q);
extern void __blk_stop_queue(struct request_queue *q);
extern void __blk_run_queue(struct request_queue *q);
extern void blk_run_queue(struct request_queue *);
+extern void blk_run_queue_async(struct request_queue *q);
extern int blk_rq_map_user(struct request_queue *, struct request *,
struct rq_map_data *, void __user *, unsigned long,
gfp_t);
--
Jens Axboe
^ permalink raw reply related [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
2011-04-19 11:16 ` Jens Axboe
@ 2011-04-19 16:13 ` Bart Van Assche
[not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
[not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
1 sibling, 1 reply; 41+ messages in thread
From: Bart Van Assche @ 2011-04-19 16:13 UTC (permalink / raw)
To: Jens Axboe
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown,
David Dillow
On Tue, Apr 19, 2011 at 1:16 PM, Jens Axboe <jaxboe@fusionio.com> wrote:
> On 2011-04-19 11:09, Jens Axboe wrote:
> > On 2011-04-18 20:32, Bart Van Assche wrote:
> >> On Mon, Apr 18, 2011 at 8:28 PM, Jens Axboe <jaxboe@fusionio.com> wrote:
> >>> On 2011-04-18 20:21, Bart Van Assche wrote:
> >>>> a performance regression in the block layer not related to the md
> >>>> issue. If I run a small block IOPS test on a block device created by
> >>>> ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
> >>>> (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
> >>>
> >>> That's not good. What's the test case?
> >>
> >> Nothing more than a fio IOPS test:
> >>
> >> fio --bs=512 --ioengine=libaio --buffered=0 --rw=read --thread
> >> --iodepth=64 --numjobs=2 --loops=10000 --group_reporting --size=1G
> >> --gtod_reduce=1 --name=iops-test --filename=/dev/${dev} --invalidate=1
> >
> > Bart, can you try the below:
>
> Here's a more complete variant. James, lets get rid of this REENTER
> crap. It's completely bogus and triggers falsely for a variety of
> reasons. The below will work, but there may be room for improvement on
> the SCSI side.
>
> diff --git a/block/blk-core.c b/block/blk-core.c
> index 5fa3dd2..4e49665 100644
> --- a/block/blk-core.c
> +++ b/block/blk-core.c
> @@ -303,15 +303,7 @@ void __blk_run_queue(struct request_queue *q)
> if (unlikely(blk_queue_stopped(q)))
> return;
>
> - /*
> - * Only recurse once to avoid overrunning the stack, let the unplug
> - * handling reinvoke the handler shortly if we already got there.
> - */
> - if (!queue_flag_test_and_set(QUEUE_FLAG_REENTER, q)) {
> - q->request_fn(q);
> - queue_flag_clear(QUEUE_FLAG_REENTER, q);
> - } else
> - queue_delayed_work(kblockd_workqueue, &q->delay_work, 0);
> + q->request_fn(q);
> }
> EXPORT_SYMBOL(__blk_run_queue);
>
> @@ -328,6 +320,7 @@ void blk_run_queue_async(struct request_queue *q)
> if (likely(!blk_queue_stopped(q)))
> queue_delayed_work(kblockd_workqueue, &q->delay_work, 0);
> }
> +EXPORT_SYMBOL(blk_run_queue_async);
>
> /**
> * blk_run_queue - run a single device queue
> diff --git a/block/blk.h b/block/blk.h
> index c9df8fc..6126346 100644
> --- a/block/blk.h
> +++ b/block/blk.h
> @@ -22,7 +22,6 @@ void blk_rq_timed_out_timer(unsigned long data);
> void blk_delete_timer(struct request *);
> void blk_add_timer(struct request *);
> void __generic_unplug_device(struct request_queue *);
> -void blk_run_queue_async(struct request_queue *q);
>
> /*
> * Internal atomic flags for request handling
> diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c
> index ab55c2f..e9901b8 100644
> --- a/drivers/scsi/scsi_lib.c
> +++ b/drivers/scsi/scsi_lib.c
> @@ -411,8 +411,6 @@ static void scsi_run_queue(struct request_queue *q)
> list_splice_init(&shost->starved_list, &starved_list);
>
> while (!list_empty(&starved_list)) {
> - int flagset;
> -
> /*
> * As long as shost is accepting commands and we have
> * starved queues, call blk_run_queue. scsi_request_fn
> @@ -435,20 +433,7 @@ static void scsi_run_queue(struct request_queue *q)
> continue;
> }
>
> - spin_unlock(shost->host_lock);
> -
> - spin_lock(sdev->request_queue->queue_lock);
> - flagset = test_bit(QUEUE_FLAG_REENTER, &q->queue_flags) &&
> - !test_bit(QUEUE_FLAG_REENTER,
> - &sdev->request_queue->queue_flags);
> - if (flagset)
> - queue_flag_set(QUEUE_FLAG_REENTER, sdev->request_queue);
> - __blk_run_queue(sdev->request_queue);
> - if (flagset)
> - queue_flag_clear(QUEUE_FLAG_REENTER, sdev->request_queue);
> - spin_unlock(sdev->request_queue->queue_lock);
> -
> - spin_lock(shost->host_lock);
> + blk_run_queue_async(sdev->request_queue);
> }
> /* put any unprocessed entries back */
> list_splice(&starved_list, &shost->starved_list);
> diff --git a/drivers/scsi/scsi_transport_fc.c b/drivers/scsi/scsi_transport_fc.c
> index 28c3350..815069d 100644
> --- a/drivers/scsi/scsi_transport_fc.c
> +++ b/drivers/scsi/scsi_transport_fc.c
> @@ -3816,28 +3816,17 @@ fail_host_msg:
> static void
> fc_bsg_goose_queue(struct fc_rport *rport)
> {
> - int flagset;
> - unsigned long flags;
> -
> if (!rport->rqst_q)
> return;
>
> + /*
> + * This get/put dance makes no sense
> + */
> get_device(&rport->dev);
> -
> - spin_lock_irqsave(rport->rqst_q->queue_lock, flags);
> - flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) &&
> - !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags);
> - if (flagset)
> - queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q);
> - __blk_run_queue(rport->rqst_q);
> - if (flagset)
> - queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q);
> - spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags);
> -
> + blk_run_queue_async(rport->rqst_q);
> put_device(&rport->dev);
> }
>
> -
> /**
> * fc_bsg_rport_dispatch - process rport bsg requests and dispatch to LLDD
> * @q: rport request queue
> diff --git a/include/linux/blkdev.h b/include/linux/blkdev.h
> index cbbfd98..2ad95fa 100644
> --- a/include/linux/blkdev.h
> +++ b/include/linux/blkdev.h
> @@ -388,20 +388,19 @@ struct request_queue
> #define QUEUE_FLAG_SYNCFULL 3 /* read queue has been filled */
> #define QUEUE_FLAG_ASYNCFULL 4 /* write queue has been filled */
> #define QUEUE_FLAG_DEAD 5 /* queue being torn down */
> -#define QUEUE_FLAG_REENTER 6 /* Re-entrancy avoidance */
> -#define QUEUE_FLAG_ELVSWITCH 7 /* don't use elevator, just do FIFO */
> -#define QUEUE_FLAG_BIDI 8 /* queue supports bidi requests */
> -#define QUEUE_FLAG_NOMERGES 9 /* disable merge attempts */
> -#define QUEUE_FLAG_SAME_COMP 10 /* force complete on same CPU */
> -#define QUEUE_FLAG_FAIL_IO 11 /* fake timeout */
> -#define QUEUE_FLAG_STACKABLE 12 /* supports request stacking */
> -#define QUEUE_FLAG_NONROT 13 /* non-rotational device (SSD) */
> +#define QUEUE_FLAG_ELVSWITCH 6 /* don't use elevator, just do FIFO */
> +#define QUEUE_FLAG_BIDI 7 /* queue supports bidi requests */
> +#define QUEUE_FLAG_NOMERGES 8 /* disable merge attempts */
> +#define QUEUE_FLAG_SAME_COMP 9 /* force complete on same CPU */
> +#define QUEUE_FLAG_FAIL_IO 10 /* fake timeout */
> +#define QUEUE_FLAG_STACKABLE 11 /* supports request stacking */
> +#define QUEUE_FLAG_NONROT 12 /* non-rotational device (SSD) */
> #define QUEUE_FLAG_VIRT QUEUE_FLAG_NONROT /* paravirt device */
> -#define QUEUE_FLAG_IO_STAT 15 /* do IO stats */
> -#define QUEUE_FLAG_DISCARD 16 /* supports DISCARD */
> -#define QUEUE_FLAG_NOXMERGES 17 /* No extended merges */
> -#define QUEUE_FLAG_ADD_RANDOM 18 /* Contributes to random pool */
> -#define QUEUE_FLAG_SECDISCARD 19 /* supports SECDISCARD */
> +#define QUEUE_FLAG_IO_STAT 13 /* do IO stats */
> +#define QUEUE_FLAG_DISCARD 14 /* supports DISCARD */
> +#define QUEUE_FLAG_NOXMERGES 15 /* No extended merges */
> +#define QUEUE_FLAG_ADD_RANDOM 16 /* Contributes to random pool */
> +#define QUEUE_FLAG_SECDISCARD 17 /* supports SECDISCARD */
>
> #define QUEUE_FLAG_DEFAULT ((1 << QUEUE_FLAG_IO_STAT) | \
> (1 << QUEUE_FLAG_STACKABLE) | \
> @@ -699,6 +698,7 @@ extern void blk_sync_queue(struct request_queue *q);
> extern void __blk_stop_queue(struct request_queue *q);
> extern void __blk_run_queue(struct request_queue *q);
> extern void blk_run_queue(struct request_queue *);
> +extern void blk_run_queue_async(struct request_queue *q);
> extern int blk_rq_map_user(struct request_queue *, struct request *,
> struct rq_map_data *, void __user *, unsigned long,
> gfp_t);
Hello Jens,
The same test with an initiator running 2.6.39-rc4 +
git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch
yields about 155.000 IOPS on my test setup, or the same performance as
with 2.6.38.3. I'm running the above patch through an I/O stress test
now.
Bart.
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-04-19 16:32 ` Linus Torvalds
[not found] ` <BANLkTikzRE7sA8_NNgVUpOqR_zGtw+YUvQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-19 17:43 ` Jens Axboe
1 sibling, 1 reply; 41+ messages in thread
From: Linus Torvalds @ 2011-04-19 16:32 UTC (permalink / raw)
To: Bart Van Assche
Cc: Jens Axboe, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown,
David Dillow
On Tue, Apr 19, 2011 at 9:13 AM, Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote:
>
> The same test with an initiator running 2.6.39-rc4 +
> git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch
> yields about 155.000 IOPS on my test setup, or the same performance as
> with 2.6.38.3. I'm running the above patch through an I/O stress test
> now.
Goodie. So not only does that patch get back the 11%, it removes the
crazy QUEUE_FLAG_REENTER flag that was broken to begin with. AND it
removes a number of complicated lines.
Halleluja.
Linus
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <1303183934.2585.12.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org>
@ 2011-04-19 16:39 ` Bart Van Assche
0 siblings, 0 replies; 41+ messages in thread
From: Bart Van Assche @ 2011-04-19 16:39 UTC (permalink / raw)
To: David Dillow
Cc: Jens Axboe, Linus Torvalds, Rafael J. Wysocki,
Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Neil Brown
On Tue, Apr 19, 2011 at 5:32 AM, David Dillow <dave-i1Mk8JYDVaaSihdK6806/g@public.gmane.org> wrote:
>
> On Mon, 2011-04-18 at 20:21 +0200, Bart Van Assche wrote:
> > On Mon, Apr 18, 2011 at 1:44 PM, Jens Axboe <jaxboe-5c4llco8/ftWk0Htik3J/w@public.gmane.org> wrote:
> > > Bart, can you try and pull:
> > >
> > > git://git.kernel.dk/linux-2.6-block.git for-linus
> > >
> > > into Linus' tree and see if that works? This has, among other things,
> > > Neils fixes for MD.
> >
> > md seems to work stable with the resulting tree, but it looks there is
> > a performance regression in the block layer not related to the md
> > issue. If I run a small block IOPS test on a block device created by
> > ib_srp (NOOP scheduler) I see about 11% less IOPS than with 2.6.38.3
> > (155.000 IOPS with 2.6.38.3 and 140.000 IOPS with 2.6.39-rc3+).
>
> The mapping code for ib_srp changed in 2.6.39-rc1, but it showed
> improved IOPS for a similar setup in my testing so I'd be surprised if
> it is the culprit. Still, it wouldn't hurt to check. Do you have time to
> try the new ib_srp code with 2.6.38.3 to eliminate it from the equation?
Hello Dave,
I just ran a test with the most important 2.6.39-specific ib_srp
commits reverted but that didn't yield a measurable performance
difference for this specific test:
$ git show --format=format:%s 7f9e5c48c1078507747434d4c182ab10925bf98a
be8b981453a4904399cb090c1660618e250092d8
c07d424d6118d528ef71b22b7424bfc359c307a5
8f26c9ff9cd0317ad867bce972f69e0c6c2cbe3c
961e0be89a5120a1409ebc525cca6f603615a8a8
8c4037b501acd2ec3abc7925e66af8af40a2da9d | grep '^IB'
IB: Increase DMA max_segment_size on Mellanox hardware
IB/srp: try to use larger FMR sizes to cover our mappings
IB/srp: add support for indirect tables that don't fit in SRP_CMD
IB/srp: rework mapping engine to use multiple FMR entries
IB/srp: move IB CM setup completion into its own function
IB/srp: always avoid non-zero offsets into an FMR
Bart.
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
@ 2011-04-19 16:48 ` Christoph Hellwig
[not found] ` <20110419164815.GA30616-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
0 siblings, 1 reply; 41+ messages in thread
From: Christoph Hellwig @ 2011-04-19 16:48 UTC (permalink / raw)
To: Jens Axboe
Cc: Bart Van Assche, Linus Torvalds, Rafael J. Wysocki,
Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Neil Brown
> + blk_run_queue_async(sdev->request_queue);
This doesn't even have to be async except when scsi drivers call
cmd->scsi_done directly. It seems like if this always went through the
softirq (or kblockd) we could still run it in context for the others.
> + /*
> + * This get/put dance makes no sense
> + */
> get_device(&rport->dev);
> -
> - spin_lock_irqsave(rport->rqst_q->queue_lock, flags);
> - flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) &&
> - !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags);
> - if (flagset)
> - queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q);
> - __blk_run_queue(rport->rqst_q);
> - if (flagset)
> - queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q);
> - spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags);
> -
> + blk_run_queue_async(rport->rqst_q);
And the QUEUE_FLAG_REENTER mess here never made sense either as it
tested for a bit beeing set and not set at the same time. So this one
actually should be able to be replaced by a plain blk_run_queue.
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <20110419164815.GA30616-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
@ 2011-04-19 17:06 ` Jens Axboe
0 siblings, 0 replies; 41+ messages in thread
From: Jens Axboe @ 2011-04-19 17:06 UTC (permalink / raw)
To: Christoph Hellwig
Cc: Bart Van Assche, Linus Torvalds, Rafael J. Wysocki,
Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Neil Brown
On 2011-04-19 18:48, Christoph Hellwig wrote:
>> + blk_run_queue_async(sdev->request_queue);
>
> This doesn't even have to be async except when scsi drivers call
> cmd->scsi_done directly. It seems like if this always went through the
> softirq (or kblockd) we could still run it in context for the others.
Exactly. I'll pass an 'optimize' patch past James.
>> + /*
>> + * This get/put dance makes no sense
>> + */
>> get_device(&rport->dev);
>> -
>> - spin_lock_irqsave(rport->rqst_q->queue_lock, flags);
>> - flagset = test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags) &&
>> - !test_bit(QUEUE_FLAG_REENTER, &rport->rqst_q->queue_flags);
>> - if (flagset)
>> - queue_flag_set(QUEUE_FLAG_REENTER, rport->rqst_q);
>> - __blk_run_queue(rport->rqst_q);
>> - if (flagset)
>> - queue_flag_clear(QUEUE_FLAG_REENTER, rport->rqst_q);
>> - spin_unlock_irqrestore(rport->rqst_q->queue_lock, flags);
>> -
>> + blk_run_queue_async(rport->rqst_q);
>
> And the QUEUE_FLAG_REENTER mess here never made sense either as it
> tested for a bit beeing set and not set at the same time. So this one
> actually should be able to be replaced by a plain blk_run_queue.
Yep, it's completely broken as-is.
--
Jens Axboe
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-19 16:32 ` Linus Torvalds
@ 2011-04-19 17:43 ` Jens Axboe
1 sibling, 0 replies; 41+ messages in thread
From: Jens Axboe @ 2011-04-19 17:43 UTC (permalink / raw)
To: Bart Van Assche
Cc: Linus Torvalds, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown,
David Dillow
On 2011-04-19 18:13, Bart Van Assche wrote:
> The same test with an initiator running 2.6.39-rc4 +
> git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch
> yields about 155.000 IOPS on my test setup, or the same performance as
> with 2.6.38.3. I'm running the above patch through an I/O stress test
> now.
OK, so parity, that's good. With the above patch, I can take a single
device from ~400K IOPS on 2.6.38 to ~440K IOPS on 2.6.39-rc4+patches.
--
Jens Axboe
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTikzRE7sA8_NNgVUpOqR_zGtw+YUvQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-04-19 17:43 ` Jens Axboe
0 siblings, 0 replies; 41+ messages in thread
From: Jens Axboe @ 2011-04-19 17:43 UTC (permalink / raw)
To: Linus Torvalds
Cc: Bart Van Assche, Rafael J. Wysocki, Linux Kernel Mailing List,
Kernel Testers List, Maciej Rutecki, Florian Mickler, Neil Brown,
David Dillow
On 2011-04-19 18:32, Linus Torvalds wrote:
> On Tue, Apr 19, 2011 at 9:13 AM, Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote:
>>
>> The same test with an initiator running 2.6.39-rc4 +
>> git://git.kernel.dk/linux-2.6-block.git for-linus + the above patch
>> yields about 155.000 IOPS on my test setup, or the same performance as
>> with 2.6.38.3. I'm running the above patch through an I/O stress test
>> now.
>
> Goodie. So not only does that patch get back the 11%, it removes the
> crazy QUEUE_FLAG_REENTER flag that was broken to begin with. AND it
> removes a number of complicated lines.
>
> Halleluja.
Indeed, coming your way soonish.
--
Jens Axboe
^ permalink raw reply [flat|nested] 41+ messages in thread
* [Bug #32982] Kernel locks up a few minutes after boot
2011-04-30 19:42 2.6.39-rc5-git4: Reported regressions from 2.6.38 Rafael J. Wysocki
@ 2011-04-30 19:42 ` Rafael J. Wysocki
2011-04-30 19:51 ` Linus Torvalds
0 siblings, 1 reply; 41+ messages in thread
From: Rafael J. Wysocki @ 2011-04-30 19:42 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Kernel Testers List, Maciej Rutecki, Florian Mickler,
Bart Van Assche, Linus Torvalds
This message has been generated automatically as a part of a summary report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.38. Please verify if it still should be listed and let the tracking team
know (either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
Subject : Kernel locks up a few minutes after boot
Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2011-04-10 19:55 (21 days old)
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
2011-04-30 19:42 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki
@ 2011-04-30 19:51 ` Linus Torvalds
[not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA@mail.gmail.com>
0 siblings, 1 reply; 41+ messages in thread
From: Linus Torvalds @ 2011-04-30 19:51 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Linux Kernel Mailing List, Kernel Testers List, Maciej Rutecki,
Florian Mickler, Bart Van Assche
I think we had all assumed that this was the MD problem that should
have been fixed in rc4 (the symptoms matched), but I don't think we
got any confirmation from Bart on that.
Bart? Does the problem still persist in current -git?
Linus
On Sat, Apr 30, 2011 at 12:42 PM, Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
> This message has been generated automatically as a part of a summary report
> of recent regressions.
>
> The following bug entry is on the current list of known regressions
> from 2.6.38. Please verify if it still should be listed and let the tracking team
> know (either way).
>
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=32982
> Subject : Kernel locks up a few minutes after boot
> Submitter : Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> Date : 2011-04-10 19:55 (21 days old)
>
>
>
^ permalink raw reply [flat|nested] 41+ messages in thread
* Re: [Bug #32982] Kernel locks up a few minutes after boot
[not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2011-05-01 17:01 ` Linus Torvalds
0 siblings, 0 replies; 41+ messages in thread
From: Linus Torvalds @ 2011-05-01 17:01 UTC (permalink / raw)
To: Bart Van Assche
Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
Maciej Rutecki, Florian Mickler
On Sun, May 1, 2011 at 2:55 AM, Bart Van Assche <bvanassche-HInyCGIudOg@public.gmane.org> wrote:
>
> There is something else and completely unrelated that is puzzling me though:
> on two almost identical systems one always recognizes all internal PCIe
> cards but the other system not. This is something that seldom happened with
> 2.6.34 but happens frequently with 2.6.38 and 2.6.39-rcx. What I see is that
> during boot either both InfiniBand PCIe cards are recognized or that one
> specific card is not recognized and even doesn't show up in the lspci
> output. A BIOS upgrade didn't help. Any idea where I should start looking to
> find the cause of this issue ?
So it has happened sporadically before, but happens much more commonly
now? That very much implies some timing issue in PCI probing.
It could be, for example, that the card has a very slow reset
sequence, and doesn't respond to PCI config cycles until it has
internally booted fully. If so, a faster boot by the kernel might just
cause the Linux PCI enumeration to be done before the card is ready.
(That's a really unlikely scenario - I'm not seriously suggesting that
the card would be quite <i>that</i> stupid and slow. But there might
be similar issues at a much lower level, ie if the Linux pcie port
driver might be resetting the port and then trying to read the card
too quickly afterwards, and you'd want some added delay there).
Have you tried it "pcie_ports=compat" (or "native") makes any difference?
But you should probably contact Jesse Barnes and the linux-pci mailing
list and see if anybody has any smarter ideas.
Linus
^ permalink raw reply [flat|nested] 41+ messages in thread
end of thread, other threads:[~2011-05-01 17:01 UTC | newest]
Thread overview: 41+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-04-17 12:52 2.6.39-rc3-git7: Reported regressions from 2.6.38 Rafael J. Wysocki
2011-04-17 12:52 ` [Bug #32262] 2.6.38-git15+ IDE hangs boot Rafael J. Wysocki
2011-04-17 13:16 ` Pete Clements
[not found] ` <201104171316.p3HDGI8Y018237-HsB6010ddUOt6YA6xNGIRwkbCANdLtlA@public.gmane.org>
2011-04-17 13:28 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33092] [regression] 2.6.39-rc1 - Beagleboard usbnet broken Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki
2011-04-17 17:03 ` Linus Torvalds
[not found] ` <BANLkTimeaHwBiZoz=isCM=qtBxoxaBqDUw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-17 18:37 ` Bart Van Assche
[not found] ` <BANLkTi=1jO73DXDTD50bVpU82qphVKmRCw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-17 21:07 ` NeilBrown
[not found] ` <20110418070711.12dd5cc1-wvvUuzkyo1EYVZTmpyfIwg@public.gmane.org>
2011-04-17 22:20 ` NeilBrown
2011-04-18 11:44 ` Jens Axboe
[not found] ` <4DAC2429.5000105-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
2011-04-18 18:21 ` Bart Van Assche
[not found] ` <BANLkTinCWWYqnksxevho9TN3u6ZC_Rjzwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-18 18:28 ` Jens Axboe
[not found] ` <4DAC82E6.3020809-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
2011-04-18 18:32 ` Bart Van Assche
[not found] ` <BANLkTimaekpwmK+z=ePE+XSP6GUteidwfw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-18 18:38 ` Jens Axboe
2011-04-19 9:09 ` Jens Axboe
[not found] ` <4DAD5156.2050300-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
2011-04-19 11:16 ` Jens Axboe
2011-04-19 16:13 ` Bart Van Assche
[not found] ` <BANLkTimLOgY1opmAt+rxmYtLa-6adL98wg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-19 16:32 ` Linus Torvalds
[not found] ` <BANLkTikzRE7sA8_NNgVUpOqR_zGtw+YUvQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-19 17:43 ` Jens Axboe
2011-04-19 17:43 ` Jens Axboe
[not found] ` <4DAD6EF2.5070405-5c4llco8/ftWk0Htik3J/w@public.gmane.org>
2011-04-19 16:48 ` Christoph Hellwig
[not found] ` <20110419164815.GA30616-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
2011-04-19 17:06 ` Jens Axboe
2011-04-19 3:32 ` David Dillow
[not found] ` <1303183934.2585.12.camel-1q1vX8mYZiGLUyTwlgNVppKKF0rrzTr+@public.gmane.org>
2011-04-19 16:39 ` Bart Van Assche
2011-04-17 12:57 ` [Bug #32902] 2.6.39-rc1 doesn't boot on thinkpad t61p x86_64 Rafael J. Wysocki
2011-04-18 15:34 ` Alex Romosan
[not found] ` <87ipub7evl.fsf-z2gvhuI27Wv/eO6KEJJvLQ@public.gmane.org>
2011-04-18 21:22 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #32892] 2.6.39-rc1 data corruption with rtorrent Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33142] 2.6.39-rc2 regression: X201s fails to resume b77dcf8460ae57d4eb9fd3633eb4f97b8fb20716 Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33272] drm related hard-hang Rafael J. Wysocki
[not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ@mail.gmail.com>
[not found] ` <BANLkTik3sciXX58NtRk_03cPc=nHjdPtAQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-04-17 18:22 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33252] [regression 2.6.39-rc2][bisected] "perf, x86: P4 PMU - Read proper MSR register to catch" and NMIs Rafael J. Wysocki
2011-04-17 13:05 ` Cyrill Gorcunov
[not found] ` <4DAAE5AA.1090302-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2011-04-17 13:31 ` Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33102] File's copied from client->linux server only copy 1st 64K data;rest is lost Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33242] Lockdep splat in autofs with 2.6.39-rc2 Rafael J. Wysocki
2011-04-17 12:57 ` [Bug #33342] [2.6.39-rc2][bisected] Constant DISK_MEDIA_CHANGE_EVENTS from CDROM drive Rafael J. Wysocki
-- strict thread matches above, loose matches on Subject: below --
2011-04-30 19:42 2.6.39-rc5-git4: Reported regressions from 2.6.38 Rafael J. Wysocki
2011-04-30 19:42 ` [Bug #32982] Kernel locks up a few minutes after boot Rafael J. Wysocki
2011-04-30 19:51 ` Linus Torvalds
[not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA@mail.gmail.com>
[not found] ` <BANLkTik_aeVn9Jf_cWnoY0fNUm+tjMnixA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-05-01 17:01 ` Linus Torvalds
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).