* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <alpine.LFD.2.00.0904061443000.7443@localhost.localdomain>
@ 2009-04-06 22:34 ` Rafael J. Wysocki
2009-04-07 3:56 ` Trenton D. Adams
` (4 subsequent siblings)
5 siblings, 0 replies; 20+ messages in thread
From: Rafael J. Wysocki @ 2009-04-06 22:34 UTC (permalink / raw)
To: Linus Torvalds
Cc: Adrian Bunk, Trenton Adams, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linux PM List
On Monday 06 April 2009, Linus Torvalds wrote:
>
> On Mon, 6 Apr 2009, Rafael J. Wysocki wrote:
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13019
> > Subject : /proc/<pid>/maps offset output broken in 2.6.29
> > Submitter : "Chris Friesen" <cfriesen@nortel.com>
> > Date : 2009-04-01 23:18 (6 days old)
> > References : http://marc.info/?l=linuxppc-embedded&m=123862902916059&w=4
> > Handled-By : Hugh Dickins <hugh@veritas.com>
>
> I don't think that's a regression, nor really a bug. It looks cosmetic,
> and likely to be fixed, but not really worth worrying about.
OK, I've dropped it from the list.
Thanks,
Rafael
^ permalink raw reply [flat|nested] 20+ messages in thread* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <alpine.LFD.2.00.0904061443000.7443@localhost.localdomain>
2009-04-06 22:34 ` Rafael J. Wysocki
@ 2009-04-07 3:56 ` Trenton D. Adams
[not found] ` <9b1675090904062056v235af58ehc99cce8ff97fe501@mail.gmail.com>
` (3 subsequent siblings)
5 siblings, 0 replies; 20+ messages in thread
From: Trenton D. Adams @ 2009-04-07 3:56 UTC (permalink / raw)
To: Linus Torvalds
Cc: Adrian Bunk, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linux PM List
On Mon, Apr 6, 2009 at 3:53 PM, Linus Torvalds
<torvalds@linux-foundation.org> wrote:
>
>
> On Mon, 6 Apr 2009, Rafael J. Wysocki wrote:
>>
>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13018
>> Subject : 2.6.29 on MacBook 2,1 fails to reboot
>> Submitter : Trenton Adams <trenton.d.adams@gmail.com>
>> Date : 2009-03-30 2:04 (8 days old)
>> References : http://marc.info/?l=linux-kernel&m=123837870307249&w=4
>> Handled-By : "Morten P.D. Stevens" <mstevens@win-professional.com>
>
> This went through bisection, but looking at the email log, I tend to
> suspect that maybe Trenton marked some versions good even though they
> weren't (because they got versions numbers from v2.6.27), and didn't
> realize that that messes up bisection in a big way.
Is it appropriate for me to respond to these things?
I was wondering about that. Someone had mentioned that I should trust
the bisect, even when it takes me into "other versions", and it was
taking me through 2.6.27, which I thought was just really weird.
Would you like me to try the bisect again with a little more
diligence, or do you think it can be found with the info given? It
may take a week or so, due to being a bit busy.
Thanks
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <9b1675090904062056v235af58ehc99cce8ff97fe501@mail.gmail.com>]
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <9b1675090904062056v235af58ehc99cce8ff97fe501@mail.gmail.com>
@ 2009-04-07 4:07 ` Linus Torvalds
2009-04-07 4:23 ` Trenton D. Adams
0 siblings, 1 reply; 20+ messages in thread
From: Linus Torvalds @ 2009-04-07 4:07 UTC (permalink / raw)
To: Trenton D. Adams
Cc: Adrian Bunk, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linux PM List
On Mon, 6 Apr 2009, Trenton D. Adams wrote:
> >
> > This went through bisection, but looking at the email log, I tend to
> > suspect that maybe Trenton marked some versions good even though they
> > weren't (because they got versions numbers from v2.6.27), and didn't
> > realize that that messes up bisection in a big way.
>
> Is it appropriate for me to respond to these things?
Yes. I added you to the cc exactly because it was hard for me to judge
from the email discussion that is linked to in the regression list whether
you actually _did_ mark some versions good because of confusion about the
version numbering.
That would certainly explain why bisection didn't seem to work.
But it's not the _only_ reason bisection doesn't work. Sometimes you can
be as careful as possible, but if it's a bug that is even _slightly_ flaky
(timing-dependencies etc), and the bisection marked something good that
shouldn't have been (or vice versa, but that's unusual), then the
bisection end result won't be right.
So you may well have done everything right, and I'm not trying to blame
you. I just was hoping that maybe that confusion would explain why the
bisection didn't seem to pinpoint anything sane..
> I was wondering about that. Someone had mentioned that I should trust
> the bisect, even when it takes me into "other versions", and it was
> taking me through 2.6.27, which I thought was just really weird.
> Would you like me to try the bisect again with a little more
> diligence, or do you think it can be found with the info given? It
> may take a week or so, due to being a bit busy.
It would be good, especially if this bug doesn't end up being solved some
other way... And slow results are better than no results at all ;)
Linus
^ permalink raw reply [flat|nested] 20+ messages in thread* Re: 2.6.29-git13: Reported regressions from 2.6.28
2009-04-07 4:07 ` Linus Torvalds
@ 2009-04-07 4:23 ` Trenton D. Adams
0 siblings, 0 replies; 20+ messages in thread
From: Trenton D. Adams @ 2009-04-07 4:23 UTC (permalink / raw)
To: Linus Torvalds
Cc: Adrian Bunk, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linux PM List
On Mon, Apr 6, 2009 at 10:07 PM, Linus Torvalds
<torvalds@linux-foundation.org> wrote:
>
>
> On Mon, 6 Apr 2009, Trenton D. Adams wrote:
>> >
>> > This went through bisection, but looking at the email log, I tend to
>> > suspect that maybe Trenton marked some versions good even though they
>> > weren't (because they got versions numbers from v2.6.27), and didn't
>> > realize that that messes up bisection in a big way.
>>
>> Is it appropriate for me to respond to these things?
>
> Yes. I added you to the cc exactly because it was hard for me to judge
> from the email discussion that is linked to in the regression list whether
> you actually _did_ mark some versions good because of confusion about the
> version numbering.
Oh, didn't know you CC'd me. I have just been searching for my name
to try and keep up to date on threads I participated in, because the
list volume is just too much.
> So you may well have done everything right, and I'm not trying to blame
> you. I just was hoping that maybe that confusion would explain why the
> bisection didn't seem to pinpoint anything sane..
This gave me a bit of a chuckle. ROFL. I didn't think you were
_blaming_ me. And even if you were, I'd just say "I'M HUMAN MAN". ;)
It is very well possible that I simply booted the wrong kernel, seeing
I had like 20 to choose from at the time, after doing all those
bisects. This time, I will make the new kernels the default so that
it is impossible to happen again, assuming that is what happened. I
will also stop and start alsa 2 or 3 times, in case it is a deadlock
timing issue as you mentioned.
I'm glad to know that I shouldn't be switching kernel versions though.
I thought that was really crazy.
>
>
> It would be good, especially if this bug doesn't end up being solved some
> other way... And slow results are better than no results at all ;)
>
> Linus
>
Okay, I will try again. Hopefully in a few days I'll have something
for the bug.
^ permalink raw reply [flat|nested] 20+ messages in thread
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <alpine.LFD.2.00.0904061443000.7443@localhost.localdomain>
` (2 preceding siblings ...)
[not found] ` <9b1675090904062056v235af58ehc99cce8ff97fe501@mail.gmail.com>
@ 2009-04-07 16:16 ` Stefan Richter
[not found] ` <49DB7C77.1000702@s5r6.in-berlin.de>
[not found] ` <200904070035.00784.rjw@sisk.pl>
5 siblings, 0 replies; 20+ messages in thread
From: Stefan Richter @ 2009-04-07 16:16 UTC (permalink / raw)
To: Trenton Adams
Cc: Adrian Bunk, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
Linus Torvalds wrote:
> On Mon, 6 Apr 2009, Rafael J. Wysocki wrote:
>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13018
>> Subject : 2.6.29 on MacBook 2,1 fails to reboot
>> Submitter : Trenton Adams <trenton.d.adams@gmail.com>
>> Date : 2009-03-30 2:04 (8 days old)
>> References : http://marc.info/?l=linux-kernel&m=123837870307249&w=4
>> Handled-By : "Morten P.D. Stevens" <mstevens@win-professional.com>
>
> This went through bisection, but looking at the email log, I tend to
> suspect that maybe Trenton marked some versions good even though they
> weren't (because they got versions numbers from v2.6.27), and didn't
> realize that that messes up bisection in a big way.
>
> The bug _sounds_ like some deadlock due to lock problems - the shutdown
> path often triggers locks that no other path really cares about. And we
> had some lock problems in the sound subsystem that got fixed post-2.6.28,
> for example.
>
> And it looks like the problem is somewhere in sound shutdown:
>
> 12181 delete_module("snd_hda_codec", O_RDONLY|O_EXCL) = -1 EAGAIN (Resource temporarily unavailable) <0.000011>
>
> So commits like 91054598f794fb5d8a0b1e747ff8e2e8fc2115b3 ("ALSA: pcm_oss,
> fix locking typo") might explain it.
Trenton,
could it be the same as this one?
http://bugzilla.kernel.org/show_bug.cgi?id=12321
"System hangs when unloading alsa modules"
http://bugs.gentoo.org/show_bug.cgi?id=253535
"System hangs when unloading alsa modules on Kernel >2.6.28"
--
Stefan Richter
-=====-==--= -=-- --===
http://arcgraph.de/sr/
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <49DB7C77.1000702@s5r6.in-berlin.de>]
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <49DB7C77.1000702@s5r6.in-berlin.de>
@ 2009-04-07 16:44 ` Trenton D. Adams
[not found] ` <9b1675090904070944m798ed608i1d9194ebd1ed3961@mail.gmail.com>
1 sibling, 0 replies; 20+ messages in thread
From: Trenton D. Adams @ 2009-04-07 16:44 UTC (permalink / raw)
To: Stefan Richter
Cc: Adrian Bunk, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
On Tue, Apr 7, 2009 at 10:16 AM, Stefan Richter
<stefanr@s5r6.in-berlin.de> wrote:
> Linus Torvalds wrote:
>> On Mon, 6 Apr 2009, Rafael J. Wysocki wrote:
>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13018
>>> Subject : 2.6.29 on MacBook 2,1 fails to reboot
>>> Submitter : Trenton Adams <trenton.d.adams@gmail.com>
>>> Date : 2009-03-30 2:04 (8 days old)
>>> References : http://marc.info/?l=linux-kernel&m=123837870307249&w=4
>>> Handled-By : "Morten P.D. Stevens" <mstevens@win-professional.com>
>>
>
> Trenton,
> could it be the same as this one?
> http://bugzilla.kernel.org/show_bug.cgi?id=12321
> "System hangs when unloading alsa modules"
> http://bugs.gentoo.org/show_bug.cgi?id=253535
> "System hangs when unloading alsa modules on Kernel >2.6.28"
> --
> Stefan Richter
> -=====-==--= -=-- --===
> http://arcgraph.de/sr/
>
The first one looks similar, if not identical. The second one
doesn't, because my problem happens on 2.6.29 only, not 2.6.28.
Either way, the unload problem with the module didn't happen in
2.6.28.
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <9b1675090904070944m798ed608i1d9194ebd1ed3961@mail.gmail.com>]
* 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28)
[not found] ` <9b1675090904070944m798ed608i1d9194ebd1ed3961@mail.gmail.com>
@ 2009-04-07 17:10 ` Stefan Richter
2009-04-07 17:20 ` Justin Mattock
` (2 more replies)
0 siblings, 3 replies; 20+ messages in thread
From: Stefan Richter @ 2009-04-07 17:10 UTC (permalink / raw)
To: Trenton D. Adams
Cc: Adrian Bunk, Linux SCSI List, Takashi Iwai, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
Trenton D. Adams wrote:
> On Tue, Apr 7, 2009 at 10:16 AM, Stefan Richter
> <stefanr@s5r6.in-berlin.de> wrote:
>> Linus Torvalds wrote:
>>> On Mon, 6 Apr 2009, Rafael J. Wysocki wrote:
>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13018
>>>> Subject : 2.6.29 on MacBook 2,1 fails to reboot
...
>>> The bug _sounds_ like some deadlock due to lock problems - the shutdown
>>> path often triggers locks that no other path really cares about. And we
>>> had some lock problems in the sound subsystem that got fixed post-2.6.28,
>>> for example.
>>>
>>> And it looks like the problem is somewhere in sound shutdown:
>>>
>>> 12181 delete_module("snd_hda_codec", O_RDONLY|O_EXCL) = -1 EAGAIN (Resource temporarily unavailable) <0.000011>
>>>
>>> So commits like 91054598f794fb5d8a0b1e747ff8e2e8fc2115b3 ("ALSA: pcm_oss,
>>> fix locking typo") might explain it.
...
>> could it be the same as this one?
>> http://bugzilla.kernel.org/show_bug.cgi?id=12321
>> "System hangs when unloading alsa modules"
>> http://bugs.gentoo.org/show_bug.cgi?id=253535
>> "System hangs when unloading alsa modules on Kernel >2.6.28"
...
> The first one looks similar, if not identical. The second one
> doesn't, because my problem happens on 2.6.29 only, not 2.6.28.
The Gentoo bug entry too is about a regression _after_ 2.6.28. :-)
I.e. 2.6.28.y. are unaffected. It's actually just the downstream
duplicate of the kernel.org bug entry.
> Either way, the unload problem with the module didn't happen in
> 2.6.28.
Interdependencies between ALSA modules have changed. The Gentoo init
scripts attempted to unload them in an order which deadlocked modprobe
due to dependencies. The fix for Gentoo is to just not unload the
modules on system shutdown. My Gentoo/amd64 Mac mini was affected by
this too; fixed by userland update.
(Added Cc to tiwai@suse.de)
--
Stefan Richter
-=====-=-=== -=-= -==-=
http://arcgraph.de/sr/
^ permalink raw reply [flat|nested] 20+ messages in thread* Re: 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28)
2009-04-07 17:10 ` 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28) Stefan Richter
@ 2009-04-07 17:20 ` Justin Mattock
2009-04-07 18:22 ` Trenton D. Adams
[not found] ` <9b1675090904071122k6a53295fwfffc336011edee8e@mail.gmail.com>
2 siblings, 0 replies; 20+ messages in thread
From: Justin Mattock @ 2009-04-07 17:20 UTC (permalink / raw)
To: Stefan Richter
Cc: Adrian Bunk, Trenton D. Adams, Linux SCSI List, Takashi Iwai,
Network Development, Linux Kernel Mailing List,
Natalie Protasevich, Linux ACPI, Andrew Morton,
Kernel Testers List, Linus Torvalds, Linux PM List
On Tue, Apr 7, 2009 at 10:10 AM, Stefan Richter
<stefanr@s5r6.in-berlin.de> wrote:
> Trenton D. Adams wrote:
>> On Tue, Apr 7, 2009 at 10:16 AM, Stefan Richter
>> <stefanr@s5r6.in-berlin.de> wrote:
>>> Linus Torvalds wrote:
>>>> On Mon, 6 Apr 2009, Rafael J. Wysocki wrote:
>>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13018
>>>>> Subject : 2.6.29 on MacBook 2,1 fails to reboot
> ...
>>>> The bug _sounds_ like some deadlock due to lock problems - the shutdown
>>>> path often triggers locks that no other path really cares about. And we
>>>> had some lock problems in the sound subsystem that got fixed post-2.6.28,
>>>> for example.
>>>>
>>>> And it looks like the problem is somewhere in sound shutdown:
>>>>
>>>> 12181 delete_module("snd_hda_codec", O_RDONLY|O_EXCL) = -1 EAGAIN (Resource temporarily unavailable) <0.000011>
>>>>
>>>> So commits like 91054598f794fb5d8a0b1e747ff8e2e8fc2115b3 ("ALSA: pcm_oss,
>>>> fix locking typo") might explain it.
> ...
>>> could it be the same as this one?
>>> http://bugzilla.kernel.org/show_bug.cgi?id=12321
>>> "System hangs when unloading alsa modules"
>>> http://bugs.gentoo.org/show_bug.cgi?id=253535
>>> "System hangs when unloading alsa modules on Kernel >2.6.28"
> ...
>> The first one looks similar, if not identical. The second one
>> doesn't, because my problem happens on 2.6.29 only, not 2.6.28.
>
> The Gentoo bug entry too is about a regression _after_ 2.6.28. :-)
> I.e. 2.6.28.y. are unaffected. It's actually just the downstream
> duplicate of the kernel.org bug entry.
>
>> Either way, the unload problem with the module didn't happen in
>> 2.6.28.
>
> Interdependencies between ALSA modules have changed. The Gentoo init
> scripts attempted to unload them in an order which deadlocked modprobe
> due to dependencies. The fix for Gentoo is to just not unload the
> modules on system shutdown. My Gentoo/amd64 Mac mini was affected by
> this too; fixed by userland update.
>
> (Added Cc to tiwai@suse.de)
> --
> Stefan Richter
> -=====-=-=== -=-= -==-=
> http://arcgraph.de/sr/
> --
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>
With the imac(kernel 2.6.29)
/sbin/shutdown -h now (works)
but
/sbin/reboot
hangs
--
Justin P. Mattock
^ permalink raw reply [flat|nested] 20+ messages in thread* Re: 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28)
2009-04-07 17:10 ` 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28) Stefan Richter
2009-04-07 17:20 ` Justin Mattock
@ 2009-04-07 18:22 ` Trenton D. Adams
[not found] ` <9b1675090904071122k6a53295fwfffc336011edee8e@mail.gmail.com>
2 siblings, 0 replies; 20+ messages in thread
From: Trenton D. Adams @ 2009-04-07 18:22 UTC (permalink / raw)
To: Stefan Richter
Cc: Adrian Bunk, Linux SCSI List, Takashi Iwai, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
On Tue, Apr 7, 2009 at 11:10 AM, Stefan Richter
<stefanr@s5r6.in-berlin.de> wrote:
> Trenton D. Adams wrote:
> Interdependencies between ALSA modules have changed. The Gentoo init
> scripts attempted to unload them in an order which deadlocked modprobe
> due to dependencies. The fix for Gentoo is to just not unload the
> modules on system shutdown. My Gentoo/amd64 Mac mini was affected by
> this too; fixed by userland update.
>
While that is interesting, I am not seeing that problem on my Gentoo
box (the macbook), which is completely up-to-date. 2.6.28 works, and
2.6.29 doesn't. Same init scripts, different kernels.
And sure, I could put a comment on the rmmod, in the init script, but
IMO that would be a hack around a _bug_. Which is fine for me. But,
is it worth leaving the issue in the kernel?
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <9b1675090904071122k6a53295fwfffc336011edee8e@mail.gmail.com>]
* Re: 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28)
[not found] ` <9b1675090904071122k6a53295fwfffc336011edee8e@mail.gmail.com>
@ 2009-04-07 19:23 ` Stefan Richter
[not found] ` <49DBA821.1070408@s5r6.in-berlin.de>
1 sibling, 0 replies; 20+ messages in thread
From: Stefan Richter @ 2009-04-07 19:23 UTC (permalink / raw)
To: Trenton D. Adams
Cc: Adrian Bunk, Linux SCSI List, Takashi Iwai, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Stefan Richter, Andrew Morton, Kernel Testers List,
Linus Torvalds, Linux PM List
Trenton D. Adams wrote:
> On Tue, Apr 7, 2009 at 11:10 AM, Stefan Richter
> <stefanr@s5r6.in-berlin.de> wrote:
>>>> http://bugs.gentoo.org/show_bug.cgi?id=253535
>> Interdependencies between ALSA modules have changed. The Gentoo init
>> scripts attempted to unload them in an order which deadlocked modprobe
>> due to dependencies. The fix for Gentoo is to just not unload the
>> modules on system shutdown. My Gentoo/amd64 Mac mini was affected by
>> this too; fixed by userland update.
>>
>
> While that is interesting, I am not seeing that problem on my Gentoo
> box (the macbook), which is completely up-to-date. 2.6.28 works, and
> 2.6.29 doesn't. Same init scripts, different kernels.
Note that the respective update changed /etc/conf.d/alsasound (a local
configuration file) to include
UNLOAD_ON_STOP="no"
KILLPROC_ON_STOP="no"
This change by update is not activated by a mere emerge; one needs to
incorporate that change with dispatch-conf or an equivalent method.
(Or simply edit the file to have these variables set to "no".)
> And sure, I could put a comment on the rmmod, in the init script, but
> IMO that would be a hack around a _bug_. Which is fine for me. But,
> is it worth leaving the issue in the kernel?
Is it a kernel issue if a script attempts to unload a busy module, then
fails to proceed? I wouldn't think so.
But more importantly, is this init scripts related bug really what's
happening at your system? Or do you actually experience an entirely
different bug?
--
Stefan Richter
-=====-=-=== -=-= -==-=
http://arcgraph.de/sr/
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <49DBA821.1070408@s5r6.in-berlin.de>]
* Re: 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28)
[not found] ` <49DBA821.1070408@s5r6.in-berlin.de>
@ 2009-04-07 20:52 ` Trenton D. Adams
[not found] ` <9b1675090904071352x513510ecm8b2574859b088275@mail.gmail.com>
1 sibling, 0 replies; 20+ messages in thread
From: Trenton D. Adams @ 2009-04-07 20:52 UTC (permalink / raw)
To: Stefan Richter
Cc: Adrian Bunk, Linux SCSI List, Takashi Iwai, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
On Tue, Apr 7, 2009 at 1:23 PM, Stefan Richter
<stefanr@s5r6.in-berlin.de> wrote:
> Trenton D. Adams wrote:
>> On Tue, Apr 7, 2009 at 11:10 AM, Stefan Richter
>> <stefanr@s5r6.in-berlin.de> wrote:
>>>>> http://bugs.gentoo.org/show_bug.cgi?id=253535
>>> Interdependencies between ALSA modules have changed. The Gentoo init
>>> scripts attempted to unload them in an order which deadlocked modprobe
>>> due to dependencies. The fix for Gentoo is to just not unload the
>>> modules on system shutdown. My Gentoo/amd64 Mac mini was affected by
>>> this too; fixed by userland update.
>>>
>>
>> While that is interesting, I am not seeing that problem on my Gentoo
>> box (the macbook), which is completely up-to-date. 2.6.28 works, and
>> 2.6.29 doesn't. Same init scripts, different kernels.
>
> Note that the respective update changed /etc/conf.d/alsasound (a local
> configuration file) to include
> UNLOAD_ON_STOP="no"
> KILLPROC_ON_STOP="no"
> This change by update is not activated by a mere emerge; one needs to
> incorporate that change with dispatch-conf or an equivalent method.
> (Or simply edit the file to have these variables set to "no".)
I run dispatch-conf every time I update. It did not set it to no by
default. I do see the option though.
>
>> And sure, I could put a comment on the rmmod, in the init script, but
>> IMO that would be a hack around a _bug_. Which is fine for me. But,
>> is it worth leaving the issue in the kernel?
>
> Is it a kernel issue if a script attempts to unload a busy module, then
> fails to proceed? I wouldn't think so.
I don't know really. It worked before, now it doesn't. But, now I'm
recalling something you said earlier. They are being done in the
wrong order. The gentoo bug mentions the correct order. I hadn't
realized that.
>
> But more importantly, is this init scripts related bug really what's
> happening at your system? Or do you actually experience an entirely
> different bug?
It could be. I will try it out when I get home from work, and get
back to you. That would be cool if it was a simple init script
problem. Cause then I don't have to do anymore git bisects. ;)
> --
> Stefan Richter
> -=====-=-=== -=-= -==-=
> http://arcgraph.de/sr/
>
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <9b1675090904071352x513510ecm8b2574859b088275@mail.gmail.com>]
* Re: 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28)
[not found] ` <9b1675090904071352x513510ecm8b2574859b088275@mail.gmail.com>
@ 2009-04-08 1:16 ` Trenton D. Adams
[not found] ` <9b1675090904071816k59080f1fn39737767db4dc0c@mail.gmail.com>
1 sibling, 0 replies; 20+ messages in thread
From: Trenton D. Adams @ 2009-04-08 1:16 UTC (permalink / raw)
To: Stefan Richter
Cc: Adrian Bunk, Linux SCSI List, Takashi Iwai, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
On Tue, Apr 7, 2009 at 2:52 PM, Trenton D. Adams
<trenton.d.adams@gmail.com> wrote:
> On Tue, Apr 7, 2009 at 1:23 PM, Stefan Richter
> <stefanr@s5r6.in-berlin.de> wrote:
>>
>> But more importantly, is this init scripts related bug really what's
>> happening at your system? Or do you actually experience an entirely
>> different bug?
>
> It could be. I will try it out when I get home from work, and get
> back to you. That would be cool if it was a simple init script
> problem. Cause then I don't have to do anymore git bisects. ;)
Yes, the order is what causes this, you can close the bug.
Sorry for wasting everyone's time. I thought it was a kernel bug
because I had not needed to update my gentoo system to make it happen.
Thanks.
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <9b1675090904071816k59080f1fn39737767db4dc0c@mail.gmail.com>]
* Re: 2.6.29 on MacBook 2, 1 fails to reboot (was Re: 2.6.29-git13: Reported regressions from 2.6.28)
[not found] ` <9b1675090904071816k59080f1fn39737767db4dc0c@mail.gmail.com>
@ 2009-04-08 8:00 ` Rafael J. Wysocki
0 siblings, 0 replies; 20+ messages in thread
From: Rafael J. Wysocki @ 2009-04-08 8:00 UTC (permalink / raw)
To: Trenton D. Adams
Cc: Adrian Bunk, Linux SCSI List, Takashi Iwai, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Stefan Richter, Andrew Morton, Kernel Testers List,
Linus Torvalds, Linux PM List
On Wednesday 08 April 2009, Trenton D. Adams wrote:
> On Tue, Apr 7, 2009 at 2:52 PM, Trenton D. Adams
> <trenton.d.adams@gmail.com> wrote:
> > On Tue, Apr 7, 2009 at 1:23 PM, Stefan Richter
> > <stefanr@s5r6.in-berlin.de> wrote:
> >>
> >> But more importantly, is this init scripts related bug really what's
> >> happening at your system? Or do you actually experience an entirely
> >> different bug?
> >
> > It could be. I will try it out when I get home from work, and get
> > back to you. That would be cool if it was a simple init script
> > problem. Cause then I don't have to do anymore git bisects. ;)
>
> Yes, the order is what causes this, you can close the bug.
Closed.
Thanks,
Rafael
^ permalink raw reply [flat|nested] 20+ messages in thread
[parent not found: <200904070035.00784.rjw@sisk.pl>]
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <200904070035.00784.rjw@sisk.pl>
@ 2009-04-16 21:08 ` Chris Friesen
[not found] ` <49E79E58.9000606@nortel.com>
1 sibling, 0 replies; 20+ messages in thread
From: Chris Friesen @ 2009-04-16 21:08 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Adrian Bunk, Trenton Adams, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
Rafael J. Wysocki wrote:
> On Monday 06 April 2009, Linus Torvalds wrote:
>> On Mon, 6 Apr 2009, Rafael J. Wysocki wrote:
>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13019
>>> Subject : /proc/<pid>/maps offset output broken in 2.6.29
>>> Submitter : "Chris Friesen" <cfriesen@nortel.com>
>>> Date : 2009-04-01 23:18 (6 days old)
>>> References : http://marc.info/?l=linuxppc-embedded&m=123862902916059&w=4
>>> Handled-By : Hugh Dickins <hugh@veritas.com>
>> I don't think that's a regression, nor really a bug. It looks cosmetic,
>> and likely to be fixed, but not really worth worrying about.
>
> OK, I've dropped it from the list.
I'm okay with that. The problem causes some backwards compatibility
problems with existing apps that get confused by the large "offset"
number. The fix is going to cause problems too, but in a different way.
We'll work around it.
Chris
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <49E79E58.9000606@nortel.com>]
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <49E79E58.9000606@nortel.com>
@ 2009-04-16 21:35 ` Linus Torvalds
[not found] ` <alpine.LFD.2.00.0904161432000.4042@localhost.localdomain>
1 sibling, 0 replies; 20+ messages in thread
From: Linus Torvalds @ 2009-04-16 21:35 UTC (permalink / raw)
To: Chris Friesen
Cc: Adrian Bunk, Trenton Adams, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linux PM List
On Thu, 16 Apr 2009, Chris Friesen wrote:
>
> I'm okay with that. The problem causes some backwards compatibility problems
> with existing apps that get confused by the large "offset" number. The fix is
> going to cause problems too, but in a different way.
>
> We'll work around it.
If you have actual apps that care, that's a different issue.
We do try to bend over backwards on ABI issues if it really is noticeable
for applications. Now, in this case, if you can just fix your app to not
care (because it really was badly written in the first place to even
notice), then that is the _much_ superior solution.
But if you actually have binary-only commercial apps that break, we'll do
a compatibility thing rather than the 0 that already got merged.
Although I don't really even see what we can sanely do except for the 0
case. We could put the virtual address in there instead of zero (I forget
what old kernels used to do - whatever magic value the anonymous mappings
got, it wasn't really designed as an important value in its own right, it
was designed to trigger the "we can merge these vma's" logic.
Linus
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <alpine.LFD.2.00.0904161432000.4042@localhost.localdomain>]
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <alpine.LFD.2.00.0904161432000.4042@localhost.localdomain>
@ 2009-04-20 21:22 ` Chris Friesen
[not found] ` <49ECE783.5050704@nortel.com>
1 sibling, 0 replies; 20+ messages in thread
From: Chris Friesen @ 2009-04-20 21:22 UTC (permalink / raw)
To: Linus Torvalds
Cc: Adrian Bunk, Trenton Adams, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linux PM List
Linus Torvalds wrote:
>
> On Thu, 16 Apr 2009, Chris Friesen wrote:
>> I'm okay with that. The problem causes some backwards compatibility problems
>> with existing apps that get confused by the large "offset" number. The fix is
>> going to cause problems too, but in a different way.
>>
>> We'll work around it.
>
> If you have actual apps that care, that's a different issue.
>
> We do try to bend over backwards on ABI issues if it really is noticeable
> for applications. Now, in this case, if you can just fix your app to not
> care (because it really was badly written in the first place to even
> notice), then that is the _much_ superior solution.
Yep, we can fix the app to ignore that field for anonymous mappings.
> Although I don't really even see what we can sanely do except for the 0
> case. We could put the virtual address in there instead of zero (I forget
> what old kernels used to do - whatever magic value the anonymous mappings
> got, it wasn't really designed as an important value in its own right, it
> was designed to trigger the "we can merge these vma's" logic.
For anonymous mappings, the older kernels put the starting address of
the VMA (from the point of view of the app) as the offset. Until the
recent change, new kernels still did this for most VMAs, but the stack
offset was a 64-bit value with no obvious relation to the VMA start address.
Chris
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <49ECE783.5050704@nortel.com>]
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <49ECE783.5050704@nortel.com>
@ 2009-04-20 23:08 ` Hugh Dickins
[not found] ` <Pine.LNX.4.64.0904202352520.2924@blonde.anvils>
1 sibling, 0 replies; 20+ messages in thread
From: Hugh Dickins @ 2009-04-20 23:08 UTC (permalink / raw)
To: Chris Friesen
Cc: Adrian Bunk, Trenton Adams, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
On Mon, 20 Apr 2009, Chris Friesen wrote:
> Linus Torvalds wrote:
> > On Thu, 16 Apr 2009, Chris Friesen wrote:
> > > I'm okay with that. The problem causes some backwards compatibility
> > > problems
> > > with existing apps that get confused by the large "offset" number. The
> > > fix is
> > > going to cause problems too, but in a different way.
> > >
> > > We'll work around it.
> >
> > If you have actual apps that care, that's a different issue.
Yes, that's what I told Chris too.
But asked for more info, suspecting his app was already broken.
> >
> > We do try to bend over backwards on ABI issues if it really is noticeable
> > for applications. Now, in this case, if you can just fix your app to not
> > care (because it really was badly written in the first place to even
> > notice), then that is the _much_ superior solution.
>
> Yep, we can fix the app to ignore that field for anonymous mappings.
>
> > Although I don't really even see what we can sanely do except for the 0
> > case. We could put the virtual address in there instead of zero (I forget
> > what old kernels used to do - whatever magic value the anonymous mappings
> > got, it wasn't really designed as an important value in its own right, it
> > was designed to trigger the "we can merge these vma's" logic.
>
> For anonymous mappings, the older kernels put the starting address of the VMA
> (from the point of view of the app) as the offset. Until the recent change,
> new kernels still did this for most VMAs, but the stack offset was a 64-bit
> value with no obvious relation to the VMA start address.
No, what they put there was something that in most cases matched the
starting address of the VMA; but try moving that VMA with mremap (and
an old /proc/<pid>/maps!) and you'll see that the "offset" remained
unchanged even when the starting address of the VMA was changed.
(The offset remaining constant so that rmap can locate the VMA's pages
and unmap them, despite their being mapped at different virtual
addresses in parent and child after a move in one of them.)
... so I think your app was indeed already broken, wasn't it?
It's also unclear why you'd want to use the offset field for the
starting address of the VMA, when /proc/<pid>/maps already shows
the starting address of the VMA. I think you've more to tell us!
Hugh
^ permalink raw reply [flat|nested] 20+ messages in thread[parent not found: <Pine.LNX.4.64.0904202352520.2924@blonde.anvils>]
* Re: 2.6.29-git13: Reported regressions from 2.6.28
[not found] ` <Pine.LNX.4.64.0904202352520.2924@blonde.anvils>
@ 2009-04-22 19:32 ` Chris Friesen
0 siblings, 0 replies; 20+ messages in thread
From: Chris Friesen @ 2009-04-22 19:32 UTC (permalink / raw)
To: Hugh Dickins
Cc: Adrian Bunk, Trenton Adams, Linux SCSI List, Network Development,
Linux Kernel Mailing List, Natalie Protasevich, Linux ACPI,
Andrew Morton, Kernel Testers List, Linus Torvalds, Linux PM List
Hugh Dickins wrote:
> On Mon, 20 Apr 2009, Chris Friesen wrote:
>> For anonymous mappings, the older kernels put the starting address of the VMA
>> (from the point of view of the app) as the offset. Until the recent change,
>> new kernels still did this for most VMAs, but the stack offset was a 64-bit
>> value with no obvious relation to the VMA start address.
>
> No, what they put there was something that in most cases matched the
> starting address of the VMA; but try moving that VMA with mremap (and
> an old /proc/<pid>/maps!) and you'll see that the "offset" remained
> unchanged even when the starting address of the VMA was changed.
>
> (The offset remaining constant so that rmap can locate the VMA's pages
> and unmap them, despite their being mapped at different virtual
> addresses in parent and child after a move in one of them.)
>
> ... so I think your app was indeed already broken, wasn't it?
>
> It's also unclear why you'd want to use the offset field for the
> starting address of the VMA, when /proc/<pid>/maps already shows
> the starting address of the VMA. I think you've more to tell us!
Yeah, given the above the app was broken. We just didn't run into any
cases where the assumption caused any problems.
Also, it's not so much that we were relying on the offset value for
anything, so much as we were parsing the file and had made some
assumptions about valid offsets for anonymous memory.
Anyways, we'll fix it going forward to simply ignore the offset for
anonymous memory.
Chris
^ permalink raw reply [flat|nested] 20+ messages in thread