* 2.6.30-rc6: Reported regressions from 2.6.29
@ 2009-05-16 19:14 Rafael J. Wysocki
2009-05-16 19:20 ` [Bug #13328] b44: eth0: BUG! Timeout waiting for bit 00000002 of register 42c to clear Rafael J. Wysocki
` (2 more replies)
0 siblings, 3 replies; 13+ messages in thread
From: Rafael J. Wysocki @ 2009-05-16 19:14 UTC (permalink / raw)
To: Linux Kernel Mailing List
Cc: Adrian Bunk, Andrew Morton, Linus Torvalds, Natalie Protasevich,
Kernel Testers List, Network Development, Linux ACPI,
Linux PM List, Linux SCSI List, Linux Wireless List, DRI
This message contains a list of some regressions from 2.6.29, for which there
are no fixes in the mainline I know of. If any of them have been fixed already,
please let me know.
If you know of any other unresolved regressions from 2.6.29, please let me know
either and I'll add them to the list. Also, please let me know if any of the
entries below are invalid.
Each entry from the list will be sent additionally in an automatic reply to
this message with CCs to the people involved in reporting and handling the
issue.
Listed regressions statistics:
Date Total Pending Unresolved
----------------------------------------
2009-05-16 81 36 33
2009-04-25 55 36 26
2009-04-17 37 35 28
Unresolved regressions
----------------------
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13329
Subject : cifs_close: NULL pointer dereference
Submitter : Luca Tettamanti <kronos.it-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-05-16 16:28 (1 days old)
References : http://marc.info/?l=linux-kernel&m=124249133701702&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13328
Subject : b44: eth0: BUG! Timeout waiting for bit 00000002 of register 42c to clear.
Submitter : Francis Moreau <francis.moro-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-05-03 16:22 (14 days old)
References : http://marc.info/?l=linux-kernel&m=124136778012280&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13327
Subject : Regression: 2.6.30-rc5 and rt2x00 / rt2500pci
Submitter : Ken Lewis <kennylewis-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-05-15 14:40 (2 days old)
References : http://marc.info/?l=linux-kernel&m=124239988223614&w=4
Handled-By : John W. Linville <linville-2XuSBdqkA4R54TAoqtyWWQ@public.gmane.org>
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13326
Subject : Null pointer dereference in rtc-cmos driver
Submitter : Ozan Çağlayan <ozan-caicS1wCkhO6A22drWdTBw@public.gmane.org>
Date : 2009-05-14 16:16 (3 days old)
References : http://marc.info/?l=linux-kernel&m=124231783704696&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13325
Subject : 2.6.30-rc kills my box hard - and lockdep chains
Submitter : Jonathan Corbet <corbet-T1hC0tSOHrs@public.gmane.org>
Date : 2009-05-14 15:49 (3 days old)
References : http://marc.info/?l=linux-kernel&m=124231630701394&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13324
Subject : panic when loading oprofile
Submitter : Brandeburg, Jesse <jesse.brandeburg-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
Date : 2009-05-13 22:30 (4 days old)
References : http://marc.info/?l=linux-kernel&m=124225384311631&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13323
Subject : 2.6.30-rc deadline scheduler performance regression for iozone over NFS
Submitter : Jeff Moyer <jmoyer-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
Date : 2009-04-23 14:01 (24 days old)
References : http://marc.info/?l=linux-kernel&m=124049547915450&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13321
Subject : kernel crash with NULL pointer when boot
Submitter : Martin Bammer <mrb74-RbZlAiThDcE@public.gmane.org>
Date : 2009-05-16 12:37 (1 days old)
References : http://lkml.org/lkml/2009/5/16/100
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13319
Subject : Page allocation failures with b43 and p54usb
Submitter : Larry Finger <Larry.Finger-tQ5ms3gMjBLk1uMJSBkQmQ@public.gmane.org>
Date : 2009-04-29 21:01 (18 days old)
References : http://marc.info/?l=linux-kernel&m=124103897101088&w=4
Handled-By : Johannes Berg <johannes-cdvu00un1VgdHxzADdlk8Q@public.gmane.org>
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13318
Subject : AGP doesn't work anymore on nforce2
Submitter : Karsten Mehrhoff <kawime-Mmb7MZpHnFY@public.gmane.org>
Date : 2009-04-30 8:51 (17 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=59de2bebabc5027f93df999d59cc65df591c3e6e
References : http://marc.info/?l=linux-kernel&m=124108156417560&w=4
Handled-By : Shaohua Li <shaohua.li-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13306
Subject : hibernate slow on _second_ run
Submitter : Johannes Berg <johannes-cdvu00un1VgdHxzADdlk8Q@public.gmane.org>
Date : 2009-05-14 09:34 (3 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13298
Subject : modprobe ipmi_si hangs under 2.6.30-rc5
Submitter : Ferenc Wagner <wferi-eEbw3PyuezQ@public.gmane.org>
Date : 2009-05-12 21:28 (5 days old)
References : http://marc.info/?l=linux-kernel&m=124216379407177&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13297
Subject : kernel panic - not syncing : fatel exception in interupt
Submitter : rob <rob1-z2Ut7v/pTR/vt0rt8C/ehw@public.gmane.org>
Date : 2009-05-12 19:34 (5 days old)
References : http://marc.info/?l=linux-kernel&m=124216126903309&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13296
Subject : Lockdep violation at cleanup_workqueue_thread during suspend
Submitter : Zdenek Kabelac <zdenek.kabelac-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-05-12 7:59 (5 days old)
References : http://marc.info/?l=linux-kernel&m=124211522525625&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13294
Subject : i915: drm: xorg leaks drm objects massively
Submitter : Sergei Trofimovich <slyich-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-05-10 19:56 (7 days old)
References : http://marc.info/?l=linux-kernel&m=124198547027903&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13293
Subject : Kernel BUG under network load with gianfar
Submitter : Michael Guntsche <mike-Z92qn3yYq0hWk0Htik3J/w@public.gmane.org>
Date : 2009-05-03 13:36 (14 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=0fd56bb5be6455d0d42241e65aed057244665e5e
References : http://marc.info/?l=linux-kernel&m=124135824600924&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13285
Subject : INTELFB: Colors display incorrectly
Submitter : Dean Menezes <samanddeanus-/E1597aS9LQAvxtiuMwx3w@public.gmane.org>
Date : 2009-05-12 01:40 (5 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13250
Subject : Side channel of Intel HDA chip doesn't work anymore, did work with 2.6.29
Submitter : Andreas Juch <kernel-bt-Sk8biX5uX1I@public.gmane.org>
Date : 2009-05-05 10:14 (12 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13245
Subject : possible circular locking dependency detected
Submitter : Miles Lane <miles.lane-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-05-04 16:56 (13 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13188
Subject : horizontal strips of the screen frozen
Submitter : Justin Madru <jdm64-u1xxEuL7cY4AvxtiuMwx3w@public.gmane.org>
Date : 2009-04-24 20:59 (23 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=280b713b5b0fd84cf2469098aee88acbb5de859c
References : http://marc.info/?l=linux-kernel&m=124060685315937&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13180
Subject : 2.6.30-rc2: WARNING at i915_gem.c for i915_gem_idle
Submitter : Niel Lambrechts <niel.lambrechts-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-04-21 21:35 (26 days old)
References : http://marc.info/?l=linux-kernel&m=124034980819102&w=4
http://lkml.org/lkml/2009/4/27/290
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13179
Subject : CD-R: wodim intermittent failures
Submitter : Andy Isaacson <adi-3HqRAUrWAWyGglJvpFV4uA@public.gmane.org>
Date : 2009-04-21 1:52 (26 days old)
References : http://marc.info/?l=linux-kernel&m=124027879214231&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13177
Subject : 2.6.30-rc2-git7 build problem
Submitter : Martin Knoblauch <spamtrap-Ys4E+72pFW0hFhg+JK9F0w@public.gmane.org>
Date : 2009-04-21 13:39 (26 days old)
References : http://marc.info/?l=linux-kernel&m=124032163602132&w=4
http://lkml.org/lkml/2009/4/27/56
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13171
Subject : 2.6.30-rc2 + xorg-intel-2.7.0 + DRM_I915_KMS = corruption
Submitter : Alex Bennee <kernel-hacker-jriKODhXDevQT0dZR+AlfA@public.gmane.org>
Date : 2009-04-19 6:27 (28 days old)
References : http://marc.info/?l=linux-kernel&m=124022460014812&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13156
Subject : keyboard backlight brightness up/down keys doesn't work
Submitter : Thomas Meyer <thomas-VsYtu1Qij5c@public.gmane.org>
Date : 2009-04-23 20:46 (24 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13148
Subject : resume after suspend-to-ram broken on Sony Vaio VGN-SR19VN when sony-laptop driver present
Submitter : fanderay <fanderay4-gM/Ye1E23mwN+BqQ9rBEUg@public.gmane.org>
Date : 2009-04-22 14:39 (25 days old)
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13126
Subject : BUG: MAX_LOCKDEP_ENTRIES too low! when mounting rootfs
Submitter : Alexander Beregalov <a.beregalov-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-04-15 12:43 (32 days old)
References : http://marc.info/?l=linux-kernel&m=123979949820538&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13119
Subject : Trouble with make-install from a NFS mount
Submitter : Gregory Haskins <ghaskins-Et1tbQHTxzrQT0dZR+AlfA@public.gmane.org>
Date : 2009-04-14 21:32 (33 days old)
References : http://marc.info/?l=linux-kernel&m=123974482327044&w=4
Handled-By : H. Peter Anvin <hpa-YMNOUZJC4hwAvxtiuMwx3w@public.gmane.org>
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13118
Subject : iptables very slow after commit 784544739a25c30637397ace5489eeb6e15d7d49
Submitter : Jeff Chua <jeff.chua.linux-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-04-10 16:05 (37 days old)
References : http://lkml.org/lkml/2009/4/10/111
http://lkml.org/lkml/2009/4/25/83
Handled-By : Eric Dumazet <dada1-fPLkHRcR87vqlBn2x/YWAg@public.gmane.org>
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13116
Subject : Can't boot with nosmp
Submitter : Stephen Hemminger <shemminger-ZtmgI6mnKB3QT0dZR+AlfA@public.gmane.org>
Date : 2009-04-15 4:18 (32 days old)
References : http://marc.info/?l=linux-kernel&m=123976917817920&w=4
Handled-By : Dan Williams <dan.j.williams-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
Submitter : Kumar Gala <galak-XVmvHMARGAS8U2dJNN8I7kB+6BGkLq7r@public.gmane.org>
Date : 2009-04-09 15:43 (38 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
http://lkml.org/lkml/2009/4/10/193
Handled-By : Sukadev Bhattiprolu <sukadev-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13069
Subject : regression in 2.6.29-git3 on SH/Dreamcast
Submitter : Adrian McMenamin <adrian-TSF8l6Tg6afpT6hvJLqO3U8SxdOydiOw@public.gmane.org>
Date : 2009-03-29 19:04 (49 days old)
References : http://marc.info/?l=linux-kernel&m=123835353115372&w=4
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13068
Subject : Lockdep warining in inotify_dev_queue_event
Submitter : Sachin Sant <sachinp-xthvdsQ13ZrQT0dZR+AlfA@public.gmane.org>
Date : 2009-04-05 12:37 (42 days old)
References : http://marc.info/?l=linux-kernel&m=123893439229272&w=4
Regressions with patches
------------------------
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13125
Subject : active uvcvideo breaks over suspend
Submitter : Alan Jenkins <alan-jenkins-cCz0Lq7MMjm9FHfhHBbuYA@public.gmane.org>
Date : 2009-04-15 10:12 (32 days old)
References : http://marc.info/?l=linux-kernel&m=123979009508840&w=4
Handled-By : Ming Lei <tom.leiming-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Patch : http://lkml.org/lkml/2009/4/18/5
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13122
Subject : reiserfs_delete_xattrs: Couldn't delete all xattrs (-13)
Submitter : Alexander Beregalov <a.beregalov-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-04-16 19:23 (31 days old)
First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=d984561b326cd0fe0d1183d11b9b4fa1d011d21d
References : http://marc.info/?l=linux-kernel&m=123990989515105&w=4
Handled-By : Jeff Mahoney <jeffm-IBi9RG/b67k@public.gmane.org>
Patch : http://lkml.org/lkml/2009/5/10/91
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13109
Subject : High latency on /sys/class/thermal
Submitter : Tiago Simões Batista <tiagosbatista-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Date : 2009-04-11 14:56 (36 days old)
References : http://marc.info/?l=linux-kernel&m=123946182301248&w=4
Handled-By : Zhang Rui <rui.zhang-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
Alexey Starikovskiy <astarikovskiy-l3A5Bk7waGM@public.gmane.org>
Patch : http://bugzilla.kernel.org/attachment.cgi?id=21061
http://bugzilla.kernel.org/attachment.cgi?id=21282
For details, please visit the bug entries and follow the links given in
references.
As you can see, there is a Bugzilla entry for each of the listed regressions.
There also is a Bugzilla entry used for tracking the regressions from 2.6.29,
unresolved as well as resolved, at:
http://bugzilla.kernel.org/show_bug.cgi?id=13070
Please let me know if there are any Bugzilla entries that should be added to
the list in there.
Thanks,
Rafael
^ permalink raw reply [flat|nested] 13+ messages in thread
* [Bug #13328] b44: eth0: BUG! Timeout waiting for bit 00000002 of register 42c to clear.
2009-05-16 19:14 2.6.30-rc6: Reported regressions from 2.6.29 Rafael J. Wysocki
@ 2009-05-16 19:20 ` Rafael J. Wysocki
2009-05-17 7:33 ` 2.6.30-rc6: Reported regressions from 2.6.29 Ingo Molnar
2009-05-22 16:40 ` Linus Torvalds
2 siblings, 0 replies; 13+ messages in thread
From: Rafael J. Wysocki @ 2009-05-16 19:20 UTC (permalink / raw)
To: Linux Kernel Mailing List; +Cc: Kernel Testers List, Francis Moreau, netdev
This message has been generated automatically as a part of a report
of recent regressions.
The following bug entry is on the current list of known regressions
from 2.6.29. Please verify if it still should be listed and let me know
(either way).
Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13328
Subject : b44: eth0: BUG! Timeout waiting for bit 00000002 of register 42c to clear.
Submitter : Francis Moreau <francis.moro@gmail.com>
Date : 2009-05-03 16:22 (14 days old)
References : http://marc.info/?l=linux-kernel&m=124136778012280&w=4
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-16 19:14 2.6.30-rc6: Reported regressions from 2.6.29 Rafael J. Wysocki
2009-05-16 19:20 ` [Bug #13328] b44: eth0: BUG! Timeout waiting for bit 00000002 of register 42c to clear Rafael J. Wysocki
@ 2009-05-17 7:33 ` Ingo Molnar
[not found] ` <20090517073327.GC1583-X9Un+BFzKDI@public.gmane.org>
2009-05-18 14:57 ` Oleg Nesterov
2009-05-22 16:40 ` Linus Torvalds
2 siblings, 2 replies; 13+ messages in thread
From: Ingo Molnar @ 2009-05-17 7:33 UTC (permalink / raw)
To: Rafael J. Wysocki, Jonathan Corbet, Martin Bammer, Jeff Garzik,
Oleg Nesterov, Kumar Gala <galak
Cc: Linux Kernel Mailing List, Adrian Bunk, Andrew Morton,
Linus Torvalds, Natalie Protasevich, Kernel Testers List,
Network Development, Linux ACPI, Linux PM List, Linux SCSI List,
Linux Wireless List, DRI
* Rafael J. Wysocki <rjw@sisk.pl> wrote:
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13325
> Subject : 2.6.30-rc kills my box hard - and lockdep chains
> Submitter : Jonathan Corbet <corbet@lwn.net>
> Date : 2009-05-14 15:49 (3 days old)
> References : http://marc.info/?l=linux-kernel&m=124231630701394&w=4
Jonathan, there's a side-issue reported there, us running out of
lockdep space. Could you try this commit from -tip:
d80c19d: lockdep: increase MAX_LOCKDEP_ENTRIES and MAX_LOCKDEP_CHAINS
(which i'll get to Linus in the next ~24 hours.) Maybe that allows
lockdep to report the reason for the deadlock.
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13321
> Subject : kernel crash with NULL pointer when boot
> Submitter : Martin Bammer <mrb74@gmx.at>
> Date : 2009-05-16 12:37 (1 days old)
> References : http://lkml.org/lkml/2009/5/16/100
that crash is in reiserfs_for_each_xattr(), during sys_unlink()'s
xattr teardown.
There's been a good deal of reiserfs changes in this cycle - some
touch the xattr code as well. Some of them fairly late in the cycle,
in the last two weeks:
earth4:~/tip> gll v2.6.29..linus --since=two-weeks-ago fs/reiserfs/
2a32ceb: Fix races around the access to ->s_options
677c9b2: reiserfs: remove privroot hiding in lookup
b82bb72: reiserfs: dont associate security.* with xattr files
ab17c4f: reiserfs: fixup xattr_root caching
edcc37a: Always lookup priv_root on reiserfs mount and keep it
5a6059c: reiserfs: Expand i_mutex to enclose lookup_one_len
Martin, you could try a blind revert of say ... ab17c4f, which looks
the most suspect and which is also a rather large commit.
Or/and you could try a bisect - perhaps accelerated via:
git bisect start fs/reiserfs/
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13297
> Subject : kernel panic - not syncing : fatel exception in interupt
> Submitter : rob <rob1@housetosell.net>
> Date : 2009-05-12 19:34 (5 days old)
> References : http://marc.info/?l=linux-kernel&m=124216126903309&w=4
tainted crash, but probably legit. It does show some badness in an
old-IDE legacy codepath:
[<c0371865>] error_code+0x65/0x6c
[<c0110155>] do_page_fault+0x0/0x1e0
[<c027dafc>] ide_complete_rq+0xf/0x3b
[<c02870a0>] cdrom_newpc_intr+0x64d/0x6cd
[<c0286a53>] cdrom_newpc_intr+0x0/0x6cd
[<c027dcc2>] ide_intr+0x109/0x161
[<c0132298>] handle_IRQ_event+0x54/0xc7
[<c013354a>] handle_level_irq+0x4f/0x85
[<c0103df7>] handle_irq+0x17/0x20
[<c0103da5>] do_IRQ+0x2b/0x66
[<c0102be9>] common_interupt+0x29/0x30
[<c0480000>] cmd40x_init+0x2ac/0x38d
[<c0106db3>] default_idle+0x25/0x38
[<c01019be>] cpu_idle+0x19/0x2d
[<c0468907>] start_kernel+0x23f/0x242
report subject line is too unspecific, it should be changed to
something like:
legacy IDE cmd40x related bootup crash
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13296
> Subject : Lockdep violation at cleanup_workqueue_thread during suspend
> Submitter : Zdenek Kabelac <zdenek.kabelac@gmail.com>
> Date : 2009-05-12 7:59 (5 days old)
> References : http://marc.info/?l=linux-kernel&m=124211522525625&w=4
looks like wireless related - the dependency that connects the
locks in a wrong way appears to be:
-> #2 (cfg80211_mutex){+.+.+.}:
[<ffffffff80271a64>] __lock_acquire+0xc64/0x10a0
[<ffffffff80271f38>] lock_acquire+0x98/0x140
[<ffffffff8054e78c>] __mutex_lock_common+0x4c/0x3b0
[<ffffffff8054ebf6>] mutex_lock_nested+0x46/0x60
[<ffffffffa007e66a>] reg_todo+0x19a/0x590 [cfg80211]
[<ffffffff80258f18>] worker_thread+0x1e8/0x3a0
[<ffffffff8025dc3a>] kthread+0x5a/0xa0
[<ffffffff8020d23a>] child_rip+0xa/0x20
(havent checked deeper)
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13245
> Subject : possible circular locking dependency detected
> Submitter : Miles Lane <miles.lane@gmail.com>
> Date : 2009-05-04 16:56 (13 days old)
same as #13296 above. (The one above should be merged into this one
i guess)
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13126
> Subject : BUG: MAX_LOCKDEP_ENTRIES too low! when mounting rootfs
> Submitter : Alexander Beregalov <a.beregalov@gmail.com>
> Date : 2009-04-15 12:43 (32 days old)
> References : http://marc.info/?l=linux-kernel&m=123979949820538&w=4
should be resolved via the lockdep space extension fix.
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13118
> Subject : iptables very slow after commit 784544739a25c30637397ace5489eeb6e15d7d49
> Submitter : Jeff Chua <jeff.chua.linux@gmail.com>
> Date : 2009-04-10 16:05 (37 days old)
> References : http://lkml.org/lkml/2009/4/10/111
> http://lkml.org/lkml/2009/4/25/83
> Handled-By : Eric Dumazet <dada1@cosmosbay.com>
solved by:
commit 942e4a2bd680c606af0211e64eb216be2e19bf61
Author: Stephen Hemminger <shemminger@vyatta.com>
Date: Tue Apr 28 22:36:33 2009 -0700
netfilter: revised locking for x_tables
commit log does not credit reporters and testers and does not
mention bugzilla id.
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13116
> Subject : Can't boot with nosmp
> Submitter : Stephen Hemminger <shemminger@vyatta.com>
> Date : 2009-04-15 4:18 (32 days old)
> References : http://marc.info/?l=linux-kernel&m=123976917817920&w=4
> Handled-By : Dan Williams <dan.j.williams@intel.com>
I think this might be fixed by:
d6de2c8: async: Fix module loading async-work regression
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
> Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
> Submitter : Kumar Gala <galak@kernel.crashing.org>
> Date : 2009-04-09 15:43 (38 days old)
> First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
> References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
> http://lkml.org/lkml/2009/4/10/193
> Handled-By : Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Oleg says in that thread that it's as-designed, and followup
questions were not replied to (yet).
But ... a relevant seeming commit has been bisected to so this
shouldnt be ignored that easily.
Andrew, you merged the commit that was bisected to:
From b3bfa0cba867f23365b81658b47efd906830879b Mon Sep 17 00:00:00 2001
From: Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
Date: Thu, 2 Apr 2009 16:58:08 -0700
Subject: [PATCH] signals: protect cinit from blocked fatal signals
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13068
> Subject : Lockdep warining in inotify_dev_queue_event
> Submitter : Sachin Sant <sachinp@in.ibm.com>
> Date : 2009-04-05 12:37 (42 days old)
> References : http://marc.info/?l=linux-kernel&m=123893439229272&w=4
should be fixed by:
381a80e: inotify: use GFP_NOFS in kernel_event() to work around a lockdep false-positive
Ingo
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
[not found] ` <20090517073327.GC1583-X9Un+BFzKDI@public.gmane.org>
@ 2009-05-17 10:34 ` Rafael J. Wysocki
[not found] ` <200905171234.23436.rjw-KKrjLPT3xs0@public.gmane.org>
0 siblings, 1 reply; 13+ messages in thread
From: Rafael J. Wysocki @ 2009-05-17 10:34 UTC (permalink / raw)
To: Ingo Molnar
Cc: Jonathan Corbet, Martin Bammer, Jeff Garzik, Oleg Nesterov,
Kumar Gala, Andrew Morton, Linux Kernel Mailing List, Adrian Bunk,
Linus Torvalds, Natalie Protasevich, Kernel Testers List,
Network Development, Linux ACPI, Linux PM List, Linux SCSI List,
Linux Wireless List, DRI, Jeff Mahoney, Stephen Hemminger
On Sunday 17 May 2009, Ingo Molnar wrote:
>
> * Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13325
> > Subject : 2.6.30-rc kills my box hard - and lockdep chains
> > Submitter : Jonathan Corbet <corbet-T1hC0tSOHrs@public.gmane.org>
> > Date : 2009-05-14 15:49 (3 days old)
> > References : http://marc.info/?l=linux-kernel&m=124231630701394&w=4
>
> Jonathan, there's a side-issue reported there, us running out of
> lockdep space. Could you try this commit from -tip:
>
> d80c19d: lockdep: increase MAX_LOCKDEP_ENTRIES and MAX_LOCKDEP_CHAINS
>
> (which i'll get to Linus in the next ~24 hours.) Maybe that allows
> lockdep to report the reason for the deadlock.
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13321
> > Subject : kernel crash with NULL pointer when boot
> > Submitter : Martin Bammer <mrb74-RbZlAiThDcE@public.gmane.org>
> > Date : 2009-05-16 12:37 (1 days old)
> > References : http://lkml.org/lkml/2009/5/16/100
>
> that crash is in reiserfs_for_each_xattr(), during sys_unlink()'s
> xattr teardown.
>
> There's been a good deal of reiserfs changes in this cycle - some
> touch the xattr code as well. Some of them fairly late in the cycle,
> in the last two weeks:
>
> earth4:~/tip> gll v2.6.29..linus --since=two-weeks-ago fs/reiserfs/
> 2a32ceb: Fix races around the access to ->s_options
> 677c9b2: reiserfs: remove privroot hiding in lookup
> b82bb72: reiserfs: dont associate security.* with xattr files
> ab17c4f: reiserfs: fixup xattr_root caching
> edcc37a: Always lookup priv_root on reiserfs mount and keep it
> 5a6059c: reiserfs: Expand i_mutex to enclose lookup_one_len
>
> Martin, you could try a blind revert of say ... ab17c4f, which looks
> the most suspect and which is also a rather large commit.
>
> Or/and you could try a bisect - perhaps accelerated via:
>
> git bisect start fs/reiserfs/
I think there's a fix already for this in the works, but it hasn't been merged
yet. It looks like Jeff has just posted some reiserfs fixes. :-)
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13297
> > Subject : kernel panic - not syncing : fatel exception in interupt
> > Submitter : rob <rob1-z2Ut7v/pTR/vt0rt8C/ehw@public.gmane.org>
> > Date : 2009-05-12 19:34 (5 days old)
> > References : http://marc.info/?l=linux-kernel&m=124216126903309&w=4
>
> tainted crash, but probably legit. It does show some badness in an
> old-IDE legacy codepath:
>
> [<c0371865>] error_code+0x65/0x6c
> [<c0110155>] do_page_fault+0x0/0x1e0
> [<c027dafc>] ide_complete_rq+0xf/0x3b
> [<c02870a0>] cdrom_newpc_intr+0x64d/0x6cd
> [<c0286a53>] cdrom_newpc_intr+0x0/0x6cd
> [<c027dcc2>] ide_intr+0x109/0x161
> [<c0132298>] handle_IRQ_event+0x54/0xc7
> [<c013354a>] handle_level_irq+0x4f/0x85
> [<c0103df7>] handle_irq+0x17/0x20
> [<c0103da5>] do_IRQ+0x2b/0x66
> [<c0102be9>] common_interupt+0x29/0x30
> [<c0480000>] cmd40x_init+0x2ac/0x38d
> [<c0106db3>] default_idle+0x25/0x38
> [<c01019be>] cpu_idle+0x19/0x2d
> [<c0468907>] start_kernel+0x23f/0x242
>
> report subject line is too unspecific, it should be changed to
> something like:
>
> legacy IDE cmd40x related bootup crash
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13296
> > Subject : Lockdep violation at cleanup_workqueue_thread during suspend
> > Submitter : Zdenek Kabelac <zdenek.kabelac-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > Date : 2009-05-12 7:59 (5 days old)
> > References : http://marc.info/?l=linux-kernel&m=124211522525625&w=4
>
> looks like wireless related - the dependency that connects the
> locks in a wrong way appears to be:
>
> -> #2 (cfg80211_mutex){+.+.+.}:
> [<ffffffff80271a64>] __lock_acquire+0xc64/0x10a0
> [<ffffffff80271f38>] lock_acquire+0x98/0x140
> [<ffffffff8054e78c>] __mutex_lock_common+0x4c/0x3b0
> [<ffffffff8054ebf6>] mutex_lock_nested+0x46/0x60
> [<ffffffffa007e66a>] reg_todo+0x19a/0x590 [cfg80211]
> [<ffffffff80258f18>] worker_thread+0x1e8/0x3a0
> [<ffffffff8025dc3a>] kthread+0x5a/0xa0
> [<ffffffff8020d23a>] child_rip+0xa/0x20
>
> (havent checked deeper)
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13245
> > Subject : possible circular locking dependency detected
> > Submitter : Miles Lane <miles.lane-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > Date : 2009-05-04 16:56 (13 days old)
>
> same as #13296 above. (The one above should be merged into this one
> i guess)
Yes, I've merged the two.
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13126
> > Subject : BUG: MAX_LOCKDEP_ENTRIES too low! when mounting rootfs
> > Submitter : Alexander Beregalov <a.beregalov-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > Date : 2009-04-15 12:43 (32 days old)
> > References : http://marc.info/?l=linux-kernel&m=123979949820538&w=4
>
> should be resolved via the lockdep space extension fix.
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13118
> > Subject : iptables very slow after commit 784544739a25c30637397ace5489eeb6e15d7d49
> > Submitter : Jeff Chua <jeff.chua.linux-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > Date : 2009-04-10 16:05 (37 days old)
> > References : http://lkml.org/lkml/2009/4/10/111
> > http://lkml.org/lkml/2009/4/25/83
> > Handled-By : Eric Dumazet <dada1-fPLkHRcR87vqlBn2x/YWAg@public.gmane.org>
>
> solved by:
>
> commit 942e4a2bd680c606af0211e64eb216be2e19bf61
> Author: Stephen Hemminger <shemminger-ZtmgI6mnKB3QT0dZR+AlfA@public.gmane.org>
> Date: Tue Apr 28 22:36:33 2009 -0700
>
> netfilter: revised locking for x_tables
Already closed.
> commit log does not credit reporters and testers and does not
> mention bugzilla id.
Sigh.
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13116
> > Subject : Can't boot with nosmp
> > Submitter : Stephen Hemminger <shemminger-ZtmgI6mnKB3QT0dZR+AlfA@public.gmane.org>
> > Date : 2009-04-15 4:18 (32 days old)
> > References : http://marc.info/?l=linux-kernel&m=123976917817920&w=4
> > Handled-By : Dan Williams <dan.j.williams-ral2JQCrhuEAvxtiuMwx3w@public.gmane.org>
>
> I think this might be fixed by:
>
> d6de2c8: async: Fix module loading async-work regression
That would be good to verify. Stephen?
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
> > Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
> > Submitter : Kumar Gala <galak-XVmvHMARGAS8U2dJNN8I7kB+6BGkLq7r@public.gmane.org>
> > Date : 2009-04-09 15:43 (38 days old)
> > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
> > References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
> > http://lkml.org/lkml/2009/4/10/193
> > Handled-By : Sukadev Bhattiprolu <sukadev-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
>
> Oleg says in that thread that it's as-designed, and followup
> questions were not replied to (yet).
>
> But ... a relevant seeming commit has been bisected to so this
> shouldnt be ignored that easily.
>
> Andrew, you merged the commit that was bisected to:
>
> From b3bfa0cba867f23365b81658b47efd906830879b Mon Sep 17 00:00:00 2001
> From: Sukadev Bhattiprolu <sukadev-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
> Date: Thu, 2 Apr 2009 16:58:08 -0700
> Subject: [PATCH] signals: protect cinit from blocked fatal signals
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13068
> > Subject : Lockdep warining in inotify_dev_queue_event
> > Submitter : Sachin Sant <sachinp-xthvdsQ13ZrQT0dZR+AlfA@public.gmane.org>
> > Date : 2009-04-05 12:37 (42 days old)
> > References : http://marc.info/?l=linux-kernel&m=123893439229272&w=4
>
> should be fixed by:
>
> 381a80e: inotify: use GFP_NOFS in kernel_event() to work around a lockdep false-positive
Closed.
Thanks a lot,
Rafael
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
[not found] ` <200905171234.23436.rjw-KKrjLPT3xs0@public.gmane.org>
@ 2009-05-17 12:56 ` Stefan Richter
2009-05-17 13:59 ` Jeff Mahoney
0 siblings, 1 reply; 13+ messages in thread
From: Stefan Richter @ 2009-05-17 12:56 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Ingo Molnar, Martin Bammer, Andrew Morton,
Linux Kernel Mailing List, Adrian Bunk, Linus Torvalds,
Natalie Protasevich, Kernel Testers List, Network Development,
Jeff Mahoney
Rafael J. Wysocki wrote:
> On Sunday 17 May 2009, Ingo Molnar wrote:
>> * Rafael J. Wysocki <rjw-KKrjLPT3xs0@public.gmane.org> wrote:
>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13321
>>> Subject : kernel crash with NULL pointer when boot
>>> Submitter : Martin Bammer <mrb74-RbZlAiThDcE@public.gmane.org>
>>> Date : 2009-05-16 12:37 (1 days old)
>>> References : http://lkml.org/lkml/2009/5/16/100
>> that crash is in reiserfs_for_each_xattr(), during sys_unlink()'s
>> xattr teardown.
>>
>> There's been a good deal of reiserfs changes in this cycle - some
>> touch the xattr code as well. Some of them fairly late in the cycle,
(Also, the bug is reportedly a post -rc5 regression.)
>> in the last two weeks:
>>
>> earth4:~/tip> gll v2.6.29..linus --since=two-weeks-ago fs/reiserfs/
>> 2a32ceb: Fix races around the access to ->s_options
>> 677c9b2: reiserfs: remove privroot hiding in lookup
>> b82bb72: reiserfs: dont associate security.* with xattr files
>> ab17c4f: reiserfs: fixup xattr_root caching
>> edcc37a: Always lookup priv_root on reiserfs mount and keep it
>> 5a6059c: reiserfs: Expand i_mutex to enclose lookup_one_len
>>
>> Martin, you could try a blind revert of say ... ab17c4f, which looks
>> the most suspect and which is also a rather large commit.
>>
>> Or/and you could try a bisect - perhaps accelerated via:
>>
>> git bisect start fs/reiserfs/
>
> I think there's a fix already for this in the works, but it hasn't been merged
> yet. It looks like Jeff has just posted some reiserfs fixes. :-)
I'd say Martin should try Jeff's pending patches before a revert or
before starting a bisection. Either apply them from
http://lkml.org/lkml/2009/5/17/6, or pull from
git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs-2.6.git for-next
or wait for them to appear in Linus' tree...
I for one didn't test the post v2.6.30-rc5 changes to reiserfs, but I
did try them _together_ with the four pending patches on a system with
reiserfs root filesystem and extended attributes disabled in the config.
Everything is fine here after almost a week uptime.
--
Stefan Richter
-=====-=-=== -=-= -==-=
http://arcgraph.de/sr/
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-17 12:56 ` Stefan Richter
@ 2009-05-17 13:59 ` Jeff Mahoney
0 siblings, 0 replies; 13+ messages in thread
From: Jeff Mahoney @ 2009-05-17 13:59 UTC (permalink / raw)
To: Stefan Richter
Cc: Rafael J. Wysocki, Ingo Molnar, Martin Bammer, Andrew Morton,
Linux Kernel Mailing List, Adrian Bunk, Linus Torvalds,
Natalie Protasevich, Kernel Testers List, Network Development
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Stefan Richter wrote:
> Rafael J. Wysocki wrote:
>> On Sunday 17 May 2009, Ingo Molnar wrote:
>>> * Rafael J. Wysocki <rjw@sisk.pl> wrote:
>>>> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13321
>>>> Subject : kernel crash with NULL pointer when boot
>>>> Submitter : Martin Bammer <mrb74@gmx.at>
>>>> Date : 2009-05-16 12:37 (1 days old)
>>>> References : http://lkml.org/lkml/2009/5/16/100
>>> that crash is in reiserfs_for_each_xattr(), during sys_unlink()'s
>>> xattr teardown.
>>>
>>> There's been a good deal of reiserfs changes in this cycle - some
>>> touch the xattr code as well. Some of them fairly late in the cycle,
>
> (Also, the bug is reportedly a post -rc5 regression.)
>
>>> in the last two weeks:
>>>
>>> earth4:~/tip> gll v2.6.29..linus --since=two-weeks-ago fs/reiserfs/
>>> 2a32ceb: Fix races around the access to ->s_options
>>> 677c9b2: reiserfs: remove privroot hiding in lookup
>>> b82bb72: reiserfs: dont associate security.* with xattr files
>>> ab17c4f: reiserfs: fixup xattr_root caching
>>> edcc37a: Always lookup priv_root on reiserfs mount and keep it
>>> 5a6059c: reiserfs: Expand i_mutex to enclose lookup_one_len
>>>
>>> Martin, you could try a blind revert of say ... ab17c4f, which looks
>>> the most suspect and which is also a rather large commit.
>>>
>>> Or/and you could try a bisect - perhaps accelerated via:
>>>
>>> git bisect start fs/reiserfs/
>>
>> I think there's a fix already for this in the works, but it hasn't
>> been merged
>> yet. It looks like Jeff has just posted some reiserfs fixes. :-)
>
> I'd say Martin should try Jeff's pending patches before a revert or
> before starting a bisection. Either apply them from
> http://lkml.org/lkml/2009/5/17/6, or pull from
> git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs-2.6.git for-next
> or wait for them to appear in Linus' tree...
>
> I for one didn't test the post v2.6.30-rc5 changes to reiserfs, but I
> did try them _together_ with the four pending patches on a system with
> reiserfs root filesystem and extended attributes disabled in the config.
> Everything is fine here after almost a week uptime.
Yes, this bug is known and is fixed with "[patch 2/4] [PATCH] reiserfs:
deal with NULL xattr root w/ xattrs disabled"
- -Jeff
- --
Jeff Mahoney
SUSE Labs
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.9 (GNU/Linux)
Comment: Using GnuPG with SUSE - http://enigmail.mozdev.org
iEYEARECAAYFAkoQGEIACgkQLPWxlyuTD7K8kwCfUEq36jJvY+FzzuvWbAyOYX+r
BPwAniRe6a7Cj8n5+vC2Zmwfp1hRRUb1
=fh19
-----END PGP SIGNATURE-----
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-17 7:33 ` 2.6.30-rc6: Reported regressions from 2.6.29 Ingo Molnar
[not found] ` <20090517073327.GC1583-X9Un+BFzKDI@public.gmane.org>
@ 2009-05-18 14:57 ` Oleg Nesterov
2009-05-18 15:11 ` Ingo Molnar
1 sibling, 1 reply; 13+ messages in thread
From: Oleg Nesterov @ 2009-05-18 14:57 UTC (permalink / raw)
To: Ingo Molnar
Cc: Adrian Bunk, DRI, Linux ACPI, Linux SCSI List, Jonathan Corbet,
Linux Kernel Mailing List, Linux Wireless List, Kumar Gala,
Natalie Protasevich, Martin Bammer, Linus Torvalds,
Network Development, Andrew Morton, Kernel Testers List,
Jeff Garzik, Linux PM List
On 05/17, Ingo Molnar wrote:
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
> > Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
> > Submitter : Kumar Gala <galak@kernel.crashing.org>
> > Date : 2009-04-09 15:43 (38 days old)
> > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
> > References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
> > http://lkml.org/lkml/2009/4/10/193
> > Handled-By : Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
>
> Oleg says in that thread that it's as-designed, and followup
> questions were not replied to (yet).
Yes, I think this is false alarm. Perhaps I missed something, and
I am waiting for more info from Kumar, but it looks like ltp was
already changed to skip the "{ PTRACE_ATTACH, 1, EPERM }" test
on kernels after 2.6.25
Oleg.
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-18 14:57 ` Oleg Nesterov
@ 2009-05-18 15:11 ` Ingo Molnar
2009-05-18 15:31 ` Oleg Nesterov
2009-05-18 15:32 ` Linus Torvalds
0 siblings, 2 replies; 13+ messages in thread
From: Ingo Molnar @ 2009-05-18 15:11 UTC (permalink / raw)
To: Oleg Nesterov
Cc: Adrian Bunk, DRI, Linux ACPI, Linux SCSI List, Jonathan Corbet,
Linux Kernel Mailing List, Linux Wireless List, Kumar Gala,
Natalie Protasevich, Martin Bammer, Linus Torvalds,
Network Development, Andrew Morton, Kernel Testers List,
Jeff Garzik, Linux PM List
* Oleg Nesterov <oleg@redhat.com> wrote:
> On 05/17, Ingo Molnar wrote:
> >
> > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
> > > Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
> > > Submitter : Kumar Gala <galak@kernel.crashing.org>
> > > Date : 2009-04-09 15:43 (38 days old)
> > > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
> > > References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
> > > http://lkml.org/lkml/2009/4/10/193
> > > Handled-By : Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
> >
> > Oleg says in that thread that it's as-designed, and followup
> > questions were not replied to (yet).
>
> Yes, I think this is false alarm. Perhaps I missed something, and
> I am waiting for more info from Kumar, but it looks like ltp was
> already changed to skip the "{ PTRACE_ATTACH, 1, EPERM }" test on
> kernels after 2.6.25
Btw., why did the patch (and the revert) make any difference to the
test? Timing differences look improbable.
Ingo
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-18 15:11 ` Ingo Molnar
@ 2009-05-18 15:31 ` Oleg Nesterov
2009-05-18 15:32 ` Linus Torvalds
1 sibling, 0 replies; 13+ messages in thread
From: Oleg Nesterov @ 2009-05-18 15:31 UTC (permalink / raw)
To: Ingo Molnar
Cc: Adrian Bunk, DRI, Linux ACPI, Linux SCSI List, Jonathan Corbet,
Linux Kernel Mailing List, Linux Wireless List, Kumar Gala,
Natalie Protasevich, Martin Bammer, Linus Torvalds,
Network Development, Andrew Morton, Kernel Testers List,
Jeff Garzik, Linux PM List
On 05/18, Ingo Molnar wrote:
>
> * Oleg Nesterov <oleg@redhat.com> wrote:
>
> > On 05/17, Ingo Molnar wrote:
> > >
> > > > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
> > > > Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
> > > > Submitter : Kumar Gala <galak@kernel.crashing.org>
> > > > Date : 2009-04-09 15:43 (38 days old)
> > > > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
> > > > References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
> > > > http://lkml.org/lkml/2009/4/10/193
> > > > Handled-By : Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
> > >
> > > Oleg says in that thread that it's as-designed, and followup
> > > questions were not replied to (yet).
> >
> > Yes, I think this is false alarm. Perhaps I missed something, and
> > I am waiting for more info from Kumar, but it looks like ltp was
> > already changed to skip the "{ PTRACE_ATTACH, 1, EPERM }" test on
> > kernels after 2.6.25
>
> Btw., why did the patch (and the revert) make any difference to the
> test? Timing differences look improbable.
Because before this series /sbin/init always ignored SIGSTOP. Now,
ptracer does PTRACE_ATTACH and exits without clearing the pending
SIGSTOP. init dequeues this signal and becomes TASK_STOPPED.
Consider:
$ sleep 1000 &
[1] 875
$ grep State: /proc/`pidof sleep`/status
State: S (sleeping)
$ perl -e 'syscall 101, 16, 0+shift, 0,0' `pidof sleep` # just PTRACE_ATTACH + exit
[1]+ Stopped sleep 1000
$ grep State: /proc/`pidof sleep`/status
State: T (stopped)
This is the "normal" behaviour, afaics the same happens with init now.
Oleg.
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-18 15:11 ` Ingo Molnar
2009-05-18 15:31 ` Oleg Nesterov
@ 2009-05-18 15:32 ` Linus Torvalds
2009-05-18 15:45 ` Oleg Nesterov
1 sibling, 1 reply; 13+ messages in thread
From: Linus Torvalds @ 2009-05-18 15:32 UTC (permalink / raw)
To: Ingo Molnar
Cc: Adrian Bunk, DRI, Linux SCSI List, Jonathan Corbet,
Linux Kernel Mailing List, Linux Wireless List, Kumar Gala,
Natalie Protasevich, Martin Bammer, Linux ACPI,
Network Development, Oleg Nesterov, Andrew Morton,
Kernel Testers List, Jeff Garzik, Linux PM List
On Mon, 18 May 2009, Ingo Molnar wrote:
>
> Btw., why did the patch (and the revert) make any difference to the
> test? Timing differences look improbable.
It's the change from
!signal_group_exit(signal)
to
!sig_kernel_only(signr)
and quite frankly, I still don't see the point.
The comment seems to be wrong too:
If SIGSTOP/SIGKILL originate from a descendant of container-init they are
never queued (i.e dropped in sig_ignored() in an earler patch).
If SIGSTOP/SIGKILL originate from parent namespace, the signal is queued
and container-init processes the signal.
since the bug was that the SIGSTOP (from within the same container) was
_not_ ignored like the comment says.
Linus
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-18 15:32 ` Linus Torvalds
@ 2009-05-18 15:45 ` Oleg Nesterov
0 siblings, 0 replies; 13+ messages in thread
From: Oleg Nesterov @ 2009-05-18 15:45 UTC (permalink / raw)
To: Linus Torvalds
Cc: Ingo Molnar, Rafael J. Wysocki, Jonathan Corbet, Martin Bammer,
Jeff Garzik, Kumar Gala, Andrew Morton, Linux Kernel Mailing List,
Adrian Bunk, Natalie Protasevich, Kernel Testers List,
Network Development, Linux ACPI, Linux PM List, Linux SCSI List,
Linux Wireless List, DRI
On 05/18, Linus Torvalds wrote:
>
> On Mon, 18 May 2009, Ingo Molnar wrote:
> >
> > Btw., why did the patch (and the revert) make any difference to the
> > test? Timing differences look improbable.
>
> It's the change from
>
> !signal_group_exit(signal)
>
> to
>
> !sig_kernel_only(signr)
>
> and quite frankly, I still don't see the point.
Previously,
!signal_group_exit(signal)
meant: we do not know what should we do, let's ignore this signal.
Unless the multithreaded init does exec, in this case we should
respect SIGKILL.
With the recent changes, sig_kernel_only() means: we already checked
we should handle SIGKILL/SIGSTOP when this signal was queued.
> The comment seems to be wrong too:
>
> If SIGSTOP/SIGKILL originate from a descendant of container-init they are
> never queued (i.e dropped in sig_ignored() in an earler patch).
>
> If SIGSTOP/SIGKILL originate from parent namespace, the signal is queued
> and container-init processes the signal.
>
> since the bug was that the SIGSTOP (from within the same container) was
> _not_ ignored like the comment says.
Yes, the changelog could be better because it ignores ptrace issues. But
this was discussed,
>From http://marc.info/?t=123222433100001
Yes we should handle SIGSTOP fine if it sent from the parent namespace.
Also. Currently it is possible to ptrace the global init, but even
ptracer can't stop it (but ptrace_stop() works). With these patches
ptracer can stop init.
I forgot to mention this behaviour change, imho this side-effect
is good.
So, at least this change is not "by accident".
Oleg.
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
2009-05-16 19:14 2.6.30-rc6: Reported regressions from 2.6.29 Rafael J. Wysocki
2009-05-16 19:20 ` [Bug #13328] b44: eth0: BUG! Timeout waiting for bit 00000002 of register 42c to clear Rafael J. Wysocki
2009-05-17 7:33 ` 2.6.30-rc6: Reported regressions from 2.6.29 Ingo Molnar
@ 2009-05-22 16:40 ` Linus Torvalds
[not found] ` <alpine.LFD.2.01.0905220926150.3555-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2 siblings, 1 reply; 13+ messages in thread
From: Linus Torvalds @ 2009-05-22 16:40 UTC (permalink / raw)
To: Rafael J. Wysocki
Cc: Adrian Bunk, DRI, Linux SCSI List, Network Development,
Linux Wireless List, Linux Kernel Mailing List,
Natalie Protasevich, Linux ACPI, Andrew Morton,
Kernel Testers List, Linux PM List
On Sat, 16 May 2009, Rafael J. Wysocki wrote:
>
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13329
> Subject : cifs_close: NULL pointer dereference
> Submitter : Luca Tettamanti <kronos.it@gmail.com>
> Date : 2009-05-16 16:28 (1 days old)
> References : http://marc.info/?l=linux-kernel&m=124249133701702&w=4
Fixed by 703a3b8e5c01cf6fb33c6d8dc99905f889a4e992.
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13298
> Subject : modprobe ipmi_si hangs under 2.6.30-rc5
> Submitter : Ferenc Wagner <wferi@niif.hu>
> Date : 2009-05-12 21:28 (5 days old)
> References : http://marc.info/?l=linux-kernel&m=124216379407177&w=4
Fixed by 9a2845c453d170e4e9b1437fa671dbf39b0e7bd8.
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13297
> Subject : kernel panic - not syncing : fatel exception in interupt
> Submitter : rob <rob1@housetosell.net>
> Date : 2009-05-12 19:34 (5 days old)
> References : http://marc.info/?l=linux-kernel&m=124216126903309&w=4
This should be fixed by e3b29f05124b07303088795396ff858811d2acb8.
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13177
> Subject : 2.6.30-rc2-git7 build problem
> Submitter : Martin Knoblauch <spamtrap@knobisoft.de>
> Date : 2009-04-21 13:39 (26 days old)
> References : http://marc.info/?l=linux-kernel&m=124032163602132&w=4
> http://lkml.org/lkml/2009/4/27/56
This one really looks like a compiler bug (the undefined symbol is an
internal gcc one: ".L192"). And the driver that gets it (kl5kusb105) has
not actually had any changes since 2.6.29 - although header files that it
includes obviously do have some, so it's entirely possible that some
infrastructure change causes it to trigger.
Martin hit another bug with his compiler (an ICE in gcc, fixed by commit
26a9a418237c0b06528941bca693c49c8d97edbe), which is why I'm so willing to
blame that.
> Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
> Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
> Submitter : Kumar Gala <galak@kernel.crashing.org>
> Date : 2009-04-09 15:43 (38 days old)
> First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
> References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
> http://lkml.org/lkml/2009/4/10/193
> Handled-By : Sukadev Bhattiprolu <sukadev@linux.vnet.ibm.com>
This one seems to have become "not-a-bug". Yes, LTP used to check that you
cannot send SIGSTP to init, but people do want to send signals to init
(especially when they are inside pid-namespace containers), and as long as
it's only a LTP test that already got modified (and not some actual
real-life usage), I'm considering this a "test failed because it was
testing for behavior that we want to change" at least for now.
Linus
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: 2.6.30-rc6: Reported regressions from 2.6.29
[not found] ` <alpine.LFD.2.01.0905220926150.3555-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
@ 2009-05-23 0:10 ` Rafael J. Wysocki
0 siblings, 0 replies; 13+ messages in thread
From: Rafael J. Wysocki @ 2009-05-23 0:10 UTC (permalink / raw)
To: Linus Torvalds
Cc: Linux Kernel Mailing List, Adrian Bunk, Andrew Morton,
Natalie Protasevich, Kernel Testers List, Network Development,
Linux ACPI, Linux PM List, Linux SCSI List, Linux Wireless List,
DRI
On Friday 22 May 2009, Linus Torvalds wrote:
>
> On Sat, 16 May 2009, Rafael J. Wysocki wrote:
> >
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13329
> > Subject : cifs_close: NULL pointer dereference
> > Submitter : Luca Tettamanti <kronos.it-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> > Date : 2009-05-16 16:28 (1 days old)
> > References : http://marc.info/?l=linux-kernel&m=124249133701702&w=4
>
> Fixed by 703a3b8e5c01cf6fb33c6d8dc99905f889a4e992.
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13298
> > Subject : modprobe ipmi_si hangs under 2.6.30-rc5
> > Submitter : Ferenc Wagner <wferi-eEbw3PyuezQ@public.gmane.org>
> > Date : 2009-05-12 21:28 (5 days old)
> > References : http://marc.info/?l=linux-kernel&m=124216379407177&w=4
>
> Fixed by 9a2845c453d170e4e9b1437fa671dbf39b0e7bd8.
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13297
> > Subject : kernel panic - not syncing : fatel exception in interupt
> > Submitter : rob <rob1-z2Ut7v/pTR/vt0rt8C/ehw@public.gmane.org>
> > Date : 2009-05-12 19:34 (5 days old)
> > References : http://marc.info/?l=linux-kernel&m=124216126903309&w=4
>
> This should be fixed by e3b29f05124b07303088795396ff858811d2acb8.
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13177
> > Subject : 2.6.30-rc2-git7 build problem
> > Submitter : Martin Knoblauch <spamtrap-Ys4E+72pFW0hFhg+JK9F0w@public.gmane.org>
> > Date : 2009-04-21 13:39 (26 days old)
> > References : http://marc.info/?l=linux-kernel&m=124032163602132&w=4
> > http://lkml.org/lkml/2009/4/27/56
>
> This one really looks like a compiler bug (the undefined symbol is an
> internal gcc one: ".L192"). And the driver that gets it (kl5kusb105) has
> not actually had any changes since 2.6.29 - although header files that it
> includes obviously do have some, so it's entirely possible that some
> infrastructure change causes it to trigger.
>
> Martin hit another bug with his compiler (an ICE in gcc, fixed by commit
> 26a9a418237c0b06528941bca693c49c8d97edbe), which is why I'm so willing to
> blame that.
>
> > Bug-Entry : http://bugzilla.kernel.org/show_bug.cgi?id=13107
> > Subject : LTP 20080131 causes defunct processes w/2.6.30-rc1
> > Submitter : Kumar Gala <galak-XVmvHMARGAS8U2dJNN8I7kB+6BGkLq7r@public.gmane.org>
> > Date : 2009-04-09 15:43 (38 days old)
> > First-Bad-Commit: http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=b3bfa0cba867f23365b81658b47efd906830879b
> > References : http://marc.info/?l=linux-kernel&m=123929187208953&w=4
> > http://lkml.org/lkml/2009/4/10/193
> > Handled-By : Sukadev Bhattiprolu <sukadev-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
>
> This one seems to have become "not-a-bug". Yes, LTP used to check that you
> cannot send SIGSTP to init, but people do want to send signals to init
> (especially when they are inside pid-namespace containers), and as long as
> it's only a LTP test that already got modified (and not some actual
> real-life usage), I'm considering this a "test failed because it was
> testing for behavior that we want to change" at least for now.
Thanks a lot, this really helps! :-)
I've closed all of the bugs above.
Best,
Rafael
--
To unsubscribe from this list: send the line "unsubscribe linux-wireless" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2009-05-23 0:10 UTC | newest]
Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-05-16 19:14 2.6.30-rc6: Reported regressions from 2.6.29 Rafael J. Wysocki
2009-05-16 19:20 ` [Bug #13328] b44: eth0: BUG! Timeout waiting for bit 00000002 of register 42c to clear Rafael J. Wysocki
2009-05-17 7:33 ` 2.6.30-rc6: Reported regressions from 2.6.29 Ingo Molnar
[not found] ` <20090517073327.GC1583-X9Un+BFzKDI@public.gmane.org>
2009-05-17 10:34 ` Rafael J. Wysocki
[not found] ` <200905171234.23436.rjw-KKrjLPT3xs0@public.gmane.org>
2009-05-17 12:56 ` Stefan Richter
2009-05-17 13:59 ` Jeff Mahoney
2009-05-18 14:57 ` Oleg Nesterov
2009-05-18 15:11 ` Ingo Molnar
2009-05-18 15:31 ` Oleg Nesterov
2009-05-18 15:32 ` Linus Torvalds
2009-05-18 15:45 ` Oleg Nesterov
2009-05-22 16:40 ` Linus Torvalds
[not found] ` <alpine.LFD.2.01.0905220926150.3555-bi+AKbBUZKY6gyzm1THtWbp2dZbC/Bob@public.gmane.org>
2009-05-23 0:10 ` Rafael J. Wysocki
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).