* 2.6.16.1 & D state processes
@ 2006-04-17 23:49 Srihari Vijayaraghavan
2006-04-18 4:03 ` Andrew Morton
0 siblings, 1 reply; 12+ messages in thread
From: Srihari Vijayaraghavan @ 2006-04-17 23:49 UTC (permalink / raw)
To: linux-kernel
SysRq+P:
Pid: 0, comm: swapper
EIP: 0060:[<c0101c9e>] CPU: 0
EIP is at default_idle+0x2b/0x53
EFLAGS: 00200246 Not tainted (2.6.16.1 #2)
EAX: 00000000 EBX: 00010809 ECX: 00000000 EDX:
c0395000
ESI: 00099100 EDI: c038a800 EBP: 00444007 DS: 007b ES:
007b
CR0: 8005003b CR2: 00ad11c5 CR3: 1926c000 CR4:
000006d0
[<c0101d00>] cpu_idle+0x3a/0x4f
[<c0396610>] start_kernel+0x28d/0x28f
SysRq+T (of only D state processes - 6 of them):
kjournald D C1605158 2912 338 1
404 330 (L-TLB)
c1611e74 c6747cd4 0000000a c1605158 c1605030 c7847f00
003d7b1c 00000000
c7847f00 003d7b1c 003d0900 00000000 c1611ecc
00000000 c1611ed4 c1611e7c
c02d1742 c13f1930 c01520d6 c02d1f34 c01520ab
c1611ecc c43f5ed8 c1611ec8
Call Trace:
[<c02d1742>] io_schedule+0xe/0x16
[<c01520d6>] sync_buffer+0x2b/0x2e
[<c02d1f34>] __wait_on_bit+0x33/0x58
[<c01520ab>] sync_buffer+0x0/0x2e
[<c02d1fce>] out_of_line_wait_on_bit+0x75/0x7d
[<c01520ab>] sync_buffer+0x0/0x2e
[<c012956c>] wake_bit_function+0x0/0x3c
[<c015203c>] __wait_on_buffer+0x1c/0x1f
[<e00337e2>] journal_commit_transaction+0x809/0xeab
[jbd]
[<c02d281d>] _spin_unlock_irq+0x5/0x7
[<c02d15fd>] schedule+0x49f/0x4fd
[<e0036e80>] kjournald+0xbd/0x20e [jbd]
[<c011743c>] schedule_tail+0x34/0x90
[<e0036769>] commit_timeout+0x0/0x5 [jbd]
[<c012953f>] autoremove_wake_function+0x0/0x2d
[<e0036dc3>] kjournald+0x0/0x20e [jbd]
[<c0101005>] kernel_thread_helper+0x5/0xb
kjournald D C1572BD8 2944 1204 1
1206 1202 (L-TLB)
dead0e74 c6747a14 0000000a c1572bd8 c1572ab0 c7847f00
003d7b1c 00000000
c7847f00 003d7b1c 003d0900 00000000 dead0ecc
00000000 dead0ed4 dead0e7c
c02d1742 c13ef320 c01520d6 c02d1f34 c01520ab
dead0ecc c43f5e70 dead0ec8
Call Trace:
[<c02d1742>] io_schedule+0xe/0x16
[<c01520d6>] sync_buffer+0x2b/0x2e
[<c02d1f34>] __wait_on_bit+0x33/0x58
[<c01520ab>] sync_buffer+0x0/0x2e
[<c02d1fce>] out_of_line_wait_on_bit+0x75/0x7d
[<c01520ab>] sync_buffer+0x0/0x2e
[<c012956c>] wake_bit_function+0x0/0x3c
[<c015203c>] __wait_on_buffer+0x1c/0x1f
[<e00337e2>] journal_commit_transaction+0x809/0xeab
[jbd]
[<c02d281d>] _spin_unlock_irq+0x5/0x7
[<c02d15fd>] schedule+0x49f/0x4fd
[<e0036e80>] kjournald+0xbd/0x20e [jbd]
[<c011743c>] schedule_tail+0x34/0x90
[<e0036769>] commit_timeout+0x0/0x5 [jbd]
[<c012953f>] autoremove_wake_function+0x0/0x2d
[<e0036dc3>] kjournald+0x0/0x20e [jbd]
[<c0101005>] kernel_thread_helper+0x5/0xb
kjournald D C1555698 2912 1206 1
1208 1204 (L-TLB)
de991e60 c02d281d 0000000a c1555698 c1555570 c7847f00
003d7b1c 00000000
c7847f00 003d7b1c 003d0900 00000000 de991eb4
de991eb4 c13f1378 de991e68
c02d1742 c01520ab c01520d6 c02d1e5d de991eb4
d74708f4 de991eb0 00000002
Call Trace:
[<c02d281d>] _spin_unlock_irq+0x5/0x7
[<c02d1742>] io_schedule+0xe/0x16
[<c01520ab>] sync_buffer+0x0/0x2e
[<c01520d6>] sync_buffer+0x2b/0x2e
[<c02d1e5d>] __wait_on_bit_lock+0x2a/0x51
[<c02d1ef9>] out_of_line_wait_on_bit_lock+0x75/0x7d
[<c01520ab>] sync_buffer+0x0/0x2e
[<c012956c>] wake_bit_function+0x0/0x3c
[<c01521ea>] __lock_buffer+0x20/0x23
[<c015222e>] ll_rw_block+0x41/0xa4
[<e0033348>] journal_commit_transaction+0x36f/0xeab
[jbd]
[<c02d281d>] _spin_unlock_irq+0x5/0x7
[<c02d15fd>] schedule+0x49f/0x4fd
[<c02d285c>] _spin_lock_irqsave+0x9/0xd
[<e0036e80>] kjournald+0xbd/0x20e [jbd]
[<c011743c>] schedule_tail+0x34/0x90
[<e0036769>] commit_timeout+0x0/0x5 [jbd]
[<c012953f>] autoremove_wake_function+0x0/0x2d
[<e0036dc3>] kjournald+0x0/0x20e [jbd]
[<c0101005>] kernel_thread_helper+0x5/0xb
kjournald D C1593698 2944 1208 1
1210 1206 (L-TLB)
de87ae74 c6747ac4 0000000a c1593698 c1593570 c7847f00
003d7b1c 00000000
c7847f00 003d7b1c 003d0900 00000000 de87aecc
00000000 de87aed4 de87ae7c
c02d1742 c13ef4d0 c01520d6 c02d1f34 c01520ab
de87aecc c43f5c68 de87aec8
Call Trace:
[<c02d1742>] io_schedule+0xe/0x16
[<c01520d6>] sync_buffer+0x2b/0x2e
[<c02d1f34>] __wait_on_bit+0x33/0x58
[<c01520ab>] sync_buffer+0x0/0x2e
[<c02d1fce>] out_of_line_wait_on_bit+0x75/0x7d
[<c01520ab>] sync_buffer+0x0/0x2e
[<c012956c>] wake_bit_function+0x0/0x3c
[<c015203c>] __wait_on_buffer+0x1c/0x1f
[<e00337e2>] journal_commit_transaction+0x809/0xeab
[jbd]
[<c02d281d>] _spin_unlock_irq+0x5/0x7
[<c02d15fd>] schedule+0x49f/0x4fd
[<e0036e80>] kjournald+0xbd/0x20e [jbd]
[<c011743c>] schedule_tail+0x34/0x90
[<e0036769>] commit_timeout+0x0/0x5 [jbd]
[<c012953f>] autoremove_wake_function+0x0/0x2d
[<e0036dc3>] kjournald+0x0/0x20e [jbd]
[<c0101005>] kernel_thread_helper+0x5/0xb
kjournald D C1555BD8 2924 1212 1
1540 1210 (L-TLB)
c1585e74 c02d285c 0000000a c1555bd8 c1555ab0 eb380300
003d7b1b 00000000
00200082 c15d1b9c 2d4cae00 00000000 c1585ecc
00000000 c1585ed4 c1585e7c
c02d1742 c13f0f40 c01520d6 c02d1f34 c01520ab
c1585ecc c43f5f40 c1585ec8
Call Trace:
[<c02d285c>] _spin_lock_irqsave+0x9/0xd
[<c02d1742>] io_schedule+0xe/0x16
[<c01520d6>] sync_buffer+0x2b/0x2e
[<c02d1f34>] __wait_on_bit+0x33/0x58
[<c01520ab>] sync_buffer+0x0/0x2e
[<c02d1fce>] out_of_line_wait_on_bit+0x75/0x7d
[<c01520ab>] sync_buffer+0x0/0x2e
[<c012956c>] wake_bit_function+0x0/0x3c
[<c015203c>] __wait_on_buffer+0x1c/0x1f
[<e00337e2>] journal_commit_transaction+0x809/0xeab
[jbd]
[<c02d281d>] _spin_unlock_irq+0x5/0x7
[<c02d15fd>] schedule+0x49f/0x4fd
[<c02d285c>] _spin_lock_irqsave+0x9/0xd
[<c0121239>] lock_timer_base+0x15/0x2f
[<e0036e80>] kjournald+0xbd/0x20e [jbd]
[<c011743c>] schedule_tail+0x34/0x90
[<e0036769>] commit_timeout+0x0/0x5 [jbd]
[<c012953f>] autoremove_wake_function+0x0/0x2d
[<e0036dc3>] kjournald+0x0/0x20e [jbd]
[<c0101005>] kernel_thread_helper+0x5/0xb
syslogd D C154ABD8 2312 1572 1
1575 1540 (NOTLB)
dbfbfe94 00000003 00000009 c154abd8 c154aab0 eafafa00
003d7b1b 00000000
eafafa00 003d7b1b 1cd94100 00000000 de7c2a00
de7c2a14 de7c2a84 dbfbfec0
e0036186 00249438 00000000 00000000 00000000
00000000 00000000 dbfbfed8
Call Trace:
[<e0036186>] log_wait_commit+0xaa/0xf3 [jbd]
[<c0116955>] __wake_up+0x2a/0x3d
[<c012953f>] autoremove_wake_function+0x0/0x2d
[<e00316c3>] journal_stop+0x1a0/0x1cd [jbd]
[<c016cb8b>] __writeback_single_inode+0x1be/0x31a
[<c013be53>] do_writepages+0x2b/0x32
[<c0137f64>] __filemap_fdatawrite_range+0x65/0x71
[<c016cd00>] sync_inode+0x19/0x2a
[<e0063cf9>] ext3_sync_file+0x9d/0xb0 [ext3]
[<c0151dab>] do_fsync+0x62/0xbb
[<c01029eb>] sysenter_past_esp+0x54/0x75
mingetty D C1593BD8 2712 1997 1
1998 1952 (NOTLB)
df2c3cc8 c6747754 00000009 c1593bd8 c1593ab0 c7c18800
003d7b1c 00000000
00000246 c13f11c8 003d0900 00000000 df2c3d20
00000000 df2c3d28 df2c3cd0
c02d1742 c13f11c8 c01520d6 c02d1f34 c01520ab
df2c3d20 c43f5c34 df2c3d1c
Call Trace:
[<c02d1742>] io_schedule+0xe/0x16
[<c01520d6>] sync_buffer+0x2b/0x2e
[<c02d1f34>] __wait_on_bit+0x33/0x58
[<c01520ab>] sync_buffer+0x0/0x2e
[<c02d1fce>] out_of_line_wait_on_bit+0x75/0x7d
[<c01520ab>] sync_buffer+0x0/0x2e
[<c012956c>] wake_bit_function+0x0/0x3c
[<c015203c>] __wait_on_buffer+0x1c/0x1f
[<c0152c5b>] __bread+0x8a/0x9d
[<e007146a>] ext3_xattr_get+0x14d/0x23b [ext3]
[<c01519ee>] __getblk+0x3d/0x23b
[<e00723e9>] ext3_xattr_security_get+0x38/0x42 [ext3]
[<c016b0aa>] generic_getxattr+0x3a/0x40
[<c019e6e2>] inode_doinit_with_dentry+0x171/0x494
[<c0164324>] d_splice_alias+0xb0/0xd1
[<e006b1c9>] ext3_lookup+0x72/0x77 [ext3]
[<c015b325>] do_lookup+0xa6/0x138
[<c015ce6b>] __link_path_walk+0x81a/0xc5d
[<c015d2f5>] link_path_walk+0x47/0xb9
[<c0135954>] audit_syscall_entry+0x118/0x13f
[<c015d685>] do_path_lookup+0x1dc/0x241
[<c015df3c>] __path_lookup_intent_open+0x42/0x72
[<c015dfbb>] path_lookup_open+0xf/0x13
[<c01586a9>] open_exec+0x1e/0xb3
[<c0135954>] audit_syscall_entry+0x118/0x13f
[<c0159bc6>] do_execve+0x3e/0x1ea
[<c01018c0>] sys_execve+0x2b/0x69
[<c0102a45>] syscall_call+0x7/0xb
kcheckpass D CE66F158 2728 12164 2366
2367 (NOTLB)
c3dc1d48 c17b7784 00000008 ce66f158 ce66f030 c7c18800
003d7b1c 00000000
c7c18800 003d7b1c 00000000 00000000 df0b5400
c3dc1d8c c13f1c48 df169650
e0031abf 00000000 dcb4ea60 c3e78388 00000000
d9d0ca40 00000000 00000000
Call Trace:
[<e0031abf>] do_get_write_access+0x2e1/0x4c1 [jbd]
[<c012956c>] wake_bit_function+0x0/0x3c
[<e0031cb7>] journal_get_write_access+0x18/0x26 [jbd]
[<e00652e6>] ext3_reserve_inode_write+0x2f/0x78
[ext3]
[<e0065358>] ext3_mark_inode_dirty+0x29/0x3f [ext3]
[<e0067aae>] ext3_dirty_inode+0x53/0x66 [ext3]
[<c016cee1>] __mark_inode_dirty+0x27/0x152
[<c0137dea>] do_generic_mapping_read+0x3b9/0x400
[<c013846d>] __generic_file_aio_read+0x148/0x18f
[<c0137351>] file_read_actor+0x0/0xe0
[<c01384ef>] generic_file_aio_read+0x3b/0x42
[<c014fcf3>] do_sync_read+0xb8/0xf3
[<c012953f>] autoremove_wake_function+0x0/0x2d
[<c014fc3b>] do_sync_read+0x0/0xf3
[<c01505bc>] vfs_read+0x9f/0x13e
[<c0150923>] sys_read+0x3c/0x63
[<c0102a45>] syscall_call+0x7/0xb
I've reported the same problem on FC5 kernels
(https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=186856)
too, assuming it was an FC only problem. But as it
turns out, kernel.org kernels too suffer from this,
though it took a while to prove that :(.
Surprisingly, I can't trigger it with my minimal
.config, however, when I use FC5's .config, within a
few days, this problem appears. Any particular
component you'd like me to isolate?
I think I ought to try 2.6.16.<latest>.
Thanks
Hari
PS: Pls keep me in the replies.
Send instant messages to your online friends http://au.messenger.yahoo.com
^ permalink raw reply [flat|nested] 12+ messages in thread* Re: 2.6.16.1 & D state processes 2006-04-17 23:49 2.6.16.1 & D state processes Srihari Vijayaraghavan @ 2006-04-18 4:03 ` Andrew Morton 2006-04-18 5:07 ` Srihari Vijayaraghavan 0 siblings, 1 reply; 12+ messages in thread From: Andrew Morton @ 2006-04-18 4:03 UTC (permalink / raw) To: Srihari Vijayaraghavan; +Cc: linux-kernel Srihari Vijayaraghavan <sriharivijayaraghavan@yahoo.com.au> wrote: > > I've reported the same problem on FC5 kernels > (https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=186856) > too, assuming it was an FC only problem. But as it > turns out, kernel.org kernels too suffer from this, > though it took a while to prove that :(. Your I/O system has lost some I/O requests: we sent write requests down, and no completion ever came back. It could be a broken device driver - please describe the I/O stack. Or it could be lost interrupts - could be broken platform interrupt setup, or broken hardware. ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-18 4:03 ` Andrew Morton @ 2006-04-18 5:07 ` Srihari Vijayaraghavan 2006-04-18 6:30 ` Mike Galbraith 0 siblings, 1 reply; 12+ messages in thread From: Srihari Vijayaraghavan @ 2006-04-18 5:07 UTC (permalink / raw) To: Andrew Morton; +Cc: linux-kernel --- Andrew Morton <akpm@osdl.org> wrote: > Srihari Vijayaraghavan > <sriharivijayaraghavan@yahoo.com.au> wrote: > [...] > Your I/O system has lost some I/O requests: we sent > write requests down, > and no completion ever came back. > > It could be a broken device driver - please describe > the I/O stack. Here's the lspci: 00:00.0 Host bridge: Intel Corporation 82845G/GL[Brookdale-G]/GE/PE DRAM Controller/Host-Hub Interface (rev 01) 00:02.0 VGA compatible controller: Intel Corporation 82845G/GL[Brookdale-G]/GE Chipset Integrated Graphics Device (rev 01) 00:1d.0 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #1 (rev 01) 00:1d.1 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #2 (rev 01) 00:1d.2 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #3 (rev 01) 00:1d.7 USB Controller: Intel Corporation 82801DB/DBM (ICH4/ICH4-M) USB2 EHCI Controller (rev 01) 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 81) 00:1f.0 ISA bridge: Intel Corporation 82801DB/DBL (ICH4/ICH4-L) LPC Interface Bridge (rev 01) 00:1f.1 IDE interface: Intel Corporation 82801DB (ICH4) IDE Controller (rev 01) 00:1f.3 SMBus: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) SMBus Controller (rev 01) 00:1f.5 Multimedia audio controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) AC'97 Audio Controller (rev 01) 02:08.0 Ethernet controller: Intel Corporation 82801DB PRO/100 VE (LOM) Ethernet Controller (rev 81) Here's the complete dmesg (with HDD & CD drives info too): Linux version 2.6.16.7 (*******@*********) (gcc version 4.1.0 20060304 (Red Hat 4.1.0-3)) #1 Fri Apr 14 21:44:33 EST 2006 BIOS-provided physical RAM map: BIOS-e820: 0000000000000000 - 000000000009f800 (usable) BIOS-e820: 000000000009f800 - 00000000000a0000 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 000000001f6f0000 (usable) BIOS-e820: 000000001f6f0000 - 000000001f6fb000 (ACPI data) BIOS-e820: 000000001f6fb000 - 000000001f700000 (ACPI NVS) BIOS-e820: 000000001f700000 - 000000001f780000 (usable) BIOS-e820: 000000001f780000 - 0000000020000000 (reserved) BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved) BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000ff800000 - 00000000ffc00000 (reserved) BIOS-e820: 00000000fffffc00 - 0000000100000000 (reserved) 0MB HIGHMEM available. 503MB LOWMEM available. found SMP MP-table at 000f62d0 On node 0 totalpages: 128896 DMA zone: 4096 pages, LIFO batch:0 DMA32 zone: 0 pages, LIFO batch:0 Normal zone: 124800 pages, LIFO batch:31 HighMem zone: 0 pages, LIFO batch:0 DMI present. ACPI: RSDP (v000 PTLTD ) @ 0x000f6360 ACPI: RSDT (v001 PTLTD RSDT 0x060400d0 LTP 0x00000000) @ 0x1f6f73ec ACPI: FADT (v001 IBM NETVISTA 0x060400d0 PTL 0x00000001) @ 0x1f6faee2 ACPI: TCPA (v001 IBM NETVISTA 0x060400d0 PTL 0x00000001) @ 0x1f6faf56 ACPI: MADT (v001 PTLTD APIC 0x060400d0 LTP 0x00000000) @ 0x1f6faf88 ACPI: BOOT (v001 PTLTD $SBFTBL$ 0x060400d0 LTP 0x00000001) @ 0x1f6fafd8 ACPI: DSDT (v001 IBM Yelotail 0x060400d0 MSFT 0x0100000e) @ 0x00000000 ACPI: PM-Timer IO Port: 0x1008 ACPI: Local APIC address 0xfee00000 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) Processor #0 15:2 APIC version 20 ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0]) IOAPIC[0]: apic_id 1, version 32, address 0xfec00000, GSI 0-23 ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) ACPI: IRQ9 used by override. Enabling APIC mode: Flat. Using 1 I/O APICs Using ACPI (MADT) for SMP configuration information Allocating PCI resources starting at 30000000 (gap: 20000000:dec00000) Built 1 zonelists Kernel command line: ro root=LABEL=/ rhgb console=ttyS0,115200 console=tty0 mapped APIC to ffffd000 (fee00000) mapped IOAPIC to ffffc000 (fec00000) Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Initializing CPU#0 CPU 0 irqstacks, hard=c03c8000 soft=c03c7000 PID hash table entries: 2048 (order: 11, 32768 bytes) Detected 2392.564 MHz processor. Using pmtmr for high-res timesource Console: colour VGA+ 80x25 Dentry cache hash table entries: 65536 (order: 6, 262144 bytes) Inode-cache hash table entries: 32768 (order: 5, 131072 bytes) Memory: 506324k/515584k available (1874k kernel code, 8616k reserved, 766k data, 176k init, 0k highmem) Checking if this processor honours the WP bit even in supervisor mode... Ok. Calibrating delay using timer specific routine.. 4792.76 BogoMIPS (lpj=9585538) Security Framework v1.0.0 initialized SELinux: Initializing. SELinux: Starting in permissive mode selinux_register_security: Registering secondary module capability Capability LSM initialized as secondary Mount-cache hash table entries: 512 CPU: After generic identify, caps: bfebfbff 00000000 00000000 00000000 00004400 00000000 00000000 CPU: After vendor identify, caps: bfebfbff 00000000 00000000 00000000 00004400 00000000 00000000 CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: After all inits, caps: bfebfbff 00000000 00000000 00000080 00004400 00000000 00000000 Intel machine check architecture supported. Intel machine check reporting enabled on CPU#0. CPU0: Intel P4/Xeon Extended MCE MSRs (12) available CPU0: Thermal monitoring enabled CPU: Intel(R) Pentium(R) 4 CPU 2.40GHz stepping 09 Checking 'hlt' instruction... OK. ENABLING IO-APIC IRQs ..TIMER: vector=0x31 apic1=0 pin1=0 apic2=-1 pin2=-1 checking if image is initramfs... it is Freeing initrd memory: 900k freed NET: Registered protocol family 16 ACPI: bus type pci registered PCI: PCI BIOS revision 2.10 entry at 0xfd98d, last bus=2 PCI: Using configuration type 1 ACPI: Subsystem revision 20060127 ACPI: Interpreter enabled ACPI: Using IOAPIC for interrupt routing ACPI: PCI Root Bridge [PCI0] (0000:00) PCI: Probing PCI hardware (bus 00) Boot video device is 0000:00:02.0 PCI quirk: region 1000-107f claimed by ICH4 ACPI/GPIO/TCO PCI quirk: region 1180-11bf claimed by ICH4 GPIO PCI: Ignoring BAR0-3 of IDE controller 0000:00:1f.1 PCI: Transparent bridge - 0000:00:1e.0 ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.SLOT._PRT] ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 7 9 10 *11 12 14 15) ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 5 7 *9 10 11 12 14 15) ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 *5 7 9 10 11 12 14 15) ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 7 9 *10 11 12 14 15) ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 7 *9 10 11 12 14 15) ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 7 9 10 11 12 14 15) *0, disabled. ACPI: PCI Interrupt Link [LNKG] (IRQs 3 4 5 7 9 10 11 12 14 15) *0, disabled. ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 7 *9 10 11 12 14 15) Linux Plug and Play Support v0.97 (c) Adam Belay pnp: PnP ACPI init pnp: PnP ACPI: found 15 devices usbcore: registered new driver usbfs usbcore: registered new driver hub PCI: Using ACPI for IRQ routing PCI: If a device doesn't work, try "pci=routeirq". If it helps, post a report PCI: Ignore bogus resource 6 [0:0] of 0000:00:02.0 PCI: Bridge: 0000:00:1e.0 IO window: 2000-2fff MEM window: c0100000-c01fffff PREFETCH window: disabled. PCI: Setting latency timer of device 0000:00:1e.0 to 64 Simple Boot Flag at 0x6c set to 0x1 IBM machine detected. Enabling interrupts during APM calls. apm: BIOS version 1.2 Flags 0x03 (Driver version 1.16ac) apm: overridden by ACPI. audit: initializing netlink socket (disabled) audit(1145359155.056:1): initialized Total HugeTLB memory allocated, 0 VFS: Disk quotas dquot_6.5.1 Dquot-cache hash table entries: 1024 (order 0, 4096 bytes) SELinux: Registering netfilter hooks Initializing Cryptographic API io scheduler noop registered io scheduler anticipatory registered io scheduler deadline registered io scheduler cfq registered (default) pci_hotplug: PCI Hot Plug PCI Core version: 0.5 ACPI: Processor [CPU0] (supports 8 throttling states) ACPI: Thermal Zone [THM0] (49 C) isapnp: Scanning for PnP cards... isapnp: No Plug & Play device found Real Time Clock Driver v1.12ac Linux agpgart interface v0.101 (c) Dave Jones agpgart: Detected an Intel 845G Chipset. agpgart: Detected 8060K stolen memory. agpgart: AGP aperture is 128M @ 0x88000000 PNP: PS/2 Controller [PNP0303:KBC,PNP0f13:PSM] at 0x60,0x64 irq 1,12 serio: i8042 AUX port at 0x60,0x64 irq 12 serio: i8042 KBD port at 0x60,0x64 irq 1 Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A 00:0c: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A 00:0d: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A RAMDISK driver initialized: 16 RAM disks of 16384K size 1024 blocksize Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx ICH4: IDE controller at PCI slot 0000:00:1f.1 PCI: Enabling device 0000:00:1f.1 (0005 -> 0007) ACPI: PCI Interrupt 0000:00:1f.1[A] -> GSI 18 (level, low) -> IRQ 16 ICH4: chipset revision 1 ICH4: not 100% native mode: will probe irqs later ide0: BM-DMA at 0x1860-0x1867, BIOS settings: hda:DMA, hdb:pio ide1: BM-DMA at 0x1868-0x186f, BIOS settings: hdc:DMA, hdd:DMA Probing IDE interface ide0... hda: IC35L060AVV207-0, ATA DISK drive ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 Probing IDE interface ide1... hdc: LITE-ON LTR-48246S, ATAPI CD/DVD-ROM drive hdd: HL-DT-ST CD-ROM GCR-8482B, ATAPI CD/DVD-ROM drive ide1 at 0x170-0x177,0x376 on irq 15 hda: max request size: 512KiB hda: 78156288 sectors (40016 MB) w/1821KiB Cache, CHS=16383/255/63, UDMA(100) hda: cache flushes supported hda: hda1 hda2 hda3 hda4 < hda5 hda6 hda7 hda8 hda9 hda10 hda11 > hdc: ATAPI 48X CD-ROM CD-R/RW drive, 2048kB Cache, UDMA(33) Uniform CD-ROM driver Revision: 3.20 hdd: ATAPI 48X CD-ROM drive, 128kB Cache, UDMA(33) ide-floppy driver 0.99.newide usbcore: registered new driver libusual usbcore: registered new driver hiddev usbcore: registered new driver usbhid drivers/usb/input/hid-core.c: v2.6:USB HID core driver mice: PS/2 mouse device common for all mice md: md driver 0.90.3 MAX_MD_DEVS=256, MD_SB_DISKS=27 md: bitmap version 4.39 NET: Registered protocol family 2 input: AT Translated Set 2 keyboard as /class/input/input0 IP route cache hash table entries: 4096 (order: 2, 16384 bytes) TCP established hash table entries: 16384 (order: 6, 262144 bytes) TCP bind hash table entries: 16384 (order: 6, 327680 bytes) TCP: Hash tables configured (established 16384 bind 16384) TCP reno registered TCP bic registered Initializing IPsec netlink socket NET: Registered protocol family 1 NET: Registered protocol family 17 Using IPI Shortcut mode ACPI wakeup devices: USB1 USB2 USB3 USBE SLOT KBC COMA COMB ACPI: (supports S0 S1 S3 S4 S5) Freeing unused kernel memory: 176k freed Write protecting the kernel read-only data: 354k input: PS/2 Generic Mouse as /class/input/input1 kjournald starting. Commit interval 5 seconds EXT3-fs: mounted filesystem with ordered data mode. security: 3 users, 6 roles, 1164 types, 137 bools, 1 sens, 256 cats security: 55 classes, 37899 rules SELinux: Completing initialization. SELinux: Setting up existing superblocks. SELinux: initialized (dev hda9, type ext3), uses xattr SELinux: initialized (dev tmpfs, type tmpfs), uses transition SIDs SELinux: initialized (dev debugfs, type debugfs), uses genfs_contexts SELinux: initialized (dev selinuxfs, type selinuxfs), uses genfs_contexts SELinux: initialized (dev mqueue, type mqueue), uses transition SIDs SELinux: initialized (dev hugetlbfs, type hugetlbfs), uses genfs_contexts SELinux: initialized (dev devpts, type devpts), uses transition SIDs SELinux: initialized (dev eventpollfs, type eventpollfs), uses genfs_contexts SELinux: initialized (dev inotifyfs, type inotifyfs), uses genfs_contexts SELinux: initialized (dev tmpfs, type tmpfs), uses transition SIDs SELinux: initialized (dev futexfs, type futexfs), uses genfs_contexts SELinux: initialized (dev pipefs, type pipefs), uses task SIDs SELinux: initialized (dev sockfs, type sockfs), uses task SIDs SELinux: initialized (dev proc, type proc), uses genfs_contexts SELinux: initialized (dev bdev, type bdev), uses genfs_contexts SELinux: initialized (dev rootfs, type rootfs), uses genfs_contexts SELinux: initialized (dev sysfs, type sysfs), uses genfs_contexts SELinux: initialized (dev usbfs, type usbfs), uses genfs_contexts hw_random: RNG not detected i810_smbus 0000:00:02.0: i810/i815 i2c device found. audit(1145323161.860:2): avc: denied { search } for pid=534 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] e100: Intel(R) PRO/100 Network Driver, 3.5.10-k2-NAPI e100: Copyright(c) 1999-2005 Intel Corporation ACPI: PCI Interrupt 0000:02:08.0[A] -> GSI 20 (level, low) -> IRQ 17 e100: eth0: e100_probe: addr 0xc0100000, irq 17, MAC addr 00:0D:60:30:03:E2 ACPI: PCI Interrupt 0000:00:1f.5[B] -> GSI 17 (level, low) -> IRQ 18 PCI: Setting latency timer of device 0000:00:1f.5 to 64 intel8x0_measure_ac97_clock: measured 53796 usecs intel8x0: clocking to 48000 audit(1145323163.088:7): avc: denied { search } for pid=623 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] USB Universal Host Controller Interface driver v2.3 ACPI: PCI Interrupt 0000:00:1d.0[A] -> GSI 16 (level, low) -> IRQ 19 PCI: Setting latency timer of device 0000:00:1d.0 to 64 uhci_hcd 0000:00:1d.0: UHCI Host Controller uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 1 uhci_hcd 0000:00:1d.0: irq 19, io base 0x00001800 usb usb1: configuration #1 chosen from 1 choice hub 1-0:1.0: USB hub found hub 1-0:1.0: 2 ports detected ACPI: PCI Interrupt 0000:00:1d.1[B] -> GSI 19 (level, low) -> IRQ 20 PCI: Setting latency timer of device 0000:00:1d.1 to 64 uhci_hcd 0000:00:1d.1: UHCI Host Controller uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 2 uhci_hcd 0000:00:1d.1: irq 20, io base 0x00001820 usb usb2: configuration #1 chosen from 1 choice hub 2-0:1.0: USB hub found hub 2-0:1.0: 2 ports detected audit(1145323165.993:192): avc: denied { search } for pid=902 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] ACPI: PCI Interrupt 0000:00:1d.2[C] -> GSI 18 (level, low) -> IRQ 16 PCI: Setting latency timer of device 0000:00:1d.2 to 64 uhci_hcd 0000:00:1d.2: UHCI Host Controller uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 3 uhci_hcd 0000:00:1d.2: irq 16, io base 0x00001840 usb usb3: configuration #1 chosen from 1 choice hub 3-0:1.0: USB hub found hub 3-0:1.0: 2 ports detected audit(1145323166.117:202): avc: denied { search } for pid=910 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] ACPI: PCI Interrupt 0000:00:1d.7[D] -> GSI 23 (level, low) -> IRQ 21 PCI: Setting latency timer of device 0000:00:1d.7 to 64 ehci_hcd 0000:00:1d.7: EHCI Host Controller PCI: cache line size of 128 is not supported by device 0000:00:1d.7 ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 4 ehci_hcd 0000:00:1d.7: irq 21, io mem 0xc0080000 ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004 usb usb4: configuration #1 chosen from 1 choice hub 4-0:1.0: USB hub found hub 4-0:1.0: 6 ports detected audit(1145323166.701:262): avc: denied { search } for pid=1007 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] Non-volatile memory driver v1.2 Floppy drive(s): fd0 is 1.44M FDC 0 is a post-1991 82077 audit(1145323167.305:307): avc: denied { search } for pid=1106 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] parport: PnPBIOS parport detected. parport0: PC-style at 0x378 (0x778), irq 7 [PCSPP,TRISTATE] lp0: using parport0 (interrupt-driven). lp0: console ready audit(1145323167.549:312): avc: denied { search } for pid=1126 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] ACPI: Power Button (FF) [PWRF] ACPI: Power Button (CM) [PWRB] ibm_acpi: ec object not found md: Autodetecting RAID arrays. md: autorun ... md: ... autorun DONE. audit(1145323168.401:327): avc: denied { search } for pid=1163 comm="pam_console_app" name="var" dev=hda9 ino=31906 scontext=system_u:system_r:pam_console_t:s0-s0:c0.c255 tcontext=system_u:object_r:file_t:s0 tclass=dir [...] EXT3 FS on hda9, internal journal audit(1145323169.865:337): avc: denied { sys_resource } for pid=1191 comm="mount" capability=24 scontext=system_u:system_r:mount_t:s0 tcontext=system_u:system_r:mount_t:s0 tclass=capability [...] kjournald starting. Commit interval 5 seconds EXT3 FS on hda2, internal journal EXT3-fs: mounted filesystem with ordered data mode. SELinux: initialized (dev hda2, type ext3), uses xattr SELinux: initialized (dev tmpfs, type tmpfs), uses transition SIDs kjournald starting. Commit interval 5 seconds EXT3 FS on hda6, internal journal EXT3-fs: mounted filesystem with ordered data mode. SELinux: initialized (dev hda6, type ext3), uses xattr kjournald starting. Commit interval 5 seconds EXT3 FS on hda7, internal journal EXT3-fs: mounted filesystem with ordered data mode. SELinux: initialized (dev hda7, type ext3), uses xattr kjournald starting. Commit interval 5 seconds EXT3 FS on hda3, internal journal EXT3-fs: mounted filesystem with ordered data mode. SELinux: initialized (dev hda3, type ext3), uses xattr kjournald starting. Commit interval 5 seconds EXT3 FS on hda5, internal journal EXT3-fs: mounted filesystem with ordered data mode. SELinux: initialized (dev hda5, type ext3), uses xattr kjournald starting. Commit interval 5 seconds EXT3 FS on hda8, internal journal EXT3-fs: mounted filesystem with ordered data mode. SELinux: initialized (dev hda8, type ext3), uses xattr audit(1145323170.461:339): avc: denied { sys_resource } for pid=1200 comm="mount" capability=24 scontext=system_u:system_r:mount_t:s0 tcontext=system_u:system_r:mount_t:s0 tclass=capability SELinux: initialized (dev ramfs, type ramfs), uses genfs_contexts NET: Registered protocol family 10 lo: Disabled Privacy Extensions IPv6 over IPv4 tunneling driver audit(1145323179.853:340): avc: denied { sys_resource } for pid=1292 comm="swapon" capability=24 scontext=system_u:system_r:fsadm_t:s0 tcontext=system_u:system_r:fsadm_t:s0 tclass=capability SELinux: initialized (dev binfmt_misc, type binfmt_misc), uses genfs_contexts ADDRCONF(NETDEV_UP): eth0: link is not ready e100: eth0: e100_watchdog: link up, 100Mbps, full-duplex ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready SELinux: initialized (dev rpc_pipefs, type rpc_pipefs), uses genfs_contexts Bluetooth: Core ver 2.8 NET: Registered protocol family 31 Bluetooth: HCI device and connection manager initialized Bluetooth: HCI socket layer initialized Bluetooth: L2CAP ver 2.8 Bluetooth: L2CAP socket layer initialized Bluetooth: HIDP (Human Interface Emulation) ver 1.1 SELinux: initialized (dev autofs, type autofs), uses genfs_contexts eth0: no IPv6 routers present [drm] Initialized drm 1.0.1 20051102 ACPI: PCI Interrupt 0000:00:02.0[A] -> GSI 16 (level, low) -> IRQ 19 [drm] Initialized i915 1.4.0 20060119 on minor 0 (If need more info such as hdparm etc. pls ask.) > Or it could be lost interrupts - could be broken > platform interrupt setup, > or broken hardware. Lost interrupts, surely: the clock was many hours behind actual time (actually a few days after the last long weekend). Broken hardware, highly unlikely. (I'll try to reproduce it on another identical machine.) With my minimal .config, all is well. With FC's .config (perhaps with a few subsystems - production & experimental, like Bluetooth, i2c etc.), however, the problem is reproduceable. Naturally, FC's .config loads many kernel drivers (538 symbols on my minimal .config vs 1800+ on FC5's). Because there are way too many symbols, I thought someone would be quick to point out which one of those would be to blame :-) by looking thro those processes. And oh, eventually machine recovers (minutes sometimes, hours other times), & it runs as if nothing happend, as long as you're not quick to power button. Sure, you can't login, or you can't execute commands, if you managed to login etc. But patience does bring the machine back every time :-). (On request I'd gladly provide my minimal .config & FC5's.) I think I should remove those symbols one at a time to narrow it down. That would take many days though. But it'll be worth a try, if there's no other way. Thanks Send instant messages to your online friends http://au.messenger.yahoo.com ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-18 5:07 ` Srihari Vijayaraghavan @ 2006-04-18 6:30 ` Mike Galbraith 2006-04-18 9:35 ` Srihari Vijayaraghavan 0 siblings, 1 reply; 12+ messages in thread From: Mike Galbraith @ 2006-04-18 6:30 UTC (permalink / raw) To: Srihari Vijayaraghavan; +Cc: Andrew Morton, linux-kernel On Tue, 2006-04-18 at 15:07 +1000, Srihari Vijayaraghavan wrote: > io scheduler noop registered > io scheduler anticipatory registered > io scheduler deadline registered > io scheduler cfq registered (default) ... > And oh, eventually machine recovers (minutes > sometimes, hours other times), & it runs as if nothing > happend, as long as you're not quick to power button. > Sure, you can't login, or you can't execute commands, > if you managed to login etc. But patience does bring > the machine back every time :-). Hmm. Recovers [odd] but takes long time [odder]. I'd try to eliminate io scheduler at this point. -Mike ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-18 6:30 ` Mike Galbraith @ 2006-04-18 9:35 ` Srihari Vijayaraghavan 2006-04-18 12:47 ` Mike Galbraith 0 siblings, 1 reply; 12+ messages in thread From: Srihari Vijayaraghavan @ 2006-04-18 9:35 UTC (permalink / raw) To: Mike Galbraith; +Cc: Andrew Morton, linux-kernel --- Mike Galbraith <efault@gmx.de> wrote: > On Tue, 2006-04-18 at 15:07 +1000, Srihari > Vijayaraghavan wrote: > > io scheduler cfq registered (default) > ... > Hmm. Recovers [odd] but takes long time [odder]. > I'd try to eliminate > io scheduler at this point. Interesting. Considering the minimal .config, where I haven't experienced this problem over a week uptime, also having CFQ as the default elevator, do you still believe CFQ is involved? (I guess if CFQ could be influenced by other kernel configurations, then perhaps another elevator might help. It's worth trying.) I'm on 2.6.16.7 full config now. I'll try a few things, if I were to observe problems: 1. Use deadline elevator 2. Start eliminating a few kernel configurations to narrow it down Thanks Send instant messages to your online friends http://au.messenger.yahoo.com ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-18 9:35 ` Srihari Vijayaraghavan @ 2006-04-18 12:47 ` Mike Galbraith 2006-04-20 5:54 ` Srihari Vijayaraghavan 0 siblings, 1 reply; 12+ messages in thread From: Mike Galbraith @ 2006-04-18 12:47 UTC (permalink / raw) To: Srihari Vijayaraghavan; +Cc: Andrew Morton, linux-kernel On Tue, 2006-04-18 at 19:35 +1000, Srihari Vijayaraghavan wrote: > --- Mike Galbraith <efault@gmx.de> wrote: > > On Tue, 2006-04-18 at 15:07 +1000, Srihari > > Vijayaraghavan wrote: > > > io scheduler cfq registered (default) > > ... > > Hmm. Recovers [odd] but takes long time [odder]. > > I'd try to eliminate > > io scheduler at this point. > > Interesting. Considering the minimal .config, where I > haven't experienced this problem over a week uptime, > also having CFQ as the default elevator, do you still > believe CFQ is involved? (I guess if CFQ could be > influenced by other kernel configurations, then > perhaps another elevator might help. It's worth > trying.) I don't know that CFQ is involved. With it recovering though, the only thing I could think of was a request stucking in the io scheduler's gizzard for some reason. It's just a suggestion, and one you can try without even rebooting. -Mike ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-18 12:47 ` Mike Galbraith @ 2006-04-20 5:54 ` Srihari Vijayaraghavan 2006-04-20 6:45 ` Mike Galbraith 0 siblings, 1 reply; 12+ messages in thread From: Srihari Vijayaraghavan @ 2006-04-20 5:54 UTC (permalink / raw) To: Mike Galbraith; +Cc: Andrew Morton, linux-kernel --- Mike Galbraith <efault@gmx.de> wrote: > On Tue, 2006-04-18 at 19:35 +1000, Srihari > Vijayaraghavan wrote: > > --- Mike Galbraith <efault@gmx.de> wrote: > > > On Tue, 2006-04-18 at 15:07 +1000, Srihari > > > Vijayaraghavan wrote: > > > > io scheduler cfq registered (default) > > > ... > > > Hmm. Recovers [odd] but takes long time > [odder]. > > > I'd try to eliminate > > > io scheduler at this point. > > > > Interesting. Considering the minimal .config, > where I > > haven't experienced this problem over a week > uptime, > > also having CFQ as the default elevator, do you > still > > believe CFQ is involved? (I guess if CFQ could be > > influenced by other kernel configurations, then > > perhaps another elevator might help. It's worth > > trying.) > > I don't know that CFQ is involved. With it > recovering though, the only > thing I could think of was a request stucking in the > io scheduler's > gizzard for some reason. > > It's just a suggestion, and one you can try without > even rebooting. Thanks for that. When it happened today on 2.6.16.7, I've started using deadline now. A couple of interesting things I've noticed after its recovery: 1. Executing "time sleep 1", takes more than 1 second. It reports real as 3 to 5 seconds, while my stop watch measures it as closer to 50 seconds. 2. Pressing & holding a key at the console doesn't produce repeating characters. Thought they might shed some light on the problem. (I ought to look at all the RTC & Time related settings between my minimal .config & FC5's, for I believe they're connected.) I haven't rebooted it yet. I'm sure it'll be back to normal after the reboot. Thanks Send instant messages to your online friends http://au.messenger.yahoo.com ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-20 5:54 ` Srihari Vijayaraghavan @ 2006-04-20 6:45 ` Mike Galbraith 2006-04-20 6:51 ` Mike Galbraith 0 siblings, 1 reply; 12+ messages in thread From: Mike Galbraith @ 2006-04-20 6:45 UTC (permalink / raw) To: Srihari Vijayaraghavan; +Cc: Andrew Morton, linux-kernel On Thu, 2006-04-20 at 15:54 +1000, Srihari Vijayaraghavan wrote: > A couple of interesting things I've noticed after its > recovery: > 1. Executing "time sleep 1", takes more than 1 second. > It reports real as 3 to 5 seconds, while my stop watch > measures it as closer to 50 seconds. Ah. Something bad happened to time keeping, and that likely screwed up (time dependent) io, not the other way around. > 2. Pressing & holding a key at the console doesn't > produce repeating characters. > > Thought they might shed some light on the problem. (I > ought to look at all the RTC & Time related settings > between my minimal .config & FC5's, for I believe > they're connected.) Good idea. What time source are you using? I'd try plain old pit. -Mike (As an aside, I suspect you're going to find that you've got dodgy hardware.) ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-20 6:45 ` Mike Galbraith @ 2006-04-20 6:51 ` Mike Galbraith 2006-05-03 7:04 ` Srihari Vijayaraghavan 0 siblings, 1 reply; 12+ messages in thread From: Mike Galbraith @ 2006-04-20 6:51 UTC (permalink / raw) To: Srihari Vijayaraghavan; +Cc: Andrew Morton, linux-kernel On Thu, 2006-04-20 at 08:45 +0200, Mike Galbraith wrote: > Good idea. What time source are you using? I'd try plain old pit. (Looking back, I see you're using pm timer.) ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-04-20 6:51 ` Mike Galbraith @ 2006-05-03 7:04 ` Srihari Vijayaraghavan 2006-05-03 7:18 ` Mike Galbraith 0 siblings, 1 reply; 12+ messages in thread From: Srihari Vijayaraghavan @ 2006-05-03 7:04 UTC (permalink / raw) To: Mike Galbraith; +Cc: Andrew Morton, linux-kernel --- Mike Galbraith <efault@gmx.de> wrote: > On Thu, 2006-04-20 at 08:45 +0200, Mike Galbraith > wrote: > > > Good idea. What time source are you using? I'd > try plain old pit. > > (Looking back, I see you're using pm timer.) Actually, I narrowed it down to "CONFIG_X86_UP_IOAPIC". When it isn't compiled in, all is well. (Yes it wasn't there in my minimal .config either. And yes, my BIOS is the latest available for this IBM P-IV NetVista desktop - BIOS as recent as Aug 2005, I'm sure.) Now I'll have to try myriad ioapic kernel boot parameters :-( to see if there is a setting in which it'll work nice. Any clues? Thanks ____________________________________________________ On Yahoo!7 Dating: It's free to join and check out our great singles! http://www.yahoo7.com.au/personals ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-05-03 7:04 ` Srihari Vijayaraghavan @ 2006-05-03 7:18 ` Mike Galbraith 2006-05-08 0:57 ` Srihari Vijayaraghavan 0 siblings, 1 reply; 12+ messages in thread From: Mike Galbraith @ 2006-05-03 7:18 UTC (permalink / raw) To: Srihari Vijayaraghavan; +Cc: Andrew Morton, linux-kernel On Wed, 2006-05-03 at 17:04 +1000, Srihari Vijayaraghavan wrote: > Any clues? Nope. ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: 2.6.16.1 & D state processes 2006-05-03 7:18 ` Mike Galbraith @ 2006-05-08 0:57 ` Srihari Vijayaraghavan 0 siblings, 0 replies; 12+ messages in thread From: Srihari Vijayaraghavan @ 2006-05-08 0:57 UTC (permalink / raw) To: Mike Galbraith, linux-kernel; +Cc: Andrew Morton --- Mike Galbraith <efault@gmx.de> wrote: > On Wed, 2006-05-03 at 17:04 +1000, Srihari > Vijayaraghavan wrote: > > > Any clues? > > Nope. OK. Thx. (Mike & Andrew, pls let me know if you want to be removed from the cc list.) I heard that a few colleagues are experiencing the same problem as well on their machines (all IBM NetVista P-IV desktops) on FC5, which goes to prove that I'm not alone. (I've suggested them to try "noapic" kernel boot parameter with FC5 kernel.) To my untrained eyes, this is the key difference between 2.6.IOAPIC & 2.6.no-IOAPIC in /proc/interrupts in ref to timer (which I suspect the source of the problem; correct me if I'm wrong): 2.6.16.14.IOAPIC: 0: NNNNN IO-APIC-edge timer 2.6.16.14.no-IOAPIC: 0: NNNNN XT-PIC timer Here's dmesg with apic=debug: Linux version 2.6.16.14 (xxxxxxx@yyyyyyy) (gcc version 4.1.0 20060304 (Red Hat 4.1.0-3)) #1 Fri May 5 14:41:31 EST 2006 BIOS-provided physical RAM map: BIOS-e820: 0000000000000000 - 000000000009f800 (usable) BIOS-e820: 000000000009f800 - 00000000000a0000 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 000000001f6f0000 (usable) BIOS-e820: 000000001f6f0000 - 000000001f6fb000 (ACPI data) BIOS-e820: 000000001f6fb000 - 000000001f700000 (ACPI NVS) BIOS-e820: 000000001f700000 - 000000001f780000 (usable) BIOS-e820: 000000001f780000 - 0000000020000000 (reserved) BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved) BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000ff800000 - 00000000ffc00000 (reserved) BIOS-e820: 00000000fffffc00 - 0000000100000000 (reserved) 503MB LOWMEM available. found SMP MP-table at 000f61d0 On node 0 totalpages: 128896 DMA zone: 4096 pages, LIFO batch:0 DMA32 zone: 0 pages, LIFO batch:0 Normal zone: 124800 pages, LIFO batch:31 HighMem zone: 0 pages, LIFO batch:0 DMI present. ACPI: RSDP (v000 PTLTD ) @ 0x000f6260 ACPI: RSDT (v001 PTLTD RSDT 0x060400d0 LTP 0x00000000) @ 0x1f6f7325 ACPI: FADT (v001 IBM NETVISTA 0x060400d0 PTL 0x00000001) @ 0x1f6faee2 ACPI: TCPA (v001 IBM NETVISTA 0x060400d0 PTL 0x00000001) @ 0x1f6faf56 ACPI: MADT (v001 PTLTD APIC 0x060400d0 LTP 0x00000000) @ 0x1f6faf88 ACPI: BOOT (v001 PTLTD $SBFTBL$ 0x060400d0 LTP 0x00000001) @ 0x1f6fafd8 ACPI: DSDT (v001 IBM Yelotail 0x060400d0 MSFT 0x0100000e) @ 0x00000000 ACPI: PM-Timer IO Port: 0x1008 ACPI: Local APIC address 0xfee00000 ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) Processor #0 15:2 APIC version 20 ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0]) IOAPIC[0]: apic_id 1, version 32, address 0xfec00000, GSI 0-23 ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) ACPI: IRQ9 used by override. Enabling APIC mode: Flat. Using 1 I/O APICs Using ACPI (MADT) for SMP configuration information Allocating PCI resources starting at 30000000 (gap: 20000000:dec00000) Built 1 zonelists Kernel command line: ro root=LABEL=/ rhgb console=ttyS0,115200 console=tty0 apic=debug mapped APIC to ffffd000 (fee00000) mapped IOAPIC to ffffc000 (fec00000) Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Initializing CPU#0 CPU 0 irqstacks, hard=c02ff000 soft=c02fe000 PID hash table entries: 2048 (order: 11, 32768 bytes) Detected 2392.752 MHz processor. Using pmtmr for high-res timesource Console: colour VGA+ 80x25 Dentry cache hash table entries: 65536 (order: 6, 262144 bytes) Inode-cache hash table entries: 32768 (order: 5, 131072 bytes) Memory: 507228k/515584k available (1344k kernel code, 7700k reserved, 532k data, 136k init, 0k highmem) Checking if this processor honours the WP bit even in supervisor mode... Ok. Calibrating delay using timer specific routine.. 4792.69 BogoMIPS (lpj=9585392) Security Framework v1.0.0 initialized SELinux: Initializing. SELinux: Starting in permissive mode selinux_register_security: Registering secondary module capability Capability LSM initialized as secondary Mount-cache hash table entries: 512 CPU: After generic identify, caps: bfebfbff 00000000 00000000 00000000 00004400 00000000 00000000 CPU: After vendor identify, caps: bfebfbff 00000000 00000000 00000000 00004400 00000000 00000000 CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: After all inits, caps: bfebfbff 00000000 00000000 00000080 00004400 00000000 00000000 Intel machine check architecture supported. Intel machine check reporting enabled on CPU#0. CPU0: Intel P4/Xeon Extended MCE MSRs (12) available CPU: Intel(R) Pentium(R) 4 CPU 2.40GHz stepping 09 Checking 'hlt' instruction... OK. Getting VERSION: 50014 Getting VERSION: 50014 Getting ID: 0 Getting LVT0: 700 Getting LVT1: 400 enabled ExtINT on CPU#0 ENABLING IO-APIC IRQs init IO_APIC IRQs IO-APIC (apicid-pin) 1-16, 1-17, 1-18, 1-19, 1-20, 1-21, 1-22, 1-23 not connected. ..TIMER: vector=0x31 apic1=0 pin1=0 apic2=-1 pin2=-1 Using local APIC timer interrupts. calibrating APIC timer ... ..... CPU clock speed is 2392.2093 MHz. ..... host bus clock speed is 132.3671 MHz. checking if image is initramfs... it is Freeing initrd memory: 896k freed NET: Registered protocol family 16 ACPI: bus type pci registered PCI: PCI BIOS revision 2.10 entry at 0xfd98d, last bus=2 PCI: Using configuration type 1 ACPI: Subsystem revision 20060127 ACPI: Interpreter enabled ACPI: Using IOAPIC for interrupt routing ACPI: PCI Root Bridge [PCI0] (0000:00) PCI: Probing PCI hardware (bus 00) Boot video device is 0000:00:02.0 PCI quirk: region 1000-107f claimed by ICH4 ACPI/GPIO/TCO PCI quirk: region 1180-11bf claimed by ICH4 GPIO PCI: Ignoring BAR0-3 of IDE controller 0000:00:1f.1 PCI: Transparent bridge - 0000:00:1e.0 ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.SLOT._PRT] ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 7 9 10 *11 12 14 15) ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 5 7 *9 10 11 12 14 15) ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 *5 7 9 10 11 12 14 15) ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 7 9 *10 11 12 14 15) ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 7 *9 10 11 12 14 15) ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 7 9 10 11 12 14 15) *0, disabled. ACPI: PCI Interrupt Link [LNKG] (IRQs 3 4 5 7 9 10 11 12 14 15) *0, disabled. ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 7 *9 10 11 12 14 15) PCI: Using ACPI for IRQ routing PCI: If a device doesn't work, try "pci=routeirq". If it helps, post a report number of MP IRQ sources: 16. number of IO-APIC #1 registers: 24. testing the IO APIC....................... IO APIC #1...... .... register #00: 01000000 ....... : physical APIC id: 01 ....... : Delivery Type: 0 ....... : LTS : 0 .... register #01: 00178020 ....... : max redirection entries: 0017 ....... : PRQ implemented: 1 ....... : IO APIC version: 0020 .... register #02: 00000000 ....... : arbitration: 00 .... register #03: 00000001 ....... : Boot DT : 1 .... IRQ redirection table: NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect: 00 001 01 0 0 0 0 0 1 1 31 01 001 01 0 0 0 0 0 1 1 39 02 001 01 1 0 0 0 0 1 1 41 03 001 01 0 0 0 0 0 1 1 49 04 001 01 0 0 0 0 0 1 1 51 05 001 01 0 0 0 0 0 1 1 59 06 001 01 0 0 0 0 0 1 1 61 07 001 01 0 0 0 0 0 1 1 69 08 001 01 0 0 0 0 0 1 1 71 09 001 01 0 1 0 0 0 1 1 79 0a 001 01 0 0 0 0 0 1 1 81 0b 001 01 0 0 0 0 0 1 1 89 0c 001 01 0 0 0 0 0 1 1 91 0d 001 01 0 0 0 0 0 1 1 99 0e 001 01 0 0 0 0 0 1 1 A1 0f 001 01 0 0 0 0 0 1 1 A9 10 000 00 1 0 0 0 0 0 0 00 11 000 00 1 0 0 0 0 0 0 00 12 000 00 1 0 0 0 0 0 0 00 13 000 00 1 0 0 0 0 0 0 00 14 000 00 1 0 0 0 0 0 0 00 15 000 00 1 0 0 0 0 0 0 00 16 000 00 1 0 0 0 0 0 0 00 17 000 00 1 0 0 0 0 0 0 00 IRQ to pin mappings: IRQ0 -> 0:0 IRQ1 -> 0:1 IRQ2 -> 0:2 IRQ3 -> 0:3 IRQ4 -> 0:4 IRQ5 -> 0:5 IRQ6 -> 0:6 IRQ7 -> 0:7 IRQ8 -> 0:8 IRQ9 -> 0:9 IRQ10 -> 0:10 IRQ11 -> 0:11 IRQ12 -> 0:12 IRQ13 -> 0:13 IRQ14 -> 0:14 IRQ15 -> 0:15 .................................... done. PCI: Ignore bogus resource 6 [0:0] of 0000:00:02.0 PCI: Bridge: 0000:00:1e.0 IO window: 2000-2fff MEM window: c0100000-c01fffff PREFETCH window: disabled. PCI: Setting latency timer of device 0000:00:1e.0 to 64 Simple Boot Flag at 0x6c set to 0x1 Machine check exception polling timer started. audit: initializing netlink socket (disabled) audit(1147084640.752:1): initialized VFS: Disk quotas dquot_6.5.1 Dquot-cache hash table entries: 1024 (order 0, 4096 bytes) SELinux: Registering netfilter hooks Initializing Cryptographic API io scheduler noop registered io scheduler cfq registered (default) ... ##### End ##### Thanks Send instant messages to your online friends http://au.messenger.yahoo.com ^ permalink raw reply [flat|nested] 12+ messages in thread
end of thread, other threads:[~2006-05-08 0:57 UTC | newest] Thread overview: 12+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2006-04-17 23:49 2.6.16.1 & D state processes Srihari Vijayaraghavan 2006-04-18 4:03 ` Andrew Morton 2006-04-18 5:07 ` Srihari Vijayaraghavan 2006-04-18 6:30 ` Mike Galbraith 2006-04-18 9:35 ` Srihari Vijayaraghavan 2006-04-18 12:47 ` Mike Galbraith 2006-04-20 5:54 ` Srihari Vijayaraghavan 2006-04-20 6:45 ` Mike Galbraith 2006-04-20 6:51 ` Mike Galbraith 2006-05-03 7:04 ` Srihari Vijayaraghavan 2006-05-03 7:18 ` Mike Galbraith 2006-05-08 0:57 ` Srihari Vijayaraghavan
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox