* Re: [Bug #14141] order 2 page allocation failures in iwlagn
       [not found]   ` <200910021111.55749.elendil@planet.nl>
@ 2009-10-05  5:13     ` Frans Pop
  2009-10-05  6:50       ` Frans Pop
  0 siblings, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-05  5:13 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, linux-mm
On Friday 02 October 2009, Frans Pop wrote:
> On Thursday 01 October 2009, Rafael J. Wysocki wrote:
> > Bug-Entry     : http://bugzilla.kernel.org/show_bug.cgi?id=14141
> > Subject       : order 2 page allocation failures in iwlagn
> > Submitter     : Frans Pop <elendil@planet.nl>
> > Date          : 2009-09-06 7:40 (26 days old)
> > References    : http://marc.info/?l=linux-kernel&m=125222287419691&w=4
> > Handled-By    : Pekka Enberg <penberg@cs.helsinki.fi>
>
> I'm not sure about this.
>
> The error messages from failed allocations should now be a lot less as a
> result of this commit:
> commit f82a924cc88a5541df1d4b9d38a0968cd077a051
> Author: Reinette Chatre <reinette.chatre@intel.com>
> Date:   Thu Sep 17 10:43:56 2009 -0700
>     iwlwifi: reduce noise when skb allocation fails
>
> That commit is in mainline, and I'm not sure if it is important enough
> for a stable update (AFAICT it's not listed for 2.6.31.2).
>
> That commit is mostly cosmetic, but possibly the real regression is not
> in iwlagn but in the way memory is freed/defragmented. That aspect was
> also reported by Bartlomiej (#14016) and was extensively discussed
> (without a clear conclusion) here: http://lkml.org/lkml/2009/8/26/140.
I may be getting somewhere with this. I just got the allocation failures
included below on .32-rc3. Note that these are not the "fixable" failures
that got suppressed with the commit referenced above, but the "this could
affect networking" failures that are still reported.
What I was doing when I got them is also interesting:
- a kernel build
- a gitk for the kernel tree (with full history this uses ~40% of memory)
- by mistake I then started a _second_ gitk
The second gitk (which shows as 'wish8.5' in top) caused a massive swap
out which brought the system to a standstill for a while (with huge
latencies as well, including a completely stuck mouse cursor, which
happens rarely).
The system has 2GB RAM + 2GB swap, so IIUC there is no danger of getting
into an OOM as the first gitk can be swapped out completely.
I'll dig into this a bit more as it looks like this should be reproducible,
probably even without the kernel build. Next step is to see how .30 behaves
in the same situation.
Even if it is reproducible with .30, I wonder if the kernel shouldn't be
more robust in this situation. Currently it seems to allow one single
process to claim so much memory before swapping out that "normal" operation
of other processes is affected. I can understand that such a situation may
be hard to avoid on a very busy system where multiple processes start
claiming (a lot of) memory at roughly the same time, but I'd say it should
be avoidable if a single process is the culprit.
BTW, the system recovered completely, although that took some time (the
first gitk remained visible in top long after I closed its window; I think
because the system was busy swapping it back in before terminating it).
Cheers,
FJP
kcryptd: page allocation failure. order:2, mode:0x4020
Pid: 1483, comm: kcryptd Not tainted 2.6.32-rc3 #22
Call Trace:
 <IRQ>  [<ffffffff8107c3d5>] __alloc_pages_nodemask+0x5a2/0x5ec
 [<ffffffff81264892>] ? _spin_unlock+0x9/0xb
 [<ffffffff811e73cd>] ? __alloc_skb+0x3c/0x15b
 [<ffffffffa03202cb>] ? iwl_rx_allocate+0x8f/0x305 [iwlcore]
 [<ffffffff8107c431>] __get_free_pages+0x12/0x41
 [<ffffffff8109cb1a>] __kmalloc_track_caller+0x3b/0xed
 [<ffffffff811e73f7>] __alloc_skb+0x66/0x15b
 [<ffffffffa03202cb>] iwl_rx_allocate+0x8f/0x305 [iwlcore]
 [<ffffffffa0320557>] iwl_rx_replenish_now+0x16/0x23 [iwlcore]
 [<ffffffffa035c0c8>] iwl_rx_handle+0x3a8/0x3c1 [iwlagn]
 [<ffffffff81051add>] ? sched_clock_local+0x1c/0x80
 [<ffffffffa035c60d>] iwl_irq_tasklet_legacy+0x52c/0x7a4 [iwlagn]
 [<ffffffffa0317aaf>] ? __iwl_read32+0xa5/0xb4 [iwlcore]
 [<ffffffff8103efb8>] tasklet_action+0x71/0xbc
 [<ffffffff8103f837>] __do_softirq+0x96/0x11b
 [<ffffffff8100cabc>] call_softirq+0x1c/0x28
 [<ffffffff8100e5ef>] do_softirq+0x33/0x6b
 [<ffffffff8103f5c5>] irq_exit+0x36/0x75
 [<ffffffff8100dcf1>] do_IRQ+0xa3/0xba
 [<ffffffff8100c353>] ret_from_intr+0x0/0xa
 <EOI>  [<ffffffff811199dd>] ? scatterwalk_start+0x11/0x19
 [<ffffffff8111bbca>] ? blkcipher_walk_first+0x173/0x196
 [<ffffffff8111b67b>] ? blkcipher_walk_done+0xe6/0x1b8
 [<ffffffff8111bc35>] ? blkcipher_walk_virt+0x1a/0x1d
 [<ffffffffa02001cf>] ? crypto_cbc_encrypt+0x43/0x18e [cbc]
 [<ffffffff81127efd>] ? blk_recount_segments+0x1b/0x2c
 [<ffffffffa021371e>] ? aes_encrypt+0x0/0xf [aes_x86_64]
 [<ffffffff8111af64>] ? async_encrypt+0x38/0x3a
 [<ffffffffa01f7b54>] ? crypt_convert+0x1f9/0x28b [dm_crypt]
 [<ffffffffa01f8009>] ? kcryptd_crypt+0x423/0x449 [dm_crypt]
 [<ffffffffa01f7be6>] ? kcryptd_crypt+0x0/0x449 [dm_crypt]
 [<ffffffff81049bfd>] ? worker_thread+0x146/0x1d8
 [<ffffffff8104d706>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81049ab7>] ? worker_thread+0x0/0x1d8
 [<ffffffff8104d3f4>] ? kthread+0x7d/0x85
 [<ffffffff8100c9ba>] ? child_rip+0xa/0x20
 [<ffffffff8104d377>] ? kthread+0x0/0x85
 [<ffffffff8100c9b0>] ? child_rip+0x0/0x20
Mem-Info:
DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
CPU    1: hi:    0, btch:   1 usd:   0
DMA32 per-cpu:
CPU    0: hi:  186, btch:  31 usd: 171
CPU    1: hi:  186, btch:  31 usd: 177
active_anon:298532 inactive_anon:100163 isolated_anon:52
 active_file:3993 inactive_file:4001 isolated_file:12
 unevictable:399 dirty:0 writeback:76102 unstable:0 buffer:125
 free:14107 slab_reclaimable:4510 slab_unreclaimable:20421
 mapped:7949 shmem:0 pagetables:4437 bounce:0
DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3340kB inactive_anon:3608kB active_file:384kB
   inactive_file:472kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB
   dirty:0kB writeback:80kB mapped:256kB shmem:0kB slab_reclaimable:12kB slab_unreclaimable:104kB kernel_stack:0kB
   pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 1976 1976 1976
DMA32 free:48500kB min:5664kB low:7080kB high:8496kB active_anon:1190788kB inactive_anon:397044kB active_file:15588kB
   inactive_file:15532kB unevictable:1596kB isolated(anon):208kB isolated(file):48kB present:2023748kB mlocked:1596kB
   dirty:0kB writeback:304328kB mapped:31540kB shmem:0kB slab_reclaimable:18028kB slab_unreclaimable:81496kB kernel_stack:1672kB
   pagetables:17732kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 0 0
DMA: 19*4kB 13*8kB 3*16kB 7*32kB 11*64kB 11*128kB 5*256kB 0*512kB 0*1024kB 0*2048kB 1*4096kB = 7940kB
DMA32: 9299*4kB 1341*8kB 4*16kB 0*32kB 0*64kB 0*128kB 0*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 48500kB
98572 total pagecache pages
90213 pages in swap cache
Swap cache stats: add 175874, delete 85661, find 7850/8731
Free swap  = 1425944kB
Total swap = 2097144kB
518064 pages RAM
10350 pages reserved
82388 pages shared
437481 pages non-shared
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
swapper: page allocation failure. order:2, mode:0x4020
Pid: 0, comm: swapper Not tainted 2.6.32-rc3 #22
Call Trace:
 <IRQ>  [<ffffffff8107c3d5>] __alloc_pages_nodemask+0x5a2/0x5ec
 [<ffffffff81264892>] ? _spin_unlock+0x9/0xb
 [<ffffffff811e73cd>] ? __alloc_skb+0x3c/0x15b
 [<ffffffffa03202cb>] ? iwl_rx_allocate+0x8f/0x305 [iwlcore]
 [<ffffffff8107c431>] __get_free_pages+0x12/0x41
 [<ffffffff8109cb1a>] __kmalloc_track_caller+0x3b/0xed
 [<ffffffff811e73f7>] __alloc_skb+0x66/0x15b
 [<ffffffffa03202cb>] iwl_rx_allocate+0x8f/0x305 [iwlcore]
 [<ffffffffa0320557>] iwl_rx_replenish_now+0x16/0x23 [iwlcore]
 [<ffffffffa035c0c8>] iwl_rx_handle+0x3a8/0x3c1 [iwlagn]
 [<ffffffffa035c60d>] iwl_irq_tasklet_legacy+0x52c/0x7a4 [iwlagn]
 [<ffffffffa0317aaf>] ? __iwl_read32+0xa5/0xb4 [iwlcore]
 [<ffffffff8103efb8>] tasklet_action+0x71/0xbc
 [<ffffffff8103f837>] __do_softirq+0x96/0x11b
 [<ffffffff8100cabc>] call_softirq+0x1c/0x28
 [<ffffffff8100e5ef>] do_softirq+0x33/0x6b
 [<ffffffff8103f5c5>] irq_exit+0x36/0x75
 [<ffffffff8100dcf1>] do_IRQ+0xa3/0xba
 [<ffffffff8100c353>] ret_from_intr+0x0/0xa
 <EOI>  [<ffffffffa0278ec7>] ? acpi_idle_enter_simple+0xf9/0x127 [processor]
 [<ffffffffa0278ebd>] ? acpi_idle_enter_simple+0xef/0x127 [processor]
 [<ffffffff811da545>] ? cpuidle_idle_call+0x8c/0xc7
 [<ffffffff8100ae2e>] ? cpu_idle+0x55/0x8d
 [<ffffffff8125432d>] ? rest_init+0x61/0x63
 [<ffffffff81436c3e>] ? start_kernel+0x348/0x353
 [<ffffffff8143629a>] ? x86_64_start_reservations+0xaa/0xae
 [<ffffffff8143637f>] ? x86_64_start_kernel+0xe1/0xe8
Mem-Info:
DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
CPU    1: hi:    0, btch:   1 usd:   0
DMA32 per-cpu:
CPU    0: hi:  186, btch:  31 usd: 171
CPU    1: hi:  186, btch:  31 usd: 155
active_anon:297901 inactive_anon:99948 isolated_anon:52
 active_file:3920 inactive_file:3948 isolated_file:12
 unevictable:399 dirty:0 writeback:34634 unstable:0 buffer:125
 free:23390 slab_reclaimable:4510 slab_unreclaimable:11714
 mapped:7819 shmem:0 pagetables:4437 bounce:0
DMA free:7908kB min:40kB low:48kB high:60kB active_anon:3340kB inactive_anon:3608kB active_file:384kB
   inactive_file:472kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB
   dirty:0kB writeback:36kB mapped:256kB shmem:0kB slab_reclaimable:12kB slab_unreclaimable:12kB kernel_stack:0kB
   pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 1976 1976 1976
DMA32 free:85652kB min:5664kB low:7080kB high:8496kB active_anon:1188264kB inactive_anon:396184kB active_file:15296kB
   inactive_file:15320kB unevictable:1596kB isolated(anon):208kB isolated(file):48kB present:2023748kB mlocked:1596kB
   dirty:0kB writeback:138500kB mapped:31020kB shmem:0kB slab_reclaimable:18028kB slab_unreclaimable:46844kB
   kernel_stack:1672kB pagetables:17732kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 0 0
DMA: 17*4kB 12*8kB 4*16kB 6*32kB 11*64kB 11*128kB 5*256kB 0*512kB 0*1024kB 0*2048kB 1*4096kB = 7908kB
DMA32: 12419*4kB 4439*8kB 1*16kB 0*32kB 1*64kB 1*128kB 1*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 85652kB
97616 total pagecache pages
89394 pages in swap cache
Swap cache stats: add 175906, delete 86512, find 7850/8733
Free swap  = 1425864kB
Total swap = 2097144kB
518064 pages RAM
10350 pages reserved
82282 pages shared
428383 pages non-shared
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-05  5:13     ` [Bug #14141] order 2 page allocation failures in iwlagn Frans Pop
@ 2009-10-05  6:50       ` Frans Pop
  2009-10-05  8:54         ` Frans Pop
                           ` (2 more replies)
  0 siblings, 3 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-05  6:50 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, linux-mm
On Monday 05 October 2009, Frans Pop wrote:
> I'll dig into this a bit more as it looks like this should be
> reproducible, probably even without the kernel build. Next step is to
> see how .30 behaves in the same situation.
This looks conclusive. I tested .30 and .32-rc3 from clean reboots and
only starting gitk. I only started music playing in the background
(amarok) from an NFS share to ensure network activity.
With .32-rc3 I got 4 SKB allocation errors while starting the *second* gitk
instance. And the system was completely frozen with music stopped until gitk
finished loading.
With .30 I was able to start *three* gitk's (which meant 2 of them got
(partially) swapped out) without any allocation errors. And with the system
remaining relatively responsive. There was a short break in the music while
I started the 2nd instance, but it just continued playing afterwards. There
was also some mild latency in the mouse cursor, but nothing like the full
desktop freeze I get with .32-rc3.
With .30 I looked at /proc/buddyinfo while the 3rd gitk was being started,
and that looked fairly healthy all the time:
Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
Node 0, zone    DMA32    579     67     25      8      5      1      1      0      1      1      0
Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
Node 0, zone    DMA32    276     54     13     15      8     10      3      1      1      1      0
Node 0, zone      DMA      4      9     22     20     21     11      0      0      0      0      1
Node 0, zone    DMA32    119     45     24     18     12      4      5      2      1      1      0
Node 0, zone      DMA      4      9     22     20     21     11      0      0      0      0      1
Node 0, zone    DMA32    527     13      9      5      5      3      2      1      1      1      0
Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
Node 0, zone    DMA32   1375     24      7      7      8      5      1      1      0      1      0
Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
Node 0, zone    DMA32    329     21      3      3     17      8      5      1      0      1      0
With .32 it was obviously impossible to get that info due to the total
freeze of the desktop. Not sure if the scheduler changes in .32 contribute
to this. Guess I could find out by doing the same test with .31.
One thing I should mention: my swap is an LVM volume that's in a VG that's
on a LUKS encrypted partition.
Does this give you enough info to go on, or should I try a bisection?
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-05  6:50       ` Frans Pop
@ 2009-10-05  8:54         ` Frans Pop
  2009-10-05  8:57         ` Mel Gorman
  2009-10-11 23:10         ` Frans Pop
  2 siblings, 0 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-05  8:54 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, linux-mm
On Monday 05 October 2009, Frans Pop wrote:
> With .32 it was obviously impossible to get that info due to the total
> freeze of the desktop. Not sure if the scheduler changes in .32
> contribute to this. Guess I could find out by doing the same test with
> .31.
I've tried with .31.1 too now and there does seem to be a scheduler
component too. With .31.1 I also get the SKB allocation errors, but the
desktop freeze seems to be less severe than with .32-rc3. I would suggest
looking into that _after_ the allocation issue has been traced/solved.
I did manage to really (partially) hang up the desktop with .31.1: music
did not come back and the task manager of the KDE desktop remained frozen,
but I could still use konsole [1].
I suspect this is because I also got an OOPS in between the SKB failures:
IP: [<ffffffffa0444ea2>] rpcauth_checkverf+0x4e/0x5a[sunrpc]
PGD 77b83067 PUD 0
Oops: 0000 [#1] SMP
last sysfs file: /sys/class/power_supply/C23D/charge_full
CPU 0
Modules linked in: i915 drm i2c_algo_bit i2c_core ppdev parport_pc lp parport cpufreq_conservative
   cpufreq_userspace cpufreq_stats cpufreq_powersave ipv6 nfsd exportfs nfs lockd nfs_acl auth_rpcgss
   sunrpc ext2 coretemp hp_wmi acpi_cpufreq loop snd_hda_codec_analog snd_hda_intel snd_hda_codec
   arc4 snd_pcm_oss snd_mixer_oss ecb snd_pcm snd_seq_dummy snd_seq_oss iwlagn iwlcore snd_seq_midi
   pcmcia mac80211 snd_rawmidi usblp snd_seq_midi_event snd_seq pcspkr cfg80211 yenta_socket
   rsrc_nonstatic pcmcia_core psmouse snd_timer snd_seq_device rfkill serio_raw snd soundcore
   snd_page_alloc hp_accel lis3lv02d video container output wmi intel_agp input_polldev battery ac
   processor button joydev evdev ext3 jbd mbcache sha256_generic aes_x86_64 aes_generic cbc usbhid hid
   dm_crypt dm_mirror dm_region_hash dm_log dm_snapshot dm_mod sg sr_mod sd_mod cdrom ide_pci_generic piix
   ide_core pata_acpi uhci_hcd ata_piix ohci1394 sdhci_pci sdhci mmc_core led_class ieee1394 ricoh_mmc
   ata_generic ehci_hcd libta e1000e scsi_mod thermal fan thermal_sys [last unloaded: scsi_wait_scan]
Pid: 3226, comm: rpciod/0 Not tainted 2.6.31.1 #20 HP Compaq 2510p Notebook PC
RIP: 0010:[<ffffffffa0444ea2>]  [<ffffffffa0444ea2>]rpcauth_checkverf+0x4e/0x5a [sunrpc]
RSP: 0018:ffff88007aafbda0  EFLAGS: 00010246
RAX: 0000000400001000 RBX: ffff88003a718e40 RCX: 0000000000000001
RDX: ffff880038b821bc RSI: ffff880038b821c8 RDI: ffff8800618358c8
RBP: ffff88007aafbdc0 R08: 0000000000000000 R09: 0000000000000000
R10: ffff880001514d80 R11: ffff8800536401f0 R12: ffff8800618358c8
R13: ffff880038b821c8 R14: ffff880037bb4bd0 R15: ffffffffa04bf52b
FS:  0000000000000000(0000) GS:ffff880001504000(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000000400001038 CR3: 0000000067ee5000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process rpciod/0 (pid: 3226, threadinfo ffff88007aafa000, task ffff88007c431670)
Stack:
 ffff88007aafbde0 ffff880037bb4bd0 ffff8800618358c8 ffff880061835958
<0> ffff88007aafbe00 ffffffffa043e24a ffff88007c4319e0 ffff8800618358c8
<0> ffff880061835970 ffff880061835958 0000000000000000 0000000000000001
Call Trace:
 [<ffffffffa043e24a>] call_decode+0x374/0x68e [sunrpc]
 [<ffffffffa044430e>] __rpc_execute+0x86/0x244 [sunrpc]
 [<ffffffffa04444f8>] ? rpc_async_schedule+0x0/0x12 [sunrpc]
 [<ffffffffa0444508>] rpc_async_schedule+0x10/0x12 [sunrpc]
 [<ffffffff81048bd5>] worker_thread+0x132/0x1ca
 [<ffffffff8104c657>] ? autoremove_wake_function+0x0/0x38
 [<ffffffff81048aa3>] ? worker_thread+0x0/0x1ca
 [<ffffffff8104c335>] kthread+0x8f/0x97
 [<ffffffff8100ca7a>] child_rip+0xa/0x20
 [<ffffffff8104c2a6>] ? kthread+0x0/0x97
 [<ffffffff8100ca70>] ? child_rip+0x0/0x20
Code: 30 0f b7 b7 06 01 00 00 48 89 d9 48 c7 c7 30 42
 45 a0 48 8b 40 10 48 8b 50 10 31 c0 e8 73 f8 e0 e0 48 8b 43 38 4c 89 ee 4c 89 e7 <ff> 50 38 41 59 5b 41 5c 41 5d c9 c3 55 48 89 e5 41 55 49 89 f5
RIP  [<ffffffffa0444ea2>] rpcauth_checkverf+0x4e/0x5a [sunrpc]
 RSP <ffff88007aafbda0>
CR2: 0000000400001038
Not sure whether it's worth following up on that as a separate issue.
Cheers,
FJP
[1] KDE's task manager freezing for short periods is normal for me while
amarok is blocked by NFS. This normally only happens when I start amarok
for the first time, but it does explain how the NFS oops can have the
same effect.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-05  6:50       ` Frans Pop
  2009-10-05  8:54         ` Frans Pop
@ 2009-10-05  8:57         ` Mel Gorman
  2009-10-05 21:34           ` Frans Pop
  2009-10-11 23:10         ` Frans Pop
  2 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-05  8:57 UTC (permalink / raw)
  To: Frans Pop
  Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, linux-mm
On Mon, Oct 05, 2009 at 08:50:58AM +0200, Frans Pop wrote:
> On Monday 05 October 2009, Frans Pop wrote:
> > I'll dig into this a bit more as it looks like this should be
> > reproducible, probably even without the kernel build. Next step is to
> > see how .30 behaves in the same situation.
> 
> This looks conclusive. I tested .30 and .32-rc3 from clean reboots and
> only starting gitk. I only started music playing in the background
> (amarok) from an NFS share to ensure network activity.
> 
> With .32-rc3 I got 4 SKB allocation errors while starting the *second* gitk
> instance. And the system was completely frozen with music stopped until gitk
> finished loading.
> 
> With .30 I was able to start *three* gitk's (which meant 2 of them got
> (partially) swapped out) without any allocation errors. And with the system
> remaining relatively responsive. There was a short break in the music while
> I started the 2nd instance, but it just continued playing afterwards. There
> was also some mild latency in the mouse cursor, but nothing like the full
> desktop freeze I get with .32-rc3.
> 
> With .30 I looked at /proc/buddyinfo while the 3rd gitk was being started,
> and that looked fairly healthy all the time:
> Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
> Node 0, zone    DMA32    579     67     25      8      5      1      1      0      1      1      0
> Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
> Node 0, zone    DMA32    276     54     13     15      8     10      3      1      1      1      0
> Node 0, zone      DMA      4      9     22     20     21     11      0      0      0      0      1
> Node 0, zone    DMA32    119     45     24     18     12      4      5      2      1      1      0
> Node 0, zone      DMA      4      9     22     20     21     11      0      0      0      0      1
> Node 0, zone    DMA32    527     13      9      5      5      3      2      1      1      1      0
> Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
> Node 0, zone    DMA32   1375     24      7      7      8      5      1      1      0      1      0
> Node 0, zone      DMA      5      9     22     20     21     11      0      0      0      0      1
> Node 0, zone    DMA32    329     21      3      3     17      8      5      1      0      1      0
> 
> With .32 it was obviously impossible to get that info due to the total
> freeze of the desktop. Not sure if the scheduler changes in .32 contribute
> to this. Guess I could find out by doing the same test with .31.
> 
> One thing I should mention: my swap is an LVM volume that's in a VG that's
> on a LUKS encrypted partition.
> 
> Does this give you enough info to go on, or should I try a bisection?
> 
I'll be trying to reproduce it, but it's unlikely I'll manage to
reproduce it reliably as there may be a specific combination of hardware
necessary as well. What I'm going to try is writing a module that
allocates order-5 every second GFP_ATOMIC and see can I reproduce using
scenarios similar to yours but it'll take some time with no guarantee of
success. If you could bisect it, it would be fantastic.
Thanks
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-05  8:57         ` Mel Gorman
@ 2009-10-05 21:34           ` Frans Pop
  2009-10-06  0:04             ` David Rientjes
  0 siblings, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-05 21:34 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, linux-mm, David Rientjes
On Monday 05 October 2009, Mel Gorman wrote:
> On Mon, Oct 05, 2009 at 08:50:58AM +0200, Frans Pop wrote:
> > On Monday 05 October 2009, Frans Pop wrote:
> > > I'll dig into this a bit more as it looks like this should be
> > > reproducible, probably even without the kernel build. Next step is
> > > to see how .30 behaves in the same situation.
> >
> > This looks conclusive. I tested .30 and .32-rc3 from clean reboots and
> > only starting gitk. I only started music playing in the background
> > (amarok) from an NFS share to ensure network activity.
> >
> > With .32-rc3 I got 4 SKB allocation errors while starting the *second*
> > gitk instance. And the system was completely frozen with music stopped
> > until gitk finished loading.
> >
> > With .30 I was able to start *three* gitk's (which meant 2 of them got
> > (partially) swapped out) without any allocation errors. And with the
> > system remaining relatively responsive. There was a short break in the
> > music while I started the 2nd instance, but it just continued playing
> > afterwards. There was also some mild latency in the mouse cursor, but
> > nothing like the full desktop freeze I get with .32-rc3.
> >
> > One thing I should mention: my swap is an LVM volume that's in a VG
> > that's on a LUKS encrypted partition.
> >
> > Does this give you enough info to go on, or should I try a bisection?
>
> I'll be trying to reproduce it, but it's unlikely I'll manage to
> reproduce it reliably as there may be a specific combination of hardware
> necessary as well. What I'm going to try is writing a module that
> allocates order-5 every second GFP_ATOMIC and see can I reproduce using
> scenarios similar to yours but it'll take some time with no guarantee of
> success. If you could bisect it, it would be fantastic.
And the winner is:
2ff05b2b4eac2e63d345fc731ea151a060247f53 is first bad commit
commit 2ff05b2b4eac2e63d345fc731ea151a060247f53
Author: David Rientjes <rientjes@google.com>
Date:   Tue Jun 16 15:32:56 2009 -0700
    oom: move oom_adj value from task_struct to mm_struct
I'm confident that the bisection is good. The test case was very reliable 
while zooming in on the merge from akpm.
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-05 21:34           ` Frans Pop
@ 2009-10-06  0:04             ` David Rientjes
  2009-10-06  1:25               ` KOSAKI Motohiro
                                 ` (2 more replies)
  0 siblings, 3 replies; 88+ messages in thread
From: David Rientjes @ 2009-10-06  0:04 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Pekka Enberg, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, linux-mm
On Mon, 5 Oct 2009, Frans Pop wrote:
> And the winner is:
> 2ff05b2b4eac2e63d345fc731ea151a060247f53 is first bad commit
> commit 2ff05b2b4eac2e63d345fc731ea151a060247f53
> Author: David Rientjes <rientjes@google.com>
> Date:   Tue Jun 16 15:32:56 2009 -0700
> 
>     oom: move oom_adj value from task_struct to mm_struct
> 
> I'm confident that the bisection is good. The test case was very reliable 
> while zooming in on the merge from akpm.
> 
I doubt it for two reasons: (i) this commit was reverted in 0753ba0 since 
2.6.31-rc7 and is no longer in the kernel, and (ii) these are GFP_ATOMIC 
allocations which would be unaffected by oom killer scores.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-06  0:04             ` David Rientjes
@ 2009-10-06  1:25               ` KOSAKI Motohiro
  2009-10-06  8:53               ` Mel Gorman
  2009-10-06 10:23               ` Frans Pop
  2 siblings, 0 replies; 88+ messages in thread
From: KOSAKI Motohiro @ 2009-10-06  1:25 UTC (permalink / raw)
  To: David Rientjes
  Cc: kosaki.motohiro, Frans Pop, Mel Gorman, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	linux-mm
> On Mon, 5 Oct 2009, Frans Pop wrote:
> 
> > And the winner is:
> > 2ff05b2b4eac2e63d345fc731ea151a060247f53 is first bad commit
> > commit 2ff05b2b4eac2e63d345fc731ea151a060247f53
> > Author: David Rientjes <rientjes@google.com>
> > Date:   Tue Jun 16 15:32:56 2009 -0700
> > 
> >     oom: move oom_adj value from task_struct to mm_struct
> > 
> > I'm confident that the bisection is good. The test case was very reliable 
> > while zooming in on the merge from akpm.
> > 
> 
> I doubt it for two reasons: (i) this commit was reverted in 0753ba0 since 
> 2.6.31-rc7 and is no longer in the kernel, and (ii) these are GFP_ATOMIC 
> allocations which would be unaffected by oom killer scores.
I agree. this patch is pretty obvious correct. it was reverted by
one unfortunately regression.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-06  0:04             ` David Rientjes
  2009-10-06  1:25               ` KOSAKI Motohiro
@ 2009-10-06  8:53               ` Mel Gorman
  2009-10-06  9:14                 ` David Rientjes
  2009-10-06 10:23               ` Frans Pop
  2 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-06  8:53 UTC (permalink / raw)
  To: David Rientjes
  Cc: Frans Pop, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Pekka Enberg, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, linux-mm
On Mon, Oct 05, 2009 at 05:04:55PM -0700, David Rientjes wrote:
> On Mon, 5 Oct 2009, Frans Pop wrote:
> 
> > And the winner is:
> > 2ff05b2b4eac2e63d345fc731ea151a060247f53 is first bad commit
> > commit 2ff05b2b4eac2e63d345fc731ea151a060247f53
> > Author: David Rientjes <rientjes@google.com>
> > Date:   Tue Jun 16 15:32:56 2009 -0700
> > 
> >     oom: move oom_adj value from task_struct to mm_struct
> > 
> > I'm confident that the bisection is good. The test case was very reliable 
> > while zooming in on the merge from akpm.
> > 
> 
> I doubt it for two reasons: (i) this commit was reverted in 0753ba0 since 
> 2.6.31-rc7 and is no longer in the kernel, and (ii) these are GFP_ATOMIC 
> allocations which would be unaffected by oom killer scores.
> 
However, the problem was reported to start showing up in 2.6.31-rc1 so
while it might not be *the* patch, it might be making the type of change
that caused more fragmentation. This patch adjusted the size of
mm_struct and maybe it was enough to change the "order" required for the
slab. Maybe there are other slabs that have changed size as well in that
timeframe.
Frans, what is the size of mm_struct before and after this patch was
applied? Find it with either
grep mm_struct /proc/slabinfo
and if the information is not available there, try
cat /sys/kernel/slab/mm_struct/slab_size and
/sys/kernel/slab/mm_struct/order
Thanks
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-06  8:53               ` Mel Gorman
@ 2009-10-06  9:14                 ` David Rientjes
  2009-10-06  9:22                   ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: David Rientjes @ 2009-10-06  9:14 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Pekka Enberg, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, linux-mm
On Tue, 6 Oct 2009, Mel Gorman wrote:
> > > And the winner is:
> > > 2ff05b2b4eac2e63d345fc731ea151a060247f53 is first bad commit
> > > commit 2ff05b2b4eac2e63d345fc731ea151a060247f53
> > > Author: David Rientjes <rientjes@google.com>
> > > Date:   Tue Jun 16 15:32:56 2009 -0700
> > > 
> > >     oom: move oom_adj value from task_struct to mm_struct
> > > 
> > > I'm confident that the bisection is good. The test case was very reliable 
> > > while zooming in on the merge from akpm.
> > > 
> > 
> > I doubt it for two reasons: (i) this commit was reverted in 0753ba0 since 
> > 2.6.31-rc7 and is no longer in the kernel, and (ii) these are GFP_ATOMIC 
> > allocations which would be unaffected by oom killer scores.
> > 
> 
> However, the problem was reported to start showing up in 2.6.31-rc1 so
> while it might not be *the* patch, it might be making the type of change
> that caused more fragmentation. This patch adjusted the size of
> mm_struct and maybe it was enough to change the "order" required for the
> slab. Maybe there are other slabs that have changed size as well in that
> timeframe.
> 
> Frans, what is the size of mm_struct before and after this patch was
> applied? Find it with either
> 
> grep mm_struct /proc/slabinfo
> 
> and if the information is not available there, try
> 
> cat /sys/kernel/slab/mm_struct/slab_size and
> /sys/kernel/slab/mm_struct/order
> 
If that's the case and the problem still persists in 2.6.31-rc7 as 
reported, then you'd need to compare the current slab order for both 
mm_struct and signal_struct to the previously known working kernel 
since the latter is where oom_adj was moved.  (You'd still have to check 
the former to see if there were any mm_struct additions between rc1 and 
rc7 between the commit and revert, though.)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-06  9:14                 ` David Rientjes
@ 2009-10-06  9:22                   ` Mel Gorman
  0 siblings, 0 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-06  9:22 UTC (permalink / raw)
  To: David Rientjes
  Cc: Frans Pop, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Pekka Enberg, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, linux-mm
On Tue, Oct 06, 2009 at 02:14:26AM -0700, David Rientjes wrote:
> On Tue, 6 Oct 2009, Mel Gorman wrote:
> 
> > > > And the winner is:
> > > > 2ff05b2b4eac2e63d345fc731ea151a060247f53 is first bad commit
> > > > commit 2ff05b2b4eac2e63d345fc731ea151a060247f53
> > > > Author: David Rientjes <rientjes@google.com>
> > > > Date:   Tue Jun 16 15:32:56 2009 -0700
> > > > 
> > > >     oom: move oom_adj value from task_struct to mm_struct
> > > > 
> > > > I'm confident that the bisection is good. The test case was very reliable 
> > > > while zooming in on the merge from akpm.
> > > > 
> > > 
> > > I doubt it for two reasons: (i) this commit was reverted in 0753ba0 since 
> > > 2.6.31-rc7 and is no longer in the kernel, and (ii) these are GFP_ATOMIC 
> > > allocations which would be unaffected by oom killer scores.
> > > 
> > 
> > However, the problem was reported to start showing up in 2.6.31-rc1 so
> > while it might not be *the* patch, it might be making the type of change
> > that caused more fragmentation. This patch adjusted the size of
> > mm_struct and maybe it was enough to change the "order" required for the
> > slab. Maybe there are other slabs that have changed size as well in that
> > timeframe.
> > 
> > Frans, what is the size of mm_struct before and after this patch was
> > applied? Find it with either
> > 
> > grep mm_struct /proc/slabinfo
> > 
> > and if the information is not available there, try
> > 
> > cat /sys/kernel/slab/mm_struct/slab_size and
> > /sys/kernel/slab/mm_struct/order
> > 
> 
> If that's the case and the problem still persists in 2.6.31-rc7 as 
> reported, then you'd need to compare the current slab order for both 
> mm_struct and signal_struct to the previously known working kernel 
> since the latter is where oom_adj was moved.  (You'd still have to check 
> the former to see if there were any mm_struct additions between rc1 and 
> rc7 between the commit and revert, though.)
> 
Best to just grab all of slabinfo for a poke around. I know task_struct
has increases in size since 2.6.29 but not enough on the machines I've
changed to make a difference to the order of pages requested. It might
be different on the problem machines.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-06  0:04             ` David Rientjes
  2009-10-06  1:25               ` KOSAKI Motohiro
  2009-10-06  8:53               ` Mel Gorman
@ 2009-10-06 10:23               ` Frans Pop
  2 siblings, 0 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-06 10:23 UTC (permalink / raw)
  To: David Rientjes
  Cc: Mel Gorman, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Pekka Enberg, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, linux-mm
On Tuesday 06 October 2009, David Rientjes wrote:
> On Mon, 5 Oct 2009, Frans Pop wrote:
> > And the winner is:
> > 2ff05b2b4eac2e63d345fc731ea151a060247f53 is first bad commit
> > commit 2ff05b2b4eac2e63d345fc731ea151a060247f53
> > Author: David Rientjes <rientjes@google.com>
> > Date:   Tue Jun 16 15:32:56 2009 -0700
> >
> >     oom: move oom_adj value from task_struct to mm_struct
> >
> > I'm confident that the bisection is good. The test case was very
> > reliable while zooming in on the merge from akpm.
>
> I doubt it for two reasons: (i) this commit was reverted in 0753ba0
> since 2.6.31-rc7 and is no longer in the kernel, and (ii) these are
> GFP_ATOMIC allocations which would be unaffected by oom killer scores.
OK. Looks like I have been getting some false "good" results. I've been 
redoing part of the bisect and am getting close to a new candidate. Will 
explain further when I have that.
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-05  6:50       ` Frans Pop
  2009-10-05  8:54         ` Frans Pop
  2009-10-05  8:57         ` Mel Gorman
@ 2009-10-11 23:10         ` Frans Pop
  2009-10-11 23:36           ` Frans Pop
  2009-10-12 13:43           ` Mel Gorman
  2 siblings, 2 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-11 23:10 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	linux-mm
Sorry for going quiet on this issue for a few days, but I have been 
spending *a lot* of time on it. I've done what amounts to 5 bisection 
rounds at ~20 minutes per iteration and in total over 80 boots.
The problem with my first bisection was that there are *at least two* 
changes at the root of this issue, both committed between .30 and .30-rc1. 
Because of this a normal bisection will not lead to a reliable result and 
even with my last effort I can only narrow it down to two different areas, 
and not 100% to specific commits.
The two identified areas are:
1) a wireless merge which causes the SKB errors to appear in the first
   place, but not always;
2) an mm merge which makes the SKB errors occur *much* quicker; IMHO this
   is the change that also causes the regressions reported by Pekka and
   Karol.
So below my results. The issue is both complex and subtle. Now it's up to 
you, domain experts for both mm *and* wireless/networking, to make sense of 
it all and come up with suggestions on how to proceed.
I've improved my test and it's now a lot more reliable, but there are still 
timing influences. Also, because this is all merge-window stuff, I'm 
hitting quite a few minor and major regressions between commits that can 
affect tests.
Please study the information below carefully. I know it's long, but I think 
this issue justifies that.
On Monday 05 October 2009, Frans Pop wrote:
> This looks conclusive. I tested .30 and .32-rc3 from clean reboots and
> only starting gitk. I only started music playing in the background
> (amarok) from an NFS share to ensure network activity.
>
> With .32-rc3 I got 4 SKB allocation errors while starting the *second*
> gitk instance. And the system was completely frozen with music stopped
> until gitk finished loading.
With .32-rc3, .31.1 and vanilla .31 I will get multiple SKB allocation 
errors the *first time* I run the test, *every* time.
> With .30 I was able to start *three* gitk's (which meant 2 of them got
> (partially) swapped out) without any allocation errors. And with the
> system remaining relatively responsive. There was a short break in the
> music while I started the 2nd instance, but it just continued playing
> afterwards. There was also some mild latency in the mouse cursor, but
> nothing like the full desktop freeze I get with .32-rc3.
With both .30.2 and vanilla .30 I have *never* been able to get any SKB 
allocation errors. No matter how often I repeat the test.
So, the start and end position are 100% reproducible. Problem is that this 
changes during the bisection. At some point the test will fail (no SKB 
errors) the first time I run it, but it will fail on the second or third 
attempt.
Apparently at some point memory must already be fragmented (or higher 
orders already used up) to some extend for the errors to trigger.
TEST METHOD
-----------
As a normal bisection (I tried 3 times...) did not lead anywhere, I had to 
think of an alternative approach. I decided to start by manually selecting 
merges by Linus into mainline. The advantage is that that makes the 
bisection linear and makes it a lot easier to see patterns.
After narrowing down to a specific merge, I bisected (again semi-manually) 
inside that merge.
Because I suspected there were multiple changes involved, I deliberately 
tried to find two points:
- where do I first start seeing SKB errors at all, even if it is only at
  the second or third try;
- where do I start getting SKB errors reliably on the first try.
I worked from "good" to "bad", i.e. I started at .30. The merges were not 
chosen completely randomly. From the first 3 bisections I strongly 
suspected the first 'net-next' merge and the first 'akpm' merge, but I did 
make sure to confirm that suspicion.
TEST DESCRIPTION
----------------
The test I've ended up using is:
1) clean boot
2) start music in amarok from NFS share; use very long song to avoid file
   changes and thus ensure a fluent stream of network data during the test
3) start 'gitk v2.6.29..master &' - to use up some memory
4) start first 'gitk master &' - after this all normal memory is as good as
   used up, with minor swap; this never resulted in SKB errors
5) start second 'gitk master &' - this causes heavy swapping (>700 MB) and
   is the real test
6) if there were no SKB errors after 5), kill the gitk processes and repeat
   steps 3) to 5). I've done this up to 4 times in some cases
7) if the results are not clear or when there is doubt later, repeat from
   step 1) with same kernel
Memory after initial 'gitk v2.6.29..master &':
             total       used       free     shared    buffers     cached
Mem:       2030776    1153008     877768          0      41572     333968
-/+ buffers/cache:     777468    1253308
Swap:      2097144          0    2097144
Memory after first 'gitk master &':
             total       used       free     shared    buffers     cached
Mem:       2030776    1979040      51736          0      35684     238420
-/+ buffers/cache:    1704936     325840
Swap:      2097144      21876    2075268
Memory after second 'gitk master &' (with .30.2):
             total       used       free     shared    buffers     cached
Mem:       2030776    2011608      19168          0      21836      92336
-/+ buffers/cache:    1897436     133340
Swap:      2097144     776160    1320984
OVERVIEW OF RESULTS
-------------------
Below I list the most relevant merges and commits. Note that they are 
listed in commit order; my kernel version shows the order of testing.
For the commits I tested the test results are listed on the next line.
The first number on that line consists of the test series + the iteration 
(and also identifies the kernel I used).
A "+" means I got no SKB errors, a "-" that I did get them. A "|" means I 
rebooted for a second series of tests.
v2.6.30-2330-gdb8e7f1	'x86-fixes-for-linus' of linux-2.6-tip
	1.1	+++	iwlagn sw-error during first test
v2.6.30-4127-g0fa2133	'merge' of powerpc (last merge before net-next-2.6)
	1.2	+++
v2.6.30-5398-g2ed0e21	net-next-2.6 (mega-merge!)
	1.4	+-	system reboot fails after testing
v2.6.30-5517-g609106b	'merge' of powerpc
	1.3	+-	system reboot fails after testing
v2.6.30-5927-gf83b1e6	'for-linus' of linux1394-2.6 (last merge before akpm)
	2.2	++-
v2.6.30-6111-g517d086	'akpm'
	2.1	-|-
BISECTION OF net-next-2.6 MERGE
-------------------------------
Note that this merge was based not on .30 vanilla, but partly on 
v2.6.30-rc1 and partly on v2.6.30-rc6.
I think this had an influence on the latencies I saw (i.e. because some 
post-rc6 bug fixes were not present it changes the general behavior of the 
system during the swapping). For example: with v2.6.30-4127-g0fa2133 the 
system remained more responsive (smaller music skips) than with 
v2.6.30-rc1-1219-g82d0481.
I started again by testing merges, this time those by David.
v2.6.30-rc1-1219-g82d0481	'master' of wireless-next-2.6
	1.5	++++	bad latencies
v2.6.30-rc6-660-gbb803cf	'master' of net-2.6
v2.6.30-rc6-808-g45ea4ea	'master' of wireless-next-2.6
v2.6.30-rc6-850-gc649c0e	'master' of net-2.6
v2.6.30-rc6-922-g3f1f39c	'linux-2.6.31.y' of wimax
v2.6.30-rc6-999-gb2f8f75	'master' of net-2.6
v2.6.30-rc6-1028-ga8c617e	'net-next' of lksctp-dev
	1.7	++++|++++|++++
	I went back to this one twice because the bisection inside the
	next merge (see below) did not give a clear result.
v2.6.30-rc6-1103-gb1bc81a	'master' of wireless-next-2.6
	1.8	+-
v2.6.30-rc6-1224-g84503dd	'master' of wireless-next-2.6
	1.6	+-
So the problem started in the v2.6.30-rc6-1103-gb1bc81a merge.
I was unable to narrow it down to an exact commit; AFAICT the remaining 
ones (between v2.6.30-rc6-1028-g8fc0fee and v2.6.30-rc6-1032-g7ba10a8) are 
uninteresting. But it *must* be in this area!
For a good overview of the area, use 'gitk 3f1f39c4..b1bc81a0'.
v2.6.30-rc6-1028-g8fc0fee	cfg80211: use key size constants
	1.11	++++
v2.6.30-rc6-1031-g1bb5633	iwmc3200wifi: fix printk format
	1.14	+++-	not quite conclusive...
v2.6.30-rc6-1032-g7ba10a8	mac80211: fix transposed min/max CW values
	1.13	-
	This is a bugfix for aa837ee1d from an earlier merge! Could this maybe
	influence the test results in between? There are various SKB related
	changes there, for example: dfbf97f3..e5b9215e.
v2.6.30-rc6-1037-g2c5b9e5	wireless: libertas: fix unaligned accesses
	1.12	+-
v2.6.30-rc6-1044-g729e9c7	cfg80211: fix for duplicate userspace replies
	1.10	+-
v2.6.30-rc6-1075-gc587de0	iwlwifi: unify station management
	1.9	++-|+-
v2.6.30-rc6-1076-gd14d444	iwl3945: port allow skb allocation in tasklet
	I thought this was a prime candidate, but as you can see several commits
	before failed too. Still worth looking at I think!
BISECTION of akpm (mm) MERGE
----------------------------
So here I went looking for "where does the test start failing on the first 
try". Again, I was unable to narrow it down to a single commit.
For a good overview of the area, use 'gitk f83b1e61..517d0869'.
v2.6.30-5466-ga1dd268	mm: use alloc_pages_exact in alloc_large_system_hash
	2.3	+-
v2.6.30-5478-ge9bb35d	mm: setup_per_zone_inactive_ratio - fix comment and..
	2.5	+-
v2.6.30-5486-g35282a2	migration: only migrate_prep() once per move_pages()
	2.6	-|+|-	not quite conclusive...
v2.6.30-5492-gbce7394	page-allocator: reset wmark_min and inactive ratio..
	2.4	-|-
WHERE NEXT?
===========
I think the results confirm there is definitely an issue here and that my 
test is reliable and consistent enough to show it. And as it currently is 
the only test we have...
I hope that the info above is enough for the mm and wireless domain 
experts to identify likely candidates in the areas I've identified.
The next step could be trying specific reverts or debug patches, either on 
top of current git, or 2.6.31, or inside the identified areas.
I'll run anything you care to throw at me and will try to provide any 
additional info you need, but at this point it's up to you.
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-11 23:10         ` Frans Pop
@ 2009-10-11 23:36           ` Frans Pop
  2009-10-12 13:43           ` Mel Gorman
  1 sibling, 0 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-11 23:36 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	linux-mm
On Monday 12 October 2009, Frans Pop wrote:
> BISECTION of akpm (mm) MERGE
> ----------------------------
> So here I went looking for "where does the test start failing on the
> first try". Again, I was unable to narrow it down to a single commit.
Note that this merge is based on mainline at v2.6.30-5415-g03347e2, so a
number of merges "drop out" once I  started bisecting into this merge. But
that point is still *after* the net-next-2.6 merge, which is all that's
really relevant for this issue.
> For a good overview of the area, use 'gitk f83b1e61..517d0869'.
>
> v2.6.30-5466-ga1dd268   mm: use alloc_pages_exact in alloc_large_system_hash
>	2.3     +- 
> v2.6.30-5478-ge9bb35d   mm: setup_per_zone_inactive_ratio - fix comment and..
>	2.5     +- 
> v2.6.30-5486-g35282a2   migration: only migrate_prep() once per move_pages()
>	2.6     -|+|-   not quite conclusive... 
> v2.6.30-5492-gbce7394   page-allocator: reset wmark_min and inactive ratio..
>	2.4     -|- 
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-11 23:10         ` Frans Pop
  2009-10-11 23:36           ` Frans Pop
@ 2009-10-12 13:43           ` Mel Gorman
  2009-10-12 17:32             ` Frans Pop
  1 sibling, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-12 13:43 UTC (permalink / raw)
  To: Frans Pop
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Mon, Oct 12, 2009 at 01:10:25AM +0200, Frans Pop wrote:
> Sorry for going quiet on this issue for a few days, but I have been 
> spending *a lot* of time on it. I've done what amounts to 5 bisection 
> rounds at ~20 minutes per iteration and in total over 80 boots.
> 
> The problem with my first bisection was that there are *at least two* 
> changes at the root of this issue, both committed between .30 and .30-rc1. 
> Because of this a normal bisection will not lead to a reliable result and 
> even with my last effort I can only narrow it down to two different areas, 
> and not 100% to specific commits.
> 
Thanks very much for your detailed work on this.
> The two identified areas are:
> 1) a wireless merge which causes the SKB errors to appear in the first
>    place, but not always;
> 2) an mm merge which makes the SKB errors occur *much* quicker; IMHO this
>    is the change that also causes the regressions reported by Pekka and
>    Karol.
> 
> So below my results. The issue is both complex and subtle. Now it's up to 
> you, domain experts for both mm *and* wireless/networking, to make sense of 
> it all and come up with suggestions on how to proceed.
> 
> I've improved my test and it's now a lot more reliable, but there are still 
> timing influences.
The timing influences is probably because kswapd is working from the
time memory gets full. High-order allocation failures would cause it to
start reclaiming at that order so it's a race always to see can it do
its work before an atomic allocation fails or not.
> Also, because this is all merge-window stuff, I'm 
> hitting quite a few minor and major regressions between commits that can 
> affect tests.
> 
> Please study the information below carefully. I know it's long, but I think 
> this issue justifies that.
> 
Agreed. I'll be looking at commits, both wireless and mm but obviously
anything I saw about wireless needs to be taken with a generous dose of
salt.
> On Monday 05 October 2009, Frans Pop wrote:
> > This looks conclusive. I tested .30 and .32-rc3 from clean reboots and
> > only starting gitk. I only started music playing in the background
> > (amarok) from an NFS share to ensure network activity.
> >
> > With .32-rc3 I got 4 SKB allocation errors while starting the *second*
> > gitk instance. And the system was completely frozen with music stopped
> > until gitk finished loading.
> 
> With .32-rc3, .31.1 and vanilla .31 I will get multiple SKB allocation 
> errors the *first time* I run the test, *every* time.
> 
So, this remains a current problem that wasn't solved by accident.
> > With .30 I was able to start *three* gitk's (which meant 2 of them got
> > (partially) swapped out) without any allocation errors. And with the
> > system remaining relatively responsive. There was a short break in the
> > music while I started the 2nd instance, but it just continued playing
> > afterwards. There was also some mild latency in the mouse cursor, but
> > nothing like the full desktop freeze I get with .32-rc3.
> 
> With both .30.2 and vanilla .30 I have *never* been able to get any SKB 
> allocation errors. No matter how often I repeat the test.
> 
> So, the start and end position are 100% reproducible. Problem is that this 
> changes during the bisection. At some point the test will fail (no SKB 
> errors) the first time I run it, but it will fail on the second or third 
> attempt.
> Apparently at some point memory must already be fragmented (or higher 
> orders already used up) to some extend for the errors to trigger.
> 
That is a reasonable assessment. It could be because
1. Something in the intevening commits greatly increases the number of
   GFP_ATOMIC allocations that are occuring. It's a pity that the allocator
   tracepoints are not available in those kernels. It would have made
   investigating this theory easier.
2. kswapd is no longer reclaiming high-order pages as well as it used
   to be it due to changes in kswapd itself or lumpy reclaim
3. Fragmentation avoidance has been broken in some subtle manner
I think 3 is particularly unlikely and am expecting it to be 1 or 2.
> TEST METHOD
> -----------
> As a normal bisection (I tried 3 times...) did not lead anywhere, I had to 
> think of an alternative approach. I decided to start by manually selecting 
> merges by Linus into mainline. The advantage is that that makes the 
> bisection linear and makes it a lot easier to see patterns.
> After narrowing down to a specific merge, I bisected (again semi-manually) 
> inside that merge.
> 
> Because I suspected there were multiple changes involved, I deliberately 
> tried to find two points:
> - where do I first start seeing SKB errors at all, even if it is only at
>   the second or third try;
> - where do I start getting SKB errors reliably on the first try.
> 
> I worked from "good" to "bad", i.e. I started at .30. The merges were not 
> chosen completely randomly. From the first 3 bisections I strongly 
> suspected the first 'net-next' merge and the first 'akpm' merge, but I did 
> make sure to confirm that suspicion.
> 
A very good approach.
> TEST DESCRIPTION
> ----------------
> The test I've ended up using is:
> 1) clean boot
> 2) start music in amarok from NFS share; use very long song to avoid file
>    changes and thus ensure a fluent stream of network data during the test
> 3) start 'gitk v2.6.29..master &' - to use up some memory
> 4) start first 'gitk master &' - after this all normal memory is as good as
>    used up, with minor swap; this never resulted in SKB errors
> 5) start second 'gitk master &' - this causes heavy swapping (>700 MB) and
>    is the real test
> 6) if there were no SKB errors after 5), kill the gitk processes and repeat
>    steps 3) to 5). I've done this up to 4 times in some cases
> 7) if the results are not clear or when there is doubt later, repeat from
>    step 1) with same kernel
> 
> Memory after initial 'gitk v2.6.29..master &':
>              total       used       free     shared    buffers     cached
> Mem:       2030776    1153008     877768          0      41572     333968
> -/+ buffers/cache:     777468    1253308
> Swap:      2097144          0    2097144
> 
> Memory after first 'gitk master &':
>              total       used       free     shared    buffers     cached
> Mem:       2030776    1979040      51736          0      35684     238420
> -/+ buffers/cache:    1704936     325840
> Swap:      2097144      21876    2075268
> 
> Memory after second 'gitk master &' (with .30.2):
>              total       used       free     shared    buffers     cached
> Mem:       2030776    2011608      19168          0      21836      92336
> -/+ buffers/cache:    1897436     133340
> Swap:      2097144     776160    1320984
> 
> OVERVIEW OF RESULTS
> -------------------
> Below I list the most relevant merges and commits. Note that they are 
> listed in commit order; my kernel version shows the order of testing.
> 
> For the commits I tested the test results are listed on the next line.
> The first number on that line consists of the test series + the iteration 
> (and also identifies the kernel I used).
> A "+" means I got no SKB errors, a "-" that I did get them. A "|" means I 
> rebooted for a second series of tests.
> 
> v2.6.30-2330-gdb8e7f1	'x86-fixes-for-linus' of linux-2.6-tip
> 	1.1	+++	iwlagn sw-error during first test
> v2.6.30-4127-g0fa2133	'merge' of powerpc (last merge before net-next-2.6)
> 	1.2	+++
> v2.6.30-5398-g2ed0e21	net-next-2.6 (mega-merge!)
> 	1.4	+-	system reboot fails after testing
> v2.6.30-5517-g609106b	'merge' of powerpc
> 	1.3	+-	system reboot fails after testing
> v2.6.30-5927-gf83b1e6	'for-linus' of linux1394-2.6 (last merge before akpm)
> 	2.2	++-
> v2.6.30-6111-g517d086	'akpm'
> 	2.1	-|-
> 
> BISECTION OF net-next-2.6 MERGE
> -------------------------------
> Note that this merge was based not on .30 vanilla, but partly on 
> v2.6.30-rc1 and partly on v2.6.30-rc6.
> I think this had an influence on the latencies I saw (i.e. because some 
> post-rc6 bug fixes were not present it changes the general behavior of the 
> system during the swapping). For example: with v2.6.30-4127-g0fa2133 the 
> system remained more responsive (smaller music skips) than with 
> v2.6.30-rc1-1219-g82d0481.
> 
> I started again by testing merges, this time those by David.
> 
> v2.6.30-rc1-1219-g82d0481	'master' of wireless-next-2.6
> 	1.5	++++	bad latencies
The bad latencies might imply that there are a lot more allocations
going on than there used to be. Maybe it was just because of a wireless
bug though that was later fixed.
> v2.6.30-rc6-660-gbb803cf	'master' of net-2.6
> v2.6.30-rc6-808-g45ea4ea	'master' of wireless-next-2.6
> v2.6.30-rc6-850-gc649c0e	'master' of net-2.6
> v2.6.30-rc6-922-g3f1f39c	'linux-2.6.31.y' of wimax
> v2.6.30-rc6-999-gb2f8f75	'master' of net-2.6
> v2.6.30-rc6-1028-ga8c617e	'net-next' of lksctp-dev
> 	1.7	++++|++++|++++
> 	I went back to this one twice because the bisection inside the
> 	next merge (see below) did not give a clear result.
> v2.6.30-rc6-1103-gb1bc81a	'master' of wireless-next-2.6
> 	1.8	+-
> v2.6.30-rc6-1224-g84503dd	'master' of wireless-next-2.6
> 	1.6	+-
> 
> So the problem started in the v2.6.30-rc6-1103-gb1bc81a merge.
> I was unable to narrow it down to an exact commit; AFAICT the remaining 
> ones (between v2.6.30-rc6-1028-g8fc0fee and v2.6.30-rc6-1032-g7ba10a8) are 
> uninteresting. But it *must* be in this area!
> 
> For a good overview of the area, use 'gitk 3f1f39c4..b1bc81a0'.
> 
> v2.6.30-rc6-1028-g8fc0fee	cfg80211: use key size constants
> 	1.11	++++
> v2.6.30-rc6-1031-g1bb5633	iwmc3200wifi: fix printk format
> 	1.14	+++-	not quite conclusive...
> v2.6.30-rc6-1032-g7ba10a8	mac80211: fix transposed min/max CW values
> 	1.13	-
> 	This is a bugfix for aa837ee1d from an earlier merge! Could this maybe
> 	influence the test results in between? There are various SKB related
> 	changes there, for example: dfbf97f3..e5b9215e.
Maybe. Your commit id's are different to what I see. Maybe it's because your
tree has been shuffled around a bit but after some digging around in this
general area, I saw this patch
4752c93c30 iwlcore: Allow skb allocation from tasklet
This patch increases the number of GFP_ATOMIC allocations that can occur by
allocating GFP_ATOMIC in some cases and GFP_KERNEL in others. Previously,
only GFP_KERNEL was used and I didn't realise this allocation method was
so recent. Problems of this sort have cropped up before and while there
are later changes that suppress some of these warnings, I believe this is
a strong candidate for where the allocation failures started appearing.
> v2.6.30-rc6-1037-g2c5b9e5	wireless: libertas: fix unaligned accesses
> 	1.12	+-
> v2.6.30-rc6-1044-g729e9c7	cfg80211: fix for duplicate userspace replies
> 	1.10	+-
> v2.6.30-rc6-1075-gc587de0	iwlwifi: unify station management
> 	1.9	++-|+-
> v2.6.30-rc6-1076-gd14d444	iwl3945: port allow skb allocation in tasklet
> 	I thought this was a prime candidate, but as you can see several commits
> 	before failed too. Still worth looking at I think!
> 
Your commit IDs are different to what I see but it's the commit merge at
b1bc81a0ef86b86fa410dd303d84c8c7bd09a64d. I agree that the last commit
(d14d44407b9f06e3cf967fcef28ccb780caf0583) could make the problem worse
because it expands the use of GFP_ATOMIC for another driver.
> BISECTION of akpm (mm) MERGE
> ----------------------------
> So here I went looking for "where does the test start failing on the first 
> try". Again, I was unable to narrow it down to a single commit.
> 
> For a good overview of the area, use 'gitk f83b1e61..517d0869'.
> 
> v2.6.30-5466-ga1dd268	mm: use alloc_pages_exact in alloc_large_system_hash
> 	2.3	+-
> v2.6.30-5478-ge9bb35d	mm: setup_per_zone_inactive_ratio - fix comment and..
> 	2.5	+-
> v2.6.30-5486-g35282a2	migration: only migrate_prep() once per move_pages()
> 	2.6	-|+|-	not quite conclusive...
> v2.6.30-5492-gbce7394	page-allocator: reset wmark_min and inactive ratio..
> 	2.4	-|-
> 
While I didn't spot anything too out of the ordinary here, they did occur
shortly after a number of other page allocator related patches.  One small
thing I noticed there is that kswapd is getting woken up less now than it did
previously. Generally, I wouldn't have expected it to make a difference but
it's possible that kswapd is not being woken up to reclaim at a higher order
than it was previously. I have a patch for this below. It'd be nice if you
could apply it and see do fewer allocation failures occur on current mainline.
> WHERE NEXT?
> ===========
> I think the results confirm there is definitely an issue here and that my 
> test is reliable and consistent enough to show it. And as it currently is 
> the only test we have...
> 
> I hope that the info above is enough for the mm and wireless domain 
> experts to identify likely candidates in the areas I've identified.
> 
> The next step could be trying specific reverts or debug patches, either on 
> top of current git, or 2.6.31, or inside the identified areas.
> I'll run anything you care to throw at me and will try to provide any 
> additional info you need, but at this point it's up to you.
> 
For the wireless people in mainline - iwl_rx_replenish_now() is doing
a GFP_ATOMIC allocation that does not use __GFP_NOWARN. As part of
investigating allocation failures, iwl_rx_allocate() was taught to
distinguish between a benign and serious allocation failure - serious
being there are very few RX buffers left and packet loss could occur soon
(see commit f82a924cc88a5541df1d4b9d38a0968cd077a051). I think this GFP mask
should be made GFP_ATOMIC|__GFP_NOWARN so that warnings only appear when the
failure is serious, dump stack after the warning if you need it. I have a
feeling that almost all these warnings have been benign and are related to
the introduction of GFP_ATOMIC being used so heavily to move more expensive
allocations to the tasklet (presumably to reduce user-visible latency).
Frans, could you try the following kswapd-related patch please? I'd be
interested in seeing if the number of allocation failure warnings are
reduced with it. After that, could you edit
drivers/net/wireless/iwlwifi/iwl-rx.c and make the GFP_ATOMIC in
iwl_rx_replenish_now() GFP_ATOMIC|__GFP_NOWARN and see do any of the
"serious" allocation failure messages appear.
Thanks again for your persistence.
==== CUT HERE ====
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-12 13:43           ` Mel Gorman
@ 2009-10-12 17:32             ` Frans Pop
  2009-10-12 18:43               ` Mel Gorman
  2009-10-13 20:38               ` Frans Pop
  0 siblings, 2 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-12 17:32 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Monday 12 October 2009, Mel Gorman wrote:
> Maybe. Your commit id's are different to what I see. Maybe it's because
> your tree has been shuffled around a bit
No, the commit IDs should be identical. My tree is just plain mainline.
Just to make sure... You did remove the "g" from the IDs, right?
So v2.6.30-rc6-1103-gb1bc81a becomes 'b1bc81a' and if you do
'git describe b1bc81a' you really should end up with the same IDs I have.
> but after some digging around in this general area, I saw this patch
>
> 4752c93c30 iwlcore: Allow skb allocation from tasklet
That is v2.6.30-rc6-773-g4752c93, which is part of the first wireless
merge I tested and where I saw no issues. But see below.
> This patch increases the number of GFP_ATOMIC allocations that can occur
> by allocating GFP_ATOMIC in some cases and GFP_KERNEL in others.
> Previously, only GFP_KERNEL was used and I didn't realise this
> allocation method was so recent. Problems of this sort have cropped up
> before and while there are later changes that suppress some of these
> warnings, I believe this is a strong candidate for where the allocation
> failures started appearing.
>
> > v2.6.30-rc6-1032-g7ba10a8       mac80211: fix transposed min/max CW values
> >     1.13    -
> >     This is a bugfix for aa837ee1d from an earlier merge! Could this maybe
There's a typo here. That ID should be: aa837e1d.
> >     influence the test results in between? There are various SKB related
> >     changes there, for example: dfbf97f3..e5b9215e.
> > v2.6.30-rc6-1037-g2c5b9e5	wireless: libertas: fix unaligned accesses
> > 	1.12    +-
> > v2.6.30-rc6-1044-g729e9c7	cfg80211: fix for duplicate userspace replies
> >     1.10    +- 
> > v2.6.30-rc6-1075-gc587de0	iwlwifi: unify station management
> >     1.9     ++-|+-
> > v2.6.30-rc6-1076-gd14d444	iwl3945: port allow skb allocation in tasklet
> >     I thought this was a prime candidate, but as you can see 
> >     several commits before failed too. Still worth looking at I think!
>
> Your commit IDs are different to what I see but it's the commit merge at
> b1bc81a0ef86b86fa410dd303d84c8c7bd09a64d. I agree that the last commit
> (d14d44407b9f06e3cf967fcef28ccb780caf0583) could make the problem worse
> because it expands the use of GFP_ATOMIC for another driver.
No, that was a mistake of mine. d14d444 is in a driver I don't even compile.
The one you identified (which is the same change for iwlagn) is much more
interesting.
I really do think that v2.6.30-rc6-1032-g7ba10a8 could play a role here.
That's a fix for v2.6.30-rc1-1131-gaa837e1. So that bug was introduced
_before_ the merge 82d0481 and may thus well explain both the latencies I
saw _and_ why that merge tested without problems. And that would also go a
long way to explain my test results.
So I'm going to retest 82d0481 with 7ba10a8 cherry-picked on top.
> > BISECTION of akpm (mm) MERGE
> > ----------------------------
[...]
> While I didn't spot anything too out of the ordinary here, they did
> occur shortly after a number of other page allocator related patches. 
> One small thing I noticed there is that kswapd is getting woken up less
> now than it did previously. Generally, I wouldn't have expected it to
> make a difference but it's possible that kswapd is not being woken up to
> reclaim at a higher order than it was previously. I have a patch for
> this below. It'd be nice if you could apply it and see do fewer
> allocation failures occur on current mainline.
I'll give that patch a try and report back.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-12 17:32             ` Frans Pop
@ 2009-10-12 18:43               ` Mel Gorman
  2009-10-13 20:38               ` Frans Pop
  1 sibling, 0 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-12 18:43 UTC (permalink / raw)
  To: Frans Pop
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Mon, Oct 12, 2009 at 07:32:11PM +0200, Frans Pop wrote:
> On Monday 12 October 2009, Mel Gorman wrote:
> > Maybe. Your commit id's are different to what I see. Maybe it's because
> > your tree has been shuffled around a bit
> 
> No, the commit IDs should be identical. My tree is just plain mainline.
> 
> Just to make sure... You did remove the "g" from the IDs, right?
> So v2.6.30-rc6-1103-gb1bc81a becomes 'b1bc81a' and if you do
> 'git describe b1bc81a' you really should end up with the same IDs I have.
> 
Bah, that's what I was doing all right. No excuse, that was just plain
stupid of me.
> > but after some digging around in this general area, I saw this patch
> >
> > 4752c93c30 iwlcore: Allow skb allocation from tasklet
> 
> That is v2.6.30-rc6-773-g4752c93, which is part of the first wireless
> merge I tested and where I saw no issues. But see below.
> 
While there were no issues at that point, I think it might have been the
beginning of a few patches that made things progressively worse. It is
possible there is more than one patch causing trouble here and bisecting
each of them is unlikely to be an option. More on this later though.
> > This patch increases the number of GFP_ATOMIC allocations that can occur
> > by allocating GFP_ATOMIC in some cases and GFP_KERNEL in others.
> > Previously, only GFP_KERNEL was used and I didn't realise this
> > allocation method was so recent. Problems of this sort have cropped up
> > before and while there are later changes that suppress some of these
> > warnings, I believe this is a strong candidate for where the allocation
> > failures started appearing.
> >
> > > v2.6.30-rc6-1032-g7ba10a8       mac80211: fix transposed min/max CW values
> > >     1.13    -
> > >     This is a bugfix for aa837ee1d from an earlier merge! Could this maybe
> 
> There's a typo here. That ID should be: aa837e1d.
> 
> > >     influence the test results in between? There are various SKB related
> > >     changes there, for example: dfbf97f3..e5b9215e.
> > > v2.6.30-rc6-1037-g2c5b9e5	wireless: libertas: fix unaligned accesses
> > > 	1.12    +-
> > > v2.6.30-rc6-1044-g729e9c7	cfg80211: fix for duplicate userspace replies
> > >     1.10    +- 
> > > v2.6.30-rc6-1075-gc587de0	iwlwifi: unify station management
> > >     1.9     ++-|+-
> > > v2.6.30-rc6-1076-gd14d444	iwl3945: port allow skb allocation in tasklet
> > >     I thought this was a prime candidate, but as you can see 
> > >     several commits before failed too. Still worth looking at I think!
> >
> > Your commit IDs are different to what I see but it's the commit merge at
> > b1bc81a0ef86b86fa410dd303d84c8c7bd09a64d. I agree that the last commit
> > (d14d44407b9f06e3cf967fcef28ccb780caf0583) could make the problem worse
> > because it expands the use of GFP_ATOMIC for another driver.
> 
> No, that was a mistake of mine. d14d444 is in a driver I don't even compile.
> The one you identified (which is the same change for iwlagn) is much more
> interesting.
> 
I had forgotten what model your card was and assumed it must have been based
on this driver for the problem to get worse for you that point.
> I really do think that v2.6.30-rc6-1032-g7ba10a8 could play a role here.
> That's a fix for v2.6.30-rc1-1131-gaa837e1. So that bug was introduced
> _before_ the merge 82d0481 and may thus well explain both the latencies I
> saw _and_ why that merge tested without problems. And that would also go a
> long way to explain my test results.
Very good point.
> So I'm going to retest 82d0481 with 7ba10a8 cherry-picked on top.
> 
Great.
> > > BISECTION of akpm (mm) MERGE
> > > ----------------------------
> [...]
> > While I didn't spot anything too out of the ordinary here, they did
> > occur shortly after a number of other page allocator related patches. 
> > One small thing I noticed there is that kswapd is getting woken up less
> > now than it did previously. Generally, I wouldn't have expected it to
> > make a difference but it's possible that kswapd is not being woken up to
> > reclaim at a higher order than it was previously. I have a patch for
> > this below. It'd be nice if you could apply it and see do fewer
> > allocation failures occur on current mainline.
> 
> I'll give that patch a try and report back.
> 
Thanks a lot.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-12 17:32             ` Frans Pop
  2009-10-12 18:43               ` Mel Gorman
@ 2009-10-13 20:38               ` Frans Pop
  2009-10-14 10:30                 ` Mel Gorman
  1 sibling, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-13 20:38 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Monday 12 October 2009, Frans Pop wrote:
> On Monday 12 October 2009, Mel Gorman wrote:
> > but after some digging around in this general area, I saw this patch
> >
> > 4752c93c30 iwlcore: Allow skb allocation from tasklet
>
> That is v2.6.30-rc6-773-g4752c93, which is part of the first wireless
> merge I tested and where I saw no issues. But see below.
>
> > This patch increases the number of GFP_ATOMIC allocations that can
> > occur by allocating GFP_ATOMIC in some cases and GFP_KERNEL in others.
> > Previously, only GFP_KERNEL was used and I didn't realise this
> > allocation method was so recent. Problems of this sort have cropped up
> > before and while there are later changes that suppress some of these
> > warnings, I believe this is a strong candidate for where the
> > allocation failures started appearing.
I have tried reverting this patch and that does make a significant 
difference, but the results are still not really conclusive.
I tested the revert on top of:
- the first net-next-2.6 merge (2ed0e21), i.e. before the mm merge
- 2.6.31.1
In both cases I no longer get SKB errors, but instead (?) I get firmware 
errors:
iwlagn 0000:10:00.0: Microcode SW error detected.  Restarting 0x2000000.
So on the wireless side it does look as if there is more than one change 
involved. Remember that with .30 I don't get any errors, only relatively 
mild latencies and skips in the music.
> I really do think that v2.6.30-rc6-1032-g7ba10a8 could play a role here.
> That's a fix for v2.6.30-rc1-1131-gaa837e1. So that bug was introduced
> _before_ the merge 82d0481 and may thus well explain both the latencies
> I saw _and_ why that merge tested without problems. And that would also
> go a long way to explain my test results.
> So I'm going to retest 82d0481 with 7ba10a8 cherry-picked on top.
                         ^^^^^^^-- should be 45ea4ea
I've tried this but still don't get any SKB errors, so that bug does not 
seem to make a difference.
> > > BISECTION of akpm (mm) MERGE
> > > ----------------------------
> > While I didn't spot anything too out of the ordinary here, they did
> > occur shortly after a number of other page allocator related patches.
> > One small thing I noticed there is that kswapd is getting woken up
> > less now than it did previously. Generally, I wouldn't have expected
> > it to make a difference but it's possible that kswapd is not being
> > woken up to reclaim at a higher order than it was previously. I have a
> > patch for this below. It'd be nice if you could apply it and see do
> > fewer allocation failures occur on current mainline.
>
> I'll give that patch a try and report back.
With your patch on .32-rc4 I still get the SKB errors, so it does not seem 
to help. The only change there may have been is that the desktop was 
frozen longer than without the patch, but that is an impression, not a 
hard fact.
Although identifying the problem on the wireless side is important, I still 
feel that tracing the mm change should have priority as it influences much 
more than just iwlagn, as the other reports prove.
> > After that, could you edit drivers/net/wireless/iwlwifi/iwl-rx.c and
> > make the GFP_ATOMIC in iwl_rx_replenish_now() GFP_ATOMIC|__GFP_NOWARN
> > and see do any of the "serious" allocation failure messages appear.
For the above reason I've not yet tried this. It seems to me that this 
change will not really solve anything, but just suppress errors.
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-13 20:38               ` Frans Pop
@ 2009-10-14 10:30                 ` Mel Gorman
  2009-10-14 13:10                   ` Frans Pop
  2009-10-14 16:28                   ` reinette chatre
  0 siblings, 2 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-14 10:30 UTC (permalink / raw)
  To: Frans Pop
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Tue, Oct 13, 2009 at 10:38:37PM +0200, Frans Pop wrote:
> On Monday 12 October 2009, Frans Pop wrote:
> > On Monday 12 October 2009, Mel Gorman wrote:
> > > but after some digging around in this general area, I saw this patch
> > >
> > > 4752c93c30 iwlcore: Allow skb allocation from tasklet
> >
> > That is v2.6.30-rc6-773-g4752c93, which is part of the first wireless
> > merge I tested and where I saw no issues. But see below.
> >
> > > This patch increases the number of GFP_ATOMIC allocations that can
> > > occur by allocating GFP_ATOMIC in some cases and GFP_KERNEL in others.
> > > Previously, only GFP_KERNEL was used and I didn't realise this
> > > allocation method was so recent. Problems of this sort have cropped up
> > > before and while there are later changes that suppress some of these
> > > warnings, I believe this is a strong candidate for where the
> > > allocation failures started appearing.
> 
> I have tried reverting this patch and that does make a significant 
> difference, but the results are still not really conclusive.
> I tested the revert on top of:
> - the first net-next-2.6 merge (2ed0e21), i.e. before the mm merge
> - 2.6.31.1
> 
I think this is very significant. Either that change needs to be backed
out or more likely, __GFP_NOWARN needs to be specified and warnings
*only* printed when the RX buffers are really low. My expectation would
be that some GFP_ATOMIC allocations fail during refill but the fact they
fail wakes kswapd to reclaim order-2 pages while the RX buffers in the
pool are consumed.
> In both cases I no longer get SKB errors, but instead (?) I get firmware 
> errors:
> iwlagn 0000:10:00.0: Microcode SW error detected.  Restarting 0x2000000.
> 
I am no wireless expert, but that looks like an separate problem to me.
I don't see how an allocation failure could trigger errors in the
microcode.
I really really hate to say it, but this might need a separate bisection
with 4752c93c30 either reverted or patched as I do below.
> So on the wireless side it does look as if there is more than one change 
> involved. Remember that with .30 I don't get any errors, only relatively 
> mild latencies and skips in the music.
> 
2.6.31 does not appear to have done wireless any favours.
> > I really do think that v2.6.30-rc6-1032-g7ba10a8 could play a role here.
> > That's a fix for v2.6.30-rc1-1131-gaa837e1. So that bug was introduced
> > _before_ the merge 82d0481 and may thus well explain both the latencies
> > I saw _and_ why that merge tested without problems. And that would also
> > go a long way to explain my test results.
> > So I'm going to retest 82d0481 with 7ba10a8 cherry-picked on top.
>                          ^^^^^^^-- should be 45ea4ea
> 
> I've tried this but still don't get any SKB errors, so that bug does not 
> seem to make a difference.
> 
> > > > BISECTION of akpm (mm) MERGE
> > > > ----------------------------
> > > While I didn't spot anything too out of the ordinary here, they did
> > > occur shortly after a number of other page allocator related patches.
> > > One small thing I noticed there is that kswapd is getting woken up
> > > less now than it did previously. Generally, I wouldn't have expected
> > > it to make a difference but it's possible that kswapd is not being
> > > woken up to reclaim at a higher order than it was previously. I have a
> > > patch for this below. It'd be nice if you could apply it and see do
> > > fewer allocation failures occur on current mainline.
> >
> > I'll give that patch a try and report back.
> 
> With your patch on .32-rc4 I still get the SKB errors, so it does not seem 
> to help. The only change there may have been is that the desktop was 
> frozen longer than without the patch, but that is an impression, not a 
> hard fact.
> 
Actually, that's fairly interesting and I think justifies pushing the
patch. Direct reclaim can stall processes in a user-visible manner which kswapd
is meant to avoid in the majority of cases but is tricky to quantify without
instrumenting the kernel to measure direct reclaim frequency and latency (I
have WIP tracepoints for this but it's still a WIP). If you notice shorter
stalls with the patch applied, it means that kswapd really did need to be
informed of the problems.
> Although identifying the problem on the wireless side is important, I still 
> feel that tracing the mm change should have priority as it influences much 
> more than just iwlagn, as the other reports prove.
> 
There still has not been a mm-change identified that makes fragmentation
significantly worse. The majority of the wireless reports have been in this
driver and I think we have the problem commit there. The only other is a
firmware loading problem in e100 after resume that fails to make an atomic
order-5 fail. It's possible that something has changed in resume in the
2.6.31 window there - maybe something like drivers now reload during resume
where they didn't previously or less memory being pushed to swap during resume.
> > > After that, could you edit drivers/net/wireless/iwlwifi/iwl-rx.c and
> > > make the GFP_ATOMIC in iwl_rx_replenish_now() GFP_ATOMIC|__GFP_NOWARN
> > > and see do any of the "serious" allocation failure messages appear.
> 
> For the above reason I've not yet tried this. It seems to me that this 
> change will not really solve anything, but just suppress errors.
> 
I disagree. Harmless allocation errors get suppressed but it still warns when
things get really bad. See the following patch that suppresses the
warnings from GFP_ATOMIC but warns for GFP_KERNEL failures and dumps a
stack on serious allocation failure.
We either need a patch like this or the
GFP_ATOMIC-direct-with-refills-from-tasklet patch needs to be reverted.
=== CUT HERE ===
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 10:30                 ` Mel Gorman
@ 2009-10-14 13:10                   ` Frans Pop
  2009-10-14 15:40                     ` Mel Gorman
                                       ` (2 more replies)
  2009-10-14 16:28                   ` reinette chatre
  1 sibling, 3 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-14 13:10 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
[-- Attachment #1: Type: text/plain, Size: 4524 bytes --]
On Wednesday 14 October 2009, Mel Gorman wrote:
> I think this is very significant. Either that change needs to be backed
> out or more likely, __GFP_NOWARN needs to be specified and warnings
> *only* printed when the RX buffers are really low. My expectation would
> be that some GFP_ATOMIC allocations fail during refill but the fact they
> fail wakes kswapd to reclaim order-2 pages while the RX buffers in the
> pool are consumed.
Sorry I did not actually mention this, but the SKB failures I get with .32 
have loads of the "Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 
free buffers remaining." errors. That's why I don't think your patch will 
help anything.
zgrep "Only 0 free buffers remaining" /var/log/kern.log* | wc -l
84
OK, they are all GPF_ATOMIC and not GPF_KERNEL, but they also almost all 
have "0 free buffers"! Next to the 84 warnings for 0 remaining I only have 
one with "3 free buffers" and one with "1 free buffers".
And that does not even count the rate limitting:
Oct 12 20:15:07 aragorn kernel: __ratelimit: 45 callbacks suppressed
Oct 12 20:25:19 aragorn kernel: __ratelimit: 27 callbacks suppressed
Oct 12 20:25:20 aragorn kernel: __ratelimit: 2 callbacks suppressed
Attached the kernel log for one test I did with .32.
> > In both cases I no longer get SKB errors, but instead (?) I get
> > firmware errors:
> > iwlagn 0000:10:00.0: Microcode SW error detected.  Restarting
> > 0x2000000.
>
> I am no wireless expert, but that looks like an separate problem to me.
> I don't see how an allocation failure could trigger errors in the
> microcode.
Yes, it is a separate problem, but it is still significant that reverting 
that patch triggers them in the extreme swap situation.
> > With your patch on .32-rc4 I still get the SKB errors, so it does not
> > seem to help. The only change there may have been is that the desktop
> > was frozen longer than without the patch, but that is an impression,
> > not a hard fact.
>
> Actually, that's fairly interesting and I think justifies pushing the
> patch. Direct reclaim can stall processes in a user-visible manner which
> kswapd is meant to avoid in the majority of cases but is tricky to
> quantify without instrumenting the kernel to measure direct reclaim
> frequency and latency (I have WIP tracepoints for this but it's still a
> WIP). If you notice shorter stalls with the patch applied, it means that
> kswapd really did need to be informed of the problems.
No, I thought I saw _longer_ stalls with your patch applied...
> There still has not been a mm-change identified that makes fragmentation
> significantly worse.
My bisection shows a very clear point, even if not an individual commit, in 
the 'akpm' merge where SKB errors suddenly become *much* more frequent and 
easy to trigger.
I'm sorry to say this, but the fact that nothing has been identified yet is 
IMO the result of a lack of effort, not because there is no such change.
> The majority of the wireless reports have been in 
> this driver and I think we have the problem commit there. The only other
> is a firmware loading problem in e100 after resume that fails to make an
> atomic order-5 fail.
Not exactly true. Bartlomiej's report was about ipw2200, so there are at 
least 3 different drivers involved, two wireless and one wired. Besides 
that one report is related to heavy swap, one to resume and one to driver 
reload.
So it's much more likely that there is some common regression (in mm) that 
affected all three than that there are three unrelated regressions.
And although both of the others did extremely high allocations, they both 
started appearing in the same timeframe. And Bart's very first report 
linked it to mm changes.
> It's possible that something has changed in resume 
> in the 2.6.31 window there - maybe something like drivers now reload
> during resume where they didn't previously or less memory being pushed
> to swap during resume.
IMO you're sticking your head in the sand here. 
I'm not saying that mm is the only issue here, but I'm convinced that there 
_is_ an mm change that has contributed in a major way to these issues, 
even if we've not yet been able to identify it.
> -			    net_ratelimit())
> +			    net_ratelimit()) {
>  				IWL_CRIT(priv, "Failed to allocate SKB buffer with %s. Only %u free
> buffers remaining.\n", priority == GFP_ATOMIC ?  "GFP_ATOMIC" :
> "GFP_KERNEL",
Haven't you broken the test 'priority == GFP_ATOMIC' here by setting 
priority to GFP_ATOMIC|__GFP_NOWARN?
Cheers,
FJP
[-- Attachment #2: kern.log --]
[-- Type: text/x-log, Size: 114900 bytes --]
Oct 12 17:13:07 aragorn kernel: Linux version 2.6.32-rc4 (root@aragorn) (gcc version 4.3.2 (Debian 4.3.2-1.1) ) #29 SMP Mon Oct 12 13:12:50 CEST 2009
Oct 12 17:13:07 aragorn kernel: Command line: root=/dev/mapper/main-root ro vga=791 quiet
Oct 12 17:13:07 aragorn kernel: KERNEL supported cpus:
Oct 12 17:13:07 aragorn kernel:   Intel GenuineIntel
Oct 12 17:13:07 aragorn kernel:   AMD AuthenticAMD
Oct 12 17:13:07 aragorn kernel:   Centaur CentaurHauls
Oct 12 17:13:07 aragorn kernel: BIOS-provided physical RAM map:
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 0000000000100000 - 000000007e7b0000 (usable)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 000000007e7b0000 - 000000007e7c5400 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 000000007e7c5400 - 000000007e7e7fb8 (ACPI NVS)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 000000007e7e7fb8 - 000000007f000000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 00000000fed20000 - 00000000fed9a000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 00000000feda0000 - 00000000fedc0000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 00000000ffb00000 - 00000000ffc00000 (reserved)
Oct 12 17:13:07 aragorn kernel:  BIOS-e820: 00000000fff00000 - 0000000100000000 (reserved)
Oct 12 17:13:07 aragorn kernel: DMI 2.4 present.
Oct 12 17:13:07 aragorn kernel: last_pfn = 0x7e7b0 max_arch_pfn = 0x400000000
Oct 12 17:13:07 aragorn kernel: MTRR default type: uncachable
Oct 12 17:13:07 aragorn kernel: MTRR fixed ranges enabled:
Oct 12 17:13:07 aragorn kernel:   00000-9FFFF write-back
Oct 12 17:13:07 aragorn kernel:   A0000-BFFFF uncachable
Oct 12 17:13:07 aragorn kernel:   C0000-D3FFF write-protect
Oct 12 17:13:07 aragorn kernel:   D4000-EFFFF uncachable
Oct 12 17:13:07 aragorn kernel:   F0000-FFFFF write-protect
Oct 12 17:13:07 aragorn kernel: MTRR variable ranges enabled:
Oct 12 17:13:07 aragorn kernel:   0 base 000000000 mask F80000000 write-back
Oct 12 17:13:07 aragorn kernel:   1 base 07F000000 mask FFF000000 uncachable
Oct 12 17:13:07 aragorn kernel:   2 base 07E800000 mask FFF800000 uncachable
Oct 12 17:13:07 aragorn kernel:   3 base 0FEDA0000 mask FFFFE0000 uncachable
Oct 12 17:13:07 aragorn kernel:   4 disabled
Oct 12 17:13:07 aragorn kernel:   5 disabled
Oct 12 17:13:07 aragorn kernel:   6 disabled
Oct 12 17:13:07 aragorn kernel:   7 disabled
Oct 12 17:13:07 aragorn kernel: x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106
Oct 12 17:13:07 aragorn kernel: initial memory mapped : 0 - 20000000
Oct 12 17:13:07 aragorn kernel: init_memory_mapping: 0000000000000000-000000007e7b0000
Oct 12 17:13:07 aragorn kernel:  0000000000 - 007e600000 page 2M
Oct 12 17:13:07 aragorn kernel:  007e600000 - 007e7b0000 page 4k
Oct 12 17:13:07 aragorn kernel: kernel direct mapping tables up to 7e7b0000 @ 8000-c000
Oct 12 17:13:07 aragorn kernel: RAMDISK: 37a79000 - 37fef2f8
Oct 12 17:13:07 aragorn kernel: ACPI: RSDP 00000000000f7960 00024 (v02 HP    )
Oct 12 17:13:07 aragorn kernel: ACPI: XSDT 000000007e7c81c8 0007C (v01 HPQOEM SLIC-MPC 00000001 HP   00000001)
Oct 12 17:13:07 aragorn kernel: ACPI: FACP 000000007e7c8084 000F4 (v04 HP     30C9     00000003 HP   00000001)
Oct 12 17:13:07 aragorn kernel: ACPI: DSDT 000000007e7c8538 13484 (v01 HP       nc2500 00010000 MSFT 03000001)
Oct 12 17:13:07 aragorn kernel: ACPI: FACS 000000007e7e7d80 00040
Oct 12 17:13:07 aragorn kernel: ACPI: SLIC 000000007e7c8244 00176 (v01 HPQOEM SLIC-MPC 00000001 HP   00000001)
Oct 12 17:13:07 aragorn kernel: ACPI: HPET 000000007e7c83bc 00038 (v01 HP     30C9     00000001 HP   00000001)
Oct 12 17:13:07 aragorn kernel: ACPI: APIC 000000007e7c83f4 00068 (v01 HP     30C9     00000001 HP   00000001)
Oct 12 17:13:07 aragorn kernel: ACPI: MCFG 000000007e7c845c 0003C (v01 HP     30C9     00000001 HP   00000001)
Oct 12 17:13:07 aragorn kernel: ACPI: TCPA 000000007e7c8498 00032 (v02 HP     30C9     00000001 HP   00000001)
Oct 12 17:13:07 aragorn kernel: ACPI: ASF! 000000007e7c84cc 00069 (v16 HP     CHIMAYU  00000001 HP   00000000)
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7db9bc 002BE (v01 HP       HPQPAT 00000001 MSFT 03000001)
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7dc640 0025F (v01 HP      Cpu0Tst 00003000 INTL 20060317)
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7dc89f 000A6 (v01 HP      Cpu1Tst 00003000 INTL 20060317)
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7dc945 004D7 (v01 HP        CpuPm 00003000 INTL 20060317)
Oct 12 17:13:07 aragorn kernel: ACPI: Local APIC address 0xfee00000
Oct 12 17:13:07 aragorn kernel: (7 early reservations) ==> bootmem [0000000000 - 007e7b0000]
Oct 12 17:13:07 aragorn kernel:   #0 [0000000000 - 0000001000]   BIOS data page ==> [0000000000 - 0000001000]
Oct 12 17:13:07 aragorn kernel:   #1 [0000006000 - 0000008000]       TRAMPOLINE ==> [0000006000 - 0000008000]
Oct 12 17:13:07 aragorn kernel:   #2 [0001000000 - 00015bdb30]    TEXT DATA BSS ==> [0001000000 - 00015bdb30]
Oct 12 17:13:07 aragorn kernel:   #3 [0037a79000 - 0037fef2f8]          RAMDISK ==> [0037a79000 - 0037fef2f8]
Oct 12 17:13:07 aragorn kernel:   #4 [000009fc00 - 0000100000]    BIOS reserved ==> [000009fc00 - 0000100000]
Oct 12 17:13:07 aragorn kernel:   #5 [00015be000 - 00015be18c]              BRK ==> [00015be000 - 00015be18c]
Oct 12 17:13:07 aragorn kernel:   #6 [0000008000 - 000000a000]          PGTABLE ==> [0000008000 - 000000a000]
Oct 12 17:13:07 aragorn kernel:  [ffffea0000000000-ffffea0001ffffff] PMD -> [ffff880001a00000-ffff8800039fffff] on node 0
Oct 12 17:13:07 aragorn kernel: Zone PFN ranges:
Oct 12 17:13:07 aragorn kernel:   DMA      0x00000000 -> 0x00001000
Oct 12 17:13:07 aragorn kernel:   DMA32    0x00001000 -> 0x00100000
Oct 12 17:13:07 aragorn kernel:   Normal   0x00100000 -> 0x00100000
Oct 12 17:13:07 aragorn kernel: Movable zone start PFN for each node
Oct 12 17:13:07 aragorn kernel: early_node_map[2] active PFN ranges
Oct 12 17:13:07 aragorn kernel:     0: 0x00000000 -> 0x0000009f
Oct 12 17:13:07 aragorn kernel:     0: 0x00000100 -> 0x0007e7b0
Oct 12 17:13:07 aragorn kernel: On node 0 totalpages: 517967
Oct 12 17:13:07 aragorn kernel:   DMA zone: 64 pages used for memmap
Oct 12 17:13:07 aragorn kernel:   DMA zone: 101 pages reserved
Oct 12 17:13:07 aragorn kernel:   DMA zone: 3834 pages, LIFO batch:0
Oct 12 17:13:07 aragorn kernel:   DMA32 zone: 8031 pages used for memmap
Oct 12 17:13:07 aragorn kernel:   DMA32 zone: 505937 pages, LIFO batch:31
Oct 12 17:13:07 aragorn kernel: ACPI: PM-Timer IO Port: 0x1008
Oct 12 17:13:07 aragorn kernel: ACPI: Local APIC address 0xfee00000
Oct 12 17:13:07 aragorn kernel: ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
Oct 12 17:13:07 aragorn kernel: ACPI: LAPIC (acpi_id[0x02] lapic_id[0x01] enabled)
Oct 12 17:13:07 aragorn kernel: ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1])
Oct 12 17:13:07 aragorn kernel: ACPI: LAPIC_NMI (acpi_id[0x02] high edge lint[0x1])
Oct 12 17:13:07 aragorn kernel: ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0])
Oct 12 17:13:07 aragorn kernel: IOAPIC[0]: apic_id 1, version 32, address 0xfec00000, GSI 0-23
Oct 12 17:13:07 aragorn kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
Oct 12 17:13:07 aragorn kernel: ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
Oct 12 17:13:07 aragorn kernel: ACPI: IRQ0 used by override.
Oct 12 17:13:07 aragorn kernel: ACPI: IRQ2 used by override.
Oct 12 17:13:07 aragorn kernel: ACPI: IRQ9 used by override.
Oct 12 17:13:07 aragorn kernel: Using ACPI (MADT) for SMP configuration information
Oct 12 17:13:07 aragorn kernel: ACPI: HPET id: 0x8086a201 base: 0xfed00000
Oct 12 17:13:07 aragorn kernel: SMP: Allowing 2 CPUs, 0 hotplug CPUs
Oct 12 17:13:07 aragorn kernel: nr_irqs_gsi: 24
Oct 12 17:13:07 aragorn kernel: PM: Registered nosave memory: 000000000009f000 - 00000000000a0000
Oct 12 17:13:07 aragorn kernel: PM: Registered nosave memory: 00000000000a0000 - 00000000000e0000
Oct 12 17:13:07 aragorn kernel: PM: Registered nosave memory: 00000000000e0000 - 0000000000100000
Oct 12 17:13:07 aragorn kernel: Allocating PCI resources starting at 7f000000 (gap: 7f000000:7fc00000)
Oct 12 17:13:07 aragorn kernel: NR_CPUS:8 nr_cpumask_bits:8 nr_cpu_ids:2 nr_node_ids:1
Oct 12 17:13:07 aragorn kernel: PERCPU: Embedded 27 pages/cpu @ffff880001600000 s81752 r8192 d20648 u1048576
Oct 12 17:13:07 aragorn kernel: pcpu-alloc: s81752 r8192 d20648 u1048576 alloc=1*2097152
Oct 12 17:13:07 aragorn kernel: pcpu-alloc: [0] 0 1 
Oct 12 17:13:07 aragorn kernel: Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 509771
Oct 12 17:13:07 aragorn kernel: Kernel command line: root=/dev/mapper/main-root ro vga=791 quiet
Oct 12 17:13:07 aragorn kernel: PID hash table entries: 4096 (order: 3, 32768 bytes)
Oct 12 17:13:07 aragorn kernel: Dentry cache hash table entries: 262144 (order: 9, 2097152 bytes)
Oct 12 17:13:07 aragorn kernel: Inode-cache hash table entries: 131072 (order: 8, 1048576 bytes)
Oct 12 17:13:07 aragorn kernel: Initializing CPU#0
Oct 12 17:13:07 aragorn kernel: Checking aperture...
Oct 12 17:13:07 aragorn kernel: No AGP bridge found
Oct 12 17:13:07 aragorn kernel: Memory: 2023496k/2072256k available (2758k kernel code, 388k absent, 47724k reserved, 1927k data, 508k init)
Oct 12 17:13:07 aragorn kernel: SLUB: Genslabs=13, HWalign=64, Order=0-3, MinObjects=0, CPUs=2, Nodes=1
Oct 12 17:13:07 aragorn kernel: Hierarchical RCU implementation.
Oct 12 17:13:07 aragorn kernel: NR_IRQS:512
Oct 12 17:13:07 aragorn kernel: Extended CMOS year: 2000
Oct 12 17:13:07 aragorn kernel: Console: colour dummy device 80x25
Oct 12 17:13:07 aragorn kernel: console [tty0] enabled
Oct 12 17:13:07 aragorn kernel: hpet clockevent registered
Oct 12 17:13:07 aragorn kernel: HPET: 3 timers in total, 0 timers will be used for per-cpu timer
Oct 12 17:13:07 aragorn kernel: Fast TSC calibration using PIT
Oct 12 17:13:07 aragorn kernel: Detected 1330.067 MHz processor.
Oct 12 17:13:07 aragorn kernel: Calibrating delay loop (skipped), value calculated using timer frequency.. 2660.13 BogoMIPS (lpj=5320268)
Oct 12 17:13:07 aragorn kernel: Security Framework initialized
Oct 12 17:13:07 aragorn kernel: SELinux:  Disabled at boot.
Oct 12 17:13:07 aragorn kernel: Mount-cache hash table entries: 256
Oct 12 17:13:07 aragorn kernel: CPU: L1 I cache: 32K, L1 D cache: 32K
Oct 12 17:13:07 aragorn kernel: CPU: L2 cache: 2048K
Oct 12 17:13:07 aragorn kernel: CPU: Physical Processor ID: 0
Oct 12 17:13:07 aragorn kernel: CPU: Processor Core ID: 0
Oct 12 17:13:07 aragorn kernel: mce: CPU supports 6 MCE banks
Oct 12 17:13:07 aragorn kernel: CPU0: Thermal monitoring handled by SMI
Oct 12 17:13:07 aragorn kernel: using mwait in idle threads.
Oct 12 17:13:07 aragorn kernel: Performance Events: Core2 events, Intel PMU driver.
Oct 12 17:13:07 aragorn kernel: ... version:                2
Oct 12 17:13:07 aragorn kernel: ... bit width:              40
Oct 12 17:13:07 aragorn kernel: ... generic registers:      2
Oct 12 17:13:07 aragorn kernel: ... value mask:             000000ffffffffff
Oct 12 17:13:07 aragorn kernel: ... max period:             000000007fffffff
Oct 12 17:13:07 aragorn kernel: ... fixed-purpose events:   3
Oct 12 17:13:07 aragorn kernel: ... event mask:             0000000700000003
Oct 12 17:13:07 aragorn kernel: ACPI: Core revision 20090903
Oct 12 17:13:07 aragorn kernel: ftrace: converting mcount calls to 0f 1f 44 00 00
Oct 12 17:13:07 aragorn kernel: ftrace: allocating 13965 entries in 55 pages
Oct 12 17:13:07 aragorn kernel: Setting APIC routing to flat
Oct 12 17:13:07 aragorn kernel: ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1
Oct 12 17:13:07 aragorn kernel: CPU0: Intel(R) Core(TM)2 Duo CPU     U7700  @ 1.33GHz stepping 0d
Oct 12 17:13:07 aragorn kernel: Booting processor 1 APIC 0x1 ip 0x6000
Oct 12 17:13:07 aragorn kernel: Initializing CPU#1
Oct 12 17:13:07 aragorn kernel: Calibrating delay using timer specific routine.. 2659.98 BogoMIPS (lpj=5319962)
Oct 12 17:13:07 aragorn kernel: CPU: L1 I cache: 32K, L1 D cache: 32K
Oct 12 17:13:07 aragorn kernel: CPU: L2 cache: 2048K
Oct 12 17:13:07 aragorn kernel: CPU: Physical Processor ID: 0
Oct 12 17:13:07 aragorn kernel: CPU: Processor Core ID: 1
Oct 12 17:13:07 aragorn kernel: mce: CPU supports 6 MCE banks
Oct 12 17:13:07 aragorn kernel: CPU1: Thermal monitoring enabled (TM2)
Oct 12 17:13:07 aragorn kernel: CPU1: Intel(R) Core(TM)2 Duo CPU     U7700  @ 1.33GHz stepping 0d
Oct 12 17:13:07 aragorn kernel: checking TSC synchronization [CPU#0 -> CPU#1]: passed.
Oct 12 17:13:07 aragorn kernel: Brought up 2 CPUs
Oct 12 17:13:07 aragorn kernel: Total of 2 processors activated (5320.11 BogoMIPS).
Oct 12 17:13:07 aragorn kernel: CPU0 attaching sched-domain:
Oct 12 17:13:07 aragorn kernel:  domain 0: span 0-1 level MC
Oct 12 17:13:07 aragorn kernel:   groups: 0 1
Oct 12 17:13:07 aragorn kernel: CPU1 attaching sched-domain:
Oct 12 17:13:07 aragorn kernel:  domain 0: span 0-1 level MC
Oct 12 17:13:07 aragorn kernel:   groups: 1 0
Oct 12 17:13:07 aragorn kernel: NET: Registered protocol family 16
Oct 12 17:13:07 aragorn kernel: ACPI FADT declares the system doesn't support PCIe ASPM, so disable it
Oct 12 17:13:07 aragorn kernel: ACPI: bus type pci registered
Oct 12 17:13:07 aragorn kernel: PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
Oct 12 17:13:07 aragorn kernel: PCI: Not using MMCONFIG.
Oct 12 17:13:07 aragorn kernel: PCI: Using configuration type 1 for base access
Oct 12 17:13:07 aragorn kernel: bio: create slab <bio-0> at 0
Oct 12 17:13:07 aragorn kernel: ACPI: EC: Look up EC in DSDT
Oct 12 17:13:07 aragorn kernel: ACPI: Interpreter enabled
Oct 12 17:13:07 aragorn kernel: ACPI: (supports S0 S3 S4 S5)
Oct 12 17:13:07 aragorn kernel: ACPI: Using IOAPIC for interrupt routing
Oct 12 17:13:07 aragorn kernel: PCI: MCFG configuration 0: base f8000000 segment 0 buses 0 - 63
Oct 12 17:13:07 aragorn kernel: PCI: MCFG area at f8000000 reserved in ACPI motherboard resources
Oct 12 17:13:07 aragorn kernel: PCI: Using MMCONFIG at f8000000 - fbffffff
Oct 12 17:13:07 aragorn kernel: ACPI: EC: GPE = 0x16, I/O: command/status = 0x66, data = 0x62
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C29F] (on)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C1C7] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C3AD] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C3B0] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C3C3] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C3C4] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C3C5] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C3C6] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: Power Resource [C3C7] (off)
Oct 12 17:13:07 aragorn kernel: ACPI: No dock devices found.
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Root Bridge [C003] (0000:00)
Oct 12 17:13:07 aragorn kernel: pci 0000:00:02.0: reg 10 64bit mmio: [0xe0400000-0xe04fffff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:02.0: reg 18 64bit mmio pref: [0xd0000000-0xdfffffff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:02.0: reg 20 io port: [0x2000-0x2007]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:02.1: reg 10 64bit mmio: [0xe0500000-0xe05fffff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.0: reg 10 64bit mmio: [0xe0600000-0xe060000f]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.0: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.0: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.2: reg 10 io port: [0x2008-0x200f]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.2: reg 14 io port: [0x2010-0x2013]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.2: reg 18 io port: [0x2018-0x201f]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.2: reg 1c io port: [0x2020-0x2023]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.2: reg 20 io port: [0x2030-0x203f]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.3: reg 10 io port: [0x2040-0x2047]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:03.3: reg 14 32bit mmio: [0xe0601000-0xe0601fff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:19.0: reg 10 32bit mmio: [0xe0620000-0xe063ffff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:19.0: reg 14 32bit mmio: [0xe0640000-0xe0640fff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:19.0: reg 18 io port: [0x2060-0x207f]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:19.0: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:00:19.0: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1a.0: reg 20 io port: [0x2080-0x209f]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1a.1: reg 20 io port: [0x20a0-0x20bf]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1a.7: reg 10 32bit mmio: [0xe0641000-0xe06413ff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1a.7: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1a.7: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1b.0: reg 10 64bit mmio: [0xe0644000-0xe0647fff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1b.0: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1b.0: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1d.0: reg 20 io port: [0x20c0-0x20df]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1d.1: reg 20 io port: [0x20e0-0x20ff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1d.2: reg 20 io port: [0x2100-0x211f]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1d.7: reg 10 32bit mmio: [0xe0648000-0xe06483ff]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1d.7: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1d.7: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.0: quirk: region 1000-107f claimed by ICH6 ACPI/GPIO/TCO
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.0: quirk: region 1100-113f claimed by ICH6 GPIO
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.0: ICH7 LPC Generic IO decode 1 PIO at 0500 (mask 007f)
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.0: ICH7 LPC Generic IO decode 4 PIO at 02e8 (mask 0007)
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.1: reg 10 io port: [0x00-0x07]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.1: reg 14 io port: [0x00-0x03]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.1: reg 18 io port: [0x00-0x07]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.1: reg 1c io port: [0x00-0x03]
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1f.1: reg 20 io port: [0x2120-0x212f]
Oct 12 17:13:07 aragorn kernel: pci 0000:10:00.0: reg 10 64bit mmio: [0xe0000000-0xe0001fff]
Oct 12 17:13:07 aragorn kernel: pci 0000:10:00.0: PME# supported from D0 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:10:00.0: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1: bridge 32bit mmio: [0xe0000000-0xe00fffff]
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0: reg 10 32bit mmio: [0xe0100000-0xe0100fff]
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0: supports D1 D2
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0: PME# supported from D0 D1 D2 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.1: reg 10 32bit mmio: [0xe0101000-0xe01017ff]
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.1: supports D1 D2
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.1: PME# supported from D0 D1 D2 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.1: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.2: reg 10 32bit mmio: [0xe0102000-0xe01020ff]
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.2: supports D1 D2
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.2: PME# supported from D0 D1 D2 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.2: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.3: reg 10 32bit mmio: [0xe0103000-0xe01030ff]
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.3: supports D1 D2
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.3: PME# supported from D0 D1 D2 D3hot D3cold
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.3: PME# disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1e.0: transparent bridge
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1e.0: bridge 32bit mmio: [0xe0100000-0xe03fffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:00: on NUMA node 0
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Routing Table [\_SB_.C003._PRT]
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Routing Table [\_SB_.C003.C0B2._PRT]
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Routing Table [\_SB_.C003.C11F._PRT]
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Routing Table [\_SB_.C003.C133._PRT]
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Link [C12F] (IRQs *10 11)
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Link [C130] (IRQs *10 11)
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Link [C131] (IRQs 10 *11)
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Link [C132] (IRQs 10 11) *5
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Link [C142] (IRQs *10 11)
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Link [C143] (IRQs 10 11) *0, disabled.
Oct 12 17:13:07 aragorn kernel: ACPI: PCI Interrupt Link [C144] (IRQs 10 *11)
Oct 12 17:13:07 aragorn kernel: ACPI Exception: AE_NOT_FOUND, Evaluating _PRS (20090903/pci_link-184)
Oct 12 17:13:07 aragorn kernel: vgaarb: device added: PCI:0000:00:02.0,decodes=io+mem,owns=io+mem,locks=none
Oct 12 17:13:07 aragorn kernel: vgaarb: loaded
Oct 12 17:13:07 aragorn kernel: usbcore: registered new interface driver usbfs
Oct 12 17:13:07 aragorn kernel: usbcore: registered new interface driver hub
Oct 12 17:13:07 aragorn kernel: usbcore: registered new device driver usb
Oct 12 17:13:07 aragorn kernel: PCI: Using ACPI for IRQ routing
Oct 12 17:13:07 aragorn kernel: hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0
Oct 12 17:13:07 aragorn kernel: hpet0: 3 comparators, 64-bit 14.318180 MHz counter
Oct 12 17:13:07 aragorn kernel: Switching to clocksource tsc
Oct 12 17:13:07 aragorn kernel: pnp: PnP ACPI init
Oct 12 17:13:07 aragorn kernel: ACPI: bus type pnp registered
Oct 12 17:13:07 aragorn kernel: pnp: PnP ACPI: found 14 devices
Oct 12 17:13:07 aragorn kernel: ACPI: ACPI bus type pnp unregistered
Oct 12 17:13:07 aragorn kernel: system 00:00: iomem range 0x0-0x9ffff could not be reserved
Oct 12 17:13:07 aragorn kernel: (The fact that a range could not be reserved is generally harmless.)
Oct 12 17:13:07 aragorn kernel: system 00:00: iomem range 0xe0000-0xfffff could not be reserved
Oct 12 17:13:07 aragorn kernel: system 00:00: iomem range 0x100000-0x7e7fffff could not be reserved
Oct 12 17:13:07 aragorn kernel: system 00:0a: ioport range 0x500-0x55f has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0a: ioport range 0x800-0x80f has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0a: iomem range 0xffb00000-0xffbfffff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0a: iomem range 0xfff00000-0xffffffff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: ioport range 0x4d0-0x4d1 has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: ioport range 0x2f8-0x2ff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: ioport range 0x3f8-0x3ff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: ioport range 0x1000-0x107f has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: ioport range 0x1100-0x113f has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: ioport range 0x1200-0x121f has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: iomem range 0xf8000000-0xfbffffff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: iomem range 0xfec00000-0xfec000ff could not be reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: iomem range 0xfed20000-0xfed3ffff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: iomem range 0xfed45000-0xfed8ffff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0c: iomem range 0xfed90000-0xfed99fff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0d: iomem range 0xcee00-0xcffff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0d: iomem range 0xd2000-0xd3fff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0d: iomem range 0xfeda0000-0xfedbffff has been reserved
Oct 12 17:13:07 aragorn kernel: system 00:0d: iomem range 0xfee00000-0xfee00fff has been reserved
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0: PCI bridge, secondary bus 0000:08
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0:   IO window: disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0:   MEM window: disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0:   PREFETCH window: disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1: PCI bridge, secondary bus 0000:10
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1:   IO window: disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1:   MEM window: 0xe0000000-0xe00fffff
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1:   PREFETCH window: disabled
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0: CardBus bridge, secondary bus 0000:03
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0:   IO window: 0x003000-0x0030ff
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0:   IO window: 0x003400-0x0034ff
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0:   PREFETCH window: 0x80000000-0x83ffffff
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0:   MEM window: 0x84000000-0x87ffffff
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1e.0: PCI bridge, secondary bus 0000:02
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1e.0:   IO window: 0x3000-0x3fff
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1e.0:   MEM window: 0xe0100000-0xe03fffff
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1e.0:   PREFETCH window: 0x80000000-0x83ffffff
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1: PCI INT B -> GSI 17 (level, low) -> IRQ 17
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1c.1: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: pci 0000:00:1e.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: pci 0000:02:06.0: PCI INT A -> GSI 18 (level, low) -> IRQ 18
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:00: resource 0 io:  [0x00-0xffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:00: resource 1 mem: [0x000000-0xffffffffffffffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:10: resource 1 mem: [0xe0000000-0xe00fffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:02: resource 0 io:  [0x3000-0x3fff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:02: resource 1 mem: [0xe0100000-0xe03fffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:02: resource 2 pref mem [0x80000000-0x83ffffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:02: resource 3 io:  [0x00-0xffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:02: resource 4 mem: [0x000000-0xffffffffffffffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:03: resource 0 io:  [0x3000-0x30ff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:03: resource 1 io:  [0x3400-0x34ff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:03: resource 2 pref mem [0x80000000-0x83ffffff]
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:03: resource 3 mem: [0x84000000-0x87ffffff]
Oct 12 17:13:07 aragorn kernel: NET: Registered protocol family 2
Oct 12 17:13:07 aragorn kernel: IP route cache hash table entries: 65536 (order: 7, 524288 bytes)
Oct 12 17:13:07 aragorn kernel: TCP established hash table entries: 262144 (order: 10, 4194304 bytes)
Oct 12 17:13:07 aragorn kernel: TCP bind hash table entries: 65536 (order: 9, 2097152 bytes)
Oct 12 17:13:07 aragorn kernel: TCP: Hash tables configured (established 262144 bind 65536)
Oct 12 17:13:07 aragorn kernel: TCP reno registered
Oct 12 17:13:07 aragorn kernel: NET: Registered protocol family 1
Oct 12 17:13:07 aragorn kernel: Trying to unpack rootfs image as initramfs...
Oct 12 17:13:07 aragorn kernel: Freeing initrd memory: 5592k freed
Oct 12 17:13:07 aragorn kernel: audit: initializing netlink socket (disabled)
Oct 12 17:13:07 aragorn kernel: type=2000 audit(1255360345.659:1): initialized
Oct 12 17:13:07 aragorn kernel: VFS: Disk quotas dquot_6.5.2
Oct 12 17:13:07 aragorn kernel: Dquot-cache hash table entries: 512 (order 0, 4096 bytes)
Oct 12 17:13:07 aragorn kernel: msgmni has been set to 3964
Oct 12 17:13:07 aragorn kernel: alg: No test for stdrng (krng)
Oct 12 17:13:07 aragorn kernel: io scheduler noop registered
Oct 12 17:13:07 aragorn kernel: io scheduler anticipatory registered
Oct 12 17:13:07 aragorn kernel: io scheduler deadline registered
Oct 12 17:13:07 aragorn kernel: io scheduler cfq registered (default)
Oct 12 17:13:07 aragorn kernel: pci 0000:00:02.0: Boot video device
Oct 12 17:13:07 aragorn kernel: pcieport-driver 0000:00:1c.0: irq 24 for MSI/MSI-X
Oct 12 17:13:07 aragorn kernel: pcieport-driver 0000:00:1c.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: pcieport-driver 0000:00:1c.1: irq 25 for MSI/MSI-X
Oct 12 17:13:07 aragorn kernel: pcieport-driver 0000:00:1c.1: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: vesafb: framebuffer at 0xd0000000, mapped to 0xffffc90004100000, using 3072k, total 7616k
Oct 12 17:13:07 aragorn kernel: vesafb: mode is 1024x768x16, linelength=2048, pages=3
Oct 12 17:13:07 aragorn kernel: vesafb: scrolling: redraw
Oct 12 17:13:07 aragorn kernel: vesafb: Truecolor: size=0:5:6:5, shift=0:11:5:0
Oct 12 17:13:07 aragorn kernel: Console: switching to colour frame buffer device 128x48
Oct 12 17:13:07 aragorn kernel: fb0: VESA VGA frame buffer device
Oct 12 17:13:07 aragorn kernel: Linux agpgart interface v0.103
Oct 12 17:13:07 aragorn kernel: Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled
Oct 12 17:13:07 aragorn kernel: serial 0000:00:03.3: PCI INT B -> GSI 17 (level, low) -> IRQ 17
Oct 12 17:13:07 aragorn kernel: 0000:00:03.3: ttyS0 at I/O 0x2040 (irq = 17) is a 16550A
Oct 12 17:13:07 aragorn kernel: brd: module loaded
Oct 12 17:13:07 aragorn kernel: PNP: PS/2 Controller [PNP0303:C29C,PNP0f13:C29D] at 0x60,0x64 irq 1,12
Oct 12 17:13:07 aragorn kernel: i8042.c: Detected active multiplexing controller, rev 1.1.
Oct 12 17:13:07 aragorn kernel: serio: i8042 KBD port at 0x60,0x64 irq 1
Oct 12 17:13:07 aragorn kernel: serio: i8042 AUX0 port at 0x60,0x64 irq 12
Oct 12 17:13:07 aragorn kernel: serio: i8042 AUX1 port at 0x60,0x64 irq 12
Oct 12 17:13:07 aragorn kernel: serio: i8042 AUX2 port at 0x60,0x64 irq 12
Oct 12 17:13:07 aragorn kernel: serio: i8042 AUX3 port at 0x60,0x64 irq 12
Oct 12 17:13:07 aragorn kernel: mice: PS/2 mouse device common for all mice
Oct 12 17:13:07 aragorn kernel: Driver 'rtc_cmos' needs updating - please use bus_type methods
Oct 12 17:13:07 aragorn kernel: rtc_cmos 00:06: RTC can wake from S4
Oct 12 17:13:07 aragorn kernel: rtc_cmos 00:06: rtc core: registered rtc_cmos as rtc0
Oct 12 17:13:07 aragorn kernel: rtc0: alarms up to one month, y3k, 114 bytes nvram, hpet irqs
Oct 12 17:13:07 aragorn kernel: cpuidle: using governor ladder
Oct 12 17:13:07 aragorn kernel: cpuidle: using governor menu
Oct 12 17:13:07 aragorn kernel: TCP cubic registered
Oct 12 17:13:07 aragorn kernel: NET: Registered protocol family 17
Oct 12 17:13:07 aragorn kernel: registered taskstats version 1
Oct 12 17:13:07 aragorn kernel: rtc_cmos 00:06: setting system clock to 2009-10-12 15:12:26 UTC (1255360346)
Oct 12 17:13:07 aragorn kernel: Freeing unused kernel memory: 508k freed
Oct 12 17:13:07 aragorn kernel: input: AT Translated Set 2 keyboard as /class/input/input0
Oct 12 17:13:07 aragorn kernel: fan PNP0C0B:00: registered as cooling_device0
Oct 12 17:13:07 aragorn kernel: ACPI: Fan [C3B1] (off)
Oct 12 17:13:07 aragorn kernel: fan PNP0C0B:01: registered as cooling_device1
Oct 12 17:13:07 aragorn kernel: ACPI: Fan [C3B2] (off)
Oct 12 17:13:07 aragorn kernel: fan PNP0C0B:02: registered as cooling_device2
Oct 12 17:13:07 aragorn kernel: ACPI: Fan [C3C8] (off)
Oct 12 17:13:07 aragorn kernel: fan PNP0C0B:03: registered as cooling_device3
Oct 12 17:13:07 aragorn kernel: ACPI: Fan [C3C9] (off)
Oct 12 17:13:07 aragorn kernel: fan PNP0C0B:04: registered as cooling_device4
Oct 12 17:13:07 aragorn kernel: ACPI: Fan [C3CA] (off)
Oct 12 17:13:07 aragorn kernel: fan PNP0C0B:05: registered as cooling_device5
Oct 12 17:13:07 aragorn kernel: ACPI: Fan [C3CB] (off)
Oct 12 17:13:07 aragorn kernel: fan PNP0C0B:06: registered as cooling_device6
Oct 12 17:13:07 aragorn kernel: ACPI: Fan [C3CC] (off)
Oct 12 17:13:07 aragorn kernel: thermal LNXTHERM:01: registered as thermal_zone0
Oct 12 17:13:07 aragorn kernel: ACPI: Thermal Zone [TZ6] (25 C)
Oct 12 17:13:07 aragorn kernel: thermal LNXTHERM:02: registered as thermal_zone1
Oct 12 17:13:07 aragorn kernel: ACPI: Thermal Zone [TZ0] (55 C)
Oct 12 17:13:07 aragorn kernel: thermal LNXTHERM:03: registered as thermal_zone2
Oct 12 17:13:07 aragorn kernel: ACPI: Thermal Zone [TZ1] (57 C)
Oct 12 17:13:07 aragorn kernel: thermal LNXTHERM:04: registered as thermal_zone3
Oct 12 17:13:07 aragorn kernel: ACPI: Thermal Zone [TZ3] (45 C)
Oct 12 17:13:07 aragorn kernel: thermal LNXTHERM:05: registered as thermal_zone4
Oct 12 17:13:07 aragorn kernel: ACPI: Thermal Zone [TZ4] (33 C)
Oct 12 17:13:07 aragorn kernel: thermal LNXTHERM:06: registered as thermal_zone5
Oct 12 17:13:07 aragorn kernel: ACPI: Thermal Zone [TZ5] (49 C)
Oct 12 17:13:07 aragorn kernel: e1000e: Intel(R) PRO/1000 Network Driver - 1.0.2-k2
Oct 12 17:13:07 aragorn kernel: e1000e: Copyright (c) 1999-2008 Intel Corporation.
Oct 12 17:13:07 aragorn kernel: e1000e 0000:00:19.0: PCI INT A -> GSI 22 (level, low) -> IRQ 22
Oct 12 17:13:07 aragorn kernel: e1000e 0000:00:19.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: e1000e 0000:00:19.0: irq 26 for MSI/MSI-X
Oct 12 17:13:07 aragorn kernel: ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
Oct 12 17:13:07 aragorn kernel: SCSI subsystem initialized
Oct 12 17:13:07 aragorn kernel: libata version 3.00 loaded.
Oct 12 17:13:07 aragorn kernel: ricoh-mmc: Ricoh MMC Controller disabling driver
Oct 12 17:13:07 aragorn kernel: ricoh-mmc: Copyright(c) Philip Langdale
Oct 12 17:13:07 aragorn kernel: ricoh-mmc: Ricoh MMC controller found at 0000:02:06.3 [1180:0843] (rev 11)
Oct 12 17:13:07 aragorn kernel: ricoh-mmc: Controller is now disabled.
Oct 12 17:13:07 aragorn kernel: ohci1394 0000:02:06.1: PCI INT B -> GSI 19 (level, low) -> IRQ 19
Oct 12 17:13:07 aragorn kernel: sdhci: Secure Digital Host Controller Interface driver
Oct 12 17:13:07 aragorn kernel: sdhci: Copyright(c) Pierre Ossman
Oct 12 17:13:07 aragorn kernel: ohci1394: fw-host0: OHCI-1394 1.1 (PCI): IRQ=[19]  MMIO=[e0101000-e01017ff]  Max Packet=[2048]  IR/IT contexts=[4/4]
Oct 12 17:13:07 aragorn kernel: sdhci-pci 0000:02:06.2: SDHCI controller found [1180:0822] (rev 21)
Oct 12 17:13:07 aragorn kernel: sdhci-pci 0000:02:06.2: PCI INT C -> GSI 20 (level, low) -> IRQ 20
Oct 12 17:13:07 aragorn kernel: Registered led device: mmc0::
Oct 12 17:13:07 aragorn kernel: mmc0: SDHCI controller on PCI [0000:02:06.2] using PIO
Oct 12 17:13:07 aragorn kernel: 0000:00:19.0: eth0: (PCI Express:2.5GB/s:Width x1) 00:1e:68:5e:3b:04
Oct 12 17:13:07 aragorn kernel: 0000:00:19.0: eth0: Intel(R) PRO/1000 Network Connection
Oct 12 17:13:07 aragorn kernel: 0000:00:19.0: eth0: MAC: 6, PHY: 6, PBA No: ffffff-0ff
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: PCI INT C -> GSI 18 (level, low) -> IRQ 18
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: EHCI Host Controller
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: new USB bus registered, assigned bus number 1
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: debug port 1
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: cache line size of 32 is not supported
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: irq 18, io mem 0xe0641000
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1a.7: USB 2.0 started, EHCI 1.00
Oct 12 17:13:07 aragorn kernel: usb usb1: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 1-0:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 1-0:1.0: 4 ports detected
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: PCI INT A -> GSI 20 (level, low) -> IRQ 20
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: EHCI Host Controller
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: new USB bus registered, assigned bus number 2
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: debug port 1
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: cache line size of 32 is not supported
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: irq 20, io mem 0xe0648000
Oct 12 17:13:07 aragorn kernel: ehci_hcd 0000:00:1d.7: USB 2.0 started, EHCI 1.00
Oct 12 17:13:07 aragorn kernel: usb usb2: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 2-0:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 2-0:1.0: 6 ports detected
Oct 12 17:13:07 aragorn kernel: pata_acpi 0000:00:03.2: PCI INT C -> GSI 18 (level, low) -> IRQ 18
Oct 12 17:13:07 aragorn kernel: pata_acpi 0000:00:03.2: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: pata_acpi 0000:00:03.2: PCI INT C disabled
Oct 12 17:13:07 aragorn kernel: pata_acpi 0000:00:1f.1: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 12 17:13:07 aragorn kernel: pata_acpi 0000:00:1f.1: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: pata_acpi 0000:00:1f.1: PCI INT A disabled
Oct 12 17:13:07 aragorn kernel: ata_piix 0000:00:1f.1: version 2.13
Oct 12 17:13:07 aragorn kernel: ata_piix 0000:00:1f.1: quirky BIOS, skipping spindown on poweroff and hibernation
Oct 12 17:13:07 aragorn kernel: ata_piix 0000:00:1f.1: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 12 17:13:07 aragorn kernel: ata_piix 0000:00:1f.1: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: uhci_hcd: USB Universal Host Controller Interface driver
Oct 12 17:13:07 aragorn kernel: Uniform Multi-Platform E-IDE driver
Oct 12 17:13:07 aragorn kernel: scsi0 : ata_piix
Oct 12 17:13:07 aragorn kernel: scsi1 : ata_piix
Oct 12 17:13:07 aragorn kernel: ata1: PATA max UDMA/100 cmd 0x1f0 ctl 0x3f6 bmdma 0x2120 irq 14
Oct 12 17:13:07 aragorn kernel: ata2: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0x2128 irq 15
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.0: UHCI Host Controller
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.0: new USB bus registered, assigned bus number 3
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.0: irq 16, io base 0x00002080
Oct 12 17:13:07 aragorn kernel: usb usb3: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 3-0:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 3-0:1.0: 2 ports detected
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.1: PCI INT B -> GSI 17 (level, low) -> IRQ 17
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.1: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.1: UHCI Host Controller
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.1: new USB bus registered, assigned bus number 4
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1a.1: irq 17, io base 0x000020a0
Oct 12 17:13:07 aragorn kernel: usb usb4: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 4-0:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 4-0:1.0: 2 ports detected
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.0: PCI INT A -> GSI 20 (level, low) -> IRQ 20
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.0: UHCI Host Controller
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 5
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.0: irq 20, io base 0x000020c0
Oct 12 17:13:07 aragorn kernel: usb usb5: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 5-0:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 5-0:1.0: 2 ports detected
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.1: PCI INT B -> GSI 22 (level, low) -> IRQ 22
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.1: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.1: UHCI Host Controller
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 6
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.1: irq 22, io base 0x000020e0
Oct 12 17:13:07 aragorn kernel: usb usb6: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 6-0:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 6-0:1.0: 2 ports detected
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.2: PCI INT C -> GSI 18 (level, low) -> IRQ 18
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.2: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.2: UHCI Host Controller
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 7
Oct 12 17:13:07 aragorn kernel: uhci_hcd 0000:00:1d.2: irq 18, io base 0x00002100
Oct 12 17:13:07 aragorn kernel: usb usb7: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 7-0:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 7-0:1.0: 2 ports detected
Oct 12 17:13:07 aragorn kernel: ata2: port disabled. ignoring.
Oct 12 17:13:07 aragorn kernel: ata1.00: ATA-7: SAMSUNG HS122JC, GQ100-04, max UDMA/100
Oct 12 17:13:07 aragorn kernel: ata1.00: 234441648 sectors, multi 16: LBA 
Oct 12 17:13:07 aragorn kernel: ata1.01: ATAPI: MATSHITADVD-RAM UJ-852S, 1.02, max MWDMA2
Oct 12 17:13:07 aragorn kernel: ata1.00: configured for UDMA/100
Oct 12 17:13:07 aragorn kernel: ata1.01: configured for MWDMA2
Oct 12 17:13:07 aragorn kernel: scsi 0:0:0:0: Direct-Access     ATA      SAMSUNG HS122JC  GQ10 PQ: 0 ANSI: 5
Oct 12 17:13:07 aragorn kernel: scsi 0:0:1:0: CD-ROM            MATSHITA DVD-RAM UJ-852S  1.02 PQ: 0 ANSI: 5
Oct 12 17:13:07 aragorn kernel: sd 0:0:0:0: [sda] 234441648 512-byte logical blocks: (120 GB/111 GiB)
Oct 12 17:13:07 aragorn kernel: sd 0:0:0:0: [sda] Write Protect is off
Oct 12 17:13:07 aragorn kernel: sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
Oct 12 17:13:07 aragorn kernel: sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Oct 12 17:13:07 aragorn kernel:  sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 >
Oct 12 17:13:07 aragorn kernel: sd 0:0:0:0: [sda] Attached SCSI disk
Oct 12 17:13:07 aragorn kernel: sr0: scsi3-mmc drive: 24x/24x writer dvd-ram cd/rw xa/form2 cdda tray
Oct 12 17:13:07 aragorn kernel: Uniform CD-ROM driver Revision: 3.20
Oct 12 17:13:07 aragorn kernel: sr 0:0:1:0: Attached scsi CD-ROM sr0
Oct 12 17:13:07 aragorn kernel: sd 0:0:0:0: Attached scsi generic sg0 type 0
Oct 12 17:13:07 aragorn kernel: sr 0:0:1:0: Attached scsi generic sg1 type 5
Oct 12 17:13:07 aragorn kernel: usb 1-2: new high speed USB device using ehci_hcd and address 3
Oct 12 17:13:07 aragorn kernel: usb 1-2: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: hub 1-2:1.0: USB hub found
Oct 12 17:13:07 aragorn kernel: hub 1-2:1.0: 4 ports detected
Oct 12 17:13:07 aragorn kernel: usb 3-1: new full speed USB device using uhci_hcd and address 2
Oct 12 17:13:07 aragorn kernel: device-mapper: ioctl: 4.15.0-ioctl (2009-04-01) initialised: dm-devel@redhat.com
Oct 12 17:13:07 aragorn kernel: usb 3-1: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: ieee1394: Host added: ID:BUS[0-00:1023]  GUID[001b249929192210]
Oct 12 17:13:07 aragorn kernel: usb 5-2: new full speed USB device using uhci_hcd and address 2
Oct 12 17:13:07 aragorn kernel: usb 5-2: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: usb 1-2.2: new full speed USB device using ehci_hcd and address 4
Oct 12 17:13:07 aragorn kernel: usb 1-2.2: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: usb 1-2.3: new low speed USB device using ehci_hcd and address 5
Oct 12 17:13:07 aragorn kernel: usb 1-2.3: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: usbcore: registered new interface driver hiddev
Oct 12 17:13:07 aragorn kernel: input: Logitech USB Receiver as /class/input/input1
Oct 12 17:13:07 aragorn kernel: generic-usb 0003:046D:C50D.0001: input: USB HID v1.10 Mouse [Logitech USB Receiver] on usb-0000:00:1a.7-2.3/input0
Oct 12 17:13:07 aragorn kernel: usbcore: registered new interface driver usbhid
Oct 12 17:13:07 aragorn kernel: usbhid: v2.6:USB HID core driver
Oct 12 17:13:07 aragorn kernel: usb 1-2.4: new low speed USB device using ehci_hcd and address 6
Oct 12 17:13:07 aragorn kernel: usb 1-2.4: configuration #1 chosen from 1 choice
Oct 12 17:13:07 aragorn kernel: input: USB Compliant Keyboard as /class/input/input2
Oct 12 17:13:07 aragorn kernel: generic-usb 0003:05A4:9841.0002: input: USB HID v1.10 Keyboard [USB Compliant Keyboard] on usb-0000:00:1a.7-2.4/input0
Oct 12 17:13:07 aragorn kernel: input: USB Compliant Keyboard as /class/input/input3
Oct 12 17:13:07 aragorn kernel: generic-usb 0003:05A4:9841.0003: input: USB HID v1.10 Device [USB Compliant Keyboard] on usb-0000:00:1a.7-2.4/input1
Oct 12 17:13:07 aragorn kernel: PM: Starting manual resume from disk
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: udevd version 125 started
Oct 12 17:13:07 aragorn kernel: input: Sleep Button as /class/input/input4
Oct 12 17:13:07 aragorn kernel: ACPI: Sleep Button [C2BF]
Oct 12 17:13:07 aragorn kernel: input: Lid Switch as /class/input/input5
Oct 12 17:13:07 aragorn kernel: ACPI: Lid Switch [C155]
Oct 12 17:13:07 aragorn kernel: input: Power Button as /class/input/input6
Oct 12 17:13:07 aragorn kernel: ACPI: Power Button [PWRF]
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7dbd42 0027F (v01 HP      Cpu0Ist 00003000 INTL 20060317)
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7dc046 005FA (v01 HP      Cpu0Cst 00003001 INTL 20060317)
Oct 12 17:13:07 aragorn kernel: ACPI: AC Adapter [C23B] (on-line)
Oct 12 17:13:07 aragorn kernel: ACPI: WMI: Mapper loaded
Oct 12 17:13:07 aragorn kernel: Monitor-Mwait will be used to enter C-1 state
Oct 12 17:13:07 aragorn kernel: Monitor-Mwait will be used to enter C-2 state
Oct 12 17:13:07 aragorn kernel: Marking TSC unstable due to TSC halts in idle
Oct 12 17:13:07 aragorn kernel: processor LNXCPU:00: registered as cooling_device7
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7dbc7a 000C8 (v01 HP      Cpu1Ist 00003000 INTL 20060317)
Oct 12 17:13:07 aragorn kernel: ACPI: SSDT 000000007e7dbfc1 00085 (v01 HP      Cpu1Cst 00003000 INTL 20060317)
Oct 12 17:13:07 aragorn kernel: Switching to clocksource hpet
Oct 12 17:13:07 aragorn kernel: agpgart-intel 0000:00:00.0: Intel 965GM Chipset
Oct 12 17:13:07 aragorn kernel: agpgart-intel 0000:00:00.0: detected 7676K stolen memory
Oct 12 17:13:07 aragorn kernel: agpgart-intel 0000:00:00.0: AGP aperture is 256M @ 0xd0000000
Oct 12 17:13:07 aragorn kernel: processor LNXCPU:01: registered as cooling_device8
Oct 12 17:13:07 aragorn kernel: ACPI: Battery Slot [C23D] (battery present)
Oct 12 17:13:07 aragorn kernel: lis3lv02d: hardware type NC2510 found
Oct 12 17:13:07 aragorn kernel: lis3lv02d: 2-byte sensor found
Oct 12 17:13:07 aragorn kernel: input: ST LIS3LV02DL Accelerometer as /class/input/input7
Oct 12 17:13:07 aragorn kernel: Registered led device: hp::hddprotect
Oct 12 17:13:07 aragorn kernel: input: PC Speaker as /class/input/input8
Oct 12 17:13:07 aragorn kernel: cfg80211: Using static regulatory domain info
Oct 12 17:13:07 aragorn kernel: cfg80211: Regulatory domain: EU
Oct 12 17:13:07 aragorn kernel: ^I(start_freq - end_freq @ bandwidth), (max_antenna_gain, max_eirp)
Oct 12 17:13:07 aragorn kernel: ^I(2402000 KHz - 2482000 KHz @ 40000 KHz), (600 mBi, 2000 mBm)
Oct 12 17:13:07 aragorn kernel: ^I(5170000 KHz - 5190000 KHz @ 40000 KHz), (600 mBi, 2300 mBm)
Oct 12 17:13:07 aragorn kernel: ^I(5190000 KHz - 5210000 KHz @ 40000 KHz), (600 mBi, 2300 mBm)
Oct 12 17:13:07 aragorn kernel: ^I(5210000 KHz - 5230000 KHz @ 40000 KHz), (600 mBi, 2300 mBm)
Oct 12 17:13:07 aragorn kernel: ^I(5230000 KHz - 5330000 KHz @ 40000 KHz), (600 mBi, 2000 mBm)
Oct 12 17:13:07 aragorn kernel: ^I(5490000 KHz - 5710000 KHz @ 40000 KHz), (600 mBi, 3000 mBm)
Oct 12 17:13:07 aragorn kernel: cfg80211: Calling CRDA for country: EU
Oct 12 17:13:07 aragorn kernel: cfg80211: Calling CRDA for country: EU
Oct 12 17:13:07 aragorn kernel: input: PS/2 Generic Mouse as /class/input/input9
Oct 12 17:13:07 aragorn kernel: usblp0: USB Bidirectional printer dev 4 if 0 alt 1 proto 2 vid 0x03F0 pid 0x3102
Oct 12 17:13:07 aragorn kernel: usbcore: registered new interface driver usblp
Oct 12 17:13:07 aragorn kernel: yenta_cardbus 0000:02:06.0: CardBus bridge found [103c:30c9]
Oct 12 17:13:07 aragorn kernel: yenta_cardbus 0000:02:06.0: ISA IRQ mask 0x0cb8, PCI irq 18
Oct 12 17:13:07 aragorn kernel: yenta_cardbus 0000:02:06.0: Socket status: 30000006
Oct 12 17:13:07 aragorn kernel: pci_bus 0000:02: Raising subordinate bus# of parent bus (#02) from #03 to #06
Oct 12 17:13:07 aragorn kernel: yenta_cardbus 0000:02:06.0: pcmcia: parent PCI bridge I/O window: 0x3000 - 0x3fff
Oct 12 17:13:07 aragorn kernel: yenta_cardbus 0000:02:06.0: pcmcia: parent PCI bridge Memory window: 0xe0100000 - 0xe03fffff
Oct 12 17:13:07 aragorn kernel: yenta_cardbus 0000:02:06.0: pcmcia: parent PCI bridge Memory window: 0x80000000 - 0x83ffffff
Oct 12 17:13:07 aragorn kernel: Synaptics Touchpad, model: 1, fw: 6.3, id: 0x1a0b1, caps: 0xa04711/0xa00000
Oct 12 17:13:07 aragorn kernel: input: SynPS/2 Synaptics TouchPad as /class/input/input10
Oct 12 17:13:07 aragorn kernel: iwlagn: Intel(R) Wireless WiFi Link AGN driver for Linux, 1.3.27kd
Oct 12 17:13:07 aragorn kernel: iwlagn: Copyright(c) 2003-2009 Intel Corporation
Oct 12 17:13:07 aragorn kernel: iwlagn 0000:10:00.0: PCI INT A -> GSI 17 (level, low) -> IRQ 17
Oct 12 17:13:07 aragorn kernel: iwlagn 0000:10:00.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: iwlagn 0000:10:00.0: Detected Intel Wireless WiFi Link 4965AGN REV=0x4
Oct 12 17:13:07 aragorn kernel: iwlagn 0000:10:00.0: Tunable channels: 11 802.11bg, 13 802.11a channels
Oct 12 17:13:07 aragorn kernel: iwlagn 0000:10:00.0: irq 27 for MSI/MSI-X
Oct 12 17:13:07 aragorn kernel: phy0: Selected rate control algorithm 'iwl-agn-rs'
Oct 12 17:13:07 aragorn kernel: HDA Intel 0000:00:1b.0: power state changed by ACPI to D0
Oct 12 17:13:07 aragorn kernel: HDA Intel 0000:00:1b.0: PCI INT A -> GSI 17 (level, low) -> IRQ 17
Oct 12 17:13:07 aragorn kernel: HDA Intel 0000:00:1b.0: setting latency timer to 64
Oct 12 17:13:07 aragorn kernel: input: HDA Digital PCBeep as /class/input/input11
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-1, internal journal
Oct 12 17:13:07 aragorn kernel: loop: module loaded
Oct 12 17:13:07 aragorn kernel: input: HP WMI hotkeys as /class/input/input12
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-5, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-2, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-3, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-4, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-6, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-9, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-12, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-8, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: kjournald starting.  Commit interval 5 seconds
Oct 12 17:13:07 aragorn kernel: EXT3 FS on dm-10, internal journal
Oct 12 17:13:07 aragorn kernel: EXT3-fs: mounted filesystem with ordered data mode.
Oct 12 17:13:07 aragorn kernel: Adding 2097144k swap on /dev/mapper/main-swap.  Priority:-1 extents:1 across:2097144k 
Oct 12 17:13:07 aragorn kernel: iwlagn 0000:10:00.0: firmware: requesting iwlwifi-4965-2.ucode
Oct 12 17:13:07 aragorn kernel: iwlagn 0000:10:00.0: loaded firmware version 228.57.2.23
Oct 12 17:13:07 aragorn kernel: Registered led device: iwl-phy0::radio
Oct 12 17:13:07 aragorn kernel: Registered led device: iwl-phy0::assoc
Oct 12 17:13:07 aragorn kernel: Registered led device: iwl-phy0::RX
Oct 12 17:13:07 aragorn kernel: Registered led device: iwl-phy0::TX
Oct 12 17:13:07 aragorn kernel: wlan0: deauthenticating from 00:14:c1:38:e5:15 by local choice (reason=3)
Oct 12 17:13:07 aragorn kernel: wlan0: direct probe to AP 00:14:c1:38:e5:15 (try 1)
Oct 12 17:13:07 aragorn kernel: wlan0: direct probe responded
Oct 12 17:13:07 aragorn kernel: wlan0: authenticate with AP 00:14:c1:38:e5:15 (try 1)
Oct 12 17:13:07 aragorn kernel: wlan0: authenticated
Oct 12 17:13:07 aragorn kernel: wlan0: associate with AP 00:14:c1:38:e5:15 (try 1)
Oct 12 17:13:07 aragorn kernel: wlan0: RX AssocResp from 00:14:c1:38:e5:15 (capab=0x411 status=0 aid=1)
Oct 12 17:13:07 aragorn kernel: wlan0: associated
Oct 12 17:13:07 aragorn kernel: RPC: Registered udp transport module.
Oct 12 17:13:07 aragorn kernel: RPC: Registered tcp transport module.
Oct 12 17:13:07 aragorn kernel: RPC: Registered tcp NFSv4.1 backchannel transport module.
Oct 12 17:13:07 aragorn kernel: Installing knfsd (copyright (C) 1996 okir@monad.swb.de).
Oct 12 17:13:07 aragorn kernel: NET: Registered protocol family 10
Oct 12 17:13:07 aragorn kernel: lo: Disabled Privacy Extensions
Oct 12 17:13:09 aragorn kernel: lp: driver loaded but no devices found
Oct 12 17:13:09 aragorn kernel: ppdev: user-space parallel port driver
Oct 12 17:13:15 aragorn kernel: wlan0: no IPv6 routers present
Oct 12 17:13:20 aragorn kernel: [drm] Initialized drm 1.1.0 20060810
Oct 12 17:13:20 aragorn kernel: pci 0000:00:02.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 12 17:13:20 aragorn kernel: pci 0000:00:02.0: setting latency timer to 64
Oct 12 17:13:20 aragorn kernel: pci 0000:00:02.0: irq 28 for MSI/MSI-X
Oct 12 17:13:20 aragorn kernel: acpi device:00: registered as cooling_device9
Oct 12 17:13:20 aragorn kernel: input: Video Bus as /class/input/input13
Oct 12 17:13:20 aragorn kernel: ACPI: Video Device [C09A] (multi-head: yes  rom: no  post: no)
Oct 12 17:13:20 aragorn kernel: [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0
Oct 12 17:16:11 aragorn kernel: sr0: CDROM not ready.  Make sure there is a disc in the drive.
Oct 12 17:16:11 aragorn kernel: sr0: CDROM not ready.  Make sure there is a disc in the drive.
Oct 12 17:55:15 aragorn kernel: sr0: CDROM not ready.  Make sure there is a disc in the drive.
Oct 12 17:55:15 aragorn kernel: sr0: CDROM not ready.  Make sure there is a disc in the drive.
Oct 12 18:10:35 aragorn kernel: e1000e 0000:00:19.0: irq 26 for MSI/MSI-X
Oct 12 18:10:35 aragorn kernel: e1000e 0000:00:19.0: irq 26 for MSI/MSI-X
Oct 12 18:10:35 aragorn kernel: ADDRCONF(NETDEV_UP): eth0: link is not ready
Oct 12 18:10:37 aragorn kernel: e1000e: eth0 NIC Link is Up 100 Mbps Full Duplex, Flow Control: RX/TX
Oct 12 18:10:37 aragorn kernel: 0000:00:19.0: eth0: 10/100 speed: disabling TSO
Oct 12 18:10:37 aragorn kernel: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Oct 12 18:10:39 aragorn kernel: wlan0: deauthenticating from 00:14:c1:38:e5:15 by local choice (reason=3)
Oct 12 18:10:48 aragorn kernel: eth0: no IPv6 routers present
Oct 12 20:13:29 aragorn kernel: Registered led device: iwl-phy0::radio
Oct 12 20:13:29 aragorn kernel: Registered led device: iwl-phy0::assoc
Oct 12 20:13:29 aragorn kernel: Registered led device: iwl-phy0::RX
Oct 12 20:13:29 aragorn kernel: Registered led device: iwl-phy0::TX
Oct 12 20:13:29 aragorn kernel: ADDRCONF(NETDEV_UP): wlan0: link is not ready
Oct 12 20:13:29 aragorn kernel: wlan0: direct probe to AP 00:14:c1:38:e5:15 (try 1)
Oct 12 20:13:29 aragorn kernel: wlan0: direct probe responded
Oct 12 20:13:29 aragorn kernel: wlan0: authenticate with AP 00:14:c1:38:e5:15 (try 1)
Oct 12 20:13:29 aragorn kernel: wlan0: authenticated
Oct 12 20:13:29 aragorn kernel: wlan0: associate with AP 00:14:c1:38:e5:15 (try 1)
Oct 12 20:13:29 aragorn kernel: wlan0: RX ReassocResp from 00:14:c1:38:e5:15 (capab=0x411 status=0 aid=0)
Oct 12 20:13:29 aragorn kernel: wlan0: invalid aid value 0; bits 15:14 not set
Oct 12 20:13:29 aragorn kernel: wlan0: associated
Oct 12 20:13:29 aragorn kernel: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
Oct 12 20:13:30 aragorn kernel: wlan0: deauthenticating from 00:14:c1:38:e5:15 by local choice (reason=3)
Oct 12 20:13:31 aragorn kernel: wlan0: deauthenticating from 00:14:c1:38:e5:15 by local choice (reason=3)
Oct 12 20:13:31 aragorn kernel: wlan0: direct probe to AP 00:14:c1:38:e5:15 (try 1)
Oct 12 20:13:31 aragorn kernel: wlan0: direct probe responded
Oct 12 20:13:31 aragorn kernel: wlan0: authenticate with AP 00:14:c1:38:e5:15 (try 1)
Oct 12 20:13:31 aragorn kernel: wlan0: authenticated
Oct 12 20:13:31 aragorn kernel: wlan0: associate with AP 00:14:c1:38:e5:15 (try 1)
Oct 12 20:13:31 aragorn kernel: wlan0: RX ReassocResp from 00:14:c1:38:e5:15 (capab=0x411 status=0 aid=1)
Oct 12 20:13:31 aragorn kernel: wlan0: associated
Oct 12 20:13:40 aragorn kernel: wlan0: no IPv6 routers present
Oct 12 20:15:06 aragorn kernel: swapper: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 0, comm: swapper Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7243>] iwl_rx_handle+0x3ad/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8125732b>] ? ip_rcv+0x2b8/0x2ef
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0362b95>] ? __iwl_read32+0xaa/0xb9 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02b1fc9>] ? acpi_idle_enter_simple+0xfe/0x12c [processor]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa02b1fbf>] ? acpi_idle_enter_simple+0xf4/0x12c [processor]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81220131>] ? cpuidle_idle_call+0x98/0xf3
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100aed0>] ? cpu_idle+0x5a/0x92
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812aa175>] ? start_secondary+0x17a/0x17f
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 187
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 159
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102886 isolated_anon:32
Oct 12 20:15:07 aragorn kernel:  active_file:10730 inactive_file:10749 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:35291 unstable:0 buffer:50
Oct 12 20:15:07 aragorn kernel:  free:3047 slab_reclaimable:3967 slab_unreclaimable:11608
Oct 12 20:15:07 aragorn kernel:  mapped:21571 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4260kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407564kB active_file:42904kB inactive_file:42868kB unevictable:1600kB isolated(anon):128kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:141164kB mapped:86256kB shmem:76kB slab_reclaimable:15864kB slab_unreclaimable:46424kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 549*4kB 196*8kB 0*16kB 0*32kB 1*64kB 0*128kB 0*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4340kB
Oct 12 20:15:07 aragorn kernel: 59407 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 37518 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 205147, delete 167629, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1361488kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93830 pages shared
Oct 12 20:15:07 aragorn kernel: 433823 pages non-shared
Oct 12 20:15:07 aragorn kernel: iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
Oct 12 20:15:07 aragorn kernel: swapper: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 0, comm: swapper Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7243>] iwl_rx_handle+0x3ad/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0362b95>] ? __iwl_read32+0xaa/0xb9 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02b1fc9>] ? acpi_idle_enter_simple+0xfe/0x12c [processor]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa02b1fbf>] ? acpi_idle_enter_simple+0xf4/0x12c [processor]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81220131>] ? cpuidle_idle_call+0x98/0xf3
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100aed0>] ? cpu_idle+0x5a/0x92
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8129e042>] ? rest_init+0x66/0x68
Oct 12 20:15:07 aragorn kernel:  [<ffffffff814a9c48>] ? start_kernel+0x34d/0x358
Oct 12 20:15:07 aragorn kernel:  [<ffffffff814a929a>] ? x86_64_start_reservations+0xaa/0xae
Oct 12 20:15:07 aragorn kernel:  [<ffffffff814a937f>] ? x86_64_start_kernel+0xe1/0xe8
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 187
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 167
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102888 isolated_anon:32
Oct 12 20:15:07 aragorn kernel:  active_file:10737 inactive_file:10749 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:35575 unstable:0 buffer:50
Oct 12 20:15:07 aragorn kernel:  free:2981 slab_reclaimable:3935 slab_unreclaimable:11684
Oct 12 20:15:07 aragorn kernel:  mapped:21563 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:3996kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407572kB active_file:42932kB inactive_file:42868kB unevictable:1600kB isolated(anon):128kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:142300kB mapped:86224kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:46728kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 609*4kB 135*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4028kB
Oct 12 20:15:07 aragorn kernel: 59639 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 37774 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 205403, delete 167629, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1360464kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93766 pages shared
Oct 12 20:15:07 aragorn kernel: 433957 pages non-shared
Oct 12 20:15:07 aragorn kernel: iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 3 free buffers remaining.
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7218>] iwl_rx_handle+0x382/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8125732b>] ? ip_rcv+0x2b8/0x2ef
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0362b95>] ? __iwl_read32+0xaa/0xb9 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 200
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102838 isolated_anon:65
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36939 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11957
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407376kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):256kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147756kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47820kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:131 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60755 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39168 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206820, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354820kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7243>] iwl_rx_handle+0x3ad/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8125732b>] ? ip_rcv+0x2b8/0x2ef
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0362b95>] ? __iwl_read32+0xaa/0xb9 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 190
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102848 isolated_anon:66
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36970 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11978
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407412kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):264kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147880kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47904kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:165 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60786 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39195 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206846, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354716kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7218>] iwl_rx_handle+0x382/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122c0e6>] ? skb_dequeue+0x60/0x6c
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 190
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102848 isolated_anon:66
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36970 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11978
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407412kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):264kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147880kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47904kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60786 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39195 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206846, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354716kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7218>] iwl_rx_handle+0x382/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122c0e6>] ? skb_dequeue+0x60/0x6c
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 190
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102848 isolated_anon:66
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36970 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11978
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407412kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):264kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147880kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47904kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60786 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39195 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206846, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354716kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7218>] iwl_rx_handle+0x382/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122c0e6>] ? skb_dequeue+0x60/0x6c
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 190
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102848 isolated_anon:66
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36970 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11978
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407412kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):264kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147880kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47904kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60786 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39195 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206846, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354716kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7218>] iwl_rx_handle+0x382/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122c0e6>] ? skb_dequeue+0x60/0x6c
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 190
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102848 isolated_anon:66
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36970 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11978
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407412kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):264kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147880kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47904kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60786 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39195 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206846, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354716kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7218>] iwl_rx_handle+0x382/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122c0e6>] ? skb_dequeue+0x60/0x6c
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 190
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102848 isolated_anon:66
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36970 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11978
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407412kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):264kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147880kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47904kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60786 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39195 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206846, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354716kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: kcryptd: page allocation failure. order:2, mode:0x4020
Oct 12 20:15:07 aragorn kernel: Pid: 1523, comm: kcryptd Not tainted 2.6.32-rc4 #29
Oct 12 20:15:07 aragorn kernel: Call Trace:
Oct 12 20:15:07 aragorn kernel:  <IRQ>  [<ffffffff810a9d87>] __alloc_pages_nodemask+0x5b9/0x632
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810a9e17>] __get_free_pages+0x17/0x46
Oct 12 20:15:07 aragorn kernel:  [<ffffffff810cecff>] __kmalloc_track_caller+0x4e/0x146
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] ? iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122d96d>] __alloc_skb+0x6b/0x161
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b583>] iwl_rx_allocate+0x94/0x30a [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa036b814>] iwl_rx_replenish_now+0x1b/0x28 [iwlcore]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b7218>] iwl_rx_handle+0x382/0x3c6 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa03b778d>] iwl_irq_tasklet_legacy+0x531/0x7a9 [iwlagn]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8122c0e6>] ? skb_dequeue+0x60/0x6c
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81048abf>] tasklet_action+0x76/0xc1
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a31a>] __do_softirq+0xdd/0x197
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100ccdc>] call_softirq+0x1c/0x28
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100e81c>] do_softirq+0x38/0x70
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8104a164>] irq_exit+0x3b/0x7a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff812af5d5>] do_IRQ+0xad/0xc4
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100c553>] ret_from_intr+0x0/0xa
Oct 12 20:15:07 aragorn kernel:  <EOI>  [<ffffffffa02419f4>] ? enc128+0x67f/0x80b [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa024273a>] ? aes_encrypt+0x12/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa022f2cc>] ? crypto_cbc_encrypt+0x131/0x193 [cbc]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0242728>] ? aes_encrypt+0x0/0x14 [aes_x86_64]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81152601>] ? async_encrypt+0x3d/0x3f
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201bbd>] ? crypt_convert+0x1fe/0x290 [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0202077>] ? kcryptd_crypt+0x428/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058e77>] ? worker_thread+0x195/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffffa0201c4f>] ? kcryptd_crypt+0x0/0x44e [dm_crypt]
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105cb19>] ? autoremove_wake_function+0x0/0x3d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff81058ce2>] ? worker_thread+0x0/0x22d
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c79b>] ? kthread+0x82/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbda>] ? child_rip+0xa/0x20
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8105c719>] ? kthread+0x0/0x8a
Oct 12 20:15:07 aragorn kernel:  [<ffffffff8100cbd0>] ? child_rip+0x0/0x20
Oct 12 20:15:07 aragorn kernel: Mem-Info:
Oct 12 20:15:07 aragorn kernel: DMA per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:    0, btch:   1 usd:   0
Oct 12 20:15:07 aragorn kernel: DMA32 per-cpu:
Oct 12 20:15:07 aragorn kernel: CPU    0: hi:  186, btch:  31 usd: 161
Oct 12 20:15:07 aragorn kernel: CPU    1: hi:  186, btch:  31 usd: 190
Oct 12 20:15:07 aragorn kernel: active_anon:306195 inactive_anon:102848 isolated_anon:66
Oct 12 20:15:07 aragorn kernel:  active_file:10600 inactive_file:10613 isolated_file:0
Oct 12 20:15:07 aragorn kernel:  unevictable:400 dirty:0 writeback:36970 unstable:0 buffer:51
Oct 12 20:15:07 aragorn kernel:  free:3171 slab_reclaimable:3935 slab_unreclaimable:11978
Oct 12 20:15:07 aragorn kernel:  mapped:21315 shmem:19 pagetables:4243 bounce:0
Oct 12 20:15:07 aragorn kernel: DMA free:7928kB min:40kB low:48kB high:60kB active_anon:3880kB inactive_anon:3980kB active_file:16kB inactive_file:128kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15336kB mlocked:0kB dirty:0kB writeback:0kB mapped:28kB shmem:0kB slab_reclaimable:4kB slab_unreclaimable:8kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 1976 1976 1976
Oct 12 20:15:07 aragorn kernel: DMA32 free:4756kB min:5664kB low:7080kB high:8496kB active_anon:1220900kB inactive_anon:407412kB active_file:42384kB inactive_file:42324kB unevictable:1600kB isolated(anon):264kB isolated(file):0kB present:2023748kB mlocked:1600kB dirty:0kB writeback:147880kB mapped:85232kB shmem:76kB slab_reclaimable:15736kB slab_unreclaimable:47904kB kernel_stack:1440kB pagetables:16956kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Oct 12 20:15:07 aragorn kernel: lowmem_reserve[]: 0 0 0 0
Oct 12 20:15:07 aragorn kernel: DMA: 6*4kB 6*8kB 5*16kB 5*32kB 5*64kB 3*128kB 1*256kB 1*512kB 2*1024kB 0*2048kB 1*4096kB = 7928kB
Oct 12 20:15:07 aragorn kernel: DMA32: 923*4kB 17*8kB 0*16kB 1*32kB 0*64kB 1*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 4756kB
Oct 12 20:15:07 aragorn kernel: 60786 total pagecache pages
Oct 12 20:15:07 aragorn kernel: 39195 pages in swap cache
Oct 12 20:15:07 aragorn kernel: Swap cache stats: add 206846, delete 167651, find 11222/12777
Oct 12 20:15:07 aragorn kernel: Free swap  = 1354716kB
Oct 12 20:15:07 aragorn kernel: Total swap = 2097144kB
Oct 12 20:15:07 aragorn kernel: 518064 pages RAM
Oct 12 20:15:07 aragorn kernel: 10503 pages reserved
Oct 12 20:15:07 aragorn kernel: 93541 pages shared
Oct 12 20:15:07 aragorn kernel: 433976 pages non-shared
Oct 12 20:15:07 aragorn kernel: __ratelimit: 45 callbacks suppressed
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 13:10                   ` Frans Pop
@ 2009-10-14 15:40                     ` Mel Gorman
  2009-10-14 16:13                       ` Frans Pop
  2009-10-14 18:34                       ` Frans Pop
  2009-10-14 16:30                     ` reinette chatre
  2009-10-18 23:33                     ` Frans Pop
  2 siblings, 2 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-14 15:40 UTC (permalink / raw)
  To: Frans Pop
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Wed, Oct 14, 2009 at 03:10:08PM +0200, Frans Pop wrote:
> On Wednesday 14 October 2009, Mel Gorman wrote:
> > I think this is very significant. Either that change needs to be backed
> > out or more likely, __GFP_NOWARN needs to be specified and warnings
> > *only* printed when the RX buffers are really low. My expectation would
> > be that some GFP_ATOMIC allocations fail during refill but the fact they
> > fail wakes kswapd to reclaim order-2 pages while the RX buffers in the
> > pool are consumed.
> 
> Sorry I did not actually mention this, but the SKB failures I get with .32 
> have loads of the "Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 
> free buffers remaining." errors. That's why I don't think your patch will 
> help anything.
> 
> zgrep "Only 0 free buffers remaining" /var/log/kern.log* | wc -l
> 84
> 
> OK, they are all GPF_ATOMIC and not GPF_KERNEL, but they also almost all 
> have "0 free buffers"! Next to the 84 warnings for 0 remaining I only have 
> one with "3 free buffers" and one with "1 free buffers".
> 
This is fairly important. It shows that the refills are not keeping up
with the GFP_ATOMIC usage. I'm not sure what to do with this. As the
driver introduced GFP_ATOMIC usage at all, I'm tempted to say revert the
changes in the driver that makes use of GFP_ATOMIC but I'm not the
maintainer. They could also consider having a GFP_ATOMIC-optimistic,
GFP_KERNEL-if-no-buffers-free-and-directly-allocating with GFP_KERNEL
refills always happening in the tasklet.
However, it might be just avoiding the MM problem on my part. It's possible
that if I figure out what went wrong in mm and drivers use of GFP_ATOMIC
will be swept under the carpet.
> And that does not even count the rate limitting:
> Oct 12 20:15:07 aragorn kernel: __ratelimit: 45 callbacks suppressed
> Oct 12 20:25:19 aragorn kernel: __ratelimit: 27 callbacks suppressed
> Oct 12 20:25:20 aragorn kernel: __ratelimit: 2 callbacks suppressed
> 
> Attached the kernel log for one test I did with .32.
> 
> > > In both cases I no longer get SKB errors, but instead (?) I get
> > > firmware errors:
> > > iwlagn 0000:10:00.0: Microcode SW error detected.  Restarting
> > > 0x2000000.
> >
> > I am no wireless expert, but that looks like an separate problem to me.
> > I don't see how an allocation failure could trigger errors in the
> > microcode.
> 
> Yes, it is a separate problem, but it is still significant that reverting 
> that patch triggers them in the extreme swap situation.
> 
True.
> > > With your patch on .32-rc4 I still get the SKB errors, so it does not
> > > seem to help. The only change there may have been is that the desktop
> > > was frozen longer than without the patch, but that is an impression,
> > > not a hard fact.
> >
> > Actually, that's fairly interesting and I think justifies pushing the
> > patch. Direct reclaim can stall processes in a user-visible manner which
> > kswapd is meant to avoid in the majority of cases but is tricky to
> > quantify without instrumenting the kernel to measure direct reclaim
> > frequency and latency (I have WIP tracepoints for this but it's still a
> > WIP). If you notice shorter stalls with the patch applied, it means that
> > kswapd really did need to be informed of the problems.
> 
> No, I thought I saw _longer_ stalls with your patch applied...
> 
Sorry, I misinterpreted. If the stalls are longer, it likely means that
kswapd is doing more work and causing more IO when applied as it tries to
get order-2 pages free. You said you still got SKB errors. Were there any
significant change to the number of failures or can that be told?
> > There still has not been a mm-change identified that makes fragmentation
> > significantly worse.
> 
> My bisection shows a very clear point, even if not an individual commit, in 
> the 'akpm' merge where SKB errors suddenly become *much* more frequent and 
> easy to trigger.
> I'm sorry to say this, but the fact that nothing has been identified yet is 
> IMO the result of a lack of effort, not because there is no such change.
> 
I apologise if I've given that impression. I've been starting at the commits
but could not find an obvious candidate within the page allocator itself which
is why I've been looking at other areas. I put together a hack that allocated
order-2 atomics at a constant rate and order-5 atomics at a lower rate to
try replicate the problem without drivers. I ran some workloads but I wasn't
able to get reliable figures that would have allowed me to investigate further.
> > The majority of the wireless reports have been in 
> > this driver and I think we have the problem commit there. The only other
> > is a firmware loading problem in e100 after resume that fails to make an
> > atomic order-5 fail.
> 
> Not exactly true. Bartlomiej's report was about ipw2200, so there are at 
> least 3 different drivers involved, two wireless and one wired. Besides 
> that one report is related to heavy swap, one to resume and one to driver 
> reload.
> So it's much more likely that there is some common regression (in mm) that 
> affected all three than that there are three unrelated regressions.
Very very likely, I'm not denying this.
> And although both of the others did extremely high allocations, they both 
> started appearing in the same timeframe. And Bart's very first report 
> linked it to mm changes.
> 
> > It's possible that something has changed in resume 
> > in the 2.6.31 window there - maybe something like drivers now reload
> > during resume where they didn't previously or less memory being pushed
> > to swap during resume.
> 
> IMO you're sticking your head in the sand here. 
No. If I was sticking my head in the sand, I would have dismissed this
entirely as "GFP_ATOMIC allocations can fail boo hoo hoo deal with it".
What I'm trying to identify what changed that would affect fragmentation
but that is not within the page allocator itself - largely because with
the exception of the patch I gave you, I couldn't find obvious breakage.
You highlighted the first akpm merge so lets look closer at that as I don't
think there is anything more I can do with the wireless driver other than the
suggestions made already. I looked at this already but I felt fixing GFP_ATOMIC
in wireless was the more likely fix. 
Here is what you said about the merge.
====
For a good overview of the area, use 'gitk f83b1e61..517d0869'.
v2.6.30-5466-ga1dd268   mm: use alloc_pages_exact in alloc_large_system_hash
        2.3     +-
v2.6.30-5478-ge9bb35d   mm: setup_per_zone_inactive_ratio - fix comment and..
        2.5     +-
v2.6.30-5486-g35282a2   migration: only migrate_prep() once per move_pages()
        2.6     -|+|-   not quite conclusive...
v2.6.30-5492-gbce7394   page-allocator: reset wmark_min and inactive ratio..
        2.4     -|-
====
This is what I found. The following were the possible commits that might
be causing the problem.
d239171..72807a7 -- page allocator
	These are the bulk of the page-allocator changes that happened int
	the 2.6.30..2.6.31 cycle. It's also the location of the change to
	kswapd that I sent you a patch for. If there was a marked increase
	in the number of failures before and after this patchset, it means
	that I was wrong about the problem not being in the page allocator
	and I have to go back and keep looking. However, you report that
	commit e9bb35d   mm: setup_per_zone_inactive_ratio - fix comment
	had relatively good results - relatively being that it didn't fail
	on the first try. In my head, these patches have been struck off the
	list of possibilities and is why I've been looking in other subsystems.
56e49d2..f166777 -- reclaim
	I would have considered this strong candidates except again, the last
	good commit happened after this point. If other obvious candidates
	don't crop up, it might be worth double checking within this range, particularly
	commit 56e49d2 vmscan: evict use-once pages first
	as it is targeted at streaming-IO workloads which would include
	your music workload. This commit also will cleanly revert on
	mainline so is relatively easy to test
5c87ead..e9bb35d -- inactive ratio changes
	These patches should be harmless but just in case, please
	compare the output of
	# grep inactive_ratio /proc/zoneinfo
	on 2.6.30 and 2.6.31 and make sure the ratios are the same.
e9bb35d..bce7394 -- various changes
	According to your analysis, this is the most likely location of
	the problem commit.
	Commit b70d94e altered how zonelists were selected during
	allocation. This was tested fairly heavily but if the testing
	missed something, it would mean that some allocations are not
	using the zones they should be.  However, my expectation would
	be that mistakes here would have severe consequences affecting a
	large number of people. This does not revert cleanly but there is
	an untested patch below that should do the job. While it's hard to
	imagine this patch being the problem, it's the most likely commit
	with the range of commits your analysis identified.
	Commit bc75d33 is totally harmless but it mentions min_free_kbytes. I
	checked on my machine to make sure min_free_kbytes was the same on both
	2.6.30 and 2.6.31. Can you check that this is true for your machine? If
	min_free_kbytes decreased, it could explain GFP_ATOMIC failures.
	An extremely unlikely candidate is 75927af8. For this to be a problem,
	much of your userspace would have to be calling madvise() with
	stupid parameters and depending on it silently ignore the
	parameters
	A vague potential candidate for swapless systems is 69c85481 but
	your machine has swap so it can't be this.
	Commit bce7394 affects min_free_kbytes but only on hotplug so it
	can't be this either for your machine
After this point, your analysis indicates that things are already broken
but lets look at some of the candidates anyway.  Out of curiousity,
was CONFIG_UNEVICTABLE_LRU unset in your .config for 2.6.30? I could
only find your 2.6.31 .config. If it was, it might be worth reverting
6837765963f1723e80ca97b1fae660f3a60d77df and unsetting it in 2.6.31 and
seeing what happens.
Commit 8cab4754d24a0f2e05920170c845bd84472814c6 keeps pages on the active
lists for longer than 2.6.30 did. It's possible the fewer reclaim decisions
is delaying lumpy reclaim.
CONFIG_NUMA is not set in your config, so the zone_reclaim() changes
around 24cf72518c79cdcda486ed26074ff8151291cf65 can be discounted.
Commit ee993b135ec75a93bd5c45e636bb210d2975159b altered how lumpy
reclaim works but it should have been harmless. It does not cleanly
revert but it's easy to manually revert.
I didn't spot any other patches that might be potential problems in the
commits.
> I'm not saying that mm is the only issue here, but I'm convinced that there 
> _is_ an mm change that has contributed in a major way to these issues, 
> even if we've not yet been able to identify it.
> 
> > -			    net_ratelimit())
> > +			    net_ratelimit()) {
> >  				IWL_CRIT(priv, "Failed to allocate SKB buffer with %s. Only %u free
> > buffers remaining.\n", priority == GFP_ATOMIC ?  "GFP_ATOMIC" :
> > "GFP_KERNEL",
> 
> Haven't you broken the test 'priority == GFP_ATOMIC' here by setting 
> priority to GFP_ATOMIC|__GFP_NOWARN?
> 
Yes, I did, but as you say that this error message is showing up and buffers
are all depleted, it's not even close to being the right fix. It'd only
be relevant if that error message was showing up with buffers remaining in
the queue.
Revert commit b70d94ee
---
diff --git a/include/linux/gfp.h b/include/linux/gfp.h
index 557bdad..3a94e4b 100644
--- a/include/linux/gfp.h
+++ b/include/linux/gfp.h
@@ -21,8 +21,7 @@ struct vm_area_struct;
 #define __GFP_DMA	((__force gfp_t)0x01u)
 #define __GFP_HIGHMEM	((__force gfp_t)0x02u)
 #define __GFP_DMA32	((__force gfp_t)0x04u)
-#define __GFP_MOVABLE	((__force gfp_t)0x08u)  /* Page is movable */
-#define GFP_ZONEMASK	(__GFP_DMA|__GFP_HIGHMEM|__GFP_DMA32|__GFP_MOVABLE)
+
 /*
  * Action modifiers - doesn't change the zoning
  *
@@ -52,6 +51,7 @@ struct vm_area_struct;
 #define __GFP_HARDWALL   ((__force gfp_t)0x20000u) /* Enforce hardwall cpuset memory allocs */
 #define __GFP_THISNODE	((__force gfp_t)0x40000u)/* No fallback, no policies */
 #define __GFP_RECLAIMABLE ((__force gfp_t)0x80000u) /* Page is reclaimable */
+#define __GFP_MOVABLE	((__force gfp_t)0x100000u)  /* Page is movable */
 
 #ifdef CONFIG_KMEMCHECK
 #define __GFP_NOTRACK	((__force gfp_t)0x200000u)  /* Don't track with kmemcheck */
@@ -128,105 +128,24 @@ static inline int allocflags_to_migratetype(gfp_t gfp_flags)
 		((gfp_flags & __GFP_RECLAIMABLE) != 0);
 }
 
-#ifdef CONFIG_HIGHMEM
-#define OPT_ZONE_HIGHMEM ZONE_HIGHMEM
-#else
-#define OPT_ZONE_HIGHMEM ZONE_NORMAL
-#endif
-
+static inline enum zone_type gfp_zone(gfp_t flags)
+{
 #ifdef CONFIG_ZONE_DMA
-#define OPT_ZONE_DMA ZONE_DMA
-#else
-#define OPT_ZONE_DMA ZONE_NORMAL
+	if (flags & __GFP_DMA)
+		return ZONE_DMA;
 #endif
-
 #ifdef CONFIG_ZONE_DMA32
-#define OPT_ZONE_DMA32 ZONE_DMA32
-#else
-#define OPT_ZONE_DMA32 ZONE_NORMAL
-#endif
-
-/*
- * GFP_ZONE_TABLE is a word size bitstring that is used for looking up the
- * zone to use given the lowest 4 bits of gfp_t. Entries are ZONE_SHIFT long
- * and there are 16 of them to cover all possible combinations of
- * __GFP_DMA, __GFP_DMA32, __GFP_MOVABLE and __GFP_HIGHMEM
- *
- * The zone fallback order is MOVABLE=>HIGHMEM=>NORMAL=>DMA32=>DMA.
- * But GFP_MOVABLE is not only a zone specifier but also an allocation
- * policy. Therefore __GFP_MOVABLE plus another zone selector is valid.
- * Only 1bit of the lowest 3 bit (DMA,DMA32,HIGHMEM) can be set to "1".
- *
- *       bit       result
- *       =================
- *       0x0    => NORMAL
- *       0x1    => DMA or NORMAL
- *       0x2    => HIGHMEM or NORMAL
- *       0x3    => BAD (DMA+HIGHMEM)
- *       0x4    => DMA32 or DMA or NORMAL
- *       0x5    => BAD (DMA+DMA32)
- *       0x6    => BAD (HIGHMEM+DMA32)
- *       0x7    => BAD (HIGHMEM+DMA32+DMA)
- *       0x8    => NORMAL (MOVABLE+0)
- *       0x9    => DMA or NORMAL (MOVABLE+DMA)
- *       0xa    => MOVABLE (Movable is valid only if HIGHMEM is set too)
- *       0xb    => BAD (MOVABLE+HIGHMEM+DMA)
- *       0xc    => DMA32 (MOVABLE+HIGHMEM+DMA32)
- *       0xd    => BAD (MOVABLE+DMA32+DMA)
- *       0xe    => BAD (MOVABLE+DMA32+HIGHMEM)
- *       0xf    => BAD (MOVABLE+DMA32+HIGHMEM+DMA)
- *
- * ZONES_SHIFT must be <= 2 on 32 bit platforms.
- */
-
-#if 16 * ZONES_SHIFT > BITS_PER_LONG
-#error ZONES_SHIFT too large to create GFP_ZONE_TABLE integer
+	if (flags & __GFP_DMA32)
+		return ZONE_DMA32;
 #endif
-
-#define GFP_ZONE_TABLE ( \
-	(ZONE_NORMAL << 0 * ZONES_SHIFT)				\
-	| (OPT_ZONE_DMA << __GFP_DMA * ZONES_SHIFT) 			\
-	| (OPT_ZONE_HIGHMEM << __GFP_HIGHMEM * ZONES_SHIFT)		\
-	| (OPT_ZONE_DMA32 << __GFP_DMA32 * ZONES_SHIFT)			\
-	| (ZONE_NORMAL << __GFP_MOVABLE * ZONES_SHIFT)			\
-	| (OPT_ZONE_DMA << (__GFP_MOVABLE | __GFP_DMA) * ZONES_SHIFT)	\
-	| (ZONE_MOVABLE << (__GFP_MOVABLE | __GFP_HIGHMEM) * ZONES_SHIFT)\
-	| (OPT_ZONE_DMA32 << (__GFP_MOVABLE | __GFP_DMA32) * ZONES_SHIFT)\
-)
-
-/*
- * GFP_ZONE_BAD is a bitmap for all combination of __GFP_DMA, __GFP_DMA32
- * __GFP_HIGHMEM and __GFP_MOVABLE that are not permitted. One flag per
- * entry starting with bit 0. Bit is set if the combination is not
- * allowed.
- */
-#define GFP_ZONE_BAD ( \
-	1 << (__GFP_DMA | __GFP_HIGHMEM)				\
-	| 1 << (__GFP_DMA | __GFP_DMA32)				\
-	| 1 << (__GFP_DMA32 | __GFP_HIGHMEM)				\
-	| 1 << (__GFP_DMA | __GFP_DMA32 | __GFP_HIGHMEM)		\
-	| 1 << (__GFP_MOVABLE | __GFP_HIGHMEM | __GFP_DMA)		\
-	| 1 << (__GFP_MOVABLE | __GFP_DMA32 | __GFP_DMA)		\
-	| 1 << (__GFP_MOVABLE | __GFP_DMA32 | __GFP_HIGHMEM)		\
-	| 1 << (__GFP_MOVABLE | __GFP_DMA32 | __GFP_DMA | __GFP_HIGHMEM)\
-)
-
-static inline enum zone_type gfp_zone(gfp_t flags)
-{
-	enum zone_type z;
-	int bit = flags & GFP_ZONEMASK;
-
-	z = (GFP_ZONE_TABLE >> (bit * ZONES_SHIFT)) &
-					 ((1 << ZONES_SHIFT) - 1);
-
-	if (__builtin_constant_p(bit))
-		MAYBE_BUILD_BUG_ON((GFP_ZONE_BAD >> bit) & 1);
-	else {
-#ifdef CONFIG_DEBUG_VM
-		BUG_ON((GFP_ZONE_BAD >> bit) & 1);
+	if ((flags & (__GFP_HIGHMEM | __GFP_MOVABLE)) ==
+			(__GFP_HIGHMEM | __GFP_MOVABLE))
+		return ZONE_MOVABLE;
+#ifdef CONFIG_HIGHMEM
+	if (flags & __GFP_HIGHMEM)
+		return ZONE_HIGHMEM;
 #endif
-	}
-	return z;
+	return ZONE_NORMAL;
 }
 
 /*
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 15:40                     ` Mel Gorman
@ 2009-10-14 16:13                       ` Frans Pop
  2009-10-14 18:34                       ` Frans Pop
  1 sibling, 0 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-14 16:13 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Wednesday 14 October 2009, Mel Gorman wrote:
> You highlighted the first akpm merge so lets look closer at that as I
> don't think there is anything more I can do with the wireless driver
> other than the suggestions made already.
Thanks a lot for that analysis Mel. I'll see if I can come up with 
additional data based of the info you provide here.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 10:30                 ` Mel Gorman
  2009-10-14 13:10                   ` Frans Pop
@ 2009-10-14 16:28                   ` reinette chatre
  2009-10-14 16:50                     ` Mel Gorman
  1 sibling, 1 reply; 88+ messages in thread
From: reinette chatre @ 2009-10-14 16:28 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
Hi Mel,
On Wed, 2009-10-14 at 03:30 -0700, Mel Gorman wrote:
> From 5fb9f897117bf2701f9fdebe4d008dbe34358ab9 Mon Sep 17 00:00:00 2001
> From: Mel Gorman <mel@csn.ul.ie>
> Date: Wed, 14 Oct 2009 11:19:57 +0100
> Subject: [PATCH] iwlwifi: Suppress warnings related to GFP_ATOMIC allocations that do not matter
> 
> iwlwifi refills RX buffers in two ways - a direct method using GFP_ATOMIC
> and a tasklet method using GFP_KERNEL. There are a number of RX buffers and
> there are only serious issues when there are no RX buffers left. The driver
> explicitly warns when refills are failing and the buffers are low but it
> always warns when a GFP_ATOMIC allocation fails even when there is no
> packet loss as a result.
No, it does not always warn when a GFP_ATOMIC allocation fails. Please
check earlier in iwl_rx_allocate() we have:
if (rxq->free_count > RX_LOW_WATERMARK)
	priority |= __GFP_NOWARN;
So it will suppress warnings as long as we have buffers available.
We do want to see warnings if memory is below watermark and allocation
fails - your patch prevents these warnings from appearing.
> This patch specifies __GFP_NOWARN for the direct refill method that uses
> GFP_ATOMIC. To help identify where allocation failures might be coming
> from, the stack is dumped when the RX queue is dangerously low.
> 
> Signed-off-by: Mel Gorman <mel@csn.ul.ie>
> --- 
>  drivers/net/wireless/iwlwifi/iwl-rx.c |    6 ++++--
>  1 file changed, 4 insertions(+), 2 deletions(-)
> 
> diff --git a/Documentation/trace/postprocess/trace-pagealloc-postprocess.pl b/Documentation/trace/postprocess/trace-pagealloc-postprocess.pl
> old mode 100644
> new mode 100755
> diff --git a/drivers/net/wireless/iwlwifi/iwl-rx.c b/drivers/net/wireless/iwlwifi/iwl-rx.c
> index 8e1bb53..f91a108 100644
> --- a/drivers/net/wireless/iwlwifi/iwl-rx.c
> +++ b/drivers/net/wireless/iwlwifi/iwl-rx.c
> @@ -260,10 +260,12 @@ void iwl_rx_allocate(struct iwl_priv *priv, gfp_t priority)
>  			if (net_ratelimit())
>  				IWL_DEBUG_INFO(priv, "Failed to allocate SKB buffer.\n");
>  			if ((rxq->free_count <= RX_LOW_WATERMARK) &&
> -			    net_ratelimit())
> +			    net_ratelimit()) {
>  				IWL_CRIT(priv, "Failed to allocate SKB buffer with %s. Only %u free buffers remaining.\n",
>  					 priority == GFP_ATOMIC ?  "GFP_ATOMIC" : "GFP_KERNEL",
>  					 rxq->free_count);
> +				dump_stack();
> +			}
>  			/* We don't reschedule replenish work here -- we will
>  			 * call the restock method and if it still needs
>  			 * more buffers it will schedule replenish */
> @@ -320,7 +322,7 @@ EXPORT_SYMBOL(iwl_rx_replenish);
>  
>  void iwl_rx_replenish_now(struct iwl_priv *priv)
>  {
> -	iwl_rx_allocate(priv, GFP_ATOMIC);
> +	iwl_rx_allocate(priv, GFP_ATOMIC|__GFP_NOWARN);
>  
>  	iwl_rx_queue_restock(priv);
>  }
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 13:10                   ` Frans Pop
  2009-10-14 15:40                     ` Mel Gorman
@ 2009-10-14 16:30                     ` reinette chatre
  2009-10-18 23:33                     ` Frans Pop
  2 siblings, 0 replies; 88+ messages in thread
From: reinette chatre @ 2009-10-14 16:30 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org, Kalle Valo
On Wed, 2009-10-14 at 06:10 -0700, Frans Pop wrote:
> On Wednesday 14 October 2009, Mel Gorman wrote:
> > The majority of the wireless reports have been in 
> > this driver and I think we have the problem commit there. The only other
> > is a firmware loading problem in e100 after resume that fails to make an
> > atomic order-5 fail.
> 
> Not exactly true. Bartlomiej's report was about ipw2200, so there are at 
> least 3 different drivers involved, two wireless and one wired. Besides 
> that one report is related to heavy swap, one to resume and one to driver 
> reload.
Another report arrived today. Please see
http://thread.gmane.org/gmane.linux.kernel.wireless.general/40858 - it
is an order-5 allocation failure during driver reload.
Reinette
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 16:28                   ` reinette chatre
@ 2009-10-14 16:50                     ` Mel Gorman
  2009-10-14 20:41                       ` reinette chatre
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-14 16:50 UTC (permalink / raw)
  To: reinette chatre
  Cc: Frans Pop, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
On Wed, Oct 14, 2009 at 09:28:00AM -0700, reinette chatre wrote:
> Hi Mel,
> 
> On Wed, 2009-10-14 at 03:30 -0700, Mel Gorman wrote:
> > From 5fb9f897117bf2701f9fdebe4d008dbe34358ab9 Mon Sep 17 00:00:00 2001
> > From: Mel Gorman <mel@csn.ul.ie>
> > Date: Wed, 14 Oct 2009 11:19:57 +0100
> > Subject: [PATCH] iwlwifi: Suppress warnings related to GFP_ATOMIC allocations that do not matter
> > 
> > iwlwifi refills RX buffers in two ways - a direct method using GFP_ATOMIC
> > and a tasklet method using GFP_KERNEL. There are a number of RX buffers and
> > there are only serious issues when there are no RX buffers left. The driver
> > explicitly warns when refills are failing and the buffers are low but it
> > always warns when a GFP_ATOMIC allocation fails even when there is no
> > packet loss as a result.
> 
> 
> No, it does not always warn when a GFP_ATOMIC allocation fails. Please
> check earlier in iwl_rx_allocate() we have:
> 
> if (rxq->free_count > RX_LOW_WATERMARK)
> 	priority |= __GFP_NOWARN;
> 
> So it will suppress warnings as long as we have buffers available.
> 
> We do want to see warnings if memory is below watermark and allocation
> fails - your patch prevents these warnings from appearing.
> 
Yeah, the patch is balls and is not the way forward.
What is your take on GFP_ATOMIC-direct deleting the pool before the tasklet
can refill it with GFP_KERNEL? Should direct allocation be falling back to
calling with GFP_KERNEL when the pool has been depleted instead of failing?
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 15:40                     ` Mel Gorman
  2009-10-14 16:13                       ` Frans Pop
@ 2009-10-14 18:34                       ` Frans Pop
  2009-10-14 23:56                         ` Mel Gorman
  1 sibling, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-14 18:34 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
Some initial results; all negative I'm afraid.
On Wednesday 14 October 2009, Mel Gorman wrote:
> This is what I found. The following were the possible commits that might
> be causing the problem.
> 56e49d2..f166777 -- reclaim
>         I would have considered this strong candidates except again, the
>         last good commit happened after this point. If other obvious
>         candidates don't crop up, it might be worth double checking
>         within this range, particularly commit 56e49d2 vmscan: evict
>         use-once pages first as it is targeted at streaming-IO workloads
>         which would include your music workload.
Reverted 56e49d2 on top of .31.1; no change.
> 5c87ead..e9bb35d -- inactive ratio changes
>         These patches should be harmless but just in case, please
>         compare the output of
>         # grep inactive_ratio /proc/zoneinfo
>         on 2.6.30 and 2.6.31 and make sure the ratios are the same.
The same for both (and for .32). DMA: 1; DMA32: 3
>         Commit b70d94e altered how zonelists were selected during
>         allocation. This was tested fairly heavily but if the testing
>         missed something, it would mean that some allocations are not
>         using the zones they should be.
Reverted on top of .31.1; no change.
>         Commit bc75d33 is totally harmless but it mentions
>         min_free_kbytes. I checked on my machine to make sure
>         min_free_kbytes was the same on both 2.6.30 and 2.6.31. Can you
>         check that this is true for your machine? If min_free_kbytes
>         decreased, it could explain GFP_ATOMIC failures.
Virtually identical. .30: 5704; .31/.32: 5711
> After this point, your analysis indicates that things are already broken
> but lets look at some of the candidates anyway.  Out of curiousity,
> was CONFIG_UNEVICTABLE_LRU unset in your .config for 2.6.30? I could
> only find your 2.6.31 .config. If it was, it might be worth reverting
> 6837765963f1723e80ca97b1fae660f3a60d77df and unsetting it in 2.6.31 and
> seeing what happens.
CONFIG_UNEVICTABLE_LRU was set and during bisections I've always accepted 
the default, which was "y".
> Commit ee993b135ec75a93bd5c45e636bb210d2975159b altered how lumpy
> reclaim works but it should have been harmless. It does not cleanly
> revert but it's easy to manually revert.
Reverted on top of .31.1; no change.
I'll do some more digging in the 'akpm' merge.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 16:50                     ` Mel Gorman
@ 2009-10-14 20:41                       ` reinette chatre
  2009-10-14 21:33                         ` Frans Pop
  2009-10-15  2:02                         ` Frans Pop
  0 siblings, 2 replies; 88+ messages in thread
From: reinette chatre @ 2009-10-14 20:41 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
On Wed, 2009-10-14 at 09:50 -0700, Mel Gorman wrote:
> What is your take on GFP_ATOMIC-direct deleting the pool before the tasklet
> can refill it with GFP_KERNEL?
I am not sure I understand your question. We attempt to reclaim a
received buffer on every receive, and with a queue size of 256 + 64 we
assume to have a pretty big buffer to deal with cases when allocations
fail. So, technically, for us to get into this situation where we start
seeing these allocation failures there would have been more than 200
times in which GFP_ATOMIC allocations failed that we did _not_ see since
we only see those warnings when there are less than 8 free buffers
remaining. More on this below ...
>  Should direct allocation be falling back to
> calling with GFP_KERNEL when the pool has been depleted instead of failing?
This is the intention of the current implementation. In the tasklet we
run iwl_rx_replenish_now(), which attempts the GFP_ATOMIC allocations
first by calling iwl_rx_allocate() with the GFP_ATOMIC flag. No
particular action is taken when this fails (apart from the error
message), but if the buffers are running low then iwl_rx_queue_restock()
(which is also called from iwl_rx_replenish_now()) will queue work that
will do the allocation with GFP_KERNEL.
We do queue the GFP_KERNEL allocations when there are only a few buffers
remaining in the queue (8 right now) ... maybe we can make this higher?
I am not sure if this will help in what you are trying to figure out
here, but would it help to play with the numbers here? That is, in
iwl_rx_queue_restock() we have:
if (rxq->free_count <= RX_LOW_WATERMARK)
	queue_work(priv->workqueue, &priv->rx_replenish);
Would it help here to make that value higher? Maybe queue the GFP_KERNEL
allocation when there are, for example, 50 or 100 free buffers
remaining? 
Reinette
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 20:41                       ` reinette chatre
@ 2009-10-14 21:33                         ` Frans Pop
  2009-10-14 21:55                           ` reinette chatre
  2009-10-15  2:02                         ` Frans Pop
  1 sibling, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-14 21:33 UTC (permalink / raw)
  To: reinette chatre
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
On Wednesday 14 October 2009, reinette chatre wrote:
> We do queue the GFP_KERNEL allocations when there are only a few buffers
> remaining in the queue (8 right now) ...
Are you sure of this? I have zero messages in my logs about allocation 
failures with GFP_KERNEL, but I do have plenty with "Only 0 free buffers 
remaining" with GFP_ATOMIC.
Does that indicate a bug or could they fall under the ratelimit somehow?
Or do I misunderstand the logic?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 21:33                         ` Frans Pop
@ 2009-10-14 21:55                           ` reinette chatre
  0 siblings, 0 replies; 88+ messages in thread
From: reinette chatre @ 2009-10-14 21:55 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
On Wed, 2009-10-14 at 14:33 -0700, Frans Pop wrote:
> On Wednesday 14 October 2009, reinette chatre wrote:
> > We do queue the GFP_KERNEL allocations when there are only a few buffers
> > remaining in the queue (8 right now) ...
> 
> Are you sure of this? I have zero messages in my logs about allocation 
> failures with GFP_KERNEL, but I do have plenty with "Only 0 free buffers 
> remaining" with GFP_ATOMIC.
That does make sense to me. We do not expect allocations with GFP_KERNEL
to fail. Considering how I understand how things work I am considering
the following scenario:
* start with system low on available memory
* now introduce incoming traffic (causing the RX code to run)
* upon receipt of frame we attempt an allocation (to reclaim the buffer)
with GFP_ATOMIC (state: num RX buffer free > watermark)
  * this fails since memory is not available
  * num RX buffer free reduces
  * does _not_ queue replenishment of buffers with GFP_KERNEL
* repeat above until we hit the watermark (currently 8)
* upon receipt of frame we attempt an allocation (to reclaim the buffer)
with GFP_ATOMIC (state: num RX buffer free <= watermark) 
  * this fails (now user sees big warning)
  * queue replenishment of buffers with GFP_KERNEL
Essentially what I suspect could happen is that
we do attempt to replenish the buffers with GFP_KERNEL after several
failures with GFP_ATOMIC, but at that point we have already run out
completely.
One way to test this theory is to queue the GFP_KERNEL allocation
earlier (when we still have a significant number of RX buffers
available), 8 may turn out to be too small.
> Does that indicate a bug or could they fall under the ratelimit somehow?
In your kernel log I do see that the driver's error messages related to
GFP_ATOMIC are rate limited (we see many more "order-2 allocation
failure" messages than the "Failed to allocate" messages). All of these
allocation failures are from the "replenish_now" code though, which is
GFP_ATOMIC. So even though we do not see the "Failed to allocate" errors
(which are rate limited) it seems that all allocation failures are from
that (the GFP_ATOMIC) code.
Reinette
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 18:34                       ` Frans Pop
@ 2009-10-14 23:56                         ` Mel Gorman
  2009-10-15 20:15                           ` Frans Pop
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-14 23:56 UTC (permalink / raw)
  To: Frans Pop
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Wed, Oct 14, 2009 at 08:34:56PM +0200, Frans Pop wrote:
> Some initial results; all negative I'm afraid.
> 
These are highly unlikely candidates. I say highly unlikely because they
are before the page allocator patches when your analysis indicated
things were ok.
Commit 70ac23c readahead: sequential mmap readahead
	This affects readahead for mmap() and could have an impact on the
	number of allocations made by the streaming IO. This might be
	generating more bursty network traffic in 2.6.31 than 2.6.30 and
	affecting the allocation apttern enough to cause problems
Commit 2fad6f5 readahead: enforce full readahead size on async mmap readahead
	Another readahead change that may affect the rate of network
	traffic being generated when streaming IO over the network
Commit 10be0b3 readahead: introduce context readahead algorithm
	By using readahead in more situations, it again may be affecting
	the burst rate of network traffic and the rate of GFP_ATOMIC arrivals
Commit 78dc583 vmscan: low order lumpy reclaim also should use PAGEOUT_IO_SYNC
	Very low probability that this is a problem, but it affects
	lumpy reclaim and so has to be considered. It's an awkward
	revert but I think the most important part is just to revert the
	condition that checks if congestion_wait() should be called or not
I relooked at the page allocator patches themselves just in case. Of the
patches in there, I came up with
Commit 11e33f6 page allocator: break up the allocator entry point into fast and slow paths
	This is possibly the most disruptive patch in the set. It should
	not have affected behaviour but the complexity of the patch is
	quite high. I did spot an oddity whereby a process exiting making
	a __GFP_NOFAIL allocation can ignore watermarks. It's unlikely
	this is the problem but as the journal layer uses __GFP_NOFAIL,
	you never know - it might be pushing things down low enough for
	other watermark checks to fail. Patch is below. This is also the
	patch that cause kswapd to wake up less. I sent a patch for that
	problem but I still don't know if it reduced the number of
	failures for you or not.
Commit f2260e6 page allocator: update NR_FREE_PAGES only as necessary
	This patch affects the timing of when NR_FREE_PAGES is updated.
	The reclaim algorithm makes decisions based on this NR_FREE_PAGES
	value.	Crucially, the value can determine if the anon list is force
	scanned or not. The window during which this can make a difference
	should be extremely small but maybe it's enough to make a difference.
Outside the range of commits suspected of causing problems was the
following. It's extremely low probability
Commit 8aa7e84 Fix congestion_wait() sync/async vs read/write confusion
	This patch alters the call to congestion_wait() in the page
	allocator. Frankly, I don't get the change but it might worth
	checking if replacing BLK_RW_ASYNC with WRITE on top of 2.6.31
	makes any difference
After a lot more eyeballing, the best next candidate within mm is the
following patch. Should be tested on it's own and in combination with
the wakeup-kswapd patch sent before.
====
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 20:41                       ` reinette chatre
  2009-10-14 21:33                         ` Frans Pop
@ 2009-10-15  2:02                         ` Frans Pop
  2009-10-15 15:29                           ` reinette chatre
  1 sibling, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-15  2:02 UTC (permalink / raw)
  To: reinette chatre
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
On Wednesday 14 October 2009, reinette chatre wrote:
> We do queue the GFP_KERNEL allocations when there are only a few buffers
> remaining in the queue (8 right now) ... maybe we can make this higher?
I've tried increasing it to 50. Here's the result for a single test:
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 25 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 48 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 48 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 48 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
__ratelimit: 1 callbacks suppressed
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
__ratelimit: 97 callbacks suppressed
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 44 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
This is with current mainline (v2.6.32-rc4-149-ga3ccf63).
The log file timestamps don't tell much as the logging gets delayed,
so they all end up at the same time. Maybe I should enable the kernel
timestamps so we can see how far apart these failures are.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-15  2:02                         ` Frans Pop
@ 2009-10-15 15:29                           ` reinette chatre
  2009-10-15 19:41                             ` Frans Pop
  0 siblings, 1 reply; 88+ messages in thread
From: reinette chatre @ 2009-10-15 15:29 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
On Wed, 2009-10-14 at 19:02 -0700, Frans Pop wrote:
> On Wednesday 14 October 2009, reinette chatre wrote:
> > We do queue the GFP_KERNEL allocations when there are only a few buffers
> > remaining in the queue (8 right now) ... maybe we can make this higher?
> 
> I've tried increasing it to 50. Here's the result for a single test:
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 25 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 48 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 48 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 48 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> __ratelimit: 1 callbacks suppressed
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> __ratelimit: 97 callbacks suppressed
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 44 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> 
> This is with current mainline (v2.6.32-rc4-149-ga3ccf63).
> The log file timestamps don't tell much as the logging gets delayed,
> so they all end up at the same time. Maybe I should enable the kernel
> timestamps so we can see how far apart these failures are.
If you can get accurate timing it will be very useful. I am interested
to see how quickly it goes from "48 free buffers" to "0 free buffers".
Thank you
Reinette
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-15 15:29                           ` reinette chatre
@ 2009-10-15 19:41                             ` Frans Pop
  2009-10-16 17:21                               ` reinette chatre
  2009-10-17  5:42                               ` reinette chatre
  0 siblings, 2 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-15 19:41 UTC (permalink / raw)
  To: reinette chatre
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
[-- Attachment #1: Type: text/plain, Size: 1150 bytes --]
On Thursday 15 October 2009, reinette chatre wrote:
> > The log file timestamps don't tell much as the logging gets delayed,
> > so they all end up at the same time. Maybe I should enable the kernel
> > timestamps so we can see how far apart these failures are.
>
> If you can get accurate timing it will be very useful. I am interested
> to see how quickly it goes from "48 free buffers" to "0 free buffers".
Attached the dmesg for three consecutive test runs (i.e. without 
rebooting). Not that the 2nd one includes only "0 free buffers" messages, 
even though the behavior (point where desktop freezes and music stops) 
looked similar.
Not sure if you can tell all that much from the data.
N.B. You may want to clean this up in iwlwifi code:
iwl-dev.h:#include "iwl-fh.h"
iwl-dev.h:#define RX_LOW_WATERMARK 8
iwl-fh.h:#define RX_LOW_WATERMARK 8
I.e: RX_LOW_WATERMARK is defined in iwl-dev.h even though that includes 
iwl-fh.h where it's also defined. The same may be true for other defines.
I think this gave me an incorrect result the first time I increased the 
limit as I only changed one of the two files (iwl-dev.h IIRC).
Cheers,
FJP
[-- Attachment #2: dmesg.tgz --]
[-- Type: application/x-tgz, Size: 44980 bytes --]
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 23:56                         ` Mel Gorman
@ 2009-10-15 20:15                           ` Frans Pop
  2009-10-16  9:39                             ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-15 20:15 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Thursday 15 October 2009, Mel Gorman wrote:
> After a lot more eyeballing, the best next candidate within mm is the
> following patch. Should be tested on it's own and in combination with
> the wakeup-kswapd patch sent before.
>
> From 4e8b5217f51a00caee527e4e8d8e46fe9f82b482 Mon Sep 17 00:00:00 2001
> From: Mel Gorman <mel@csn.ul.ie>
> Date: Thu, 15 Oct 2009 00:17:05 +0100
> Subject: [PATCH] page allocator: Direct reclaim should always obey
> watermarks
>
> ALLOC_NO_WATERMARKS should be cleared when trying to allocate from the
> free-lists after a direct reclaim.
I've tested the two patches together and this seems like a definite
improvement. I still get SKB errors on the first test, but the desktop
freezes are a lot shorter and the total time needed to load the 3rd gitk
goes down from ~2:15 to ~1:15. The counter in gitk of the number of
loaded commits goes up visibly faster and with fewer halts.
This is on top of current mainline with the RX_LOW_WATERMARK in iwlagn
at it's current value (8).
Here are the allocation failures for 2 consecutive tests. Note that the
first test still shows quite a lot of failures, but the second test hardly
had any at all (I still had music skips though).
[  232.845116] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  232.845116] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  232.873009] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  232.884545] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  240.121577] __ratelimit: 26 callbacks suppressed
[  240.121634] __ratelimit: 6 callbacks suppressed
[  240.124006] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 6 free buffers remaining.
[  304.335767] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  304.335767] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  304.374729] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  309.446481] __ratelimit: 5 callbacks suppressed
[  309.450197] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
[  525.912934] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 5 free buffers remaining.
[  525.953939] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 7 free buffers remaining.
[  536.058171] __ratelimit: 1 callbacks suppressed
Thanks,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-15 20:15                           ` Frans Pop
@ 2009-10-16  9:39                             ` Mel Gorman
  0 siblings, 0 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-16  9:39 UTC (permalink / raw)
  To: Frans Pop
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Thu, Oct 15, 2009 at 10:15:09PM +0200, Frans Pop wrote:
> On Thursday 15 October 2009, Mel Gorman wrote:
> > After a lot more eyeballing, the best next candidate within mm is the
> > following patch. Should be tested on it's own and in combination with
> > the wakeup-kswapd patch sent before.
> >
> > From 4e8b5217f51a00caee527e4e8d8e46fe9f82b482 Mon Sep 17 00:00:00 2001
> > From: Mel Gorman <mel@csn.ul.ie>
> > Date: Thu, 15 Oct 2009 00:17:05 +0100
> > Subject: [PATCH] page allocator: Direct reclaim should always obey
> > watermarks
> >
> > ALLOC_NO_WATERMARKS should be cleared when trying to allocate from the
> > free-lists after a direct reclaim.
> 
> I've tested the two patches together and this seems like a definite
> improvement.
You probably don't need the mental image, but this made me do a happy
dance. Certainly helps my cold!
> I still get SKB errors on the first test, but the desktop
> freezes are a lot shorter and the total time needed to load the 3rd gitk
> goes down from ~2:15 to ~1:15. The counter in gitk of the number of
> loaded commits goes up visibly faster and with fewer halts.
> 
This brings us close to the state of affairs before the akpm merge.
There might still be something missing in either the mm area or the wireless
driver but any improvement is better than none.
> This is on top of current mainline with the RX_LOW_WATERMARK in iwlagn
> at it's current value (8).
> 
> Here are the allocation failures for 2 consecutive tests. Note that the
> first test still shows quite a lot of failures, but the second test hardly
> had any at all (I still had music skips though).
> 
So, we are still dealing with three problems.
1. GFP_ATOMICS were introduced to the wireless driver in the
   2.6.30..2.6.31 timeframe. It has been more or less identified as being
   the tasklet off-loading and the pools being depleted too easily. This
   still needs to be fixed. 
2. There is also some firmware reloading problem of an unknown source
3. There was an mm regression that made GFP_ATOMIC failures much worse.
   This is at least partially due to tasks exiting being able to go below the
   watermarks and kswapd not being woken up when it should be.  This could
   be the source of the allocation failures on resume that have nothing to
   do with the iwlagn wireless driver.
I am going to put together the pair of patches against mainline with a
recommendation they be also applied for 2.6.31.5. I'll keep looking to
see can I spot another possible candidate for GFP_ATOMIC being worse
than it was.
> [  232.845116] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> [  232.845116] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> [  232.873009] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> [  232.884545] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> [  240.121577] __ratelimit: 26 callbacks suppressed
> [  240.121634] __ratelimit: 6 callbacks suppressed
> [  240.124006] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 6 free buffers remaining.
> [  304.335767] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> [  304.335767] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> [  304.374729] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> [  309.446481] __ratelimit: 5 callbacks suppressed
> [  309.450197] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 0 free buffers remaining.
> 
> [  525.912934] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 5 free buffers remaining.
> [  525.953939] iwlagn 0000:10:00.0: Failed to allocate SKB buffer with GFP_ATOMIC. Only 7 free buffers remaining.
> [  536.058171] __ratelimit: 1 callbacks suppressed
> 
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-15 19:41                             ` Frans Pop
@ 2009-10-16 17:21                               ` reinette chatre
  2009-10-17  5:42                               ` reinette chatre
  1 sibling, 0 replies; 88+ messages in thread
From: reinette chatre @ 2009-10-16 17:21 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
On Thu, 2009-10-15 at 12:41 -0700, Frans Pop wrote:
> On Thursday 15 October 2009, reinette chatre wrote:
> > > The log file timestamps don't tell much as the logging gets delayed,
> > > so they all end up at the same time. Maybe I should enable the kernel
> > > timestamps so we can see how far apart these failures are.
> >
> > If you can get accurate timing it will be very useful. I am interested
> > to see how quickly it goes from "48 free buffers" to "0 free buffers".
> 
> Attached the dmesg for three consecutive test runs (i.e. without 
> rebooting). Not that the 2nd one includes only "0 free buffers" messages, 
> even though the behavior (point where desktop freezes and music stops) 
> looked similar.
Thank you very much. I am studying it.
> Not sure if you can tell all that much from the data.
> 
> N.B. You may want to clean this up in iwlwifi code:
> iwl-dev.h:#include "iwl-fh.h"
> iwl-dev.h:#define RX_LOW_WATERMARK 8
> iwl-fh.h:#define RX_LOW_WATERMARK 8
> 
> I.e: RX_LOW_WATERMARK is defined in iwl-dev.h even though that includes 
> iwl-fh.h where it's also defined. The same may be true for other defines.
Sorry about that. The patch below will fix that. I will send it
separately to wireless list.
>From 7cc8e6482b359eef5ce099457037a237d355b5b1 Mon Sep 17 00:00:00 2001
From: Reinette Chatre <reinette.chatre@intel.com>
Date: Fri, 16 Oct 2009 10:11:10 -0700
Subject: [PATCH] iwlwifi: remove duplicate defines
RX_FREE_BUFFERS and RX_LOW_WATERMARK are currently defined in four places.
Based on how files are included we only need the definition in iwl-fh.h
Signed-off-by: Reinette Chatre <reinette.chatre@intel.com>
Reported-by: Frans Pop <elendil@planet.nl>
---
 drivers/net/wireless/iwlwifi/iwl-3945-hw.h |    6 ------
 drivers/net/wireless/iwlwifi/iwl-3945.h    |    6 ------
 drivers/net/wireless/iwlwifi/iwl-dev.h     |    6 ------
 3 files changed, 0 insertions(+), 18 deletions(-)
diff --git a/drivers/net/wireless/iwlwifi/iwl-3945-hw.h b/drivers/net/wireless/iwlwifi/iwl-3945-hw.h
index ccdac69..6fd10d4 100644
--- a/drivers/net/wireless/iwlwifi/iwl-3945-hw.h
+++ b/drivers/net/wireless/iwlwifi/iwl-3945-hw.h
@@ -248,12 +248,6 @@ struct iwl3945_eeprom {
 #define TFD_CTL_PAD_SET(n)         (n << 28)
 #define TFD_CTL_PAD_GET(ctl)       (ctl >> 28)
 
-/*
- * RX related structures and functions
- */
-#define RX_FREE_BUFFERS 64
-#define RX_LOW_WATERMARK 8
-
 /* Sizes and addresses for instruction and data memory (SRAM) in
  * 3945's embedded processor.  Driver access is via HBUS_TARG_MEM_* regs. */
 #define IWL39_RTC_INST_LOWER_BOUND		(0x000000)
diff --git a/drivers/net/wireless/iwlwifi/iwl-3945.h b/drivers/net/wireless/iwlwifi/iwl-3945.h
index f3907c1..84fa0d7 100644
--- a/drivers/net/wireless/iwlwifi/iwl-3945.h
+++ b/drivers/net/wireless/iwlwifi/iwl-3945.h
@@ -130,12 +130,6 @@ struct iwl3945_frame {
 #define SN_TO_SEQ(ssn) (((ssn) << 4) & IEEE80211_SCTL_SEQ)
 #define MAX_SN ((IEEE80211_SCTL_SEQ) >> 4)
 
-/*
- * RX related structures and functions
- */
-#define RX_FREE_BUFFERS 64
-#define RX_LOW_WATERMARK 8
-
 #define SUP_RATE_11A_MAX_NUM_CHANNELS  8
 #define SUP_RATE_11B_MAX_NUM_CHANNELS  4
 #define SUP_RATE_11G_MAX_NUM_CHANNELS  12
diff --git a/drivers/net/wireless/iwlwifi/iwl-dev.h b/drivers/net/wireless/iwlwifi/iwl-dev.h
index 1378654..0fa0cf5 100644
--- a/drivers/net/wireless/iwlwifi/iwl-dev.h
+++ b/drivers/net/wireless/iwlwifi/iwl-dev.h
@@ -406,12 +406,6 @@ struct iwl_host_cmd {
 	u8 id;
 };
 
-/*
- * RX related structures and functions
- */
-#define RX_FREE_BUFFERS 64
-#define RX_LOW_WATERMARK 8
-
 #define SUP_RATE_11A_MAX_NUM_CHANNELS  8
 #define SUP_RATE_11B_MAX_NUM_CHANNELS  4
 #define SUP_RATE_11G_MAX_NUM_CHANNELS  12
-- 
1.5.6.3
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-15 19:41                             ` Frans Pop
  2009-10-16 17:21                               ` reinette chatre
@ 2009-10-17  5:42                               ` reinette chatre
  2009-10-27 11:10                                 ` Frans Pop
  1 sibling, 1 reply; 88+ messages in thread
From: reinette chatre @ 2009-10-17  5:42 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
[-- Attachment #1: Type: text/plain, Size: 1102 bytes --]
Hi Frans,
On Thu, 2009-10-15 at 12:41 -0700, Frans Pop wrote:
> On Thursday 15 October 2009, reinette chatre wrote:
> > > The log file timestamps don't tell much as the logging gets delayed,
> > > so they all end up at the same time. Maybe I should enable the kernel
> > > timestamps so we can see how far apart these failures are.
> >
> > If you can get accurate timing it will be very useful. I am interested
> > to see how quickly it goes from "48 free buffers" to "0 free buffers".
> 
> Attached the dmesg for three consecutive test runs (i.e. without 
> rebooting). Not that the 2nd one includes only "0 free buffers" messages, 
> even though the behavior (point where desktop freezes and music stops) 
> looked similar.
> 
> Not sure if you can tell all that much from the data.
> 
Prompted by this thread we are in process of moving allocation to paged
skb. This will definitely reduce the allocation size (from order 2 to
order 1) and hopefully help with this problem also. Could you please try
with the attached two patches? They are based on 2.6.32-rc4.
Thank you very much
Reinette
[-- Attachment #2: 0001-iwlwifi-use-paged-Rx.patch --]
[-- Type: text/x-patch, Size: 0 bytes --]
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-14 13:10                   ` Frans Pop
  2009-10-14 15:40                     ` Mel Gorman
  2009-10-14 16:30                     ` reinette chatre
@ 2009-10-18 23:33                     ` Frans Pop
  2009-10-19  0:36                       ` Pekka Enberg
  2009-10-19 14:01                       ` Mel Gorman
  2 siblings, 2 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-18 23:33 UTC (permalink / raw)
  To: Mel Gorman
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
Another long mail, sorry.
On Wednesday 14 October 2009, Frans Pop wrote:
> > There still has not been a mm-change identified that makes
> > fragmentation significantly worse.
>
> My bisection shows a very clear point, even if not an individual commit,
> in the 'akpm' merge where SKB errors suddenly become *much* more
> frequent and easy to trigger.
> I'm sorry to say this, but the fact that nothing has been identified yet
> is IMO the result of a lack of effort, not because there is no such
> change.
I was wrong. It turns out that I was creating the variations in the test 
results around the akpm merge myself by tiny changes in the way I ran the 
tests. It took another round of about 30 compilations and tests purely in 
this range to show that, but those same tests also made me aware of other 
patterns I should look at.
Until a few days ago I was concentrating on "do I see SKB allocation errors 
or not". Since then I've also been looking more consciously at when they 
happen, at disk access patterns and at desktop freeze patterns.
I think I did mention before that this whole issue is rather subtle :-/
So, my apologies for finguering the wrong area for so long, but it looked 
solid given the info available at the time.
On Thursday 15 October 2009, Mel Gorman wrote:
> Outside the range of commits suspected of causing problems was the
> following. It's extremely low probability
>
> Commit 8aa7e84 Fix congestion_wait() sync/async vs read/write confusion
>         This patch alters the call to congestion_wait() in the page
>         allocator. Frankly, I don't get the change but it might worth
>         checking if replacing BLK_RW_ASYNC with WRITE on top of 2.6.31
>         makes any difference
This is the real culprit. Mel: thanks very much for looking beyond the 
area I identified. Your overview of mm changes was exactly what I needed 
and really helped a lot during my later tests.
This commit definitely causes most of the problems; confirmed by reverting 
it on top of 2.6.31 (also requires reverting 373c0a7e, which is a later 
build fix).
The rest of this mail gives details on my tests and how I reached the above 
conclusion.
TEST BASELINE (2.6.30)
======================
I mentioned in an earlier mail that I run three instances of gitk for my 
tests. Loading gitk seems to consist of 3 phases:
1) general initial scan of the repository (branches?)
2) reading commits: commit counter increases
3) reading references (including bisection good/bad points) and
   uncommitted changes
Below times and comments per stage when the test is run with 2.6.30. As my 
test starts after a clean boot, buffers are mostly empty.
1st instance: 'gitk v2.6.29..master' (preparation)
1) ~20 seconds; user interface is mostly blank
2) ~5 seconds to read 35.000 commits; user interface is updated and counter
   increases steadily as they are read
3) ~10 seconds; "branch"/"follows"/"precedes" info and tags are filled
   in; fairly heavy disk activity
2st instance: 'gitk master' (preparation)
1) 0 seconds (because data is already buffered)
2) ~25 seconds to read 167500 commits; counter increases steadily
3) 1-2 seconds (because data is already buffered)
3st instance: 'gitk master' (the actual test)
1) 0 seconds because data is already buffered
2) ~55 seconds due to swapping overhead; minor music skip around commit
   110.000; counter slower after 90.000, some short halts, but generally
   increases steadily; moderate disk activity
3) ~55-60 seconds; because buffers have been emptied data must by read
   again, with swapping; very heavy disk activity; fairly long music
   skip (15-20 seconds), but no SKB allocation errors
So, the loading of the 3rd instance takes 1.5 minutes longer than the 
second because of the swapping. And phase 3) is most affected by it.
AFTER WIRELESS CHANGE
=====================
After commit 4752c93c30 ("iwlcore: Allow skb allocation from tasklet") I 
start getting the SKB errors. They can be triggered reliably if the whole 
test is repeated 1 or 2 times, but generally not the first time the test 
is run.
Or so I thought for a long time.
It turns out that I will get SKB errors during the first run if I'm
"sloppy" in the test execution. For example if I wait too long before 
switching from the last gitk instance to konsole where I have 
a 'tail -f /var/log/kern.log' running.
Another factor is the state of the repository: do I have master checked 
out, or an older branch, or am I in the middle of a bisection. This 
influences how data is read from the disk and thus the test results.
A last factor may be the size of the kernel I'm using: my test/bisect 
kernel is significantly smaller than my regular kernel.
If the test is run completely cleanly, I will not get SKB errors during the 
first run. Also, this change does not affect the timings of the test at 
all: the total load time of the 3rd instance is still ~1:55 and music 
skips happen in roughly the same places. The pattern of disk activity also 
remains unchanged.
If I do *not* run the test cleanly, any SKB errors during the first test 
run will always be during phase 3), never during phase 2). This is what I 
saw during tests in the 'akpm' range, and explains the inconsistent 
results there.
After discovering this I've made a copy of the git repo so that I always 
test using the exact same state and tightened my test procedure.
AFTER congestion_wait CHANGE
============================
If I test commit 9f2d8be, which is just before the congestion_wait() 
change, I still get the same pattern as described above. But when I test 
with 8aa7e84 ("Fix congestion_wait() sync/async vs read/write confusion"), 
things change dramatically when the 3rd gitk instance is started.
During the 2nd phase I see the first SKB allocation errors with a music 
skip between reading commits 95.000 and 110.000.
About commit 115.000 there is a very long pause during which the counter 
does not increase, music stops and the desktop freezes completely. The 
first 30 seconds of that freeze there is only very low disk activity (which 
seems strange); the next 25 seconds there suddenly is very high disk  
activity during which things gradually unfreeze and more SKB errors are 
displayed. After that the commit counter runs up fairly steadily again.
Phase 2) ends at ~1:45. Phase 3) (with more SKB errors) ends at ~2:05.
So this change almost doubles the time needed for phase 2) and causes SKB 
allocation errors to occur during that phase. Also, before this commit the 
desktop freezes are much shorter and less severe. With this change the 
desktop is completely unusable for almost a minute during phase 2), with 
even the mouse pointer frozen solid.
Note that phase 3) becomes shorter, but that the total time needed to load 
the 3rd instance increases by about 10-15 seconds.
Note: -rc2 and -rc3 had broken NFS, so I had to cherry-pick 3 NFS commits 
from -rc4 on top of the commits I wanted to test.
WITH congestion_wait CHANGE REVERTED
====================================
I've done quite a few tests of 2.6.31 with 373c0a7e and 8aa7e847 reverted 
to confirm that's really the culprit. I've done this for .31-rc3, .31-rc4,
.31-rc5, .31 and .31.1.
In all cases the huge freeze in phase 2) is gone and the general behavior 
and timings are again as it was after the wireless change. During most 
tests I did not get any SKB allocation errors during phase 2) or phase 3).
However with .31-rc5, .31 and .31.1 I have had some tests where I would see 
a few SKB allocation errors during phase 3) (which is somewhat likely), 
but also during phase 2). At this point I'm unsure whether this is just 
noise, or maybe a minor influence from some change merged after .31-rc4.
Looking through the commits there are several mm/page allocation changes.
For now I suggest ignoring this though as the impact (if any) is very minor 
and it is not reproducible reliably enough.
Next I'll retest Mel's patches and also test Reinette's patches.
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-18 23:33                     ` Frans Pop
@ 2009-10-19  0:36                       ` Pekka Enberg
  2009-10-19  2:44                         ` Frans Pop
  2009-10-19  2:52                         ` [Bug #14141] order 2 page allocation failures in iwlagn Jens Axboe
  2009-10-19 14:01                       ` Mel Gorman
  1 sibling, 2 replies; 88+ messages in thread
From: Pekka Enberg @ 2009-10-19  0:36 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
(Adding Jens to CC.)
On Wednesday 14 October 2009, Frans Pop wrote:
> > > There still has not been a mm-change identified that makes
> > > fragmentation significantly worse.
On Mon, 2009-10-19 at 01:33 +0200, Frans Pop wrote:
> > My bisection shows a very clear point, even if not an individual commit,
> > in the 'akpm' merge where SKB errors suddenly become *much* more
> > frequent and easy to trigger.
> > I'm sorry to say this, but the fact that nothing has been identified yet
> > is IMO the result of a lack of effort, not because there is no such
> > change.
> 
> I was wrong. It turns out that I was creating the variations in the test 
> results around the akpm merge myself by tiny changes in the way I ran the 
> tests. It took another round of about 30 compilations and tests purely in 
> this range to show that, but those same tests also made me aware of other 
> patterns I should look at.
> 
> Until a few days ago I was concentrating on "do I see SKB allocation errors 
> or not". Since then I've also been looking more consciously at when they 
> happen, at disk access patterns and at desktop freeze patterns.
> 
> I think I did mention before that this whole issue is rather subtle :-/
> So, my apologies for finguering the wrong area for so long, but it looked 
> solid given the info available at the time.
> 
> On Thursday 15 October 2009, Mel Gorman wrote:
> > Outside the range of commits suspected of causing problems was the
> > following. It's extremely low probability
> >
> > Commit 8aa7e84 Fix congestion_wait() sync/async vs read/write confusion
> >         This patch alters the call to congestion_wait() in the page
> >         allocator. Frankly, I don't get the change but it might worth
> >         checking if replacing BLK_RW_ASYNC with WRITE on top of 2.6.31
> >         makes any difference
> 
> This is the real culprit. Mel: thanks very much for looking beyond the 
> area I identified. Your overview of mm changes was exactly what I needed 
> and really helped a lot during my later tests.
> 
> This commit definitely causes most of the problems; confirmed by reverting 
> it on top of 2.6.31 (also requires reverting 373c0a7e, which is a later 
> build fix).
Mel/Jens, any ideas why commit 8aa7e84 makes us run out of high order
pages? Should we be using BLK_RW_SYNC in mm/page_alloc.c instead of
BLK_RW_ASYNC?
			Pekka
diff --git a/mm/page_alloc.c b/mm/page_alloc.c
index bf72055..fa8380a 100644
--- a/mm/page_alloc.c
+++ b/mm/page_alloc.c
@@ -1727,7 +1727,7 @@ __alloc_pages_high_priority(gfp_t gfp_mask, unsigned int order,
 			preferred_zone, migratetype);
 
 		if (!page && gfp_mask & __GFP_NOFAIL)
-			congestion_wait(BLK_RW_ASYNC, HZ/50);
+			congestion_wait(BLK_RW_SYNC, HZ/50);
 	} while (!page && (gfp_mask & __GFP_NOFAIL));
 
 	return page;
@@ -1898,7 +1898,7 @@ rebalance:
 	pages_reclaimed += did_some_progress;
 	if (should_alloc_retry(gfp_mask, order, pages_reclaimed)) {
 		/* Wait for some write requests to complete then retry */
-		congestion_wait(BLK_RW_ASYNC, HZ/50);
+		congestion_wait(BLK_RW_SYNC, HZ/50);
 		goto rebalance;
 	}
 
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19  0:36                       ` Pekka Enberg
@ 2009-10-19  2:44                         ` Frans Pop
  2009-10-19  9:49                           ` [Bug #14141] order 2 page allocation failures (generic) Tobi Oetiker
  2009-10-19  2:52                         ` [Bug #14141] order 2 page allocation failures in iwlagn Jens Axboe
  1 sibling, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-19  2:44 UTC (permalink / raw)
  To: Pekka Enberg
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
On Monday 19 October 2009, Pekka Enberg wrote:
> On Wednesday 14 October 2009, Frans Pop wrote:
> > On Thursday 15 October 2009, Mel Gorman wrote:
> > > Outside the range of commits suspected of causing problems was the
> > > following. It's extremely low probability
> > >
> > > Commit 8aa7e84 Fix congestion_wait() sync/async vs read/write
> > > confusion This patch alters the call to congestion_wait() in the
> > > page allocator. Frankly, I don't get the change but it might worth
> > > checking if replacing BLK_RW_ASYNC with WRITE on top of 2.6.31 makes
> > > any difference
> >
> > This is the real culprit. Mel: thanks very much for looking beyond the
> > area I identified. Your overview of mm changes was exactly what I
> > needed and really helped a lot during my later tests.
> >
> > This commit definitely causes most of the problems; confirmed by
> > reverting it on top of 2.6.31 (also requires reverting 373c0a7e, which
> > is a later build fix).
>
> Mel/Jens, any ideas why commit 8aa7e84 makes us run out of high order
> pages? Should we be using BLK_RW_SYNC in mm/page_alloc.c instead of
> BLK_RW_ASYNC?
I'm starting to think that this commit may not be directly related to high 
order allocation failures. The fact that I'm seeing SKB allocation 
failures earlier because of this commit could be just a side effect.
It could be that instead the main impact of this commit is on encrypted 
file system and/or encrypted swap (kcryptd).
Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm 
only reading from NFS that's unlikely).
Reason for thinking this is that reverting it makes no difference for Karol 
[1]. It will be interesting to see if it does make a difference for Sven 
Geggus [2].
/me wonders if we'll ever get to the bottom of this...
[1] http://lkml.org/lkml/2009/10/18/138
[2] http://lkml.org/lkml/2009/10/17/113
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19  0:36                       ` Pekka Enberg
  2009-10-19  2:44                         ` Frans Pop
@ 2009-10-19  2:52                         ` Jens Axboe
  1 sibling, 0 replies; 88+ messages in thread
From: Jens Axboe @ 2009-10-19  2:52 UTC (permalink / raw)
  To: Pekka Enberg
  Cc: Frans Pop, Mel Gorman, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm
On Mon, Oct 19 2009, Pekka Enberg wrote:
> (Adding Jens to CC.)
> 
> On Wednesday 14 October 2009, Frans Pop wrote:
> > > > There still has not been a mm-change identified that makes
> > > > fragmentation significantly worse.
> 
> On Mon, 2009-10-19 at 01:33 +0200, Frans Pop wrote:
> > > My bisection shows a very clear point, even if not an individual commit,
> > > in the 'akpm' merge where SKB errors suddenly become *much* more
> > > frequent and easy to trigger.
> > > I'm sorry to say this, but the fact that nothing has been identified yet
> > > is IMO the result of a lack of effort, not because there is no such
> > > change.
> > 
> > I was wrong. It turns out that I was creating the variations in the test 
> > results around the akpm merge myself by tiny changes in the way I ran the 
> > tests. It took another round of about 30 compilations and tests purely in 
> > this range to show that, but those same tests also made me aware of other 
> > patterns I should look at.
> > 
> > Until a few days ago I was concentrating on "do I see SKB allocation errors 
> > or not". Since then I've also been looking more consciously at when they 
> > happen, at disk access patterns and at desktop freeze patterns.
> > 
> > I think I did mention before that this whole issue is rather subtle :-/
> > So, my apologies for finguering the wrong area for so long, but it looked 
> > solid given the info available at the time.
> > 
> > On Thursday 15 October 2009, Mel Gorman wrote:
> > > Outside the range of commits suspected of causing problems was the
> > > following. It's extremely low probability
> > >
> > > Commit 8aa7e84 Fix congestion_wait() sync/async vs read/write confusion
> > >         This patch alters the call to congestion_wait() in the page
> > >         allocator. Frankly, I don't get the change but it might worth
> > >         checking if replacing BLK_RW_ASYNC with WRITE on top of 2.6.31
> > >         makes any difference
> > 
> > This is the real culprit. Mel: thanks very much for looking beyond the 
> > area I identified. Your overview of mm changes was exactly what I needed 
> > and really helped a lot during my later tests.
> > 
> > This commit definitely causes most of the problems; confirmed by reverting 
> > it on top of 2.6.31 (also requires reverting 373c0a7e, which is a later 
> > build fix).
> 
> Mel/Jens, any ideas why commit 8aa7e84 makes us run out of high order
> pages? Should we be using BLK_RW_SYNC in mm/page_alloc.c instead of
> BLK_RW_ASYNC?
No, I think that is definitely broken since the page freeing should be
using async writes. If the commit in question is making the difference
and the below does indeed fix it, I think that's primarliy due to timing
issues and the general brokeness of the congestion bits. With the below
change, you essentially guarenteed to be waiting 20ms every time and
it's quite likely that that is enough to change the picture.
So I'd like elsewhere for the real problem, it's not likely to be caused
by the sync vs async bits themselves.
-- 
Jens Axboe
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19  2:44                         ` Frans Pop
@ 2009-10-19  9:49                           ` Tobi Oetiker
  2009-10-19  9:54                             ` Pekka Enberg
  2009-10-19 13:31                             ` Mel Gorman
  0 siblings, 2 replies; 88+ messages in thread
From: Tobi Oetiker @ 2009-10-19  9:49 UTC (permalink / raw)
  To: Frans Pop
  Cc: Pekka Enberg, Mel Gorman, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe
Today Frans Pop wrote:
>
> I'm starting to think that this commit may not be directly related to high
> order allocation failures. The fact that I'm seeing SKB allocation
> failures earlier because of this commit could be just a side effect.
> It could be that instead the main impact of this commit is on encrypted
> file system and/or encrypted swap (kcryptd).
>
> Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm
> only reading from NFS that's unlikely).
I have updated a fileserver to 2.6.31 today and I see page
allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
So I guess the problem must be quite generic:
Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
Oct 19 08:59:16 johan kernel: [30120.685647] __ratelimit: 13 callbacks suppressed [kern.warning]
Oct 19 08:59:16 johan kernel: [30120.685654] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
Oct 19 08:59:16 johan kernel: [30120.685660] Pid: 6071, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
Oct 19 08:59:16 johan kernel: [30120.685663] Call Trace: [kern.warning]
Oct 19 08:59:16 johan kernel: [30120.685666]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
Oct 19 09:36:31 johan kernel: [32355.708345] __ratelimit: 16 callbacks suppressed [kern.warning]
Oct 19 09:36:31 johan kernel: [32355.708352] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
Oct 19 09:36:31 johan kernel: [32355.708358] Pid: 6087, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
Oct 19 09:36:31 johan kernel: [32355.708361] Call Trace: [kern.warning]
Oct 19 09:36:31 johan kernel: [32355.708364]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
Oct 19 10:52:01 johan kernel: [36885.358312] __ratelimit: 31 callbacks suppressed [kern.warning]
Oct 19 10:52:01 johan kernel: [36885.358319] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
Oct 19 10:52:01 johan kernel: [36885.358325] Pid: 6057, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
Oct 19 10:52:01 johan kernel: [36885.358327] Call Trace: [kern.warning]
Oct 19 10:52:01 johan kernel: [36885.358331]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
Oct 19 11:12:01 johan kernel: [38085.163831] events/3: page allocation failure. order:5, mode:0x4020 [kern.warning]
Oct 19 11:12:01 johan kernel: [38085.163840] Pid: 18, comm: events/3 Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
Oct 19 11:12:01 johan kernel: [38085.163843] Call Trace: [kern.warning]
Oct 19 11:12:01 johan kernel: [38085.163846]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19  9:49                           ` [Bug #14141] order 2 page allocation failures (generic) Tobi Oetiker
@ 2009-10-19  9:54                             ` Pekka Enberg
  2009-10-19 14:01                               ` Karol Lewandowski
  2009-10-19 13:31                             ` Mel Gorman
  1 sibling, 1 reply; 88+ messages in thread
From: Pekka Enberg @ 2009-10-19  9:54 UTC (permalink / raw)
  To: Tobi Oetiker
  Cc: Frans Pop, Mel Gorman, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe
On Mon, 2009-10-19 at 11:49 +0200, Tobi Oetiker wrote:
> Today Frans Pop wrote:
> 
> >
> > I'm starting to think that this commit may not be directly related to high
> > order allocation failures. The fact that I'm seeing SKB allocation
> > failures earlier because of this commit could be just a side effect.
> > It could be that instead the main impact of this commit is on encrypted
> > file system and/or encrypted swap (kcryptd).
> >
> > Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm
> > only reading from NFS that's unlikely).
> 
> I have updated a fileserver to 2.6.31 today and I see page
> allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> So I guess the problem must be quite generic:
Yup, it almost certainly is. Does this patch help?
http://lkml.org/lkml/2009/10/16/89
Frans, did you ever get around retesting with just the above patch
applied?
			Pekka
> Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> 
> Oct 19 08:59:16 johan kernel: [30120.685647] __ratelimit: 13 callbacks suppressed [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685654] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685660] Pid: 6071, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685663] Call Trace: [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685666]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> Oct 19 09:36:31 johan kernel: [32355.708345] __ratelimit: 16 callbacks suppressed [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708352] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708358] Pid: 6087, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708361] Call Trace: [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708364]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> Oct 19 10:52:01 johan kernel: [36885.358312] __ratelimit: 31 callbacks suppressed [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358319] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358325] Pid: 6057, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358327] Call Trace: [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358331]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> Oct 19 11:12:01 johan kernel: [38085.163831] events/3: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 11:12:01 johan kernel: [38085.163840] Pid: 18, comm: events/3 Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 11:12:01 johan kernel: [38085.163843] Call Trace: [kern.warning]
> Oct 19 11:12:01 johan kernel: [38085.163846]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> 
> 
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19  9:49                           ` [Bug #14141] order 2 page allocation failures (generic) Tobi Oetiker
  2009-10-19  9:54                             ` Pekka Enberg
@ 2009-10-19 13:31                             ` Mel Gorman
  2009-10-19 13:40                               ` Tobias Oetiker
  1 sibling, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-19 13:31 UTC (permalink / raw)
  To: Tobi Oetiker
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe
On Mon, Oct 19, 2009 at 11:49:08AM +0200, Tobi Oetiker wrote:
> Today Frans Pop wrote:
> 
> >
> > I'm starting to think that this commit may not be directly related to high
> > order allocation failures. The fact that I'm seeing SKB allocation
> > failures earlier because of this commit could be just a side effect.
> > It could be that instead the main impact of this commit is on encrypted
> > file system and/or encrypted swap (kcryptd).
> >
> > Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm
> > only reading from NFS that's unlikely).
> 
> I have updated a fileserver to 2.6.31 today and I see page
> allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> So I guess the problem must be quite generic:
> 
> 
> Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
What's the rest of the stack trace? I'm wondering where a large number
of order-5 GFP_ATOMIC allocations are coming from. It seems different to
the e100 problem where there is one GFP_ATOMIC allocation while the
firmware is being loaded.
Thanks
> 
> Oct 19 08:59:16 johan kernel: [30120.685647] __ratelimit: 13 callbacks suppressed [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685654] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685660] Pid: 6071, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685663] Call Trace: [kern.warning]
> Oct 19 08:59:16 johan kernel: [30120.685666]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> Oct 19 09:36:31 johan kernel: [32355.708345] __ratelimit: 16 callbacks suppressed [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708352] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708358] Pid: 6087, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708361] Call Trace: [kern.warning]
> Oct 19 09:36:31 johan kernel: [32355.708364]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> Oct 19 10:52:01 johan kernel: [36885.358312] __ratelimit: 31 callbacks suppressed [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358319] nfsd: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358325] Pid: 6057, comm: nfsd Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358327] Call Trace: [kern.warning]
> Oct 19 10:52:01 johan kernel: [36885.358331]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> Oct 19 11:12:01 johan kernel: [38085.163831] events/3: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 11:12:01 johan kernel: [38085.163840] Pid: 18, comm: events/3 Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 11:12:01 johan kernel: [38085.163843] Call Trace: [kern.warning]
> Oct 19 11:12:01 johan kernel: [38085.163846]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> 
> 
> 
> -- 
> Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
> http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
> 
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 13:31                             ` Mel Gorman
@ 2009-10-19 13:40                               ` Tobias Oetiker
  2009-10-19 14:09                                 ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-19 13:40 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Mel Gorman wrote:
> On Mon, Oct 19, 2009 at 11:49:08AM +0200, Tobi Oetiker wrote:
> > Today Frans Pop wrote:
> >
> > >
> > > I'm starting to think that this commit may not be directly related to high
> > > order allocation failures. The fact that I'm seeing SKB allocation
> > > failures earlier because of this commit could be just a side effect.
> > > It could be that instead the main impact of this commit is on encrypted
> > > file system and/or encrypted swap (kcryptd).
> > >
> > > Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm
> > > only reading from NFS that's unlikely).
> >
> > I have updated a fileserver to 2.6.31 today and I see page
> > allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> > So I guess the problem must be quite generic:
> >
> >
> > Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> >
>
> What's the rest of the stack trace? I'm wondering where a large number
> of order-5 GFP_ATOMIC allocations are coming from. It seems different to
> the e100 problem where there is one GFP_ATOMIC allocation while the
> firmware is being loaded.
Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684157]  [<ffffffff810da7e5>] __alloc_pages_nodemask+0x135/0x140 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684164]  [<ffffffff815065b4>] ? _spin_unlock_bh+0x14/0x20 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684170]  [<ffffffff8110b368>] kmalloc_large_node+0x68/0xc0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684175]  [<ffffffff8110f15a>] __kmalloc_node_track_caller+0x11a/0x180 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684181]  [<ffffffff8140ffd2>] ? skb_copy+0x32/0xa0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684185]  [<ffffffff8140d8b6>] __alloc_skb+0x76/0x180 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684205]  [<ffffffff8140ffd2>] skb_copy+0x32/0xa0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684221]  [<ffffffffa050f33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684227]  [<ffffffff81416a6d>] dev_queue_xmit_nit+0x10d/0x170 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684231]  [<ffffffff81416f79>] dev_hard_start_xmit+0x189/0x1c0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684236]  [<ffffffff8142f071>] __qdisc_run+0x1a1/0x230 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684240]  [<ffffffff81418a88>] dev_queue_xmit+0x238/0x310 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684246]  [<ffffffff8144864b>] ip_finish_output+0x11b/0x2f0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684250]  [<ffffffff814488a9>] ip_output+0x89/0xd0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684254]  [<ffffffff814478c0>] ip_local_out+0x20/0x30 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684258]  [<ffffffff814481ab>] ip_queue_xmit+0x22b/0x3f0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684264]  [<ffffffff8145d5e5>] tcp_transmit_skb+0x345/0x4e0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684269]  [<ffffffff8145eaf6>] tcp_write_xmit+0xb6/0x2e0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684273]  [<ffffffff8145ed8b>] __tcp_push_pending_frames+0x2b/0xa0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684277]  [<ffffffff8145b249>] tcp_rcv_established+0x459/0x6d0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684282]  [<ffffffff814630bd>] tcp_v4_do_rcv+0x12d/0x140 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684285]  [<ffffffff8146365e>] tcp_v4_rcv+0x58e/0x7c0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684289]  [<ffffffff8144276d>] ip_local_deliver_finish+0x11d/0x2b0 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684293]  [<ffffffff8144293b>] ip_local_deliver+0x3b/0x90 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684297]  [<ffffffff81442ad6>] ip_rcv_finish+0x146/0x420 [kern.warning]
Oct 19 07:10:02 johan kernel: [23565.684301]  [<ffffffff8144304b>] ip_rcv+0x29b/0x370 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684304]  [<ffffffff81418f9a>] netif_receive_skb+0x38a/0x4d0 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684308]  [<ffffffff81419268>] napi_skb_finish+0x48/0x60 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684311]  [<ffffffff81419724>] napi_gro_receive+0x34/0x40 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684330]  [<ffffffffa006b623>] tg3_rx+0x373/0x4b0 [tg3] [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684339]  [<ffffffffa006cbf0>] tg3_poll_work+0x70/0xf0 [tg3] [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684347]  [<ffffffffa006ccae>] tg3_poll+0x3e/0xe0 [tg3] [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684350]  [<ffffffff814198d2>] net_rx_action+0x102/0x210 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684357]  [<ffffffff81061d24>] __do_softirq+0xc4/0x1f0 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684362]  [<ffffffff8101314c>] call_softirq+0x1c/0x30 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684365]  [<ffffffff81014945>] do_softirq+0x55/0x90 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684369]  [<ffffffff8106116b>] irq_exit+0x7b/0x90 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684372]  [<ffffffff81013e93>] do_IRQ+0x73/0xe0 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684378]  [<ffffffff81012993>] ret_from_intr+0x0/0x11 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684381]  <EOI>  [<ffffffff810318b6>] ? native_safe_halt+0x6/0x10 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684391]  [<ffffffff81019cd8>] ? default_idle+0x48/0xe0 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684396]  [<ffffffff8150929d>] ? __atomic_notifier_call_chain+0xd/0x10 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684400]  [<ffffffff815092b1>] ? atomic_notifier_call_chain+0x11/0x20 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684404]  [<ffffffff810107c8>] ? cpu_idle+0x98/0xe0 [kern.warning]
Oct 19 07:10:04 johan kernel: [23565.684410]  [<ffffffff81500d95>] ? start_secondary+0x95/0xc0 [kern.warning]
if you need more, I can send you a whole bunch of them ...
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19  9:54                             ` Pekka Enberg
@ 2009-10-19 14:01                               ` Karol Lewandowski
  2009-10-19 14:06                                 ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Karol Lewandowski @ 2009-10-19 14:01 UTC (permalink / raw)
  To: Pekka Enberg
  Cc: Tobi Oetiker, Frans Pop, Mel Gorman, David Rientjes,
	KOSAKI Motohiro, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, Mohamed Abbas, John W. Linville, linux-mm,
	jens.axboe
On Mon, Oct 19, 2009 at 12:54:11PM +0300, Pekka Enberg wrote:
> On Mon, 2009-10-19 at 11:49 +0200, Tobi Oetiker wrote:
> > I have updated a fileserver to 2.6.31 today and I see page
> > allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> > So I guess the problem must be quite generic:
> 
> Yup, it almost certainly is. Does this patch help?
> 
> http://lkml.org/lkml/2009/10/16/89
This patch seems to help in some cases.  Before applying this patch I
was able to trigger alloc failures on different machine by booting
kernel with "mem=256MB" and doing:
  $ gitk on-full-tree &
  # rmmod e100
  ... wait for few MBs in swap
  # modprobe e100; ifup --force ethX
So here this patch helped -- with it, I was unable to trigger page
allocation failures (testing was short, tough).  However, as I said
here[1], I applied both of Mel's patches (including this one) and that
didn't help my orginal issue (failures after suspend).
[1] http://lkml.org/lkml/2009/10/17/109
Thanks.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-18 23:33                     ` Frans Pop
  2009-10-19  0:36                       ` Pekka Enberg
@ 2009-10-19 14:01                       ` Mel Gorman
  2009-10-19 16:18                         ` Chris Mason
  1 sibling, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-19 14:01 UTC (permalink / raw)
  To: Frans Pop
  Cc: David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Mon, Oct 19, 2009 at 01:33:29AM +0200, Frans Pop wrote:
> Another long mail, sorry.
> 
> On Wednesday 14 October 2009, Frans Pop wrote:
> > > There still has not been a mm-change identified that makes
> > > fragmentation significantly worse.
> >
> > My bisection shows a very clear point, even if not an individual commit,
> > in the 'akpm' merge where SKB errors suddenly become *much* more
> > frequent and easy to trigger.
> > I'm sorry to say this, but the fact that nothing has been identified yet
> > is IMO the result of a lack of effort, not because there is no such
> > change.
> 
> I was wrong. It turns out that I was creating the variations in the test 
> results around the akpm merge myself by tiny changes in the way I ran the 
> tests. It took another round of about 30 compilations and tests purely in 
> this range to show that, but those same tests also made me aware of other 
> patterns I should look at.
> 
Once again, thanks for persisting with this for so long. That many tests
and searching is a miserable undertaking.
> Until a few days ago I was concentrating on "do I see SKB allocation errors 
> or not". Since then I've also been looking more consciously at when they 
> happen, at disk access patterns and at desktop freeze patterns.
> 
> I think I did mention before that this whole issue is rather subtle :-/
Indeed
> So, my apologies for finguering the wrong area for so long, but it looked 
> solid given the info available at the time.
> 
> On Thursday 15 October 2009, Mel Gorman wrote:
> > Outside the range of commits suspected of causing problems was the
> > following. It's extremely low probability
> >
> > Commit 8aa7e84 Fix congestion_wait() sync/async vs read/write confusion
> >         This patch alters the call to congestion_wait() in the page
> >         allocator. Frankly, I don't get the change but it might worth
> >         checking if replacing BLK_RW_ASYNC with WRITE on top of 2.6.31
> >         makes any difference
> 
> This is the real culprit. Mel: thanks very much for looking beyond the 
> area I identified. Your overview of mm changes was exactly what I needed 
> and really helped a lot during my later tests.
> 
I'm surprised this made such a big difference which is why I described
it as "extremely low probability". It implies that the real problem isn't
fragmentation per-se but the timing of when pages get consumed.
Maybe what has really changed is how long direct reclaimers wait before trying
to allocate again. After the commit, if direct reclaimers are waiting longer
between direct reclaim attempts, it might mean that the GFP_KERNEL reclaimers
of high-order pages are doing less work before and hurting parallel GFP_ATOMIC
users. Jens, does this sound plausible?
> This commit definitely causes most of the problems; confirmed by reverting 
> it on top of 2.6.31 (also requires reverting 373c0a7e, which is a later 
> build fix).
> 
> The rest of this mail gives details on my tests and how I reached the above 
> conclusion.
> 
> TEST BASELINE (2.6.30)
> ======================
> I mentioned in an earlier mail that I run three instances of gitk for my 
> tests. Loading gitk seems to consist of 3 phases:
> 1) general initial scan of the repository (branches?)
> 2) reading commits: commit counter increases
> 3) reading references (including bisection good/bad points) and
>    uncommitted changes
> 
> Below times and comments per stage when the test is run with 2.6.30. As my 
> test starts after a clean boot, buffers are mostly empty.
> 
> 1st instance: 'gitk v2.6.29..master' (preparation)
> 1) ~20 seconds; user interface is mostly blank
> 2) ~5 seconds to read 35.000 commits; user interface is updated and counter
>    increases steadily as they are read
> 3) ~10 seconds; "branch"/"follows"/"precedes" info and tags are filled
>    in; fairly heavy disk activity
> 
> 2st instance: 'gitk master' (preparation)
> 1) 0 seconds (because data is already buffered)
> 2) ~25 seconds to read 167500 commits; counter increases steadily
> 3) 1-2 seconds (because data is already buffered)
> 
> 3st instance: 'gitk master' (the actual test)
> 1) 0 seconds because data is already buffered
> 2) ~55 seconds due to swapping overhead; minor music skip around commit
>    110.000; counter slower after 90.000, some short halts, but generally
>    increases steadily; moderate disk activity
> 3) ~55-60 seconds; because buffers have been emptied data must by read
>    again, with swapping; very heavy disk activity; fairly long music
>    skip (15-20 seconds), but no SKB allocation errors
> 
> So, the loading of the 3rd instance takes 1.5 minutes longer than the 
> second because of the swapping. And phase 3) is most affected by it.
> 
> AFTER WIRELESS CHANGE
> =====================
> After commit 4752c93c30 ("iwlcore: Allow skb allocation from tasklet") I 
> start getting the SKB errors. They can be triggered reliably if the whole 
> test is repeated 1 or 2 times, but generally not the first time the test 
> is run.
It's up to the wireless driver maintainer what to do here, but it seems
like that patch needs to be reverted and thought about some more before
trying again.
> 
> Or so I thought for a long time.
> It turns out that I will get SKB errors during the first run if I'm
> "sloppy" in the test execution. For example if I wait too long before 
> switching from the last gitk instance to konsole where I have 
> a 'tail -f /var/log/kern.log' running.
So the timing is critical of when the high-order atomic allocations
start kicking in.
> Another factor is the state of the repository: do I have master checked 
> out, or an older branch, or am I in the middle of a bisection. This 
> influences how data is read from the disk and thus the test results.
> A last factor may be the size of the kernel I'm using: my test/bisect 
> kernel is significantly smaller than my regular kernel.
> 
> If the test is run completely cleanly, I will not get SKB errors during the 
> first run. Also, this change does not affect the timings of the test at 
> all: the total load time of the 3rd instance is still ~1:55 and music 
> skips happen in roughly the same places. The pattern of disk activity also 
> remains unchanged.
> 
> If I do *not* run the test cleanly, any SKB errors during the first test 
> run will always be during phase 3), never during phase 2). This is what I 
> saw during tests in the 'akpm' range, and explains the inconsistent 
> results there.
> 
> After discovering this I've made a copy of the git repo so that I always 
> test using the exact same state and tightened my test procedure.
> 
> AFTER congestion_wait CHANGE
> ============================
> If I test commit 9f2d8be, which is just before the congestion_wait() 
> change, I still get the same pattern as described above. But when I test 
> with 8aa7e84 ("Fix congestion_wait() sync/async vs read/write confusion"), 
> things change dramatically when the 3rd gitk instance is started.
> 
So, assuming this is a timing problem, this commit affects the timing of
when pages are consumed by processes doing direct reclaim.
> During the 2nd phase I see the first SKB allocation errors with a music 
> skip between reading commits 95.000 and 110.000.
> About commit 115.000 there is a very long pause during which the counter 
> does not increase, music stops and the desktop freezes completely. The 
> first 30 seconds of that freeze there is only very low disk activity (which 
> seems strange);
I'm just going to have to depend on Jens here. Jens, the congestion_wait() is
on BLK_RW_ASYNC after the commit. Reclaim usually writes pages asynchronously
but lumpy reclaim actually waits of pages to write out synchronously so
it's not always async.
Either way, reclaim is usually worried about writing pages but it would appear
after this change that a lot of read activity can also stall a process in
direct reclaim. What might be happening in Frans's particular case is that the
tasklet that allocates high-order pages for the RX buffers is getting stalled
by congestion caused by other processes doing reads from the filesystem.
While it makes sense from a congestion point of view to halt the IO, the
reclaim operations from direct reclaimers is getting delayed for long enough
to cause problems for GFP_ATOMIC.
Does this sound plausible to you? If so, what's the best way of
addressing this? Changing congestion_wait back to WRITE (assuming that
works for Frans)? Changing it to SYNC (again, assuming it actually
works) or a revert?
> the next 25 seconds there suddenly is very high disk  
> activity during which things gradually unfreeze and more SKB errors are 
> displayed. After that the commit counter runs up fairly steadily again.
> 
> Phase 2) ends at ~1:45. Phase 3) (with more SKB errors) ends at ~2:05.
> 
> So this change almost doubles the time needed for phase 2) and causes SKB 
> allocation errors to occur during that phase. Also, before this commit the 
> desktop freezes are much shorter and less severe. With this change the 
> desktop is completely unusable for almost a minute during phase 2), with 
> even the mouse pointer frozen solid.
> Note that phase 3) becomes shorter, but that the total time needed to load 
> the 3rd instance increases by about 10-15 seconds.
> 
> Note: -rc2 and -rc3 had broken NFS, so I had to cherry-pick 3 NFS commits 
> from -rc4 on top of the commits I wanted to test.
> 
> WITH congestion_wait CHANGE REVERTED
> ====================================
> I've done quite a few tests of 2.6.31 with 373c0a7e and 8aa7e847 reverted 
> to confirm that's really the culprit. I've done this for .31-rc3, .31-rc4,
> .31-rc5, .31 and .31.1.
> 
> In all cases the huge freeze in phase 2) is gone and the general behavior 
> and timings are again as it was after the wireless change. During most 
> tests I did not get any SKB allocation errors during phase 2) or phase 3).
> 
> However with .31-rc5, .31 and .31.1 I have had some tests where I would see 
> a few SKB allocation errors during phase 3) (which is somewhat likely), 
> but also during phase 2). At this point I'm unsure whether this is just 
> noise, or maybe a minor influence from some change merged after .31-rc4.
> Looking through the commits there are several mm/page allocation changes.
> 
It could still be kswapd not being woken up often enough after direct
reclaimers. I took a look through the commits but none of the mm or
allocator changes struck me as likely candidates for making
fragmentation worse or altering the timing.
> For now I suggest ignoring this though as the impact (if any) is very minor 
> and it is not reproducible reliably enough.
> 
> Next I'll retest Mel's patches and also test Reinette's patches.
> 
Of the two patches, only the kswapd one should have any significance. As
David pointed out, the second patch is essentially a no-op as it should
not have been possible to enter direct reclaim with ALLOC_NO_WATERMARKS
set.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 14:01                               ` Karol Lewandowski
@ 2009-10-19 14:06                                 ` Mel Gorman
  2009-10-19 17:09                                   ` Karol Lewandowski
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-19 14:06 UTC (permalink / raw)
  To: Karol Lewandowski
  Cc: Pekka Enberg, Tobi Oetiker, Frans Pop, David Rientjes,
	KOSAKI Motohiro, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe
On Mon, Oct 19, 2009 at 04:01:45PM +0200, Karol Lewandowski wrote:
> On Mon, Oct 19, 2009 at 12:54:11PM +0300, Pekka Enberg wrote:
> > On Mon, 2009-10-19 at 11:49 +0200, Tobi Oetiker wrote:
> > > I have updated a fileserver to 2.6.31 today and I see page
> > > allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> > > So I guess the problem must be quite generic:
> > 
> > Yup, it almost certainly is. Does this patch help?
> > 
> > http://lkml.org/lkml/2009/10/16/89
> 
> This patch seems to help in some cases.  Before applying this patch I
> was able to trigger alloc failures on different machine by booting
> kernel with "mem=256MB" and doing:
> 
>   $ gitk on-full-tree &
>   # rmmod e100
>   ... wait for few MBs in swap
>   # modprobe e100; ifup --force ethX
> 
> So here this patch helped -- with it, I was unable to trigger page
> allocation failures (testing was short, tough).  However, as I said
> here[1], I applied both of Mel's patches (including this one) and that
> didn't help my orginal issue (failures after suspend).
> 
> [1] http://lkml.org/lkml/2009/10/17/109
> 
Can you test with my kswapd patch applied and commits 373c0a7e,8aa7e847
reverted please?
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 13:40                               ` Tobias Oetiker
@ 2009-10-19 14:09                                 ` Mel Gorman
  2009-10-19 14:16                                   ` Tobias Oetiker
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-19 14:09 UTC (permalink / raw)
  To: Tobias Oetiker
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe
On Mon, Oct 19, 2009 at 03:40:05PM +0200, Tobias Oetiker wrote:
> Hi Mel,
> 
> Today Mel Gorman wrote:
> 
> > On Mon, Oct 19, 2009 at 11:49:08AM +0200, Tobi Oetiker wrote:
> > > Today Frans Pop wrote:
> > >
> > > >
> > > > I'm starting to think that this commit may not be directly related to high
> > > > order allocation failures. The fact that I'm seeing SKB allocation
> > > > failures earlier because of this commit could be just a side effect.
> > > > It could be that instead the main impact of this commit is on encrypted
> > > > file system and/or encrypted swap (kcryptd).
> > > >
> > > > Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm
> > > > only reading from NFS that's unlikely).
> > >
> > > I have updated a fileserver to 2.6.31 today and I see page
> > > allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> > > So I guess the problem must be quite generic:
> > >
> > >
> > > Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> > >
> >
> > What's the rest of the stack trace? I'm wondering where a large number
> > of order-5 GFP_ATOMIC allocations are coming from. It seems different to
> > the e100 problem where there is one GFP_ATOMIC allocation while the
> > firmware is being loaded.
> 
> Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684157]  [<ffffffff810da7e5>] __alloc_pages_nodemask+0x135/0x140 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684164]  [<ffffffff815065b4>] ? _spin_unlock_bh+0x14/0x20 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684170]  [<ffffffff8110b368>] kmalloc_large_node+0x68/0xc0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684175]  [<ffffffff8110f15a>] __kmalloc_node_track_caller+0x11a/0x180 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684181]  [<ffffffff8140ffd2>] ? skb_copy+0x32/0xa0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684185]  [<ffffffff8140d8b6>] __alloc_skb+0x76/0x180 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684205]  [<ffffffff8140ffd2>] skb_copy+0x32/0xa0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684221]  [<ffffffffa050f33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
Is the MTU set very high between the host and virtualised machine?
Can you test please with the patch at http://lkml.org/lkml/2009/10/16/89
applied and with commits 373c0a7e and 8aa7e847 reverted please?
> Oct 19 07:10:02 johan kernel: [23565.684231]  [<ffffffff81416f79>] dev_hard_start_xmit+0x189/0x1c0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684236]  [<ffffffff8142f071>] __qdisc_run+0x1a1/0x230 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684240]  [<ffffffff81418a88>] dev_queue_xmit+0x238/0x310 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684246]  [<ffffffff8144864b>] ip_finish_output+0x11b/0x2f0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684250]  [<ffffffff814488a9>] ip_output+0x89/0xd0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684254]  [<ffffffff814478c0>] ip_local_out+0x20/0x30 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684258]  [<ffffffff814481ab>] ip_queue_xmit+0x22b/0x3f0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684264]  [<ffffffff8145d5e5>] tcp_transmit_skb+0x345/0x4e0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684269]  [<ffffffff8145eaf6>] tcp_write_xmit+0xb6/0x2e0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684273]  [<ffffffff8145ed8b>] __tcp_push_pending_frames+0x2b/0xa0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684277]  [<ffffffff8145b249>] tcp_rcv_established+0x459/0x6d0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684282]  [<ffffffff814630bd>] tcp_v4_do_rcv+0x12d/0x140 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684285]  [<ffffffff8146365e>] tcp_v4_rcv+0x58e/0x7c0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684289]  [<ffffffff8144276d>] ip_local_deliver_finish+0x11d/0x2b0 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684293]  [<ffffffff8144293b>] ip_local_deliver+0x3b/0x90 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684297]  [<ffffffff81442ad6>] ip_rcv_finish+0x146/0x420 [kern.warning]
> Oct 19 07:10:02 johan kernel: [23565.684301]  [<ffffffff8144304b>] ip_rcv+0x29b/0x370 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684304]  [<ffffffff81418f9a>] netif_receive_skb+0x38a/0x4d0 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684308]  [<ffffffff81419268>] napi_skb_finish+0x48/0x60 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684311]  [<ffffffff81419724>] napi_gro_receive+0x34/0x40 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684330]  [<ffffffffa006b623>] tg3_rx+0x373/0x4b0 [tg3] [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684339]  [<ffffffffa006cbf0>] tg3_poll_work+0x70/0xf0 [tg3] [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684347]  [<ffffffffa006ccae>] tg3_poll+0x3e/0xe0 [tg3] [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684350]  [<ffffffff814198d2>] net_rx_action+0x102/0x210 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684357]  [<ffffffff81061d24>] __do_softirq+0xc4/0x1f0 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684362]  [<ffffffff8101314c>] call_softirq+0x1c/0x30 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684365]  [<ffffffff81014945>] do_softirq+0x55/0x90 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684369]  [<ffffffff8106116b>] irq_exit+0x7b/0x90 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684372]  [<ffffffff81013e93>] do_IRQ+0x73/0xe0 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684378]  [<ffffffff81012993>] ret_from_intr+0x0/0x11 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684381]  <EOI>  [<ffffffff810318b6>] ? native_safe_halt+0x6/0x10 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684391]  [<ffffffff81019cd8>] ? default_idle+0x48/0xe0 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684396]  [<ffffffff8150929d>] ? __atomic_notifier_call_chain+0xd/0x10 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684400]  [<ffffffff815092b1>] ? atomic_notifier_call_chain+0x11/0x20 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684404]  [<ffffffff810107c8>] ? cpu_idle+0x98/0xe0 [kern.warning]
> Oct 19 07:10:04 johan kernel: [23565.684410]  [<ffffffff81500d95>] ? start_secondary+0x95/0xc0 [kern.warning]
> 
> if you need more, I can send you a whole bunch of them ...
> 
I'm assuming they are all more or less the same.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 14:09                                 ` Mel Gorman
@ 2009-10-19 14:16                                   ` Tobias Oetiker
  2009-10-19 14:59                                     ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-19 14:16 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Mel Gorman wrote:
> On Mon, Oct 19, 2009 at 03:40:05PM +0200, Tobias Oetiker wrote:
> > Hi Mel,
> >
> > Today Mel Gorman wrote:
> >
> > > On Mon, Oct 19, 2009 at 11:49:08AM +0200, Tobi Oetiker wrote:
> > > > Today Frans Pop wrote:
> > > >
> > > > >
> > > > > I'm starting to think that this commit may not be directly related to high
> > > > > order allocation failures. The fact that I'm seeing SKB allocation
> > > > > failures earlier because of this commit could be just a side effect.
> > > > > It could be that instead the main impact of this commit is on encrypted
> > > > > file system and/or encrypted swap (kcryptd).
> > > > >
> > > > > Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm
> > > > > only reading from NFS that's unlikely).
> > > >
> > > > I have updated a fileserver to 2.6.31 today and I see page
> > > > allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> > > > So I guess the problem must be quite generic:
> > > >
> > > >
> > > > Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> > > > Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> > > > Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> > > > Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> > > >
> > >
> > > What's the rest of the stack trace? I'm wondering where a large number
> > > of order-5 GFP_ATOMIC allocations are coming from. It seems different to
> > > the e100 problem where there is one GFP_ATOMIC allocation while the
> > > firmware is being loaded.
> >
> > Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684157]  [<ffffffff810da7e5>] __alloc_pages_nodemask+0x135/0x140 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684164]  [<ffffffff815065b4>] ? _spin_unlock_bh+0x14/0x20 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684170]  [<ffffffff8110b368>] kmalloc_large_node+0x68/0xc0 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684175]  [<ffffffff8110f15a>] __kmalloc_node_track_caller+0x11a/0x180 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684181]  [<ffffffff8140ffd2>] ? skb_copy+0x32/0xa0 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684185]  [<ffffffff8140d8b6>] __alloc_skb+0x76/0x180 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684205]  [<ffffffff8140ffd2>] skb_copy+0x32/0xa0 [kern.warning]
> > Oct 19 07:10:02 johan kernel: [23565.684221]  [<ffffffffa050f33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
>
> Is the MTU set very high between the host and virtualised machine?
>
> Can you test please with the patch at http://lkml.org/lkml/2009/10/16/89
> applied and with commits 373c0a7e and 8aa7e847 reverted please?
if you can send me a consolidated patch which does apply to
2.6.31.4 I will be glad to try ...
your patch in http://lkml.org/lkml/2009/10/16/89 seems not to be
for 2.6.31 ... I assume it would be but then again I I don't realy
understand the code so this is just pattern matching ...
--- a/mm/page_alloc.c   2009-10-05 19:12:06.000000000 +0200
+++ b/mm/page_alloc.c   2009-10-19 14:52:15.000000000 +0200
@@ -1763,6 +1763,7 @@
        if (NUMA_BUILD && (gfp_mask & GFP_THISNODE) == GFP_THISNODE)
                goto nopage;
+restart:
        wake_all_kswapd(order, zonelist, high_zoneidx);
        /*
@@ -1772,7 +1773,6 @@
         */
        alloc_flags = gfp_to_alloc_flags(gfp_mask);
-restart:
        /* This is the last chance, in general, before the goto nopage. */
        page = get_page_from_freelist(gfp_mask, nodemask, order, zonelist,
                        high_zoneidx, alloc_flags & ~ALLOC_NO_WATERMARKS,
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 14:16                                   ` Tobias Oetiker
@ 2009-10-19 14:59                                     ` Mel Gorman
  2009-10-19 20:12                                       ` Tobias Oetiker
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-19 14:59 UTC (permalink / raw)
  To: Tobias Oetiker
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
On Mon, Oct 19, 2009 at 04:16:36PM +0200, Tobias Oetiker wrote:
> Hi Mel,
> 
> Today Mel Gorman wrote:
> 
> > On Mon, Oct 19, 2009 at 03:40:05PM +0200, Tobias Oetiker wrote:
> > > Hi Mel,
> > >
> > > Today Mel Gorman wrote:
> > >
> > > > On Mon, Oct 19, 2009 at 11:49:08AM +0200, Tobi Oetiker wrote:
> > > > > Today Frans Pop wrote:
> > > > >
> > > > > >
> > > > > > I'm starting to think that this commit may not be directly related to high
> > > > > > order allocation failures. The fact that I'm seeing SKB allocation
> > > > > > failures earlier because of this commit could be just a side effect.
> > > > > > It could be that instead the main impact of this commit is on encrypted
> > > > > > file system and/or encrypted swap (kcryptd).
> > > > > >
> > > > > > Besides mm the commit also touches dm-crypt (and nfs/write.c, but as I'm
> > > > > > only reading from NFS that's unlikely).
> > > > >
> > > > > I have updated a fileserver to 2.6.31 today and I see page
> > > > > allocation failures from several parts of the system ... mostly nfs though ... (it is a nfs server).
> > > > > So I guess the problem must be quite generic:
> > > > >
> > > > >
> > > > > Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> > > > > Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> > > > > Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> > > > > Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> > > > >
> > > >
> > > > What's the rest of the stack trace? I'm wondering where a large number
> > > > of order-5 GFP_ATOMIC allocations are coming from. It seems different to
> > > > the e100 problem where there is one GFP_ATOMIC allocation while the
> > > > firmware is being loaded.
> > >
> > > Oct 19 07:10:02 johan kernel: [23565.684110] swapper: page allocation failure. order:5, mode:0x4020 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684118] Pid: 0, comm: swapper Not tainted 2.6.31-02063104-generic #02063104 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684121] Call Trace: [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684124]  <IRQ>  [<ffffffff810da5a2>] __alloc_pages_slowpath+0x3b2/0x4c0 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684157]  [<ffffffff810da7e5>] __alloc_pages_nodemask+0x135/0x140 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684164]  [<ffffffff815065b4>] ? _spin_unlock_bh+0x14/0x20 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684170]  [<ffffffff8110b368>] kmalloc_large_node+0x68/0xc0 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684175]  [<ffffffff8110f15a>] __kmalloc_node_track_caller+0x11a/0x180 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684181]  [<ffffffff8140ffd2>] ? skb_copy+0x32/0xa0 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684185]  [<ffffffff8140d8b6>] __alloc_skb+0x76/0x180 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684205]  [<ffffffff8140ffd2>] skb_copy+0x32/0xa0 [kern.warning]
> > > Oct 19 07:10:02 johan kernel: [23565.684221]  [<ffffffffa050f33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
> >
> > Is the MTU set very high between the host and virtualised machine?
> >
> > Can you test please with the patch at http://lkml.org/lkml/2009/10/16/89
> > applied and with commits 373c0a7e and 8aa7e847 reverted please?
> 
> if you can send me a consolidated patch which does apply to
> 2.6.31.4 I will be glad to try ...
> 
Sure
==== CUT HERE ====
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19 14:01                       ` Mel Gorman
@ 2009-10-19 16:18                         ` Chris Mason
  2009-10-19 17:01                           ` Christoph Hellwig
                                             ` (4 more replies)
  0 siblings, 5 replies; 88+ messages in thread
From: Chris Mason @ 2009-10-19 16:18 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Mon, Oct 19, 2009 at 03:01:52PM +0100, Mel Gorman wrote:
> 
> > During the 2nd phase I see the first SKB allocation errors with a music 
> > skip between reading commits 95.000 and 110.000.
> > About commit 115.000 there is a very long pause during which the counter 
> > does not increase, music stops and the desktop freezes completely. The 
> > first 30 seconds of that freeze there is only very low disk activity (which 
> > seems strange);
> 
> I'm just going to have to depend on Jens here. Jens, the congestion_wait() is
> on BLK_RW_ASYNC after the commit. Reclaim usually writes pages asynchronously
> but lumpy reclaim actually waits of pages to write out synchronously so
> it's not always async.
Waiting doesn't make it synchronous from the elevator point of view ;)
If you're using WB_SYNC_NONE, it's a async write.  WB_SYNC_ALL makes it
a sync write.  I only see WB_SYNC_NONE in vmscan.c, so we should be
using the async congestion wait.  (the exception is xfs which always
does async writes).
But I'm honestly not 100% sure.  Looking back through the emails, the
test case is doing IO on top of a whole lot of things on top of
dm-crypt?  I just tried to figure out if dm-crypt is turning the async
IO into sync IOs, but didn't quite make sense of it.
Could you also please include which filesystems were being abused during
the test and how?  Reading through the emails, I think you've got:
gitk being run 3 times on some FS (NFS?)
streaming reads on NFS
swap on dm-crypt
If other filesystems are being used, please correct me.  Also please
include if they are on crypto or straight block device.
> 
> Either way, reclaim is usually worried about writing pages but it would appear
> after this change that a lot of read activity can also stall a process in
> direct reclaim. What might be happening in Frans's particular case is that the
> tasklet that allocates high-order pages for the RX buffers is getting stalled
> by congestion caused by other processes doing reads from the filesystem.
> While it makes sense from a congestion point of view to halt the IO, the
> reclaim operations from direct reclaimers is getting delayed for long enough
> to cause problems for GFP_ATOMIC.
The congestion_wait code either waits for congestion to clear or for
a given timeout.  The part that isn't clear is if before the patch
we waited a very short time (congestion cleared quickly) or a very long
time (we hit the timeout or congestion cleared slowly).
The easiest way to tell is to just replace the congestion_wait() calls
in direct reclaim with schedule_timeout_interruptible(10), test, then
schedule_timeout_interruptible(HZ/20), then test again.
> 
> Does this sound plausible to you? If so, what's the best way of
> addressing this? Changing congestion_wait back to WRITE (assuming that
> works for Frans)? Changing it to SYNC (again, assuming it actually
> works) or a revert?
I don't think changing it to SYNC is a good plan unless we're actually
doing sync io.  It would be better to just wait on one of the pages that
you've sent down (or its hashed waitqueue since the page can go away).
-chris
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19 16:18                         ` Chris Mason
  2009-10-19 17:01                           ` Christoph Hellwig
@ 2009-10-19 17:01                           ` Christoph Hellwig
  2009-10-19 21:57                             ` Chris Mason
  2009-10-20 10:48                           ` Mel Gorman
                                             ` (2 subsequent siblings)
  4 siblings, 1 reply; 88+ messages in thread
From: Christoph Hellwig @ 2009-10-19 17:01 UTC (permalink / raw)
  To: Chris Mason, Mel Gorman, Frans Pop, David Rientjes,
	KOSAKI Motohiro, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Pekka Enberg, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	Jens Axboe, John W. Linville, linux-mm
On Tue, Oct 20, 2009 at 01:18:15AM +0900, Chris Mason wrote:
> Waiting doesn't make it synchronous from the elevator point of view ;)
> If you're using WB_SYNC_NONE, it's a async write.  WB_SYNC_ALL makes it
> a sync write.  I only see WB_SYNC_NONE in vmscan.c, so we should be
> using the async congestion wait.  (the exception is xfs which always
> does async writes).
That's only because those people who did the global sweep did not bother
to convert it or even tell the list about it.  I have a patch in my
QA queue to change it..
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19 16:18                         ` Chris Mason
@ 2009-10-19 17:01                           ` Christoph Hellwig
  2009-10-19 17:01                           ` Christoph Hellwig
                                             ` (3 subsequent siblings)
  4 siblings, 0 replies; 88+ messages in thread
From: Christoph Hellwig @ 2009-10-19 17:01 UTC (permalink / raw)
  To: Chris Mason, Mel Gorman, Frans Pop, David Rientjes,
	KOSAKI Motohiro
On Tue, Oct 20, 2009 at 01:18:15AM +0900, Chris Mason wrote:
> Waiting doesn't make it synchronous from the elevator point of view ;)
> If you're using WB_SYNC_NONE, it's a async write.  WB_SYNC_ALL makes it
> a sync write.  I only see WB_SYNC_NONE in vmscan.c, so we should be
> using the async congestion wait.  (the exception is xfs which always
> does async writes).
That's only because those people who did the global sweep did not bother
to convert it or even tell the list about it.  I have a patch in my
QA queue to change it..
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 14:06                                 ` Mel Gorman
@ 2009-10-19 17:09                                   ` Karol Lewandowski
  2009-10-20  1:47                                     ` Karol Lewandowski
  0 siblings, 1 reply; 88+ messages in thread
From: Karol Lewandowski @ 2009-10-19 17:09 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Karol Lewandowski, Pekka Enberg, Tobi Oetiker, Frans Pop,
	David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Mohamed Abbas, John W. Linville,
	linux-mm, jens.axboe
On Mon, Oct 19, 2009 at 03:06:19PM +0100, Mel Gorman wrote:
> Can you test with my kswapd patch applied and commits 373c0a7e,8aa7e847
> reverted please?
It seems that your patch and Frans' reverts together *do* make
difference.
With these patches I haven't been able to trigger failures so far
(in about 6 attempts). I'll continue testing and let you know if
anything changes.
If nothing changes this looks like fix for my problem.
Thanks.  Thanks a lot!
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 14:59                                     ` Mel Gorman
@ 2009-10-19 20:12                                       ` Tobias Oetiker
  2009-10-19 20:17                                         ` Tobias Oetiker
  0 siblings, 1 reply; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-19 20:12 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Mel Gorman wrote:
> >
> > if you can send me a consolidated patch which does apply to
> > 2.6.31.4 I will be glad to try ...
> >
>
> Sure
>
> ==== CUT HERE ====
>
> From 6c0215af3b7c39ef7b8083ea38ca3ad93cd3f51f Mon Sep 17 00:00:00 2001
> From: Mel Gorman <mel@csn.ul.ie>
> Date: Mon, 19 Oct 2009 15:40:43 +0100
> Subject: [PATCH] Kick off kswapd after direct reclaim and revert congestion changes
>
> The following patch is http://lkml.org/lkml/2009/10/16/89 on top of
> 2.6.31.4 as well as patches 373c0a7e and 8aa7e847 reverted.
it seems to help ... the server has been running for 3 hours now
without incident, but then again it is not as active as during the
day, ... will report tomorrow.
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 20:12                                       ` Tobias Oetiker
@ 2009-10-19 20:17                                         ` Tobias Oetiker
  2009-10-20 10:57                                           ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-19 20:17 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Tobias Oetiker wrote:
> Hi Mel,
>
> Today Mel Gorman wrote:
>
> > >
> > > if you can send me a consolidated patch which does apply to
> > > 2.6.31.4 I will be glad to try ...
> > >
> >
> > Sure
> >
> > ==== CUT HERE ====
> >
> > From 6c0215af3b7c39ef7b8083ea38ca3ad93cd3f51f Mon Sep 17 00:00:00 2001
> > From: Mel Gorman <mel@csn.ul.ie>
> > Date: Mon, 19 Oct 2009 15:40:43 +0100
> > Subject: [PATCH] Kick off kswapd after direct reclaim and revert congestion changes
> >
> > The following patch is http://lkml.org/lkml/2009/10/16/89 on top of
> > 2.6.31.4 as well as patches 373c0a7e and 8aa7e847 reverted.
>
> it seems to help ... the server has been running for 3 hours now
> without incident, but then again it is not as active as during the
> day, ... will report tomorrow.
while I was writing, the system found that the patch does not realy
help:
Oct 19 22:09:52 johan kernel: [11157.121506] smtpd: page allocation failure. order:5, mode:0x4020 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121514] Pid: 19324, comm: smtpd Tainted: G      D    2.6.31.4-oep #1 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121518] Call Trace: [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121521]  <IRQ>  [<ffffffff810cb599>] __alloc_pages_nodemask+0x549/0x650 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121563]  [<ffffffffa02bde3b>] ? __nf_ct_refresh_acct+0xab/0x110 [nf_conntrack] [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121572]  [<ffffffffa02a8337>] ? ipt_do_table+0x2f7/0x610 [ip_tables] [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121580]  [<ffffffff810fac18>] kmalloc_large_node+0x68/0xc0 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121585]  [<ffffffff810fe90a>] __kmalloc_node_track_caller+0x11a/0x180 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121592]  [<ffffffff813ebd42>] ? skb_copy+0x32/0xa0 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121596]  [<ffffffff813e9606>] __alloc_skb+0x76/0x180 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121600]  [<ffffffff813ebd42>] skb_copy+0x32/0xa0 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121615]  [<ffffffffa07dd33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121620]  [<ffffffff813f2512>] dev_hard_start_xmit+0x142/0x320 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121632]  [<ffffffff8140a2c1>] __qdisc_run+0x1a1/0x230 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121637]  [<ffffffff813f41e0>] dev_queue_xmit+0x2b0/0x3a0 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121642]  [<ffffffff8142349b>] ip_finish_output+0x11b/0x2f0 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121646]  [<ffffffff814236f9>] ip_output+0x89/0xd0 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121650]  [<ffffffff81422710>] ip_local_out+0x20/0x30 [kern.warning]
Oct 19 22:09:52 johan kernel: [11157.121654]  [<ffffffff81422ffb>] ip_queue_xmit+0x22b/0x3f0 [kern.warning]
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19 17:01                           ` Christoph Hellwig
@ 2009-10-19 21:57                             ` Chris Mason
  0 siblings, 0 replies; 88+ messages in thread
From: Chris Mason @ 2009-10-19 21:57 UTC (permalink / raw)
  To: Christoph Hellwig
  Cc: Mel Gorman, Frans Pop, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, Mohamed Abbas, Jens Axboe, John W. Linville,
	linux-mm
On Mon, Oct 19, 2009 at 01:01:15PM -0400, Christoph Hellwig wrote:
> On Tue, Oct 20, 2009 at 01:18:15AM +0900, Chris Mason wrote:
> > Waiting doesn't make it synchronous from the elevator point of view ;)
> > If you're using WB_SYNC_NONE, it's a async write.  WB_SYNC_ALL makes it
> > a sync write.  I only see WB_SYNC_NONE in vmscan.c, so we should be
> > using the async congestion wait.  (the exception is xfs which always
> > does async writes).
> 
> That's only because those people who did the global sweep did not bother
> to convert it or even tell the list about it.  I have a patch in my
> QA queue to change it..
Yes, we just didn't realize XFS was missed.  Sorry.  I wasn't trying to
blame xfs for being behind, just mentioning that we've got about 10
different variables here and I'm having a hard time figuring out which
ones to push on.
-chris
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 17:09                                   ` Karol Lewandowski
@ 2009-10-20  1:47                                     ` Karol Lewandowski
  0 siblings, 0 replies; 88+ messages in thread
From: Karol Lewandowski @ 2009-10-20  1:47 UTC (permalink / raw)
  To: Karol Lewandowski
  Cc: Mel Gorman, Pekka Enberg, Tobi Oetiker, Frans Pop, David Rientjes,
	KOSAKI Motohiro, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe
On Mon, Oct 19, 2009 at 07:09:47PM +0200, Karol Lewandowski wrote:
> On Mon, Oct 19, 2009 at 03:06:19PM +0100, Mel Gorman wrote:
> > Can you test with my kswapd patch applied and commits 373c0a7e,8aa7e847
> > reverted please?
> 
> It seems that your patch and Frans' reverts together *do* make
> difference.
> 
> With these patches I haven't been able to trigger failures so far
> (in about 6 attempts). I'll continue testing and let you know if
> anything changes.
Damn it.
I'm sorry to inform you that yes, I still get failures (less often,
but still).
Thanks.
e100: Intel(R) PRO/100 Network Driver, 3.5.24-k2-NAPI
e100: Copyright(c) 1999-2006 Intel Corporation
e100 0000:00:03.0: PCI INT A -> Link[LNKC] -> GSI 9 (level, low) -> IRQ 9
e100 0000:00:03.0: PME# disabled
e100: eth0: e100_probe: addr 0xe8120000, irq 9, MAC addr 00:10:a4:89:e8:84
ifconfig: page allocation failure. order:5, mode:0x8020
Pid: 5151, comm: ifconfig Not tainted 2.6.31+frans2+mel-00002-g90702f9-dirty #2
Call Trace:
 [<c015c4e1>] ? __alloc_pages_nodemask+0x423/0x468
 [<c0104de7>] ? dma_generic_alloc_coherent+0x4a/0xab
 [<c0104d9d>] ? dma_generic_alloc_coherent+0x0/0xab
 [<d1614b6f>] ? e100_alloc_cbs+0xc7/0x174 [e100]
 [<d1615bfe>] ? e100_up+0x1b/0xf5 [e100]
 [<d1615cef>] ? e100_open+0x17/0x41 [e100]
 [<c02f871f>] ? dev_open+0x8f/0xc5
 [<c02f7ed9>] ? dev_change_flags+0xa2/0x155
 [<c032daa6>] ? devinet_ioctl+0x22a/0x51c
 [<c02ebabe>] ? sock_ioctl+0x0/0x1e4
 [<c02ebc7e>] ? sock_ioctl+0x1c0/0x1e4
 [<c02ebabe>] ? sock_ioctl+0x0/0x1e4
 [<c017f23a>] ? vfs_ioctl+0x16/0x4a
 [<c017fb01>] ? do_vfs_ioctl+0x48a/0x4c1
 [<c0168137>] ? handle_mm_fault+0x1e0/0x42c
 [<c0348c6b>] ? do_page_fault+0x2ce/0x2e4
 [<c017fb64>] ? sys_ioctl+0x2c/0x42
 [<c0102748>] ? sysenter_do_call+0x12/0x26
Mem-Info:
DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
Normal per-cpu:
CPU    0: hi:   90, btch:  15 usd:  35
Active_anon:14778 active_file:10836 inactive_anon:22033
 inactive_file:11854 unevictable:0 dirty:6 writeback:0 unstable:0
 free:1031 slab:2083 mapped:6193 pagetables:417 bounce:0
DMA free:1096kB min:124kB low:152kB high:184kB active_anon:528kB inactive_anon:3440kB active_file:1076kB inactive_file:5580kB unevictable:0kB present:15868kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 238 238
Normal free:3028kB min:1908kB low:2384kB high:2860kB active_anon:58584kB inactive_anon:84692kB active_file:42268kB inactive_file:41836kB unevictable:0kB present:243776kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 0 0
DMA: 46*4kB 0*8kB 5*16kB 0*32kB 1*64kB 0*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 1096kB
Normal: 135*4kB 213*8kB 21*16kB 4*32kB 5*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 3028kB
25927 total pagecache pages
3010 pages in swap cache
Swap cache stats: add 205613, delete 202603, find 63665/79800
Free swap  = 485236kB
Total swap = 514040kB
65520 pages RAM
1663 pages reserved
14633 pages shared
52919 pages non-shared
ifconfig: page allocation failure. order:5, mode:0x8020
Pid: 5151, comm: ifconfig Not tainted 2.6.31+frans2+mel-00002-g90702f9-dirty #2
Call Trace:
 [<c015c4e1>] ? __alloc_pages_nodemask+0x423/0x468
 [<c0104de7>] ? dma_generic_alloc_coherent+0x4a/0xab
 [<c0104d9d>] ? dma_generic_alloc_coherent+0x0/0xab
 [<d1614b6f>] ? e100_alloc_cbs+0xc7/0x174 [e100]
 [<d1615bfe>] ? e100_up+0x1b/0xf5 [e100]
 [<d1615cef>] ? e100_open+0x17/0x41 [e100]
 [<c02f871f>] ? dev_open+0x8f/0xc5
 [<c02f7ed9>] ? dev_change_flags+0xa2/0x155
 [<c032daa6>] ? devinet_ioctl+0x22a/0x51c
 [<c02ebabe>] ? sock_ioctl+0x0/0x1e4
 [<c02ebc7e>] ? sock_ioctl+0x1c0/0x1e4
 [<c02ebabe>] ? sock_ioctl+0x0/0x1e4
 [<c017f23a>] ? vfs_ioctl+0x16/0x4a
 [<c017fb01>] ? do_vfs_ioctl+0x48a/0x4c1
 [<c0175fd1>] ? vfs_write+0xf4/0x105
 [<c017fb64>] ? sys_ioctl+0x2c/0x42
 [<c0102748>] ? sysenter_do_call+0x12/0x26
Mem-Info:
DMA per-cpu:
CPU    0: hi:    0, btch:   1 usd:   0
Normal per-cpu:
CPU    0: hi:   90, btch:  15 usd:  67
Active_anon:14760 active_file:10798 inactive_anon:22052
 inactive_file:11862 unevictable:0 dirty:6 writeback:30 unstable:0
 free:1031 slab:2083 mapped:6187 pagetables:417 bounce:0
DMA free:1096kB min:124kB low:152kB high:184kB active_anon:528kB inactive_anon:3440kB active_file:1076kB inactive_file:5580kB unevictable:0kB present:15868kB pages_scanned:0 all_unreclaimable? no
lowmem_reserve[]: 0 238 238
Normal free:3028kB min:1908kB low:2384kB high:2860kB active_anon:58512kB inactive_anon:84768kB active_file:42116kB inactive_file:41868kB unevictable:0kB present:243776kB pages_scanned:100 all_unreclaimable? no
lowmem_reserve[]: 0 0 0
DMA: 46*4kB 0*8kB 5*16kB 0*32kB 1*64kB 0*128kB 1*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 1096kB
Normal: 135*4kB 213*8kB 21*16kB 4*32kB 5*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 3028kB
25924 total pagecache pages
3037 pages in swap cache
Swap cache stats: add 205644, delete 202607, find 63666/79802
Free swap  = 485116kB
Total swap = 514040kB
65520 pages RAM
1663 pages reserved
14638 pages shared
52896 pages non-shared
e100 0000:00:03.0: firmware: requesting e100/d101s_ucode.bin
ADDRCONF(NETDEV_UP): eth0: link is not ready
e100: eth0 NIC Link is Up 100 Mbps Full Duplex
ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
eth0: no IPv6 routers present
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19 16:18                         ` Chris Mason
                                             ` (2 preceding siblings ...)
  2009-10-20 10:48                           ` Mel Gorman
@ 2009-10-20 10:48                           ` Mel Gorman
  2009-10-26 21:06                             ` Frans Pop
  2009-10-25 18:54                           ` Frans Pop
  4 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-20 10:48 UTC (permalink / raw)
  To: Chris Mason, Frans Pop, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, Mohamed Abbas, Jens Axboe, John W. Linville,
	linux-mm
On Tue, Oct 20, 2009 at 01:18:15AM +0900, Chris Mason wrote:
> On Mon, Oct 19, 2009 at 03:01:52PM +0100, Mel Gorman wrote:
> > 
> > > During the 2nd phase I see the first SKB allocation errors with a music 
> > > skip between reading commits 95.000 and 110.000.
> > > About commit 115.000 there is a very long pause during which the counter 
> > > does not increase, music stops and the desktop freezes completely. The 
> > > first 30 seconds of that freeze there is only very low disk activity (which 
> > > seems strange);
> > 
> > I'm just going to have to depend on Jens here. Jens, the congestion_wait() is
> > on BLK_RW_ASYNC after the commit. Reclaim usually writes pages asynchronously
> > but lumpy reclaim actually waits of pages to write out synchronously so
> > it's not always async.
> 
> Waiting doesn't make it synchronous from the elevator point of view ;)
> If you're using WB_SYNC_NONE, it's a async write.  WB_SYNC_ALL makes it
> a sync write.  I only see WB_SYNC_NONE in vmscan.c, so we should be
> using the async congestion wait.  (the exception is xfs which always
> does async writes).
> 
Right, reclaim always queues the pages for async IO but for lumpy reclaim,
it calls wait_on_page_writeback() but as you say, from an elevator point of
view, it's still async.
> But I'm honestly not 100% sure.  Looking back through the emails, the
> test case is doing IO on top of a whole lot of things on top of
> dm-crypt?  I just tried to figure out if dm-crypt is turning the async
> IO into sync IOs, but didn't quite make sense of it.
> 
I'm not overly sure either.
> Could you also please include which filesystems were being abused during
> the test and how?  Reading through the emails, I think you've got:
> 
> gitk being run 3 times on some FS (NFS?)
> streaming reads on NFS
> swap on dm-crypt
> 
> If other filesystems are being used, please correct me.  Also please
> include if they are on crypto or straight block device.
> 
I've attached a patch below that should allow us to cheat. When it's applied,
it outputs who called congestion_wait(), how long the timeout was and how
long it waited for. By comparing before and after sleep times, we should
be able to see which of the callers has significantly changed and if
it's something easily addressable.
> > Either way, reclaim is usually worried about writing pages but it would appear
> > after this change that a lot of read activity can also stall a process in
> > direct reclaim. What might be happening in Frans's particular case is that the
> > tasklet that allocates high-order pages for the RX buffers is getting stalled
> > by congestion caused by other processes doing reads from the filesystem.
> > While it makes sense from a congestion point of view to halt the IO, the
> > reclaim operations from direct reclaimers is getting delayed for long enough
> > to cause problems for GFP_ATOMIC.
> 
> The congestion_wait code either waits for congestion to clear or for
> a given timeout.  The part that isn't clear is if before the patch
> we waited a very short time (congestion cleared quickly) or a very long
> time (we hit the timeout or congestion cleared slowly).
> 
Using the instrumentation patch, I found with a very basic test that we
are waiting for short periods of time more often with the patch applied
      1 congestion_wait   rw=1 delay 6 timeout 25 :: before commit
      7 kswapd            congestion_wait   rw=1 delay 0 timeout 25 :: before commit
     32 kswapd            congestion_wait sync=0 delay 0 timeout 25 :: after commit
     61 kswapd            congestion_wait   rw=1 delay 1 timeout 25 :: before commit
    133 kswapd            congestion_wait sync=0 delay 1 timeout 25 :: after commit
     16 kswapd            congestion_wait   rw=1 delay 2 timeout 25 :: before commit
     70 kswapd            congestion_wait sync=0 delay 2 timeout 25 :: after commit
      1 try_to_free_pages congestion_wait sync=0 delay 2 timeout 25 :: after commit
     17 kswapd            congestion_wait   rw=1 delay 3 timeout 25 :: before commit
     28 kswapd            congestion_wait sync=0 delay 3 timeout 25 :: after commit
      1 try_to_free_pages congestion_wait sync=0 delay 3 timeout 25 :: after commit
     23 kswapd            congestion_wait   rw=1 delay 4 timeout 25 :: before commit
     16 kswapd            congestion_wait sync=0 delay 4 timeout 25 :: after commit
      5 try_to_free_pages congestion_wait sync=0 delay 4 timeout 25 :: after commit
     20 kswapd            congestion_wait   rw=1 delay 5 timeout 25 :: before commit
     18 kswapd            congestion_wait sync=0 delay 5 timeout 25 :: after commit
      3 try_to_free_pages congestion_wait sync=0 delay 5 timeout 25 :: after commit
     21 kswapd            congestion_wait   rw=1 delay 6 timeout 25 :: before commit
      8 kswapd            congestion_wait sync=0 delay 6 timeout 25 :: after commit
      2 try_to_free_pages congestion_wait sync=0 delay 6 timeout 25 :: after commit
     13 kswapd            congestion_wait   rw=1 delay 7 timeout 25 :: before commit
     12 kswapd            congestion_wait sync=0 delay 7 timeout 25 :: after commit
      2 try_to_free_pages congestion_wait sync=0 delay 7 timeout 25 :: after commit
      8 kswapd            congestion_wait   rw=1 delay 8 timeout 25 :: before commit
      7 kswapd            congestion_wait sync=0 delay 8 timeout 25 :: after commit
      9 kswapd            congestion_wait   rw=1 delay 9 timeout 25 :: before commit
      5 kswapd            congestion_wait sync=0 delay 9 timeout 25 :: after commit
      2 try_to_free_pages congestion_wait sync=0 delay 9 timeout 25 :: after commit
      4 kswapd            congestion_wait   rw=1 delay 10 timeout 25 :: before commit
      5 kswapd            congestion_wait sync=0 delay 10 timeout 25 :: after commit
      1 try_to_free_pages congestion_wait sync=0 delay 10 timeout 25 :: after commit
[... remaining output snipped ...]
The before and after commit are really 2.6.31 and 2.6.31-patch-reverted.
The first column is how many times we delayed for that length of time.
To generate the output, I just took the console log from both kernels with
a basic test, put the congestion_wait lines into two separate files and
cat congestion-*-sorted | sort -n -k5 | uniq -c
to give a count of how many times we delayed for a particular caller.
> The easiest way to tell is to just replace the congestion_wait() calls
> in direct reclaim with schedule_timeout_interruptible(10), test, then
> schedule_timeout_interruptible(HZ/20), then test again.
> 
Reclaim can also call congestion_wait() and maybe the problem isn't
within the page allocator at all but that it's indirectly affected by
timing.
> > 
> > Does this sound plausible to you? If so, what's the best way of
> > addressing this? Changing congestion_wait back to WRITE (assuming that
> > works for Frans)? Changing it to SYNC (again, assuming it actually
> > works) or a revert?
> 
> I don't think changing it to SYNC is a good plan unless we're actually
> doing sync io.  It would be better to just wait on one of the pages that
> you've sent down (or its hashed waitqueue since the page can go away).
> 
Frans, is there any chance you could apply the following patch and get
the console logs for a vanilla kernel and with the congestion patches
reverted? I'm hoping it'll be able to tell us which of the callers has
significantly changed in timing. If there is one caller that has
significantly changed, it might be enough to address just that caller.
=====
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19 16:18                         ` Chris Mason
  2009-10-19 17:01                           ` Christoph Hellwig
  2009-10-19 17:01                           ` Christoph Hellwig
@ 2009-10-20 10:48                           ` Mel Gorman
  2009-10-20 10:48                           ` Mel Gorman
  2009-10-25 18:54                           ` Frans Pop
  4 siblings, 0 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-20 10:48 UTC (permalink / raw)
  To: Chris Mason, Frans Pop, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki
On Tue, Oct 20, 2009 at 01:18:15AM +0900, Chris Mason wrote:
> On Mon, Oct 19, 2009 at 03:01:52PM +0100, Mel Gorman wrote:
> > 
> > > During the 2nd phase I see the first SKB allocation errors with a music 
> > > skip between reading commits 95.000 and 110.000.
> > > About commit 115.000 there is a very long pause during which the counter 
> > > does not increase, music stops and the desktop freezes completely. The 
> > > first 30 seconds of that freeze there is only very low disk activity (which 
> > > seems strange);
> > 
> > I'm just going to have to depend on Jens here. Jens, the congestion_wait() is
> > on BLK_RW_ASYNC after the commit. Reclaim usually writes pages asynchronously
> > but lumpy reclaim actually waits of pages to write out synchronously so
> > it's not always async.
> 
> Waiting doesn't make it synchronous from the elevator point of view ;)
> If you're using WB_SYNC_NONE, it's a async write.  WB_SYNC_ALL makes it
> a sync write.  I only see WB_SYNC_NONE in vmscan.c, so we should be
> using the async congestion wait.  (the exception is xfs which always
> does async writes).
> 
Right, reclaim always queues the pages for async IO but for lumpy reclaim,
it calls wait_on_page_writeback() but as you say, from an elevator point of
view, it's still async.
> But I'm honestly not 100% sure.  Looking back through the emails, the
> test case is doing IO on top of a whole lot of things on top of
> dm-crypt?  I just tried to figure out if dm-crypt is turning the async
> IO into sync IOs, but didn't quite make sense of it.
> 
I'm not overly sure either.
> Could you also please include which filesystems were being abused during
> the test and how?  Reading through the emails, I think you've got:
> 
> gitk being run 3 times on some FS (NFS?)
> streaming reads on NFS
> swap on dm-crypt
> 
> If other filesystems are being used, please correct me.  Also please
> include if they are on crypto or straight block device.
> 
I've attached a patch below that should allow us to cheat. When it's applied,
it outputs who called congestion_wait(), how long the timeout was and how
long it waited for. By comparing before and after sleep times, we should
be able to see which of the callers has significantly changed and if
it's something easily addressable.
> > Either way, reclaim is usually worried about writing pages but it would appear
> > after this change that a lot of read activity can also stall a process in
> > direct reclaim. What might be happening in Frans's particular case is that the
> > tasklet that allocates high-order pages for the RX buffers is getting stalled
> > by congestion caused by other processes doing reads from the filesystem.
> > While it makes sense from a congestion point of view to halt the IO, the
> > reclaim operations from direct reclaimers is getting delayed for long enough
> > to cause problems for GFP_ATOMIC.
> 
> The congestion_wait code either waits for congestion to clear or for
> a given timeout.  The part that isn't clear is if before the patch
> we waited a very short time (congestion cleared quickly) or a very long
> time (we hit the timeout or congestion cleared slowly).
> 
Using the instrumentation patch, I found with a very basic test that we
are waiting for short periods of time more often with the patch applied
      1 congestion_wait   rw=1 delay 6 timeout 25 :: before commit
      7 kswapd            congestion_wait   rw=1 delay 0 timeout 25 :: before commit
     32 kswapd            congestion_wait sync=0 delay 0 timeout 25 :: after commit
     61 kswapd            congestion_wait   rw=1 delay 1 timeout 25 :: before commit
    133 kswapd            congestion_wait sync=0 delay 1 timeout 25 :: after commit
     16 kswapd            congestion_wait   rw=1 delay 2 timeout 25 :: before commit
     70 kswapd            congestion_wait sync=0 delay 2 timeout 25 :: after commit
      1 try_to_free_pages congestion_wait sync=0 delay 2 timeout 25 :: after commit
     17 kswapd            congestion_wait   rw=1 delay 3 timeout 25 :: before commit
     28 kswapd            congestion_wait sync=0 delay 3 timeout 25 :: after commit
      1 try_to_free_pages congestion_wait sync=0 delay 3 timeout 25 :: after commit
     23 kswapd            congestion_wait   rw=1 delay 4 timeout 25 :: before commit
     16 kswapd            congestion_wait sync=0 delay 4 timeout 25 :: after commit
      5 try_to_free_pages congestion_wait sync=0 delay 4 timeout 25 :: after commit
     20 kswapd            congestion_wait   rw=1 delay 5 timeout 25 :: before commit
     18 kswapd            congestion_wait sync=0 delay 5 timeout 25 :: after commit
      3 try_to_free_pages congestion_wait sync=0 delay 5 timeout 25 :: after commit
     21 kswapd            congestion_wait   rw=1 delay 6 timeout 25 :: before commit
      8 kswapd            congestion_wait sync=0 delay 6 timeout 25 :: after commit
      2 try_to_free_pages congestion_wait sync=0 delay 6 timeout 25 :: after commit
     13 kswapd            congestion_wait   rw=1 delay 7 timeout 25 :: before commit
     12 kswapd            congestion_wait sync=0 delay 7 timeout 25 :: after commit
      2 try_to_free_pages congestion_wait sync=0 delay 7 timeout 25 :: after commit
      8 kswapd            congestion_wait   rw=1 delay 8 timeout 25 :: before commit
      7 kswapd            congestion_wait sync=0 delay 8 timeout 25 :: after commit
      9 kswapd            congestion_wait   rw=1 delay 9 timeout 25 :: before commit
      5 kswapd            congestion_wait sync=0 delay 9 timeout 25 :: after commit
      2 try_to_free_pages congestion_wait sync=0 delay 9 timeout 25 :: after commit
      4 kswapd            congestion_wait   rw=1 delay 10 timeout 25 :: before commit
      5 kswapd            congestion_wait sync=0 delay 10 timeout 25 :: after commit
      1 try_to_free_pages congestion_wait sync=0 delay 10 timeout 25 :: after commit
[... remaining output snipped ...]
The before and after commit are really 2.6.31 and 2.6.31-patch-reverted.
The first column is how many times we delayed for that length of time.
To generate the output, I just took the console log from both kernels with
a basic test, put the congestion_wait lines into two separate files and
cat congestion-*-sorted | sort -n -k5 | uniq -c
to give a count of how many times we delayed for a particular caller.
> The easiest way to tell is to just replace the congestion_wait() calls
> in direct reclaim with schedule_timeout_interruptible(10), test, then
> schedule_timeout_interruptible(HZ/20), then test again.
> 
Reclaim can also call congestion_wait() and maybe the problem isn't
within the page allocator at all but that it's indirectly affected by
timing.
> > 
> > Does this sound plausible to you? If so, what's the best way of
> > addressing this? Changing congestion_wait back to WRITE (assuming that
> > works for Frans)? Changing it to SYNC (again, assuming it actually
> > works) or a revert?
> 
> I don't think changing it to SYNC is a good plan unless we're actually
> doing sync io.  It would be better to just wait on one of the pages that
> you've sent down (or its hashed waitqueue since the page can go away).
> 
Frans, is there any chance you could apply the following patch and get
the console logs for a vanilla kernel and with the congestion patches
reverted? I'm hoping it'll be able to tell us which of the callers has
significantly changed in timing. If there is one caller that has
significantly changed, it might be enough to address just that caller.
=====
>From 757999066dc41f2e053d59589c673052fc7c1a65 Mon Sep 17 00:00:00 2001
From: Mel Gorman <mel@csn.ul.ie>
Date: Tue, 20 Oct 2009 11:01:57 +0100
Subject: [PATCH] Instrument congestion_wait
This patch instruments how long congestion_wait() really waited for a
given caller.
diff --git a/mm/backing-dev.c b/mm/backing-dev.c
index 3d3accb..fc945e0 100644
--- a/mm/backing-dev.c
+++ b/mm/backing-dev.c
@@ -10,6 +10,7 @@
 #include <linux/module.h>
 #include <linux/writeback.h>
 #include <linux/device.h>
+#include <linux/kallsyms.h>
 
 void default_unplug_io_fn(struct backing_dev_info *bdi, struct page *page)
 {
@@ -729,6 +730,11 @@ EXPORT_SYMBOL(set_bdi_congested);
  */
 long congestion_wait(int sync, long timeout)
 {
+	unsigned long jiffies_start = jiffies;
+	char *module;
+	char buf[128];
+	const char *symbol;
+	unsigned long offset, symbolsize;
 	long ret;
 	DEFINE_WAIT(wait);
 	wait_queue_head_t *wqh = &congestion_wqh[sync];
@@ -736,6 +742,13 @@ long congestion_wait(int sync, long timeout)
 	prepare_to_wait(wqh, &wait, TASK_UNINTERRUPTIBLE);
 	ret = io_schedule_timeout(timeout);
 	finish_wait(wqh, &wait);
+
+	symbol = kallsyms_lookup(_RET_IP_, &symbolsize, &offset, &module, buf),
+	printk(KERN_INFO "%-20s congestion_wait sync=%d delay %lu timeout %ld\n",
+		symbol,
+		sync,
+		jiffies - jiffies_start,
+		timeout);
 	return ret;
 }
 EXPORT_SYMBOL(congestion_wait);
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-19 20:17                                         ` Tobias Oetiker
@ 2009-10-20 10:57                                           ` Mel Gorman
  2009-10-20 11:44                                             ` Tobias Oetiker
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-20 10:57 UTC (permalink / raw)
  To: Tobias Oetiker
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
On Mon, Oct 19, 2009 at 10:17:06PM +0200, Tobias Oetiker wrote:
> Hi Mel,
> 
> Today Tobias Oetiker wrote:
> 
> > Hi Mel,
> >
> > Today Mel Gorman wrote:
> >
> > > >
> > > > if you can send me a consolidated patch which does apply to
> > > > 2.6.31.4 I will be glad to try ...
> > > >
> > >
> > > Sure
> > >
> > > ==== CUT HERE ====
> > >
> > > From 6c0215af3b7c39ef7b8083ea38ca3ad93cd3f51f Mon Sep 17 00:00:00 2001
> > > From: Mel Gorman <mel@csn.ul.ie>
> > > Date: Mon, 19 Oct 2009 15:40:43 +0100
> > > Subject: [PATCH] Kick off kswapd after direct reclaim and revert congestion changes
> > >
> > > The following patch is http://lkml.org/lkml/2009/10/16/89 on top of
> > > 2.6.31.4 as well as patches 373c0a7e and 8aa7e847 reverted.
> >
> > it seems to help ... the server has been running for 3 hours now
> > without incident, but then again it is not as active as during the
> > day, ... will report tomorrow.
> 
> while I was writing, the system found that the patch does not realy
> help:
> 
> Oct 19 22:09:52 johan kernel: [11157.121506] smtpd: page allocation failure. order:5, mode:0x4020 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121514] Pid: 19324, comm: smtpd Tainted: G      D    2.6.31.4-oep #1 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121518] Call Trace: [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121521]  <IRQ>  [<ffffffff810cb599>] __alloc_pages_nodemask+0x549/0x650 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121563]  [<ffffffffa02bde3b>] ? __nf_ct_refresh_acct+0xab/0x110 [nf_conntrack] [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121572]  [<ffffffffa02a8337>] ? ipt_do_table+0x2f7/0x610 [ip_tables] [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121580]  [<ffffffff810fac18>] kmalloc_large_node+0x68/0xc0 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121585]  [<ffffffff810fe90a>] __kmalloc_node_track_caller+0x11a/0x180 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121592]  [<ffffffff813ebd42>] ? skb_copy+0x32/0xa0 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121596]  [<ffffffff813e9606>] __alloc_skb+0x76/0x180 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121600]  [<ffffffff813ebd42>] skb_copy+0x32/0xa0 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121615]  [<ffffffffa07dd33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121620]  [<ffffffff813f2512>] dev_hard_start_xmit+0x142/0x320 [kern.warning]
Are the number of failures at least reduced or are they occuring at the
same rate? Also, what was the last kernel that worked for you with this
configuration?
Thanks
> Oct 19 22:09:52 johan kernel: [11157.121632]  [<ffffffff8140a2c1>] __qdisc_run+0x1a1/0x230 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121637]  [<ffffffff813f41e0>] dev_queue_xmit+0x2b0/0x3a0 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121642]  [<ffffffff8142349b>] ip_finish_output+0x11b/0x2f0 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121646]  [<ffffffff814236f9>] ip_output+0x89/0xd0 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121650]  [<ffffffff81422710>] ip_local_out+0x20/0x30 [kern.warning]
> Oct 19 22:09:52 johan kernel: [11157.121654]  [<ffffffff81422ffb>] ip_queue_xmit+0x22b/0x3f0 [kern.warning]
> 
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 10:57                                           ` Mel Gorman
@ 2009-10-20 11:44                                             ` Tobias Oetiker
  2009-10-20 12:51                                               ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-20 11:44 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Mel Gorman wrote:
> On Mon, Oct 19, 2009 at 10:17:06PM +0200, Tobias Oetiker wrote:
> > Oct 19 22:09:52 johan kernel: [11157.121600]  [<ffffffff813ebd42>] skb_copy+0x32/0xa0 [kern.warning]
> > Oct 19 22:09:52 johan kernel: [11157.121615]  [<ffffffffa07dd33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
> > Oct 19 22:09:52 johan kernel: [11157.121620]  [<ffffffff813f2512>] dev_hard_start_xmit+0x142/0x320 [kern.warning]
>
> Are the number of failures at least reduced or are they occuring at the
> same rate?
not that it would have any statistical significance, but I had 5
failure (clusters) yesterday morning and 5 this morning ...
the failures often show up in groups I saved one on
http://tobi.oetiker.ch/cluster-2009-10-20-08-31.txt
> Also, what was the last kernel that worked for you with this
> configuration?
that would be 2.6.24 ... I have not upgraded in quite some time.
But since the io performance of 2.6.31 is about double in my tests
I thought it would be a good thing todo ...
cheers
tobi
> Thanks
>
> > Oct 19 22:09:52 johan kernel: [11157.121632]  [<ffffffff8140a2c1>] __qdisc_run+0x1a1/0x230 [kern.warning]
> > Oct 19 22:09:52 johan kernel: [11157.121637]  [<ffffffff813f41e0>] dev_queue_xmit+0x2b0/0x3a0 [kern.warning]
> > Oct 19 22:09:52 johan kernel: [11157.121642]  [<ffffffff8142349b>] ip_finish_output+0x11b/0x2f0 [kern.warning]
> > Oct 19 22:09:52 johan kernel: [11157.121646]  [<ffffffff814236f9>] ip_output+0x89/0xd0 [kern.warning]
> > Oct 19 22:09:52 johan kernel: [11157.121650]  [<ffffffff81422710>] ip_local_out+0x20/0x30 [kern.warning]
> > Oct 19 22:09:52 johan kernel: [11157.121654]  [<ffffffff81422ffb>] ip_queue_xmit+0x22b/0x3f0 [kern.warning]
> >
>
>
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 11:44                                             ` Tobias Oetiker
@ 2009-10-20 12:51                                               ` Mel Gorman
  2009-10-20 12:58                                                 ` Tobias Oetiker
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-20 12:51 UTC (permalink / raw)
  To: Tobias Oetiker
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
On Tue, Oct 20, 2009 at 01:44:50PM +0200, Tobias Oetiker wrote:
> Hi Mel,
> 
> Today Mel Gorman wrote:
> 
> > On Mon, Oct 19, 2009 at 10:17:06PM +0200, Tobias Oetiker wrote:
> 
> > > Oct 19 22:09:52 johan kernel: [11157.121600]  [<ffffffff813ebd42>] skb_copy+0x32/0xa0 [kern.warning]
> > > Oct 19 22:09:52 johan kernel: [11157.121615]  [<ffffffffa07dd33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
> > > Oct 19 22:09:52 johan kernel: [11157.121620]  [<ffffffff813f2512>] dev_hard_start_xmit+0x142/0x320 [kern.warning]
> >
> > Are the number of failures at least reduced or are they occuring at the
> > same rate?
> 
> not that it would have any statistical significance, but I had 5
> failure (clusters) yesterday morning and 5 this morning ...
> 
Before the patches were applied, how many failures were you seeing in
the morning?
> the failures often show up in groups I saved one on
> http://tobi.oetiker.ch/cluster-2009-10-20-08-31.txt
> 
> > Also, what was the last kernel that worked for you with this
> > configuration?
> 
> that would be 2.6.24 ... I have not upgraded in quite some time.
> But since the io performance of 2.6.31 is about double in my tests
> I thought it would be a good thing todo ...
> 
That significant a different in performance may explain differences in timing
as well. i.e. the allocator is being put under more pressure now than it
was previously as more processes make forward progress.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 12:51                                               ` Mel Gorman
@ 2009-10-20 12:58                                                 ` Tobias Oetiker
  2009-10-20 13:39                                                   ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-20 12:58 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Mel Gorman wrote:
> On Tue, Oct 20, 2009 at 01:44:50PM +0200, Tobias Oetiker wrote:
> > Hi Mel,
> >
> > Today Mel Gorman wrote:
> >
> > > On Mon, Oct 19, 2009 at 10:17:06PM +0200, Tobias Oetiker wrote:
> >
> > > > Oct 19 22:09:52 johan kernel: [11157.121600]  [<ffffffff813ebd42>] skb_copy+0x32/0xa0 [kern.warning]
> > > > Oct 19 22:09:52 johan kernel: [11157.121615]  [<ffffffffa07dd33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
> > > > Oct 19 22:09:52 johan kernel: [11157.121620]  [<ffffffff813f2512>] dev_hard_start_xmit+0x142/0x320 [kern.warning]
> > >
> > > Are the number of failures at least reduced or are they occuring at the
> > > same rate?
> >
> > not that it would have any statistical significance, but I had 5
> > failure (clusters) yesterday morning and 5 this morning ...
> >
>
> Before the patches were applied, how many failures were you seeing in
> the morning?
5 as well ... before an after ...
> > the failures often show up in groups I saved one on
> > http://tobi.oetiker.ch/cluster-2009-10-20-08-31.txt
> >
> > > Also, what was the last kernel that worked for you with this
> > > configuration?
> >
> > that would be 2.6.24 ... I have not upgraded in quite some time.
> > But since the io performance of 2.6.31 is about double in my tests
> > I thought it would be a good thing todo ...
> >
>
> That significant a different in performance may explain differences in timing
> as well. i.e. the allocator is being put under more pressure now than it
> was previously as more processes make forward progress.
you are saing that the problem might be even older ?
we do have 8GB ram and 16 GB swap, so it should not fail to allocate all that
often
top - 14:58:34 up 19:54,  6 users,  load average: 2.09, 1.94, 1.97
Tasks: 451 total,   1 running, 449 sleeping,   0 stopped,   1 zombie
Cpu(s):  3.5%us, 15.5%sy,  2.0%ni, 72.2%id,  6.5%wa,  0.1%hi,  0.3%si,  0.0%st
Mem:   8198504k total,  7599132k used,   599372k free,  1212636k buffers
Swap: 16777208k total,    83568k used, 16693640k free,   610136k cached
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 12:58                                                 ` Tobias Oetiker
@ 2009-10-20 13:39                                                   ` Mel Gorman
  2009-10-20 13:50                                                     ` Tobias Oetiker
  2009-10-22 10:27                                                     ` Tobias Oetiker
  0 siblings, 2 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-20 13:39 UTC (permalink / raw)
  To: Tobias Oetiker
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
On Tue, Oct 20, 2009 at 02:58:53PM +0200, Tobias Oetiker wrote:
> Hi Mel,
> 
> Today Mel Gorman wrote:
> 
> > On Tue, Oct 20, 2009 at 01:44:50PM +0200, Tobias Oetiker wrote:
> > > Hi Mel,
> > >
> > > Today Mel Gorman wrote:
> > >
> > > > On Mon, Oct 19, 2009 at 10:17:06PM +0200, Tobias Oetiker wrote:
> > >
> > > > > Oct 19 22:09:52 johan kernel: [11157.121600]  [<ffffffff813ebd42>] skb_copy+0x32/0xa0 [kern.warning]
> > > > > Oct 19 22:09:52 johan kernel: [11157.121615]  [<ffffffffa07dd33c>] vboxNetFltLinuxPacketHandler+0x5c/0xd0 [vboxnetflt] [kern.warning]
> > > > > Oct 19 22:09:52 johan kernel: [11157.121620]  [<ffffffff813f2512>] dev_hard_start_xmit+0x142/0x320 [kern.warning]
> > > >
> > > > Are the number of failures at least reduced or are they occuring at the
> > > > same rate?
> > >
> > > not that it would have any statistical significance, but I had 5
> > > failure (clusters) yesterday morning and 5 this morning ...
> > >
> >
> > Before the patches were applied, how many failures were you seeing in
> > the morning?
> 
> 5 as well ... before an after ...
> 
> > > the failures often show up in groups I saved one on
> > > http://tobi.oetiker.ch/cluster-2009-10-20-08-31.txt
> > >
> > > > Also, what was the last kernel that worked for you with this
> > > > configuration?
> > >
> > > that would be 2.6.24 ... I have not upgraded in quite some time.
> > > But since the io performance of 2.6.31 is about double in my tests
> > > I thought it would be a good thing todo ...
> > >
> >
> > That significant a different in performance may explain differences in timing
> > as well. i.e. the allocator is being put under more pressure now than it
> > was previously as more processes make forward progress.
> 
> you are saing that the problem might be even older ?
> 
> we do have 8GB ram and 16 GB swap, so it should not fail to allocate all that
> often
> 
> top - 14:58:34 up 19:54,  6 users,  load average: 2.09, 1.94, 1.97
> Tasks: 451 total,   1 running, 449 sleeping,   0 stopped,   1 zombie
> Cpu(s):  3.5%us, 15.5%sy,  2.0%ni, 72.2%id,  6.5%wa,  0.1%hi,  0.3%si,  0.0%st
> Mem:   8198504k total,  7599132k used,   599372k free,  1212636k buffers
> Swap: 16777208k total,    83568k used, 16693640k free,   610136k cached
> 
High-order atomic allocations of the type you are trying at that frequency
were always a very long shot. The most likely outcome is that something
has changed that means a burst of allocations trigger an allocation failure
where as before processes would delay long enough for the system not to notice.
1. Have MTU settings changed?
2. As order-5 allocations are required to succeed, I'm surprised in a
   sense that there are only 5 failures because it implies the machine is
   actually recovering and continueing on as normal. Can you think of what
   happens in the morning that causes a burst of allocations to occur?
3. Other than the failures, have you noticed any other problems with the
   machine or does it continue along happily?
4. Does the following patch help by any chance?
Thanks
==== CUT HERE ====
vmscan: Force kswapd to take notice faster when high-order watermarks are being hit
When a high-order allocation fails, kswapd is kicked so that it reclaims
at a higher-order to avoid direct reclaimers stall and to help GFP_ATOMIC
allocations. Something has changed in recent kernels that affect the timing
where high-order GFP_ATOMIC allocations are now failing with more frequency,
particularly under pressure. This patch forces kswapd to notice sooner that
high-order allocations are occuring by checking when watermarks are hit early
and by having kswapd restart quickly when the reclaim order is increased.
Not-signed-off-by-because-this-is-a-hatchet-job: Mel Gorman <mel@csn.ul.ie>
--- 
 mm/page_alloc.c |   14 ++++++++++++--
 mm/vmscan.c     |    9 +++++++++
 2 files changed, 21 insertions(+), 2 deletions(-)
diff --git a/mm/page_alloc.c b/mm/page_alloc.c
index 2fd7b20..fdbf8c9 100644
--- a/mm/page_alloc.c
+++ b/mm/page_alloc.c
@@ -1907,6 +1906,17 @@ __alloc_pages_nodemask(gfp_t gfp_mask, unsigned int order,
 				zonelist, high_zoneidx, nodemask,
 				preferred_zone, migratetype);
 
+	/*
+	 * If after a high-order allocation we are now below watermarks,
+	 * pre-emptively kick kswapd rather than having the next allocation
+	 * fail and have to wake up kswapd, potentially failing GFP_ATOMIC
+	 * allocations or entering direct reclaim
+	 */
+	if (unlikely(order) && page && !zone_watermark_ok(preferred_zone, order,
+				preferred_zone->watermark[ALLOC_WMARK_LOW],
+				zone_idx(preferred_zone), ALLOC_WMARK_LOW))
+		wake_all_kswapd(order, zonelist, high_zoneidx);
+
 	return page;
 }
 EXPORT_SYMBOL(__alloc_pages_nodemask);
diff --git a/mm/vmscan.c b/mm/vmscan.c
index 9219beb..0e66a6b 100644
--- a/mm/vmscan.c
+++ b/mm/vmscan.c
@@ -1925,6 +1925,15 @@ loop_again:
 					priority != DEF_PRIORITY)
 				continue;
 
+			/*
+			 * Exit quickly to restart if it has been indicated
+			 * that higher orders are required
+			 */
+			if (pgdat->kswapd_max_order > order) {
+				all_zones_ok = 1;
+				goto out;
+			}
+
 			if (!zone_watermark_ok(zone, order,
 					high_wmark_pages(zone), end_zone, 0))
 				all_zones_ok = 0;
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 13:39                                                   ` Mel Gorman
@ 2009-10-20 13:50                                                     ` Tobias Oetiker
  2009-10-20 14:14                                                       ` Mel Gorman
  2009-10-22 10:27                                                     ` Tobias Oetiker
  1 sibling, 1 reply; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-20 13:50 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Mel Gorman wrote:
> On Tue, Oct 20, 2009 at 02:58:53PM +0200, Tobias Oetiker wrote:
> > you are saing that the problem might be even older ?
> >
> > we do have 8GB ram and 16 GB swap, so it should not fail to allocate all that
> > often
> >
> > top - 14:58:34 up 19:54,  6 users,  load average: 2.09, 1.94, 1.97
> > Tasks: 451 total,   1 running, 449 sleeping,   0 stopped,   1 zombie
> > Cpu(s):  3.5%us, 15.5%sy,  2.0%ni, 72.2%id,  6.5%wa,  0.1%hi,  0.3%si,  0.0%st
> > Mem:   8198504k total,  7599132k used,   599372k free,  1212636k buffers
> > Swap: 16777208k total,    83568k used, 16693640k free,   610136k cached
> >
>
> High-order atomic allocations of the type you are trying at that frequency
> were always a very long shot. The most likely outcome is that something
> has changed that means a burst of allocations trigger an allocation failure
> where as before processes would delay long enough for the system not to notice.
>
> 1. Have MTU settings changed?
no not to my knowledge
> 2. As order-5 allocations are required to succeed, I'm surprised in a
>    sense that there are only 5 failures because it implies the machine is
>    actually recovering and continueing on as normal. Can you think of what
>    happens in the morning that causes a burst of allocations to occur?
the burts occur all day while the machine is in use ... its just
that I was  writing this at noon so only the morning had passed. So
I compared things to the day before ...
> 3. Other than the failures, have you noticed any other problems with the
>    machine or does it continue along happily?
The machine seems to be fine.
> 4. Does the following patch help by any chance?
should I try this on vanilla 2.6.31.4 or ontop of your previous
patch?
we are running virtualbox 3.0.8 on this machine, virtualbox is using
the physical network interface in bridge mode access the network.
Could this have something todo with the problem ?
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 13:50                                                     ` Tobias Oetiker
@ 2009-10-20 14:14                                                       ` Mel Gorman
  2009-10-20 14:20                                                         ` Tobias Oetiker
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-20 14:14 UTC (permalink / raw)
  To: Tobias Oetiker
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
On Tue, Oct 20, 2009 at 03:50:12PM +0200, Tobias Oetiker wrote:
> Hi Mel,
> 
> Today Mel Gorman wrote:
> 
> > On Tue, Oct 20, 2009 at 02:58:53PM +0200, Tobias Oetiker wrote:
> > > you are saing that the problem might be even older ?
> > >
> > > we do have 8GB ram and 16 GB swap, so it should not fail to allocate all that
> > > often
> > >
> > > top - 14:58:34 up 19:54,  6 users,  load average: 2.09, 1.94, 1.97
> > > Tasks: 451 total,   1 running, 449 sleeping,   0 stopped,   1 zombie
> > > Cpu(s):  3.5%us, 15.5%sy,  2.0%ni, 72.2%id,  6.5%wa,  0.1%hi,  0.3%si,  0.0%st
> > > Mem:   8198504k total,  7599132k used,   599372k free,  1212636k buffers
> > > Swap: 16777208k total,    83568k used, 16693640k free,   610136k cached
> > >
> >
> > High-order atomic allocations of the type you are trying at that frequency
> > were always a very long shot. The most likely outcome is that something
> > has changed that means a burst of allocations trigger an allocation failure
> > where as before processes would delay long enough for the system not to notice.
> >
> > 1. Have MTU settings changed?
> 
> no not to my knowledge
> 
> > 2. As order-5 allocations are required to succeed, I'm surprised in a
> >    sense that there are only 5 failures because it implies the machine is
> >    actually recovering and continueing on as normal. Can you think of what
> >    happens in the morning that causes a burst of allocations to occur?
> 
> the burts occur all day while the machine is in use ... its just
> that I was  writing this at noon so only the morning had passed. So
> I compared things to the day before ...
> 
Over the course of a day, how many would you see? By and large, it seems
that the problem yourself and Frans are similar except his is a lot more
severe.
> > 3. Other than the failures, have you noticed any other problems with the
> >    machine or does it continue along happily?
> 
> The machine seems to be fine.
> 
> > 4. Does the following patch help by any chance?
> 
> should I try this on vanilla 2.6.31.4 or ontop of your previous
> patch?
> 
Try on top of vanilla 2.6.31.4 first plase and if failures still occur,
then on top of the previous patch.
> we are running virtualbox 3.0.8 on this machine, virtualbox is using
> the physical network interface in bridge mode access the network.
> Could this have something todo with the problem ?
> 
I do not know for sure. I'm assuming the configuration is the same on
both kernels so it's unlikely to be the issue.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 14:14                                                       ` Mel Gorman
@ 2009-10-20 14:20                                                         ` Tobias Oetiker
  0 siblings, 0 replies; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-20 14:20 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Today Mel Gorman wrote:
>
> Over the course of a day, how many would you see? By and large, it seems
> that the problem yourself and Frans are similar except his is a lot more
> severe.
yesterday it was 19 for 24 hours, today it is 9 for 16 hours (day
is not done yet).
> Try on top of vanilla 2.6.31.4 first plase and if failures still occur,
> then on top of the previous patch.
ok
> > we are running virtualbox 3.0.8 on this machine, virtualbox is using
> > the physical network interface in bridge mode access the network.
> > Could this have something todo with the problem ?
> >
>
> I do not know for sure. I'm assuming the configuration is the same on
> both kernels so it's unlikely to be the issue.
just to be on the sure side I created a tickt with the virtualbox
people ... http://www.virtualbox.org/ticket/5260
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* [PATCH] SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab() (was: Re: [Bug #14265] ifconfig: page allocation failure. order:5,ode:0x8020 w/ e100)
       [not found] ` <COE24pZSBH.A.rP.2MTxKB@chimera>
@ 2009-10-21 20:04   ` Karol Lewandowski
  2009-10-21 21:06     ` David Rientjes
  0 siblings, 1 reply; 88+ messages in thread
From: Karol Lewandowski @ 2009-10-21 20:04 UTC (permalink / raw)
  To: Rafael J. Wysocki
  Cc: Linux Kernel Mailing List, Kernel Testers List, Karol Lewandowski,
	Mel Gorman, Frans Pop, Pekka Enberg, David Rientjes,
	KOSAKI Motohiro, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe,
	Tobias Oetiker
On Thu, Oct 01, 2009 at 09:56:04PM +0200, Rafael J. Wysocki wrote:
> Bug-Entry	: http://bugzilla.kernel.org/show_bug.cgi?id=14265
> Subject		: ifconfig: page allocation failure. order:5, mode:0x8020 w/ e100
> Submitter	: Karol Lewandowski <karol.k.lewandowski@gmail.com>
> Date		: 2009-09-15 12:05 (17 days old)
> References	: http://marc.info/?l=linux-kernel&m=125301636509517&w=4
Guys, could anyone check if patch below helps?  I think I've finally
found culprit of all allocation failures (but I might be wrong
too... ;-)
Thanks.
commit d6849591e042bceb66f1b4513a1df6740d2ad762
Author: Karol Lewandowski <karol.k.lewandowski@gmail.com>
Date:   Wed Oct 21 21:01:20 2009 +0200
    SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab()
    
    Commit ba52270d18fb17ce2cf176b35419dab1e43fe4a3 unconditionally
    cleared __GFP_NOFAIL flag on all allocations.
    
    Preserve this flag on second attempt to allocate page (with possibly
    decreased order).
    
    This should help with bugs #14265, #14141 and similar.
    
    Signed-off-by: Karol Lewandowski <karol.k.lewandowski@gmail.com>
diff --git a/mm/slub.c b/mm/slub.c
index b627675..ac5db65 100644
--- a/mm/slub.c
+++ b/mm/slub.c
@@ -1084,7 +1084,7 @@ static struct page *allocate_slab(struct kmem_cache *s, gfp_t flags, int node)
 {
 	struct page *page;
 	struct kmem_cache_order_objects oo = s->oo;
-	gfp_t alloc_gfp;
+	gfp_t alloc_gfp, nofail;
 
 	flags |= s->allocflags;
 
@@ -1092,6 +1092,7 @@ static struct page *allocate_slab(struct kmem_cache *s, gfp_t flags, int node)
 	 * Let the initial higher-order allocation fail under memory pressure
 	 * so we fall-back to the minimum order allocation.
 	 */
+	nofail = flags & __GFP_NOFAIL;
 	alloc_gfp = (flags | __GFP_NOWARN | __GFP_NORETRY) & ~__GFP_NOFAIL;
 
 	page = alloc_slab_page(alloc_gfp, node, oo);
@@ -1100,8 +1101,10 @@ static struct page *allocate_slab(struct kmem_cache *s, gfp_t flags, int node)
 		/*
 		 * Allocation may have failed due to fragmentation.
 		 * Try a lower order alloc if possible
+		 *
+		 * Preserve __GFP_NOFAIL flag if previous allocation failed.
 		 */
-		page = alloc_slab_page(flags, node, oo);
+		page = alloc_slab_page(flags | nofail, node, oo);
 		if (!page)
 			return NULL;
 
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related	[flat|nested] 88+ messages in thread
* Re: [PATCH] SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab() (was: Re: [Bug #14265] ifconfig: page allocation failure. order:5,ode:0x8020 w/ e100)
  2009-10-21 20:04   ` [PATCH] SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab() (was: Re: [Bug #14265] ifconfig: page allocation failure. order:5,ode:0x8020 w/ e100) Karol Lewandowski
@ 2009-10-21 21:06     ` David Rientjes
  2009-10-21 21:20       ` Karol Lewandowski
  0 siblings, 1 reply; 88+ messages in thread
From: David Rientjes @ 2009-10-21 21:06 UTC (permalink / raw)
  To: Karol Lewandowski
  Cc: Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Mel Gorman, Frans Pop, Pekka Enberg, KOSAKI Motohiro,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe, Tobias Oetiker
On Wed, 21 Oct 2009, Karol Lewandowski wrote:
> commit d6849591e042bceb66f1b4513a1df6740d2ad762
> Author: Karol Lewandowski <karol.k.lewandowski@gmail.com>
> Date:   Wed Oct 21 21:01:20 2009 +0200
> 
>     SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab()
>     
>     Commit ba52270d18fb17ce2cf176b35419dab1e43fe4a3 unconditionally
>     cleared __GFP_NOFAIL flag on all allocations.
>     
No, it clears __GFP_NOFAIL from the first allocation of oo_order(s->oo).  
If that fails (and it's easy to fail, it has __GFP_NORETRY), another 
allocation is attempted with oo_order(s->min), for which __GFP_NOFAIL 
would be preserved if that's the slab cache's allocflags.
>     Preserve this flag on second attempt to allocate page (with possibly
>     decreased order).
>     
>     This should help with bugs #14265, #14141 and similar.
>     
>     Signed-off-by: Karol Lewandowski <karol.k.lewandowski@gmail.com>
> 
> diff --git a/mm/slub.c b/mm/slub.c
> index b627675..ac5db65 100644
> --- a/mm/slub.c
> +++ b/mm/slub.c
> @@ -1084,7 +1084,7 @@ static struct page *allocate_slab(struct kmem_cache *s, gfp_t flags, int node)
>  {
>  	struct page *page;
>  	struct kmem_cache_order_objects oo = s->oo;
> -	gfp_t alloc_gfp;
> +	gfp_t alloc_gfp, nofail;
>  
>  	flags |= s->allocflags;
>  
> @@ -1092,6 +1092,7 @@ static struct page *allocate_slab(struct kmem_cache *s, gfp_t flags, int node)
>  	 * Let the initial higher-order allocation fail under memory pressure
>  	 * so we fall-back to the minimum order allocation.
>  	 */
> +	nofail = flags & __GFP_NOFAIL;
>  	alloc_gfp = (flags | __GFP_NOWARN | __GFP_NORETRY) & ~__GFP_NOFAIL;
>  
>  	page = alloc_slab_page(alloc_gfp, node, oo);
> @@ -1100,8 +1101,10 @@ static struct page *allocate_slab(struct kmem_cache *s, gfp_t flags, int node)
>  		/*
>  		 * Allocation may have failed due to fragmentation.
>  		 * Try a lower order alloc if possible
> +		 *
> +		 * Preserve __GFP_NOFAIL flag if previous allocation failed.
>  		 */
> -		page = alloc_slab_page(flags, node, oo);
> +		page = alloc_slab_page(flags | nofail, node, oo);
>  		if (!page)
>  			return NULL;
>  
> 
This does nothing.  You may have missed that the lower order allocation is 
passing 'flags' (which is a union of the gfp flags passed to 
allocate_slab() based on the allocation context and the cache's 
allocflags), and not alloc_gfp where __GFP_NOFAIL is masked.
Nack.
Note: slub isn't going to be a culprit in order 5 allocation failures 
since they have kmalloc passthrough to the page allocator.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [PATCH] SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab() (was: Re: [Bug #14265] ifconfig: page allocation failure. order:5,ode:0x8020 w/ e100)
  2009-10-21 21:06     ` David Rientjes
@ 2009-10-21 21:20       ` Karol Lewandowski
  2009-10-22 10:20         ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Karol Lewandowski @ 2009-10-21 21:20 UTC (permalink / raw)
  To: David Rientjes
  Cc: Karol Lewandowski, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Mel Gorman, Frans Pop, Pekka Enberg,
	KOSAKI Motohiro, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Mohamed Abbas, John W. Linville, linux-mm, jens.axboe,
	Tobias Oetiker
On Wed, Oct 21, 2009 at 02:06:41PM -0700, David Rientjes wrote:
> On Wed, 21 Oct 2009, Karol Lewandowski wrote:
> 
> > commit d6849591e042bceb66f1b4513a1df6740d2ad762
> > Author: Karol Lewandowski <karol.k.lewandowski@gmail.com>
> > Date:   Wed Oct 21 21:01:20 2009 +0200
> > 
> >     SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab()
> >     
> >     Commit ba52270d18fb17ce2cf176b35419dab1e43fe4a3 unconditionally
> >     cleared __GFP_NOFAIL flag on all allocations.
> >     
> 
> No, it clears __GFP_NOFAIL from the first allocation of oo_order(s->oo).  
> If that fails (and it's easy to fail, it has __GFP_NORETRY), another 
> allocation is attempted with oo_order(s->min), for which __GFP_NOFAIL 
> would be preserved if that's the slab cache's allocflags.
Right, patch is junk.
However, I haven't been able to trigger failures since I've switched
to SLAB allocator.  That patch seemed related (and wrong), but it
wasn't.
> >  		 */
> > -		page = alloc_slab_page(flags, node, oo);
> > +		page = alloc_slab_page(flags | nofail, node, oo);
> >  		if (!page)
> >  			return NULL;
> >  
> > 
> 
> This does nothing.  You may have missed that the lower order allocation is 
> passing 'flags' (which is a union of the gfp flags passed to 
> allocate_slab() based on the allocation context and the cache's 
> allocflags), and not alloc_gfp where __GFP_NOFAIL is masked.
Right, I missed that.
> Nack.
> 
> Note: slub isn't going to be a culprit in order 5 allocation failures 
> since they have kmalloc passthrough to the page allocator.
However, it might change fragmentation somewhat I guess.  This might
make problem more/less visible.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [PATCH] SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab() (was: Re: [Bug #14265] ifconfig: page allocation failure. order:5,ode:0x8020 w/ e100)
  2009-10-21 21:20       ` Karol Lewandowski
@ 2009-10-22 10:20         ` Mel Gorman
  2009-10-22 21:33           ` Karol Lewandowski
  0 siblings, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-22 10:20 UTC (permalink / raw)
  To: Karol Lewandowski
  Cc: David Rientjes, Rafael J. Wysocki, Linux Kernel Mailing List,
	Kernel Testers List, Frans Pop, Pekka Enberg, KOSAKI Motohiro,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe, Tobias Oetiker
On Wed, Oct 21, 2009 at 11:20:34PM +0200, Karol Lewandowski wrote:
> On Wed, Oct 21, 2009 at 02:06:41PM -0700, David Rientjes wrote:
> > On Wed, 21 Oct 2009, Karol Lewandowski wrote:
> > 
> > > commit d6849591e042bceb66f1b4513a1df6740d2ad762
> > > Author: Karol Lewandowski <karol.k.lewandowski@gmail.com>
> > > Date:   Wed Oct 21 21:01:20 2009 +0200
> > > 
> > >     SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab()
> > >     
> > >     Commit ba52270d18fb17ce2cf176b35419dab1e43fe4a3 unconditionally
> > >     cleared __GFP_NOFAIL flag on all allocations.
> > >     
> > 
> > No, it clears __GFP_NOFAIL from the first allocation of oo_order(s->oo).  
> > If that fails (and it's easy to fail, it has __GFP_NORETRY), another 
> > allocation is attempted with oo_order(s->min), for which __GFP_NOFAIL 
> > would be preserved if that's the slab cache's allocflags.
> 
> Right, patch is junk.
> 
> However, I haven't been able to trigger failures since I've switched
> to SLAB allocator.  That patch seemed related (and wrong), but it
> wasn't.
> 
Interesting. Pekka, I looked for SLUB commits in the 2.6.30..2.6.31
range for patches that might affect what order of pages SLUB allocates
but didn't spot anything obvious. Can you think of any changes that
might have altered how SLUB uses memory?
> > >  		 */
> > > -		page = alloc_slab_page(flags, node, oo);
> > > +		page = alloc_slab_page(flags | nofail, node, oo);
> > >  		if (!page)
> > >  			return NULL;
> > >  
> > > 
> > 
> > This does nothing.  You may have missed that the lower order allocation is 
> > passing 'flags' (which is a union of the gfp flags passed to 
> > allocate_slab() based on the allocation context and the cache's 
> > allocflags), and not alloc_gfp where __GFP_NOFAIL is masked.
> 
> Right, I missed that.
> 
> > Nack.
> > 
> > Note: slub isn't going to be a culprit in order 5 allocation failures 
> > since they have kmalloc passthrough to the page allocator.
> 
> However, it might change fragmentation somewhat I guess.  This might
> make problem more/less visible.
> 
Did you have CONFIG_KMEMCHECK set by any chance?
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures (generic)
  2009-10-20 13:39                                                   ` Mel Gorman
  2009-10-20 13:50                                                     ` Tobias Oetiker
@ 2009-10-22 10:27                                                     ` Tobias Oetiker
  1 sibling, 0 replies; 88+ messages in thread
From: Tobias Oetiker @ 2009-10-22 10:27 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Pekka Enberg, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Mohamed Abbas,
	John W. Linville, linux-mm, jens.axboe
Hi Mel,
Tuesday Mel Gorman wrote:
> 4. Does the following patch help by any chance?
>
> Thanks
>
> ==== CUT HERE ====
> vmscan: Force kswapd to take notice faster when high-order watermarks are being hit
>
> When a high-order allocation fails, kswapd is kicked so that it reclaims
> at a higher-order to avoid direct reclaimers stall and to help GFP_ATOMIC
> allocations. Something has changed in recent kernels that affect the timing
> where high-order GFP_ATOMIC allocations are now failing with more frequency,
> particularly under pressure. This patch forces kswapd to notice sooner that
> high-order allocations are occuring by checking when watermarks are hit early
> and by having kswapd restart quickly when the reclaim order is increased.
>
> Not-signed-off-by-because-this-is-a-hatchet-job: Mel Gorman <mel@csn.ul.ie>
> ---
it does seem to help ... I have been running it from 6am to 12am on
our server now and have not yet seen any issues ...
will shout if I do ...
cheers
tobi
-- 
Tobi Oetiker, OETIKER+PARTNER AG, Aarweg 15 CH-4600 Olten, Switzerland
http://it.oetiker.ch tobi@oetiker.ch ++41 62 775 9902 / sb: -9900
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [PATCH] SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab() (was: Re: [Bug #14265] ifconfig: page allocation failure. order:5,ode:0x8020 w/ e100)
  2009-10-22 10:20         ` Mel Gorman
@ 2009-10-22 21:33           ` Karol Lewandowski
  0 siblings, 0 replies; 88+ messages in thread
From: Karol Lewandowski @ 2009-10-22 21:33 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Karol Lewandowski, David Rientjes, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Frans Pop,
	Pekka Enberg, KOSAKI Motohiro, Reinette Chatre,
	Bartlomiej Zolnierkiewicz, Mohamed Abbas, John W. Linville,
	linux-mm, jens.axboe, Tobias Oetiker
On Thu, Oct 22, 2009 at 11:20:14AM +0100, Mel Gorman wrote:
> On Wed, Oct 21, 2009 at 11:20:34PM +0200, Karol Lewandowski wrote:
> > > Note: slub isn't going to be a culprit in order 5 allocation failures 
> > > since they have kmalloc passthrough to the page allocator.
> > 
> > However, it might change fragmentation somewhat I guess.  This might
> > make problem more/less visible.
> > 
> 
> Did you have CONFIG_KMEMCHECK set by any chance?
No, kmemcheck (and kmemleak) was always disabled.
It's likely that's possible to trigger allocation failures with slab,
I just haven't been successful at it.  Lack of good testcase is really
problem here -- even if I can't trigger failures I can never be sure
that these wont appear in some strange moment.
BTW I'll test your patches (from another thread) shortly.
Thanks.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-19 16:18                         ` Chris Mason
                                             ` (3 preceding siblings ...)
  2009-10-20 10:48                           ` Mel Gorman
@ 2009-10-25 18:54                           ` Frans Pop
  4 siblings, 0 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-25 18:54 UTC (permalink / raw)
  To: Chris Mason
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
Sorry for the delayed reply.
On Monday 19 October 2009, Chris Mason wrote:
> On Mon, Oct 19, 2009 at 03:01:52PM +0100, Mel Gorman wrote:
> > > During the 2nd phase I see the first SKB allocation errors with a
> > > music skip between reading commits 95.000 and 110.000.
> > > About commit 115.000 there is a very long pause during which the
> > > counter does not increase, music stops and the desktop freezes
> > > completely. The first 30 seconds of that freeze there is only very
> > > low disk activity (which seems strange);
> >
> > I'm just going to have to depend on Jens here. Jens, the
> > congestion_wait() is on BLK_RW_ASYNC after the commit. Reclaim usually
> > writes pages asynchronously but lumpy reclaim actually waits of pages
> > to write out synchronously so it's not always async.
>
> Waiting doesn't make it synchronous from the elevator point of view ;)
> If you're using WB_SYNC_NONE, it's a async write.  WB_SYNC_ALL makes it
> a sync write.  I only see WB_SYNC_NONE in vmscan.c, so we should be
> using the async congestion wait.  (the exception is xfs which always
> does async writes).
>
> But I'm honestly not 100% sure.  Looking back through the emails, the
> test case is doing IO on top of a whole lot of things on top of
> dm-crypt?  I just tried to figure out if dm-crypt is turning the async
> IO into sync IOs, but didn't quite make sense of it.
>
> Could you also please include which filesystems were being abused during
> the test and how?  Reading through the emails, I think you've got:
>
> gitk being run 3 times on some FS (NFS?)
gitk is run on an ext3 logical volume in a volume group that's on a LUKS 
encrypted partition of the local hard disk.
So it's: SATA harddisk -> dm-crypt (dmsetup) -> LVM (lvm2) -> ext3
> streaming reads on NFS
Correct. My music share is a remote (nfs4) read-only mounted ext3 
partition.
> swap on dm-crypt
Correct. Swap is another logical volume in the same volume group as 
mentioned above.
So kcrypt gets to (de)encrypt both the gitk data *and* any swapping caused 
by that [1].
> If other filesystems are being used, please correct me.  Also please
> include if they are on crypto or straight block device.
All my file systems are ext3. Nothing newfangled or exotic ;-)
There are some bind mounts involved, but I expect that's transparent.
Cheers,
FJP
[1] I've plans to move some of my data outside the encrypted volume, but 
currently everything except /boot is in the encrypted VG.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-20 10:48                           ` Mel Gorman
@ 2009-10-26 21:06                             ` Frans Pop
  2009-10-27 14:54                               ` Mel Gorman
  2009-11-05 20:14                               ` Frans Pop
  0 siblings, 2 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-26 21:06 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Chris Mason, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
[-- Attachment #1: Type: text/plain, Size: 2324 bytes --]
On Tuesday 20 October 2009, Mel Gorman wrote:
> I've attached a patch below that should allow us to cheat. When it's
> applied, it outputs who called congestion_wait(), how long the timeout
> was and how long it waited for. By comparing before and after sleep
> times, we should be able to see which of the callers has significantly
> changed and if it's something easily addressable.
The results from this look fairly interesting (although I may be a bad 
judge as I don't really know what I'm looking at ;-).
I've tested with two kernels:
1) 2.6.31.1: 1 test run
2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
The 1st kernel had the expected "freeze" while reading commits in gitk; 
reading commits with the 2nd kernel was more fluent.
I did 2 runs with the 2nd kernel as the first run had a fairly long music 
skip and more SKB errors than expected. The second run was fairly normal 
with no music skips at all even though it had a few SKB errors.
Data for the tests:
				1st kernel	2nd kernel 1	2nd kernel 2
end reading commits		1:15		1:00		0:55
  "freeze"			yes		no		no
branch data shown		1:55		1:15		1:10
system quiet			2:25		1:50		1:45
# SKB allocation errors		10		53		5
Note that the test is substantially faster with the 2nd kernel and that the 
SKB errors don't really affect the duration of the test.
Attached a tarball with the kernel logs, both the full logs and a stripped 
version with only the lines generated during the actual test.
Something like this will extract the debug data from the logs:
$ grep "delay " <file> | sed "s/^.*\] //"
Also attached a ODF spreadsheet with a summary of the data for all 3 tests.
I've dropped the congestion_wait and sync/rw= columns as they were always 
the same (rw=1 for 1st kernel and sync=0 for 2nd kernel).
I've added a column "weighed delay" and totals for that column and the 
count column.
My layman's observations are:
- without the revert 'background_writeout' is called a lot less frequently,
  but when it's called it gets long delays
- without the revert you have 'wb_kupdate', which is relatively expensive
- with the revert 'shrink_list' is relatively expensive, although not
  really in absolute terms
You people may want to look at exactly what happens directly around the SKB 
allocation errors. I've only looked at totals.
Cheers,
FJP
[-- Attachment #2: logs.tgz --]
[-- Type: application/x-tgz, Size: 151463 bytes --]
[-- Attachment #3: results.ods --]
[-- Type: application/vnd.oasis.opendocument.spreadsheet, Size: 20051 bytes --]
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-17  5:42                               ` reinette chatre
@ 2009-10-27 11:10                                 ` Frans Pop
  2009-10-27 16:15                                   ` reinette chatre
  0 siblings, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-10-27 11:10 UTC (permalink / raw)
  To: reinette chatre
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
Sorry for the delay in replying.
On Saturday 17 October 2009, reinette chatre wrote:
> Prompted by this thread we are in process of moving allocation to paged
> skb. This will definitely reduce the allocation size (from order 2 to
> order 1) and hopefully help with this problem also. Could you please try
> with the attached two patches? They are based on 2.6.32-rc4.
Looks very good! With these patches I no longer get any SKB allocation 
errors, even during the heaviest freezes while gitk is loading. I do still 
get (long) music skips during the freezes, but that's not unexpected.
AFAICT the wireless connection is stable.
Tested on top of current mainline git: v2.6.32-rc5-81-g964fe08.
Please add, if you feel it's appropriate, my:
Reported-and-tested-by: Frans Pop <elendil@planet.nl>
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-26 21:06                             ` Frans Pop
@ 2009-10-27 14:54                               ` Mel Gorman
  2009-10-27 15:16                                 ` KOSAKI Motohiro
  2009-10-27 15:52                                 ` Mel Gorman
  2009-11-05 20:14                               ` Frans Pop
  1 sibling, 2 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-27 14:54 UTC (permalink / raw)
  To: Frans Pop
  Cc: Chris Mason, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Mon, Oct 26, 2009 at 10:06:09PM +0100, Frans Pop wrote:
> On Tuesday 20 October 2009, Mel Gorman wrote:
> > I've attached a patch below that should allow us to cheat. When it's
> > applied, it outputs who called congestion_wait(), how long the timeout
> > was and how long it waited for. By comparing before and after sleep
> > times, we should be able to see which of the callers has significantly
> > changed and if it's something easily addressable.
> 
> The results from this look fairly interesting (although I may be a bad 
> judge as I don't really know what I'm looking at ;-).
> 
> I've tested with two kernels:
> 1) 2.6.31.1: 1 test run
> 2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
> 
> The 1st kernel had the expected "freeze" while reading commits in gitk; 
> reading commits with the 2nd kernel was more fluent.
> I did 2 runs with the 2nd kernel as the first run had a fairly long music 
> skip and more SKB errors than expected. The second run was fairly normal 
> with no music skips at all even though it had a few SKB errors.
> 
> Data for the tests:
> 				1st kernel	2nd kernel 1	2nd kernel 2
> end reading commits		1:15		1:00		0:55
>   "freeze"			yes		no		no
> branch data shown		1:55		1:15		1:10
> system quiet			2:25		1:50		1:45
> # SKB allocation errors		10		53		5
> 
> Note that the test is substantially faster with the 2nd kernel and that the 
> SKB errors don't really affect the duration of the test.
> 
Ok. I think that despite expectations, the writeback changes have
changed the timing significantly enough to be worth examining closer.
> 
> - without the revert 'background_writeout' is called a lot less frequently,
>   but when it's called it gets long delays
> - without the revert you have 'wb_kupdate', which is relatively expensive
> - with the revert 'shrink_list' is relatively expensive, although not
>   really in absolute terms
> 
Lets look at the callers that waited in congestion_wait() for at least
25 jiffies.
2.6.31.1-async-sync-congestion-wait i.e. vanilla kernel
generated with: cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
     24  background_writeout  congestion_wait sync=0 delay 25 timeout 25
    203  kswapd               congestion_wait sync=0 delay 25 timeout 25
      5  shrink_list          congestion_wait sync=0 delay 25 timeout 25
    155  try_to_free_pages    congestion_wait sync=0 delay 25 timeout 25
    145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
      2  kswapd               congestion_wait sync=0 delay 26 timeout 25
      8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
      1  try_to_free_pages    congestion_wait sync=0 delay 54 timeout 25
2.6.31.1-write-congestion-wait i.e. kernel with patch reverted
generated with: cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
      2  background_writeout  congestion_wait rw=1 delay 25 timeout 25
    188  kswapd               congestion_wait rw=1 delay 25 timeout 25
     14  shrink_list          congestion_wait rw=1 delay 25 timeout 25
    181  try_to_free_pages    congestion_wait rw=1 delay 25 timeout 25
      5  kswapd               congestion_wait rw=1 delay 26 timeout 25
     10  try_to_free_pages    congestion_wait rw=1 delay 26 timeout 25
      3  try_to_free_pages    congestion_wait rw=1 delay 27 timeout 25
      1  kswapd               congestion_wait rw=1 delay 29 timeout 25
      1  __alloc_pages_nodemask congestion_wait rw=1 delay 30 timeout 5
      1  try_to_free_pages    congestion_wait rw=1 delay 31 timeout 25
      1  try_to_free_pages    congestion_wait rw=1 delay 35 timeout 25
      1  kswapd               congestion_wait rw=1 delay 51 timeout 25
      1  try_to_free_pages    congestion_wait rw=1 delay 56 timeout 25
So, wb_kupdate and background_writeout are the big movers in terms of waiting,
not the direct reclaimers which is what we were expecting. Of those big
movers, wb_kupdate is the most interested because compare the following
$ cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
[ no output ]
$ $ cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
      1  wb_kupdate           congestion_wait sync=0 delay 15 timeout 25
      1  wb_kupdate           congestion_wait sync=0 delay 23 timeout 25
    145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
      8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
The vanilla kernel is not waiting in wb_kupdate at all.
Jens, before the congestion_wait() changes, wb_kupdate was waiting on
congestion and afterwards it's not. Furthermore, look at the number of pages
that are queued for writeback in the two page allocation failure reports.
without-revert:	writeback:65653
with-revert:	writeback:21713
So, after the move to async/sync, a lot more pages are getting queued
for writeback - more than three times the number of pages are queued for
writeback with the vanilla kernel. This amount of congestion might be why
direct reclaimers and kswapd's timings have changed so much.
Chris Mason hinted at this but I didn't quite "get it" at the time but is it
possible that writeback_inodes() is converting what is expected to be async
IO into sync IO? One way of checking this is if Frans could test the patch
below that makes wb_kupdate wait on sync instead of async.
If this makes a difference, I think the three main areas of trouble we
are now seeing are
	1. page allocator regressions - mostly fixed hopefully
	2. page writeback change in timing - theory yet to be confirmed
	3. drivers using more atomics - iwlagn specific, being dealt with
Of course, the big problem is if the changes are due to major timing
differences in page writeback, then mainline is a totally different
shape of problem as pdflush has been replaced there.
==== 
Have wb_kupdate wait on sync IO congestion instead of async
wb_kupdate is expected to only have queued up pages for async IO.
However, something screwy is happening because it never appears to go to
sleep. Frans, can you test with this patch instead of the revert please?
Preferably, keep the verbose-congestion_wait patch applied so we can
still get an idea who is going to sleep and for how long when calling
congestion_wait. thanks
Not-signed-off-hacket-job: Mel Gorman <mel@csn.ul.ie>
--- 
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 81627eb..cb646dd 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -787,7 +787,7 @@ static void wb_kupdate(unsigned long arg)
 		writeback_inodes(&wbc);
 		if (wbc.nr_to_write > 0) {
 			if (wbc.encountered_congestion || wbc.more_io)
-				congestion_wait(BLK_RW_ASYNC, HZ/10);
+				congestion_wait(BLK_RW_SYNC, HZ/10);
 			else
 				break;	/* All the old data is written */
 		}
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-27 14:54                               ` Mel Gorman
@ 2009-10-27 15:16                                 ` KOSAKI Motohiro
  2009-10-27 15:21                                   ` Mel Gorman
  2009-10-27 15:52                                 ` Mel Gorman
  1 sibling, 1 reply; 88+ messages in thread
From: KOSAKI Motohiro @ 2009-10-27 15:16 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, Chris Mason, David Rientjes, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
2009/10/27 Mel Gorman <mel@csn.ul.ie>:
> On Mon, Oct 26, 2009 at 10:06:09PM +0100, Frans Pop wrote:
>> On Tuesday 20 October 2009, Mel Gorman wrote:
>> > I've attached a patch below that should allow us to cheat. When it's
>> > applied, it outputs who called congestion_wait(), how long the timeout
>> > was and how long it waited for. By comparing before and after sleep
>> > times, we should be able to see which of the callers has significantly
>> > changed and if it's something easily addressable.
>>
>> The results from this look fairly interesting (although I may be a bad
>> judge as I don't really know what I'm looking at ;-).
>>
>> I've tested with two kernels:
>> 1) 2.6.31.1: 1 test run
>> 2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
>>
>> The 1st kernel had the expected "freeze" while reading commits in gitk;
>> reading commits with the 2nd kernel was more fluent.
>> I did 2 runs with the 2nd kernel as the first run had a fairly long music
>> skip and more SKB errors than expected. The second run was fairly normal
>> with no music skips at all even though it had a few SKB errors.
>>
>> Data for the tests:
>>                               1st kernel      2nd kernel 1    2nd kernel 2
>> end reading commits           1:15            1:00            0:55
>>   "freeze"                    yes             no              no
>> branch data shown             1:55            1:15            1:10
>> system quiet                  2:25            1:50            1:45
>> # SKB allocation errors               10              53              5
>>
>> Note that the test is substantially faster with the 2nd kernel and that the
>> SKB errors don't really affect the duration of the test.
>>
>
> Ok. I think that despite expectations, the writeback changes have
> changed the timing significantly enough to be worth examining closer.
>
>>
>> - without the revert 'background_writeout' is called a lot less frequently,
>>   but when it's called it gets long delays
>> - without the revert you have 'wb_kupdate', which is relatively expensive
>> - with the revert 'shrink_list' is relatively expensive, although not
>>   really in absolute terms
>>
>
> Lets look at the callers that waited in congestion_wait() for at least
> 25 jiffies.
>
> 2.6.31.1-async-sync-congestion-wait i.e. vanilla kernel
> generated with: cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
>     24  background_writeout  congestion_wait sync=0 delay 25 timeout 25
>    203  kswapd               congestion_wait sync=0 delay 25 timeout 25
>      5  shrink_list          congestion_wait sync=0 delay 25 timeout 25
>    155  try_to_free_pages    congestion_wait sync=0 delay 25 timeout 25
>    145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
>      2  kswapd               congestion_wait sync=0 delay 26 timeout 25
>      8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
>      1  try_to_free_pages    congestion_wait sync=0 delay 54 timeout 25
>
> 2.6.31.1-write-congestion-wait i.e. kernel with patch reverted
> generated with: cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
>      2  background_writeout  congestion_wait rw=1 delay 25 timeout 25
>    188  kswapd               congestion_wait rw=1 delay 25 timeout 25
>     14  shrink_list          congestion_wait rw=1 delay 25 timeout 25
>    181  try_to_free_pages    congestion_wait rw=1 delay 25 timeout 25
>      5  kswapd               congestion_wait rw=1 delay 26 timeout 25
>     10  try_to_free_pages    congestion_wait rw=1 delay 26 timeout 25
>      3  try_to_free_pages    congestion_wait rw=1 delay 27 timeout 25
>      1  kswapd               congestion_wait rw=1 delay 29 timeout 25
>      1  __alloc_pages_nodemask congestion_wait rw=1 delay 30 timeout 5
>      1  try_to_free_pages    congestion_wait rw=1 delay 31 timeout 25
>      1  try_to_free_pages    congestion_wait rw=1 delay 35 timeout 25
>      1  kswapd               congestion_wait rw=1 delay 51 timeout 25
>      1  try_to_free_pages    congestion_wait rw=1 delay 56 timeout 25
>
> So, wb_kupdate and background_writeout are the big movers in terms of waiting,
> not the direct reclaimers which is what we were expecting. Of those big
> movers, wb_kupdate is the most interested because compare the following
>
> $ cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
> [ no output ]
> $ $ cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
>      1  wb_kupdate           congestion_wait sync=0 delay 15 timeout 25
>      1  wb_kupdate           congestion_wait sync=0 delay 23 timeout 25
>    145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
>      8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
>
> The vanilla kernel is not waiting in wb_kupdate at all.
>
> Jens, before the congestion_wait() changes, wb_kupdate was waiting on
> congestion and afterwards it's not. Furthermore, look at the number of pages
> that are queued for writeback in the two page allocation failure reports.
>
> without-revert: writeback:65653
> with-revert:    writeback:21713
>
> So, after the move to async/sync, a lot more pages are getting queued
> for writeback - more than three times the number of pages are queued for
> writeback with the vanilla kernel. This amount of congestion might be why
> direct reclaimers and kswapd's timings have changed so much.
>
> Chris Mason hinted at this but I didn't quite "get it" at the time but is it
> possible that writeback_inodes() is converting what is expected to be async
> IO into sync IO? One way of checking this is if Frans could test the patch
> below that makes wb_kupdate wait on sync instead of async.
>
> If this makes a difference, I think the three main areas of trouble we
> are now seeing are
>
>        1. page allocator regressions - mostly fixed hopefully
>        2. page writeback change in timing - theory yet to be confirmed
>        3. drivers using more atomics - iwlagn specific, being dealt with
>
> Of course, the big problem is if the changes are due to major timing
> differences in page writeback, then mainline is a totally different
> shape of problem as pdflush has been replaced there.
>
> ====
> Have wb_kupdate wait on sync IO congestion instead of async
>
> wb_kupdate is expected to only have queued up pages for async IO.
> However, something screwy is happening because it never appears to go to
> sleep. Frans, can you test with this patch instead of the revert please?
> Preferably, keep the verbose-congestion_wait patch applied so we can
> still get an idea who is going to sleep and for how long when calling
> congestion_wait. thanks
>
> Not-signed-off-hacket-job: Mel Gorman <mel@csn.ul.ie>
> ---
>
> diff --git a/mm/page-writeback.c b/mm/page-writeback.c
> index 81627eb..cb646dd 100644
> --- a/mm/page-writeback.c
> +++ b/mm/page-writeback.c
> @@ -787,7 +787,7 @@ static void wb_kupdate(unsigned long arg)
>                writeback_inodes(&wbc);
>                if (wbc.nr_to_write > 0) {
>                        if (wbc.encountered_congestion || wbc.more_io)
> -                               congestion_wait(BLK_RW_ASYNC, HZ/10);
> +                               congestion_wait(BLK_RW_SYNC, HZ/10);
>                        else
>                                break;  /* All the old data is written */
>                }
Hmm, This doesn't looks correct to me.
BLK_RW_ASYNC mean async write.
BLK_RW_SYNC  mean read and sync-write.
wb_kupdate use WB_SYNC_NONE. it's async write.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-27 15:16                                 ` KOSAKI Motohiro
@ 2009-10-27 15:21                                   ` Mel Gorman
  0 siblings, 0 replies; 88+ messages in thread
From: Mel Gorman @ 2009-10-27 15:21 UTC (permalink / raw)
  To: KOSAKI Motohiro
  Cc: Frans Pop, Chris Mason, David Rientjes, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Wed, Oct 28, 2009 at 12:16:30AM +0900, KOSAKI Motohiro wrote:
> 2009/10/27 Mel Gorman <mel@csn.ul.ie>:
> > On Mon, Oct 26, 2009 at 10:06:09PM +0100, Frans Pop wrote:
> >> On Tuesday 20 October 2009, Mel Gorman wrote:
> >> > I've attached a patch below that should allow us to cheat. When it's
> >> > applied, it outputs who called congestion_wait(), how long the timeout
> >> > was and how long it waited for. By comparing before and after sleep
> >> > times, we should be able to see which of the callers has significantly
> >> > changed and if it's something easily addressable.
> >>
> >> The results from this look fairly interesting (although I may be a bad
> >> judge as I don't really know what I'm looking at ;-).
> >>
> >> I've tested with two kernels:
> >> 1) 2.6.31.1: 1 test run
> >> 2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
> >>
> >> The 1st kernel had the expected "freeze" while reading commits in gitk;
> >> reading commits with the 2nd kernel was more fluent.
> >> I did 2 runs with the 2nd kernel as the first run had a fairly long music
> >> skip and more SKB errors than expected. The second run was fairly normal
> >> with no music skips at all even though it had a few SKB errors.
> >>
> >> Data for the tests:
> >>                               1st kernel      2nd kernel 1    2nd kernel 2
> >> end reading commits           1:15            1:00            0:55
> >>   "freeze"                    yes             no              no
> >> branch data shown             1:55            1:15            1:10
> >> system quiet                  2:25            1:50            1:45
> >> # SKB allocation errors               10              53              5
> >>
> >> Note that the test is substantially faster with the 2nd kernel and that the
> >> SKB errors don't really affect the duration of the test.
> >>
> >
> > Ok. I think that despite expectations, the writeback changes have
> > changed the timing significantly enough to be worth examining closer.
> >
> >>
> >> - without the revert 'background_writeout' is called a lot less frequently,
> >>   but when it's called it gets long delays
> >> - without the revert you have 'wb_kupdate', which is relatively expensive
> >> - with the revert 'shrink_list' is relatively expensive, although not
> >>   really in absolute terms
> >>
> >
> > Lets look at the callers that waited in congestion_wait() for at least
> > 25 jiffies.
> >
> > 2.6.31.1-async-sync-congestion-wait i.e. vanilla kernel
> > generated with: cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
> >     24  background_writeout  congestion_wait sync=0 delay 25 timeout 25
> >    203  kswapd               congestion_wait sync=0 delay 25 timeout 25
> >      5  shrink_list          congestion_wait sync=0 delay 25 timeout 25
> >    155  try_to_free_pages    congestion_wait sync=0 delay 25 timeout 25
> >    145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
> >      2  kswapd               congestion_wait sync=0 delay 26 timeout 25
> >      8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
> >      1  try_to_free_pages    congestion_wait sync=0 delay 54 timeout 25
> >
> > 2.6.31.1-write-congestion-wait i.e. kernel with patch reverted
> > generated with: cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
> >      2  background_writeout  congestion_wait rw=1 delay 25 timeout 25
> >    188  kswapd               congestion_wait rw=1 delay 25 timeout 25
> >     14  shrink_list          congestion_wait rw=1 delay 25 timeout 25
> >    181  try_to_free_pages    congestion_wait rw=1 delay 25 timeout 25
> >      5  kswapd               congestion_wait rw=1 delay 26 timeout 25
> >     10  try_to_free_pages    congestion_wait rw=1 delay 26 timeout 25
> >      3  try_to_free_pages    congestion_wait rw=1 delay 27 timeout 25
> >      1  kswapd               congestion_wait rw=1 delay 29 timeout 25
> >      1  __alloc_pages_nodemask congestion_wait rw=1 delay 30 timeout 5
> >      1  try_to_free_pages    congestion_wait rw=1 delay 31 timeout 25
> >      1  try_to_free_pages    congestion_wait rw=1 delay 35 timeout 25
> >      1  kswapd               congestion_wait rw=1 delay 51 timeout 25
> >      1  try_to_free_pages    congestion_wait rw=1 delay 56 timeout 25
> >
> > So, wb_kupdate and background_writeout are the big movers in terms of waiting,
> > not the direct reclaimers which is what we were expecting. Of those big
> > movers, wb_kupdate is the most interested because compare the following
> >
> > $ cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
> > [ no output ]
> > $ $ cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
> >      1  wb_kupdate           congestion_wait sync=0 delay 15 timeout 25
> >      1  wb_kupdate           congestion_wait sync=0 delay 23 timeout 25
> >    145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
> >      8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
> >
> > The vanilla kernel is not waiting in wb_kupdate at all.
> >
> > Jens, before the congestion_wait() changes, wb_kupdate was waiting on
> > congestion and afterwards it's not. Furthermore, look at the number of pages
> > that are queued for writeback in the two page allocation failure reports.
> >
> > without-revert: writeback:65653
> > with-revert:    writeback:21713
> >
> > So, after the move to async/sync, a lot more pages are getting queued
> > for writeback - more than three times the number of pages are queued for
> > writeback with the vanilla kernel. This amount of congestion might be why
> > direct reclaimers and kswapd's timings have changed so much.
> >
> > Chris Mason hinted at this but I didn't quite "get it" at the time but is it
> > possible that writeback_inodes() is converting what is expected to be async
> > IO into sync IO? One way of checking this is if Frans could test the patch
> > below that makes wb_kupdate wait on sync instead of async.
> >
> > If this makes a difference, I think the three main areas of trouble we
> > are now seeing are
> >
> >        1. page allocator regressions - mostly fixed hopefully
> >        2. page writeback change in timing - theory yet to be confirmed
> >        3. drivers using more atomics - iwlagn specific, being dealt with
> >
> > Of course, the big problem is if the changes are due to major timing
> > differences in page writeback, then mainline is a totally different
> > shape of problem as pdflush has been replaced there.
> >
> > ====
> > Have wb_kupdate wait on sync IO congestion instead of async
> >
> > wb_kupdate is expected to only have queued up pages for async IO.
> > However, something screwy is happening because it never appears to go to
> > sleep. Frans, can you test with this patch instead of the revert please?
> > Preferably, keep the verbose-congestion_wait patch applied so we can
> > still get an idea who is going to sleep and for how long when calling
> > congestion_wait. thanks
> >
> > Not-signed-off-hacket-job: Mel Gorman <mel@csn.ul.ie>
> > ---
> >
> > diff --git a/mm/page-writeback.c b/mm/page-writeback.c
> > index 81627eb..cb646dd 100644
> > --- a/mm/page-writeback.c
> > +++ b/mm/page-writeback.c
> > @@ -787,7 +787,7 @@ static void wb_kupdate(unsigned long arg)
> >                writeback_inodes(&wbc);
> >                if (wbc.nr_to_write > 0) {
> >                        if (wbc.encountered_congestion || wbc.more_io)
> > -                               congestion_wait(BLK_RW_ASYNC, HZ/10);
> > +                               congestion_wait(BLK_RW_SYNC, HZ/10);
> >                        else
> >                                break;  /* All the old data is written */
> >                }
> 
> Hmm, This doesn't looks correct to me.
> 
> BLK_RW_ASYNC mean async write.
> BLK_RW_SYNC  mean read and sync-write.
> 
> wb_kupdate use WB_SYNC_NONE. it's async write.
> 
I don't think it's correct either which is why I described it as
"something screwy is happening because it never appears to go to sleep".
This is despite there being a whole lot of pages queued for writeback
according to the page allocation failure reports. 
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-27 14:54                               ` Mel Gorman
  2009-10-27 15:16                                 ` KOSAKI Motohiro
@ 2009-10-27 15:52                                 ` Mel Gorman
  2009-10-27 16:03                                   ` Chris Mason
  1 sibling, 1 reply; 88+ messages in thread
From: Mel Gorman @ 2009-10-27 15:52 UTC (permalink / raw)
  To: Frans Pop
  Cc: Chris Mason, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Tue, Oct 27, 2009 at 02:54:35PM +0000, Mel Gorman wrote:
> On Mon, Oct 26, 2009 at 10:06:09PM +0100, Frans Pop wrote:
> > On Tuesday 20 October 2009, Mel Gorman wrote:
> > > I've attached a patch below that should allow us to cheat. When it's
> > > applied, it outputs who called congestion_wait(), how long the timeout
> > > was and how long it waited for. By comparing before and after sleep
> > > times, we should be able to see which of the callers has significantly
> > > changed and if it's something easily addressable.
> > 
> > The results from this look fairly interesting (although I may be a bad 
> > judge as I don't really know what I'm looking at ;-).
> > 
> > I've tested with two kernels:
> > 1) 2.6.31.1: 1 test run
> > 2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
> > 
> > The 1st kernel had the expected "freeze" while reading commits in gitk; 
> > reading commits with the 2nd kernel was more fluent.
> > I did 2 runs with the 2nd kernel as the first run had a fairly long music 
> > skip and more SKB errors than expected. The second run was fairly normal 
> > with no music skips at all even though it had a few SKB errors.
> > 
> > Data for the tests:
> > 				1st kernel	2nd kernel 1	2nd kernel 2
> > end reading commits		1:15		1:00		0:55
> >   "freeze"			yes		no		no
> > branch data shown		1:55		1:15		1:10
> > system quiet			2:25		1:50		1:45
> > # SKB allocation errors		10		53		5
> > 
> > Note that the test is substantially faster with the 2nd kernel and that the 
> > SKB errors don't really affect the duration of the test.
> > 
> 
> Ok. I think that despite expectations, the writeback changes have
> changed the timing significantly enough to be worth examining closer.
> 
> > 
> > - without the revert 'background_writeout' is called a lot less frequently,
> >   but when it's called it gets long delays
> > - without the revert you have 'wb_kupdate', which is relatively expensive
> > - with the revert 'shrink_list' is relatively expensive, although not
> >   really in absolute terms
> > 
> 
> Lets look at the callers that waited in congestion_wait() for at least
> 25 jiffies.
> 
> 2.6.31.1-async-sync-congestion-wait i.e. vanilla kernel
> generated with: cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
>      24  background_writeout  congestion_wait sync=0 delay 25 timeout 25
>     203  kswapd               congestion_wait sync=0 delay 25 timeout 25
>       5  shrink_list          congestion_wait sync=0 delay 25 timeout 25
>     155  try_to_free_pages    congestion_wait sync=0 delay 25 timeout 25
>     145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
>       2  kswapd               congestion_wait sync=0 delay 26 timeout 25
>       8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
>       1  try_to_free_pages    congestion_wait sync=0 delay 54 timeout 25
> 
> 2.6.31.1-write-congestion-wait i.e. kernel with patch reverted
> generated with: cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c
>       2  background_writeout  congestion_wait rw=1 delay 25 timeout 25
>     188  kswapd               congestion_wait rw=1 delay 25 timeout 25
>      14  shrink_list          congestion_wait rw=1 delay 25 timeout 25
>     181  try_to_free_pages    congestion_wait rw=1 delay 25 timeout 25
>       5  kswapd               congestion_wait rw=1 delay 26 timeout 25
>      10  try_to_free_pages    congestion_wait rw=1 delay 26 timeout 25
>       3  try_to_free_pages    congestion_wait rw=1 delay 27 timeout 25
>       1  kswapd               congestion_wait rw=1 delay 29 timeout 25
>       1  __alloc_pages_nodemask congestion_wait rw=1 delay 30 timeout 5
>       1  try_to_free_pages    congestion_wait rw=1 delay 31 timeout 25
>       1  try_to_free_pages    congestion_wait rw=1 delay 35 timeout 25
>       1  kswapd               congestion_wait rw=1 delay 51 timeout 25
>       1  try_to_free_pages    congestion_wait rw=1 delay 56 timeout 25
> 
> So, wb_kupdate and background_writeout are the big movers in terms of waiting,
> not the direct reclaimers which is what we were expecting. Of those big
> movers, wb_kupdate is the most interested because compare the following
> 
Bah, this part is right, but I got the next section the wrong way
around. I should have renamed the damn things instead of remember what
was 1 and what was 2.
1 == vanilla
2 == with-revert
> $ cat kern.log_2.1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
> [ no output ]
> $ $ cat kern.log_1_test | awk -F ] '{print $2}' | sort -k 5 -n | uniq -c | grep wb_kup
>       1  wb_kupdate           congestion_wait sync=0 delay 15 timeout 25
>       1  wb_kupdate           congestion_wait sync=0 delay 23 timeout 25
>     145  wb_kupdate           congestion_wait sync=0 delay 25 timeout 25
>       8  wb_kupdate           congestion_wait sync=0 delay 26 timeout 25
> 
> The vanilla kernel is not waiting in wb_kupdate at all.
> 
The vanilla kernel *is* waiting. The reverted kernel is not. If my patch
makes any difference, it's not for the right reasons.
> Jens, before the congestion_wait() changes, wb_kupdate was waiting on
> congestion and afterwards it's not. Furthermore, look at the number of pages
> that are queued for writeback in the two page allocation failure reports.
> 
> without-revert:	writeback:65653
> with-revert:	writeback:21713
> 
and got it back right again.
kernel 1 == vanilla kernel == without-revert	writeback:65653
kernel 2 == revert  kernel == with-revert	writeback:21713
> So, after the move to async/sync, a lot more pages are getting queued
> for writeback - more than three times the number of pages are queued for
> writeback with the vanilla kernel. This amount of congestion might be why
> direct reclaimers and kswapd's timings have changed so much.
> 
Or more accurately, the vanilla kernel has queued up a lot more pages for
IO than when the patch is reverted. I'm not seeing yet why this is.
> Chris Mason hinted at this but I didn't quite "get it" at the time but is it
> possible that writeback_inodes() is converting what is expected to be async
> IO into sync IO? One way of checking this is if Frans could test the patch
> below that makes wb_kupdate wait on sync instead of async.
> 
This reasoning is rubbish. If the patch makes any difference, it's because
it changes timing. It's probably more important to figure out if a) if the
different number of pages for writeback is relevant and if so b) why has
it changed.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-27 15:52                                 ` Mel Gorman
@ 2009-10-27 16:03                                   ` Chris Mason
  2009-10-27 17:21                                     ` Frans Pop
  2009-10-27 17:21                                     ` Frans Pop
  0 siblings, 2 replies; 88+ messages in thread
From: Chris Mason @ 2009-10-27 16:03 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Frans Pop, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Tue, Oct 27, 2009 at 03:52:24PM +0000, Mel Gorman wrote:
> 
> > So, after the move to async/sync, a lot more pages are getting queued
> > for writeback - more than three times the number of pages are queued for
> > writeback with the vanilla kernel. This amount of congestion might be why
> > direct reclaimers and kswapd's timings have changed so much.
> > 
> 
> Or more accurately, the vanilla kernel has queued up a lot more pages for
> IO than when the patch is reverted. I'm not seeing yet why this is.
[ sympathies over confusion about congestion...lots of variables here ]
If wb_kupdate has been able to queue more writes it is because the
congestion logic isn't stopping it.  We have congestion_wait(), but
before calling that in the writeback paths it says: are you congested?
and then backs off if the answer is yes.
Ideally, direct reclaim will never do writeback.  We want it to be able
to find clean pages that kupdate and friends have already processed.
Waiting for congestion is a funny thing, it only tells us the device has
managed to finish some IO or that a timeout has passed.  Neither event has
any relation to figuring out if the IO for reclaimable pages has
finished.
One option is to have the VM remember the hashed waitqueue for one of
the pages it direct reclaims and then wait on it.
-chris
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-27 11:10                                 ` Frans Pop
@ 2009-10-27 16:15                                   ` reinette chatre
  0 siblings, 0 replies; 88+ messages in thread
From: reinette chatre @ 2009-10-27 16:15 UTC (permalink / raw)
  To: Frans Pop
  Cc: Mel Gorman, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Bartlomiej Zolnierkiewicz, Karol Lewandowski, Abbas, Mohamed,
	John W. Linville, linux-mm@kvack.org
Hi Frans,
On Tue, 2009-10-27 at 04:10 -0700, Frans Pop wrote:
> Sorry for the delay in replying.
> 
> On Saturday 17 October 2009, reinette chatre wrote:
> > Prompted by this thread we are in process of moving allocation to paged
> > skb. This will definitely reduce the allocation size (from order 2 to
> > order 1) and hopefully help with this problem also. Could you please try
> > with the attached two patches? They are based on 2.6.32-rc4.
> 
> Looks very good! With these patches I no longer get any SKB allocation 
> errors, even during the heaviest freezes while gitk is loading. I do still 
> get (long) music skips during the freezes, but that's not unexpected.
> AFAICT the wireless connection is stable.
> 
> Tested on top of current mainline git: v2.6.32-rc5-81-g964fe08.
> 
> Please add, if you feel it's appropriate, my:
> Reported-and-tested-by: Frans Pop <elendil@planet.nl>
Thank you very much for testing these patches so thoroughly. They are
both on their way upstream already so I am not able to add your
signature at this time. Since these are pretty big changes these patches
will be in 2.6.33.
Reinette
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-27 16:03                                   ` Chris Mason
@ 2009-10-27 17:21                                     ` Frans Pop
  2009-10-27 17:21                                     ` Frans Pop
  1 sibling, 0 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-27 17:21 UTC (permalink / raw)
  To: Chris Mason, Mel Gorman, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki, Linux Kernel Mailing List, Kernel Testers List,
	Pekka Enberg, Reinette Chatre, Bartlomiej Zolnierkiewicz,
	Karol Lewandowski, Mohamed Abbas, Jens Axboe, John W. Linville,
	linux-mm
On Tuesday 27 October 2009, Chris Mason wrote:
> On Tue, Oct 27, 2009 at 03:52:24PM +0000, Mel Gorman wrote:
> > > So, after the move to async/sync, a lot more pages are getting
> > > queued for writeback - more than three times the number of pages are
> > > queued for writeback with the vanilla kernel. This amount of
> > > congestion might be why direct reclaimers and kswapd's timings have
> > > changed so much.
> >
> > Or more accurately, the vanilla kernel has queued up a lot more pages
> > for IO than when the patch is reverted. I'm not seeing yet why this
> > is.
>
> [ sympathies over confusion about congestion...lots of variables here ]
>
> If wb_kupdate has been able to queue more writes it is because the
> congestion logic isn't stopping it.  We have congestion_wait(), but
> before calling that in the writeback paths it says: are you congested?
> and then backs off if the answer is yes.
>
> Ideally, direct reclaim will never do writeback.  We want it to be able
> to find clean pages that kupdate and friends have already processed.
>
> Waiting for congestion is a funny thing, it only tells us the device has
> managed to finish some IO or that a timeout has passed.  Neither event
> has any relation to figuring out if the IO for reclaimable pages has
> finished.
>
> One option is to have the VM remember the hashed waitqueue for one of
> the pages it direct reclaims and then wait on it.
What people should be aware of is the behavior of the system I see at this 
point. I've already mentioned this in other mails, but it's probably good 
to repeat it here.
While gitk is reading commits with vanilla .31 and .32 kernels there is at 
some point a fairly long period (10-20 seconds) where I see:
- a completely frozen desktop, including frozen mouse cursor
- really very little disk activity (HD led flashes very briefly less than
  once per second)
- reading commits stops completely during this period
- no music.
After that there is a period (another 5-15 seconds) with a huge amount of 
disk activity during which the system gradually becomes responsive again 
and in gitk the count of commits that have been read starts increasing 
again (without a jump in the counter which confirms that no commits were 
read during the freeze).
I cannot really tell what the system is doing during those freezes. Because 
of the frozen desktop I cannot for example see CPU usage. I suspect that, 
as there is hardly any disk activity, the system must be reorganizing RAM 
or something. But it seems quite bad that that gets "bunched up" instead 
of happening more gradually.
With the congestion_wait() change reverted I never see these freezes, only 
much more normal minor latencies (< 2 seconds; mostly < 0.5 seconds), 
which is probably unavoidable during heavy swapping.
Hth,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-27 16:03                                   ` Chris Mason
  2009-10-27 17:21                                     ` Frans Pop
@ 2009-10-27 17:21                                     ` Frans Pop
  1 sibling, 0 replies; 88+ messages in thread
From: Frans Pop @ 2009-10-27 17:21 UTC (permalink / raw)
  To: Chris Mason, Mel Gorman, David Rientjes, KOSAKI Motohiro,
	Rafael J. Wysocki
On Tuesday 27 October 2009, Chris Mason wrote:
> On Tue, Oct 27, 2009 at 03:52:24PM +0000, Mel Gorman wrote:
> > > So, after the move to async/sync, a lot more pages are getting
> > > queued for writeback - more than three times the number of pages are
> > > queued for writeback with the vanilla kernel. This amount of
> > > congestion might be why direct reclaimers and kswapd's timings have
> > > changed so much.
> >
> > Or more accurately, the vanilla kernel has queued up a lot more pages
> > for IO than when the patch is reverted. I'm not seeing yet why this
> > is.
>
> [ sympathies over confusion about congestion...lots of variables here ]
>
> If wb_kupdate has been able to queue more writes it is because the
> congestion logic isn't stopping it.  We have congestion_wait(), but
> before calling that in the writeback paths it says: are you congested?
> and then backs off if the answer is yes.
>
> Ideally, direct reclaim will never do writeback.  We want it to be able
> to find clean pages that kupdate and friends have already processed.
>
> Waiting for congestion is a funny thing, it only tells us the device has
> managed to finish some IO or that a timeout has passed.  Neither event
> has any relation to figuring out if the IO for reclaimable pages has
> finished.
>
> One option is to have the VM remember the hashed waitqueue for one of
> the pages it direct reclaims and then wait on it.
What people should be aware of is the behavior of the system I see at this 
point. I've already mentioned this in other mails, but it's probably good 
to repeat it here.
While gitk is reading commits with vanilla .31 and .32 kernels there is at 
some point a fairly long period (10-20 seconds) where I see:
- a completely frozen desktop, including frozen mouse cursor
- really very little disk activity (HD led flashes very briefly less than
  once per second)
- reading commits stops completely during this period
- no music.
After that there is a period (another 5-15 seconds) with a huge amount of 
disk activity during which the system gradually becomes responsive again 
and in gitk the count of commits that have been read starts increasing 
again (without a jump in the counter which confirms that no commits were 
read during the freeze).
I cannot really tell what the system is doing during those freezes. Because 
of the frozen desktop I cannot for example see CPU usage. I suspect that, 
as there is hardly any disk activity, the system must be reorganizing RAM 
or something. But it seems quite bad that that gets "bunched up" instead 
of happening more gradually.
With the congestion_wait() change reverted I never see these freezes, only 
much more normal minor latencies (< 2 seconds; mostly < 0.5 seconds), 
which is probably unavoidable during heavy swapping.
Hth,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-10-26 21:06                             ` Frans Pop
  2009-10-27 14:54                               ` Mel Gorman
@ 2009-11-05 20:14                               ` Frans Pop
  2009-11-06  9:51                                 ` Frans Pop
  1 sibling, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-11-05 20:14 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Chris Mason, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Monday 26 October 2009, Frans Pop wrote:
> On Tuesday 20 October 2009, Mel Gorman wrote:
> > I've attached a patch below that should allow us to cheat. When it's
> > applied, it outputs who called congestion_wait(), how long the timeout
> > was and how long it waited for. By comparing before and after sleep
> > times, we should be able to see which of the callers has significantly
> > changed and if it's something easily addressable.
>
> The results from this look fairly interesting (although I may be a bad
> judge as I don't really know what I'm looking at ;-).
>
> I've tested with two kernels:
> 1) 2.6.31.1: 1 test run
> 2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
I've taken another look at the data from this debug patch, resulting in 
these graphs: http://people.debian.org/~fjp/tmp/kernel/congestion.pdf
I think the graph may show the reason for the congestion_wait() regression. 
Horizontal axis shows time, vertical axis shows number of logged 
congestion_wait calls per type.
The top chart is without the revert, the bottom one after the revert.
Note how before the revert the graph shows distinct steps: first you get 
almost exclusively kwapd, followed by almost exclusively alloc_pages and 
try_to_free. I suspect the periods where kswapd is almost horizontal 
correspond to the freezes.
With the revert the lines for the different functions are almost straight 
and everything happens much better interspersed.
Cheers,
FJP
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-11-05 20:14                               ` Frans Pop
@ 2009-11-06  9:51                                 ` Frans Pop
  2009-11-09 19:00                                   ` Mel Gorman
  0 siblings, 1 reply; 88+ messages in thread
From: Frans Pop @ 2009-11-06  9:51 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Chris Mason, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Thursday 05 November 2009, Frans Pop wrote:
> On Monday 26 October 2009, Frans Pop wrote:
> > On Tuesday 20 October 2009, Mel Gorman wrote:
> > > I've attached a patch below that should allow us to cheat. When it's
> > > applied, it outputs who called congestion_wait(), how long the
> > > timeout was and how long it waited for. By comparing before and
> > > after sleep times, we should be able to see which of the callers has
> > > significantly changed and if it's something easily addressable.
> >
> > The results from this look fairly interesting (although I may be a bad
> > judge as I don't really know what I'm looking at ;-).
> >
> > I've tested with two kernels:
> > 1) 2.6.31.1: 1 test run
> > 2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
>
> I've taken another look at the data from this debug patch, resulting in
> these graphs: http://people.debian.org/~fjp/tmp/kernel/congestion.pdf
>
> I think the graph may show the reason for the congestion_wait()
> regression. Horizontal axis shows time, vertical axis shows number of
> logged congestion_wait calls per type.
I'm sorry. My initial version had a skewed time axis (showed occurrences 
instead of actual time). I've now uploaded a corrected version:
http://people.debian.org/~fjp/tmp/kernel/congestion.pdf
I've also uploaded a second version that shows cumulative delay per type, 
which probably gives a better insight:
http://people.debian.org/~fjp/tmp/kernel/congestion2.pdf
For both the top chart is without the revert, the bottom one after the 
revert.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
* Re: [Bug #14141] order 2 page allocation failures in iwlagn
  2009-11-06  9:51                                 ` Frans Pop
@ 2009-11-09 19:00                                   ` Mel Gorman
  0 siblings, 0 replies; 88+ messages in thread
From: Mel Gorman @ 2009-11-09 19:00 UTC (permalink / raw)
  To: Frans Pop
  Cc: Chris Mason, David Rientjes, KOSAKI Motohiro, Rafael J. Wysocki,
	Linux Kernel Mailing List, Kernel Testers List, Pekka Enberg,
	Reinette Chatre, Bartlomiej Zolnierkiewicz, Karol Lewandowski,
	Mohamed Abbas, Jens Axboe, John W. Linville, linux-mm
On Fri, Nov 06, 2009 at 10:51:37AM +0100, Frans Pop wrote:
> On Thursday 05 November 2009, Frans Pop wrote:
> > On Monday 26 October 2009, Frans Pop wrote:
> > > On Tuesday 20 October 2009, Mel Gorman wrote:
> > > > I've attached a patch below that should allow us to cheat. When it's
> > > > applied, it outputs who called congestion_wait(), how long the
> > > > timeout was and how long it waited for. By comparing before and
> > > > after sleep times, we should be able to see which of the callers has
> > > > significantly changed and if it's something easily addressable.
> > >
> > > The results from this look fairly interesting (although I may be a bad
> > > judge as I don't really know what I'm looking at ;-).
> > >
> > > I've tested with two kernels:
> > > 1) 2.6.31.1: 1 test run
> > > 2) 2.6.31.1 + congestion_wait() reverts: 2 test runs
> >
> > I've taken another look at the data from this debug patch, resulting in
> > these graphs: http://people.debian.org/~fjp/tmp/kernel/congestion.pdf
> >
> > I think the graph may show the reason for the congestion_wait()
> > regression. Horizontal axis shows time, vertical axis shows number of
> > logged congestion_wait calls per type.
> 
> I'm sorry. My initial version had a skewed time axis (showed occurrences 
> instead of actual time). I've now uploaded a corrected version:
> http://people.debian.org/~fjp/tmp/kernel/congestion.pdf
> 
> I've also uploaded a second version that shows cumulative delay per type, 
> which probably gives a better insight:
> http://people.debian.org/~fjp/tmp/kernel/congestion2.pdf
> 
> For both the top chart is without the revert, the bottom one after the 
> revert.
> 
I'm looking into this at the moment. There are some definite
differences not only in the length congestion_wait() is waiting but in
what the callers are doing. I've more or less reproduced your results
locally and am slowly plodding through each caller to see what has
changed of significance. No patches yet though.
-- 
Mel Gorman
Part-time Phd Student                          Linux Technology Center
University of Limerick                         IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply	[flat|nested] 88+ messages in thread
end of thread, other threads:[~2009-11-09 19:01 UTC | newest]
Thread overview: 88+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <3onW63eFtRF.A.xXH.oMTxKB@chimera>
     [not found] ` <COE24pZSBH.A.k2B.ZNTxKB@chimera>
     [not found]   ` <200910021111.55749.elendil@planet.nl>
2009-10-05  5:13     ` [Bug #14141] order 2 page allocation failures in iwlagn Frans Pop
2009-10-05  6:50       ` Frans Pop
2009-10-05  8:54         ` Frans Pop
2009-10-05  8:57         ` Mel Gorman
2009-10-05 21:34           ` Frans Pop
2009-10-06  0:04             ` David Rientjes
2009-10-06  1:25               ` KOSAKI Motohiro
2009-10-06  8:53               ` Mel Gorman
2009-10-06  9:14                 ` David Rientjes
2009-10-06  9:22                   ` Mel Gorman
2009-10-06 10:23               ` Frans Pop
2009-10-11 23:10         ` Frans Pop
2009-10-11 23:36           ` Frans Pop
2009-10-12 13:43           ` Mel Gorman
2009-10-12 17:32             ` Frans Pop
2009-10-12 18:43               ` Mel Gorman
2009-10-13 20:38               ` Frans Pop
2009-10-14 10:30                 ` Mel Gorman
2009-10-14 13:10                   ` Frans Pop
2009-10-14 15:40                     ` Mel Gorman
2009-10-14 16:13                       ` Frans Pop
2009-10-14 18:34                       ` Frans Pop
2009-10-14 23:56                         ` Mel Gorman
2009-10-15 20:15                           ` Frans Pop
2009-10-16  9:39                             ` Mel Gorman
2009-10-14 16:30                     ` reinette chatre
2009-10-18 23:33                     ` Frans Pop
2009-10-19  0:36                       ` Pekka Enberg
2009-10-19  2:44                         ` Frans Pop
2009-10-19  9:49                           ` [Bug #14141] order 2 page allocation failures (generic) Tobi Oetiker
2009-10-19  9:54                             ` Pekka Enberg
2009-10-19 14:01                               ` Karol Lewandowski
2009-10-19 14:06                                 ` Mel Gorman
2009-10-19 17:09                                   ` Karol Lewandowski
2009-10-20  1:47                                     ` Karol Lewandowski
2009-10-19 13:31                             ` Mel Gorman
2009-10-19 13:40                               ` Tobias Oetiker
2009-10-19 14:09                                 ` Mel Gorman
2009-10-19 14:16                                   ` Tobias Oetiker
2009-10-19 14:59                                     ` Mel Gorman
2009-10-19 20:12                                       ` Tobias Oetiker
2009-10-19 20:17                                         ` Tobias Oetiker
2009-10-20 10:57                                           ` Mel Gorman
2009-10-20 11:44                                             ` Tobias Oetiker
2009-10-20 12:51                                               ` Mel Gorman
2009-10-20 12:58                                                 ` Tobias Oetiker
2009-10-20 13:39                                                   ` Mel Gorman
2009-10-20 13:50                                                     ` Tobias Oetiker
2009-10-20 14:14                                                       ` Mel Gorman
2009-10-20 14:20                                                         ` Tobias Oetiker
2009-10-22 10:27                                                     ` Tobias Oetiker
2009-10-19  2:52                         ` [Bug #14141] order 2 page allocation failures in iwlagn Jens Axboe
2009-10-19 14:01                       ` Mel Gorman
2009-10-19 16:18                         ` Chris Mason
2009-10-19 17:01                           ` Christoph Hellwig
2009-10-19 17:01                           ` Christoph Hellwig
2009-10-19 21:57                             ` Chris Mason
2009-10-20 10:48                           ` Mel Gorman
2009-10-20 10:48                           ` Mel Gorman
2009-10-26 21:06                             ` Frans Pop
2009-10-27 14:54                               ` Mel Gorman
2009-10-27 15:16                                 ` KOSAKI Motohiro
2009-10-27 15:21                                   ` Mel Gorman
2009-10-27 15:52                                 ` Mel Gorman
2009-10-27 16:03                                   ` Chris Mason
2009-10-27 17:21                                     ` Frans Pop
2009-10-27 17:21                                     ` Frans Pop
2009-11-05 20:14                               ` Frans Pop
2009-11-06  9:51                                 ` Frans Pop
2009-11-09 19:00                                   ` Mel Gorman
2009-10-25 18:54                           ` Frans Pop
2009-10-14 16:28                   ` reinette chatre
2009-10-14 16:50                     ` Mel Gorman
2009-10-14 20:41                       ` reinette chatre
2009-10-14 21:33                         ` Frans Pop
2009-10-14 21:55                           ` reinette chatre
2009-10-15  2:02                         ` Frans Pop
2009-10-15 15:29                           ` reinette chatre
2009-10-15 19:41                             ` Frans Pop
2009-10-16 17:21                               ` reinette chatre
2009-10-17  5:42                               ` reinette chatre
2009-10-27 11:10                                 ` Frans Pop
2009-10-27 16:15                                   ` reinette chatre
     [not found] ` <COE24pZSBH.A.rP.2MTxKB@chimera>
2009-10-21 20:04   ` [PATCH] SLUB: Don't drop __GFP_NOFAIL completely from allocate_slab() (was: Re: [Bug #14265] ifconfig: page allocation failure. order:5,ode:0x8020 w/ e100) Karol Lewandowski
2009-10-21 21:06     ` David Rientjes
2009-10-21 21:20       ` Karol Lewandowski
2009-10-22 10:20         ` Mel Gorman
2009-10-22 21:33           ` Karol Lewandowski
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).