* 2.6.19-rc1-mm1+ memory problem
@ 2006-11-20 6:26 Michael Raskin
2006-11-20 18:18 ` Michael Raskin
` (3 more replies)
0 siblings, 4 replies; 10+ messages in thread
From: Michael Raskin @ 2006-11-20 6:26 UTC (permalink / raw)
To: linux-kernel
Short description: when X is loaded (maybe any heavy application is
sufficient, but I don't use anything heavy in console), 'free' says used
memory is growing.
Keywords: memory.
Kernel: built locally, gcc 4.0.3
I have a strange problem with 2.6.19-rc-mm kernels. After I load X, I
notice that memory is marked used at rate of tens of KB/s. Then it
starts to swap very heavily, when physical memory is all used. I tried
to verify it - it is so with all -mm kernels after 2.6.19-rc1-mm1,
including 2.6.19-rc5-mm2. At the meantime everything works OK with
kernels 2.6.18-mm3 and 2.6.19-rc1 through 2.6.19-rc6. I do not see any
options that should be memory eating in my .config . Module list is
short enough to include inline.
When I just run some things like periodical suck, oops proxy server etc
with X shut down, I do not notice "leak" from console because of small
fluctuations of memory use. When I run X and shut it down, used memory
count goes up a few megs (consistent with speed of eating it by X).
I didn't find exactly this problem in lkml or www, though the problem
with OOM on 2.6.19-rc-mm seems similar.
What should I check to fix problem or produce a useful bug report?
/etc/sysconfig/modules:
ehci-hcd, usb-storage, usbhid, ipaq, i915
Now loaded in 2.6.19-rc6:
i915, drm, ipaq, usbserial, usbhid, usb_storage, libusual, ehci_hcd,
usbcore
Main configuration options:
http://bigtip.narod.ru/temp/xorg.conf.txt
http://bigtip.narod.ru/temp/config-2.6.19-rc2-mm5-swsusp-my-1.txt
http://bigtip.narod.ru/temp/lspci.txt
Drivers:
http://bigtip.narod.ru/temp/ioports.txt
http://bigtip.narod.ru/temp/iomem.txt
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
2006-11-20 6:26 2.6.19-rc1-mm1+ memory problem Michael Raskin
@ 2006-11-20 18:18 ` Michael Raskin
2006-11-21 8:37 ` Andrew Morton
` (2 subsequent siblings)
3 siblings, 0 replies; 10+ messages in thread
From: Michael Raskin @ 2006-11-20 18:18 UTC (permalink / raw)
To: linux-kernel
Michael Raskin wrote:
> Short description: when X is loaded (maybe any heavy application is
> sufficient, but I don't use anything heavy in console), 'free' says used
> memory is growing.
>
Tried driver vesa. Leak still exists.
About leak size: with dri, xscreensaver, and nothing loaded while true;
do free >>free.log; sleep 1; done
shows ~100KB/s.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
2006-11-20 6:26 2.6.19-rc1-mm1+ memory problem Michael Raskin
2006-11-20 18:18 ` Michael Raskin
@ 2006-11-21 8:37 ` Andrew Morton
[not found] ` <4563485B.3050801@mail.ru>
2006-11-24 13:23 ` Michael Raskin
2006-11-29 4:29 ` 2.6.19-rc6-mm2 is ok (2.6.19-rc1-mm1+ memory problem) Michael Raskin
3 siblings, 1 reply; 10+ messages in thread
From: Andrew Morton @ 2006-11-21 8:37 UTC (permalink / raw)
To: Michael Raskin; +Cc: linux-kernel
On Mon, 20 Nov 2006 09:26:29 +0300
Michael Raskin <a1d23ab4@mail.ru> wrote:
> Short description: when X is loaded (maybe any heavy application is
> sufficient, but I don't use anything heavy in console), 'free' says used
> memory is growing.
>
> Keywords: memory.
>
> Kernel: built locally, gcc 4.0.3
>
> I have a strange problem with 2.6.19-rc-mm kernels. After I load X, I
> notice that memory is marked used at rate of tens of KB/s. Then it
> starts to swap very heavily, when physical memory is all used. I tried
> to verify it - it is so with all -mm kernels after 2.6.19-rc1-mm1,
> including 2.6.19-rc5-mm2. At the meantime everything works OK with
> kernels 2.6.18-mm3 and 2.6.19-rc1 through 2.6.19-rc6. I do not see any
> options that should be memory eating in my .config . Module list is
> short enough to include inline.
>
> When I just run some things like periodical suck, oops proxy server etc
> with X shut down, I do not notice "leak" from console because of small
> fluctuations of memory use. When I run X and shut it down, used memory
> count goes up a few megs (consistent with speed of eating it by X).
>
> I didn't find exactly this problem in lkml or www, though the problem
> with OOM on 2.6.19-rc-mm seems similar.
>
> What should I check to fix problem or produce a useful bug report?
Monitor /proc/meminfo
If the leak is slab, monitor /proc/slabinfo and /proc/slab_allocators.
/proc/slab_allocators needs CONFIG_DEBUG_SLAB_LEAK.
Thanks.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
[not found] ` <4563485B.3050801@mail.ru>
@ 2006-11-21 19:45 ` Andrew Morton
2006-11-21 21:18 ` Michael Raskin
0 siblings, 1 reply; 10+ messages in thread
From: Andrew Morton @ 2006-11-21 19:45 UTC (permalink / raw)
To: Michael Raskin; +Cc: linux-kernel
On Tue, 21 Nov 2006 21:41:31 +0300
Michael Raskin <a1d23ab4@mail.ru> wrote:
> Andrew Morton wrote:
> > On Mon, 20 Nov 2006 09:26:29 +0300
> > Michael Raskin <a1d23ab4@mail.ru> wrote:
> >
> >> Short description: when X is loaded (maybe any heavy application is
> >> sufficient, but I don't use anything heavy in console), 'free' says used
> >> memory is growing.
> >>
> Thank you for reply.
>
> > Monitor /proc/meminfo
> Thanks for advice. I didn't think of it. I should be ashamed.
>
> Result: mysterious. All fields that grow can not account for even a
> third part.
>
> In top I found a situation when 2MB (~half a minute) go to nowhere and
> no of first 50 processes changes resident memory usage at all. The rest
> have less than a MB each.
>
> > If the leak is slab, monitor /proc/slabinfo and /proc/slab_allocators.
> I hope no.
> > /proc/slab_allocators needs CONFIG_DEBUG_SLAB_LEAK.
> >
> > Thanks.
> >
> I did a few cat /proc/meminfo. Two of them are here:
>
> MemTotal: 763532 kB 763532 kB
> MemFree: 445956 kB 430932 kB
> Buffers: 20908 kB 21048 kB
> Cached: 77008 kB 77212 kB
> SwapCached: 0 kB 0 kB
> Active: 65916 kB 66120 kB
> Inactive: 54748 kB 54884 kB
> SwapTotal: 1052216 kB 1052216 kB
> SwapFree: 1052216 kB 1052216 kB
> Dirty: 264 kB 324 kB
> Writeback: 0 kB 0 kB
> AnonPages: 22752 kB 22748 kB
> Mapped: 14616 kB 14628 kB
> Slab: 23108 kB 23088 kB
> SReclaimable: 15360 kB 15364 kB
> SUnreclaim: 7748 kB 7724 kB
> PageTables: 1216 kB 1216 kB
> NFS_Unstable: 0 kB 0 kB
> Bounce: 0 kB 0 kB
> CommitLimit: 1433980 kB 1433980 kB
> Committed_AS: 281456 kB 281448 kB
> VmallocTotal: 262104 kB 262104 kB
> VmallocUsed: 2876 kB 2876 kB
> VmallocChunk: 259028 kB 259028 kB
You lost 15MB and they didn't even turn up on the page LRU.
Can you try to determine exactly which activity causes this to happen? In
particular, is it due to the X server? If so, does any particular client
cause it to happen? Things which use 3d?
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
2006-11-21 19:45 ` Andrew Morton
@ 2006-11-21 21:18 ` Michael Raskin
0 siblings, 0 replies; 10+ messages in thread
From: Michael Raskin @ 2006-11-21 21:18 UTC (permalink / raw)
To: Andrew Morton, linux-kernel
Andrew Morton wrote:
> On Tue, 21 Nov 2006 21:41:31 +0300
> Michael Raskin <a1d23ab4@mail.ru> wrote:
Sorry for leaving lkml out of "To: " in previous post.
> Can you try to determine exactly which activity causes this to happen? In
> particular, is it due to the X server? If so, does any particular client
> cause it to happen? Things which use 3d?
You were right, it's not because of personally X, but because of
environment I use.
Simplest example of reproducing code:
while true; do free | cat &>/dev/null; done
Looks like minimum (except of &>/dev/null not to involve console/xterm
output - leaks well without it too).
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
2006-11-20 6:26 2.6.19-rc1-mm1+ memory problem Michael Raskin
2006-11-20 18:18 ` Michael Raskin
2006-11-21 8:37 ` Andrew Morton
@ 2006-11-24 13:23 ` Michael Raskin
2006-11-24 23:07 ` Michael Raskin
2006-11-29 4:29 ` 2.6.19-rc6-mm2 is ok (2.6.19-rc1-mm1+ memory problem) Michael Raskin
3 siblings, 1 reply; 10+ messages in thread
From: Michael Raskin @ 2006-11-24 13:23 UTC (permalink / raw)
To: linux-kernel
Michael Raskin wrote:
Strange thing: when run from xterm,
while true; do free | cat &>/dev/null; done
causes leak. While X is not loaded - no.
Also I have uploaded contents of /proc/page_owner after loosing more
than 100M. (220M used, 29M - on page_owner, lessthan 50M - for
processes). I will study it also.
http://bigtip.narod.ru/temp/page_owner.bz2
http://bigtip.narod.ru/temp/page_owner.gz
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
2006-11-24 13:23 ` Michael Raskin
@ 2006-11-24 23:07 ` Michael Raskin
2006-11-25 19:03 ` Andrew Morton
0 siblings, 1 reply; 10+ messages in thread
From: Michael Raskin @ 2006-11-24 23:07 UTC (permalink / raw)
To: linux-kernel
Michael Raskin wrote:
> Also I have uploaded contents of /proc/page_owner after loosing more
> than 100M. (220M used, 29M - on page_owner, lessthan 50M - for
> processes).
Top 3 entries:
89361 times:
Page allocated via order 0, mask 0x280d2
[0xc0159f31] __handle_mm_fault+1809
[0xc011318a] do_page_fault+314
[0xc04111c4] error_code+116
Can be anything. But if I understand anything, this memory is used
because someone has requested a page that is swapped out. So the memory
must be used, but not reflected in meminfo, and not by a process?
35560 times:
Page allocated via order 0, mask 0x201d2
[0xc0152ec2] __do_page_cache_readahead+450
[0xc015309a] do_page_cache_readahead+74
[0xc014d7b5] filemap_nopage+325
[0xc0159919] __handle_mm_fault+249
[0xc011318a] do_page_fault+314
[0xc04111c4] error_code+116
- is reflected in cache usage statistics, I guess..
6185 times:
Page allocated via order 0, mask 0x200d2
[0xc014e069] generic_file_buffered_write+329
[0xc014e814] __generic_file_aio_write_nolock+612
[0xc014eb85] generic_file_aio_write+85
[0xc01b26ff] ext3_file_write+63
[0xc016b23c] do_sync_write+204
[0xc016b9a7] vfs_write+167
[0xc016c2a7] sys_write+71
[0xc010303a] sysenter_past_esp+95
- negligible, really..
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
2006-11-24 23:07 ` Michael Raskin
@ 2006-11-25 19:03 ` Andrew Morton
2006-11-25 21:53 ` Michael Raskin
0 siblings, 1 reply; 10+ messages in thread
From: Andrew Morton @ 2006-11-25 19:03 UTC (permalink / raw)
To: Michael Raskin; +Cc: linux-kernel
On Sat, 25 Nov 2006 02:07:43 +0300
Michael Raskin <a1d23ab4@mail.ru> wrote:
> Michael Raskin wrote:
> > Also I have uploaded contents of /proc/page_owner after loosing more
> > than 100M. (220M used, 29M - on page_owner, lessthan 50M - for
> > processes).
>
> Top 3 entries:
>
> 89361 times:
> Page allocated via order 0, mask 0x280d2
> [0xc0159f31] __handle_mm_fault+1809
> [0xc011318a] do_page_fault+314
> [0xc04111c4] error_code+116
> Can be anything. But if I understand anything, this memory is used
> because someone has requested a page that is swapped out. So the memory
> must be used, but not reflected in meminfo, and not by a process?
>
>
> 35560 times:
> Page allocated via order 0, mask 0x201d2
> [0xc0152ec2] __do_page_cache_readahead+450
> [0xc015309a] do_page_cache_readahead+74
> [0xc014d7b5] filemap_nopage+325
> [0xc0159919] __handle_mm_fault+249
> [0xc011318a] do_page_fault+314
> [0xc04111c4] error_code+116
> - is reflected in cache usage statistics, I guess..
>
> 6185 times:
> Page allocated via order 0, mask 0x200d2
> [0xc014e069] generic_file_buffered_write+329
> [0xc014e814] __generic_file_aio_write_nolock+612
> [0xc014eb85] generic_file_aio_write+85
> [0xc01b26ff] ext3_file_write+63
> [0xc016b23c] do_sync_write+204
> [0xc016b9a7] vfs_write+167
> [0xc016c2a7] sys_write+71
> [0xc010303a] sysenter_past_esp+95
> - negligible, really..
What you should do is to cause the system to free as many pages as possible
before looking ad /proc/page_owner. For example, build `usemem' from
http://www.zip.com.au/~akpm/linux/patches/stuff/ext3-tools.tar.gz, run
usemem -m N (where N is the number of megabytes which the machine has)
a couple of times. Then check /proc/meminfo, and look to see which pages
are left over in /proc/page_owner.
Thanks.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc1-mm1+ memory problem
2006-11-25 19:03 ` Andrew Morton
@ 2006-11-25 21:53 ` Michael Raskin
0 siblings, 0 replies; 10+ messages in thread
From: Michael Raskin @ 2006-11-25 21:53 UTC (permalink / raw)
To: linux-kernel; +Cc: Andrew Morton
Andrew Morton wrote:
>> 89361 times:
>> Page allocated via order 0, mask 0x280d2
>> [0xc0159f31] __handle_mm_fault+1809
>> [0xc011318a] do_page_fault+314
>> [0xc04111c4] error_code+116
>> Can be anything. But if I understand anything, this memory is used
>> because someone has requested a page that is swapped out. So the memory
>> must be used, but not reflected in meminfo, and not by a process?
> What you should do is to cause the system to free as many pages as possible
> before looking ad /proc/page_owner. For example, build `usemem' from
> http://www.zip.com.au/~akpm/linux/patches/stuff/ext3-tools.tar.gz, run
>
> usemem -m N (where N is the number of megabytes which the machine has)
>
> a couple of times. Then check /proc/meminfo, and look to see which pages
> are left over in /proc/page_owner.
Well, I was too lazy to get this utility, used my own to allocate and
fill enough memory as to go some 50MB to deep swap (Did I understand
correctly what usemem does?). Top 3 did not change, except for exact
numbers.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: 2.6.19-rc6-mm2 is ok (2.6.19-rc1-mm1+ memory problem)
2006-11-20 6:26 2.6.19-rc1-mm1+ memory problem Michael Raskin
` (2 preceding siblings ...)
2006-11-24 13:23 ` Michael Raskin
@ 2006-11-29 4:29 ` Michael Raskin
3 siblings, 0 replies; 10+ messages in thread
From: Michael Raskin @ 2006-11-29 4:29 UTC (permalink / raw)
To: linux-kernel; +Cc: Andrew Morton
Michael Raskin wrote:
> I have a strange problem with 2.6.19-rc-mm kernels. After I load X, I
> notice that memory is marked used at rate of tens of KB/s. Then it
Tried 2.6.19-rc6-mm2. Now the problem is gone. Sometimes memory is
getting maked used as before, but when the loss reaches a few MB's it is
all freed. After 3 hours of X+all those scripts that cause leak +
ThunderBird I can still shut down everything except a few processes and
have only 50MB used. Script that demonstrated leak is now working
without problems and without eating memory.
Thanks.
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2006-11-29 4:30 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-11-20 6:26 2.6.19-rc1-mm1+ memory problem Michael Raskin
2006-11-20 18:18 ` Michael Raskin
2006-11-21 8:37 ` Andrew Morton
[not found] ` <4563485B.3050801@mail.ru>
2006-11-21 19:45 ` Andrew Morton
2006-11-21 21:18 ` Michael Raskin
2006-11-24 13:23 ` Michael Raskin
2006-11-24 23:07 ` Michael Raskin
2006-11-25 19:03 ` Andrew Morton
2006-11-25 21:53 ` Michael Raskin
2006-11-29 4:29 ` 2.6.19-rc6-mm2 is ok (2.6.19-rc1-mm1+ memory problem) Michael Raskin
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox