All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Chris Friesen" <cfriesen@nortel.com>
To: balbir@linux.vnet.ibm.com
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>,
	Rik van Riel <riel@redhat.com>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	linux-mm@kvack.org
Subject: Re: tracking memory usage/leak in "inactive" field in /proc/meminfo?
Date: Mon, 15 Feb 2010 10:02:13 -0600	[thread overview]
Message-ID: <4B797005.6030308@nortel.com> (raw)
In-Reply-To: <20100213062905.GF11364@balbir.in.ibm.com>

On 02/13/2010 12:29 AM, Balbir Singh wrote:

> OK, I did not find the OOM kill output, dmesg. Is the OOM killer doing
> the right thing? If it kills the process we suspect is leaking memory,
> then it is working correctly :) If the leak is in kernel space, we
> need to examine the changes more closely.

I didn't include the oom killer message because it didn't seem important
in this case.  The oom killer took out the process with by far the
largest memory consumption, but as far as I know that process was not
the source of the leak.

It appears that the leak is in kernel space, given the unexplained pages
that are part of the active/inactive list but not in
buffers/cache/anon/swapcached.

> kernel modifications that we are unaware of make the problem harder to
> debug, since we have no way of knowing if they are the source of the
> problem.

Yes, I realize this.  I'm not expecting miracles, just hoping for some
guidance.


>> Committed_AS	12666508	12745200	7700484
> 
> Comitted_AS shows a large change, does the process that gets killed
> use a lot of virtual memory (total_vm)? Please see my first question
> as well. Can you try to set
> 
> vm.overcommit_memory=2
> 
> and run the tests to see if you still get OOM killed.

As mentioned above, the process that was killed did indeed consume a lot
of memory.  I could try running with strict memory accounting, but would
you agree that that given the gradual but constant increase in the
unexplained pages described above, currently that looks like a more
likely culprit?

Chris

WARNING: multiple messages have this Message-ID (diff)
From: "Chris Friesen" <cfriesen@nortel.com>
To: balbir@linux.vnet.ibm.com
Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>,
	Rik van Riel <riel@redhat.com>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	linux-mm@kvack.org
Subject: Re: tracking memory usage/leak in "inactive" field in /proc/meminfo?
Date: Mon, 15 Feb 2010 10:02:13 -0600	[thread overview]
Message-ID: <4B797005.6030308@nortel.com> (raw)
In-Reply-To: <20100213062905.GF11364@balbir.in.ibm.com>

On 02/13/2010 12:29 AM, Balbir Singh wrote:

> OK, I did not find the OOM kill output, dmesg. Is the OOM killer doing
> the right thing? If it kills the process we suspect is leaking memory,
> then it is working correctly :) If the leak is in kernel space, we
> need to examine the changes more closely.

I didn't include the oom killer message because it didn't seem important
in this case.  The oom killer took out the process with by far the
largest memory consumption, but as far as I know that process was not
the source of the leak.

It appears that the leak is in kernel space, given the unexplained pages
that are part of the active/inactive list but not in
buffers/cache/anon/swapcached.

> kernel modifications that we are unaware of make the problem harder to
> debug, since we have no way of knowing if they are the source of the
> problem.

Yes, I realize this.  I'm not expecting miracles, just hoping for some
guidance.


>> Committed_AS	12666508	12745200	7700484
> 
> Comitted_AS shows a large change, does the process that gets killed
> use a lot of virtual memory (total_vm)? Please see my first question
> as well. Can you try to set
> 
> vm.overcommit_memory=2
> 
> and run the tests to see if you still get OOM killed.

As mentioned above, the process that was killed did indeed consume a lot
of memory.  I could try running with strict memory accounting, but would
you agree that that given the gradual but constant increase in the
unexplained pages described above, currently that looks like a more
likely culprit?

Chris

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2010-02-15 16:08 UTC|newest]

Thread overview: 41+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-02-09 16:51 tracking memory usage/leak in "inactive" field in /proc/meminfo? Chris Friesen
2010-02-09 16:51 ` Chris Friesen
2010-02-10  0:32 ` KOSAKI Motohiro
2010-02-10  0:32   ` KOSAKI Motohiro
2010-02-10  3:50   ` Balbir Singh
2010-02-10  3:50     ` Balbir Singh
2010-02-10  4:09     ` KOSAKI Motohiro
2010-02-10  4:09       ` KOSAKI Motohiro
2010-02-10 17:05   ` Chris Friesen
2010-02-11  0:45     ` Minchan Kim
2010-02-11  0:45       ` Minchan Kim
2010-02-11 18:54       ` Chris Friesen
2010-02-11 18:54         ` Chris Friesen
2010-02-11 19:04         ` Rik van Riel
2010-02-11 19:04           ` Rik van Riel
2010-02-12  2:38         ` Minchan Kim
2010-02-12  2:38           ` Minchan Kim
2010-02-12  7:35           ` Chris Friesen
2010-02-12  7:35             ` Chris Friesen
2010-02-12  8:04             ` KOSAKI Motohiro
2010-02-12  8:04               ` KOSAKI Motohiro
2010-02-15 15:50             ` Chris Friesen
2010-02-15 15:50               ` Chris Friesen
2010-02-15 17:00               ` Rik van Riel
2010-02-15 17:00                 ` Rik van Riel
2010-02-16 16:52                 ` Chris Friesen
2010-02-16 16:52                   ` Chris Friesen
2010-02-16 17:12                   ` Rik van Riel
2010-02-16 17:12                     ` Rik van Riel
2010-02-16 21:26                     ` Chris Friesen
2010-02-16 21:26                       ` Chris Friesen
2010-02-16 22:22                       ` Rik van Riel
2010-02-16 22:22                         ` Rik van Riel
2010-02-18 15:39                         ` tracking memory usage/leak in "inactive" field in /proc/meminfo? -- solved Chris Friesen
2010-02-18 15:39                           ` Chris Friesen
2010-02-12 17:50           ` tracking memory usage/leak in "inactive" field in /proc/meminfo? Catalin Marinas
2010-02-12 17:50             ` Catalin Marinas
2010-02-13  6:29     ` Balbir Singh
2010-02-13  6:29       ` Balbir Singh
2010-02-15 16:02       ` Chris Friesen [this message]
2010-02-15 16:02         ` Chris Friesen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4B797005.6030308@nortel.com \
    --to=cfriesen@nortel.com \
    --cc=balbir@linux.vnet.ibm.com \
    --cc=kosaki.motohiro@jp.fujitsu.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=riel@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.