From: Minchan Kim <minchan-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>
To: Anton Vorontsov
<anton.vorontsov-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org>
Cc: cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
Tejun Heo <tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>,
David Rientjes <rientjes-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>,
Pekka Enberg <penberg-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org>,
Mel Gorman <mgorman-l3A5Bk7waGM@public.gmane.org>,
Glauber Costa <glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org>,
Michal Hocko <mhocko-AlSwsSmVLrQ@public.gmane.org>,
"Kirill A. Shutemov"
<kirill-oKw7cIdHH8eLwutG50LtGA@public.gmane.org>,
Kamezawa Hiroyuki
<kamezawa.hiroyu-+CUm20s59erQFUHtdCDX3A@public.gmane.org>,
Luiz Capitulino
<lcapitulino-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
Andrew Morton
<akpm-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org>,
Greg Thelen <gthelen-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>,
Leonid Moiseichuk
<leonid.moiseichuk-xNZwKgViW5gAvxtiuMwx3w@public.gmane.org>,
KOSAKI Motohiro
<kosaki.motohiro-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
Bartlomiej Zolnierkiewicz
<b.zolnierkie-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org>,
John Stultz <john.stultz-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org>,
linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org,
linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
linaro-kernel-cunTk1MwBs8s++Sfvej+rw@public.gmane.org,
patches-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org,
kernel-team-z5hGa2qSFaRBDgjK7y7TUQ@public.gmane.org
Subject: Re: [PATCH v2] memcg: Add memory.pressure_level events
Date: Wed, 20 Feb 2013 09:17:43 +0900 [thread overview]
Message-ID: <20130220001743.GE16950@blaptop> (raw)
In-Reply-To: <20130219044012.GA23356-SAfYLu58TvvqjT3d3NRMk4C2/QZ1MJnMKwcig+XE9tjR7s880joybQ@public.gmane.org>
Hi Anton,
On Mon, Feb 18, 2013 at 08:40:12PM -0800, Anton Vorontsov wrote:
> With this patch userland applications that want to maintain the
> interactivity/memory allocation cost can use the pressure level
> notifications. The levels are defined like this:
>
> The "low" level means that the system is reclaiming memory for new
> allocations. Monitoring this reclaiming activity might be useful for
> maintaining cache level. Upon notification, the program (typically
> "Activity Manager") might analyze vmstat and act in advance (i.e.
> prematurely shutdown unimportant services).
>
> The "medium" level means that the system is experiencing medium memory
> pressure, the system might be making swap, paging out active file caches,
> etc. Upon this event applications may decide to further analyze
> vmstat/zoneinfo/memcg or internal memory usage statistics and free any
> resources that can be easily reconstructed or re-read from a disk.
>
> The "critical" level means that the system is actively thrashing, it is
> about to out of memory (OOM) or even the in-kernel OOM killer is on its
> way to trigger. Applications should do whatever they can to help the
> system. It might be too late to consult with vmstat or any other
> statistics, so it's advisable to take an immediate action.
>
> The events are propagated upward until the event is handled, i.e. the
> events are not pass-through. Here is what this means: for example you have
> three cgroups: A->B->C. Now you set up an event listener on cgroups A, B
> and C, and suppose group C experiences some pressure. In this situation,
> only group C will receive the notification, i.e. groups A and B will not
> receive it. This is done to avoid excessive "broadcasting" of messages,
> which disturbs the system and which is especially bad if we are low on
> memory or thrashing. So, organize the cgroups wisely, or propagate the
> events manually (or, ask us to implement the pass-through events,
> explaining why would you need them.)
>
> Signed-off-by: Anton Vorontsov <anton.vorontsov-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org>
> Acked-by: Kirill A. Shutemov <kirill-oKw7cIdHH8eLwutG50LtGA@public.gmane.org>
> ---
>
> Hi all,
>
> Many thanks for the previous reviews! In this revision:
>
> - Addressed Glauber Costa's comments:
> o Use parent_mem_cgroup() instead of own parent function (also suggested
> by Kamezawa). This change also affected events distribution logic, so
> it became more like memory thresholds notifications, i.e. we deliver
> the event to the cgroup where the event originated, not to the parent
> cgroup; (This also addreses Kamezawa's remark regarding which cgroup
> receives which event.)
> o Register vmpressure cgroup file directly in memcontrol.c.
>
> - Addressed Greg Thelen's comments:
> o Fixed bool/int inconsistency in the code;
> o Fixed nr_scanned accounting;
> o Don't use cryptic 's', 'r' abbreviations; get rid of confusing
> 'window' argument.
>
> - Addressed Kamezawa Hiroyuki's comments:
> o Moved declarations from mm/internal.h into linux/vmpressue.h;
> o Removed Kconfig symbol. Vmpressure is pretty lightweight (especially
> comparing to the memcg accounting). If it ever causes any measurable
> performance effect, we want to fix it, not paper it over with a
> Kconfig option. :-)
> o Removed read operation on pressure_level cgroup file. In apps, we only
> use notifications, we don't need the content of the file, so let's
> keep things simple for now. Plus this resolves questions like what
> should we return there when the system is not reclaiming;
> o Reworded documentation;
> o Improved comments for vmpressure_prio().
Should we really enable memcg for just pressure notificaion in embedded side?
I didn't check the size(cgroup + memcg) and performance penalty but I don't want
to add unnecessary overhead if it is possible.
Do you have a plan to support it via global knob(ie, /proc/mempressure), NOT memcg?
--
Kind regards,
Minchan Kim
WARNING: multiple messages have this Message-ID (diff)
From: Minchan Kim <minchan@kernel.org>
To: Anton Vorontsov <anton.vorontsov@linaro.org>
Cc: cgroups@vger.kernel.org, Tejun Heo <tj@kernel.org>,
David Rientjes <rientjes@google.com>,
Pekka Enberg <penberg@kernel.org>, Mel Gorman <mgorman@suse.de>,
Glauber Costa <glommer@parallels.com>,
Michal Hocko <mhocko@suse.cz>,
"Kirill A. Shutemov" <kirill@shutemov.name>,
Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
Luiz Capitulino <lcapitulino@redhat.com>,
Andrew Morton <akpm@linux-foundation.org>,
Greg Thelen <gthelen@google.com>,
Leonid Moiseichuk <leonid.moiseichuk@nokia.com>,
KOSAKI Motohiro <kosaki.motohiro@gmail.com>,
Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
John Stultz <john.stultz@linaro.org>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org,
linaro-kernel@lists.linaro.org, patches@linaro.org,
kernel-team@android.com
Subject: Re: [PATCH v2] memcg: Add memory.pressure_level events
Date: Wed, 20 Feb 2013 09:17:43 +0900 [thread overview]
Message-ID: <20130220001743.GE16950@blaptop> (raw)
In-Reply-To: <20130219044012.GA23356@lizard.sbx00618.mountca.wayport.net>
Hi Anton,
On Mon, Feb 18, 2013 at 08:40:12PM -0800, Anton Vorontsov wrote:
> With this patch userland applications that want to maintain the
> interactivity/memory allocation cost can use the pressure level
> notifications. The levels are defined like this:
>
> The "low" level means that the system is reclaiming memory for new
> allocations. Monitoring this reclaiming activity might be useful for
> maintaining cache level. Upon notification, the program (typically
> "Activity Manager") might analyze vmstat and act in advance (i.e.
> prematurely shutdown unimportant services).
>
> The "medium" level means that the system is experiencing medium memory
> pressure, the system might be making swap, paging out active file caches,
> etc. Upon this event applications may decide to further analyze
> vmstat/zoneinfo/memcg or internal memory usage statistics and free any
> resources that can be easily reconstructed or re-read from a disk.
>
> The "critical" level means that the system is actively thrashing, it is
> about to out of memory (OOM) or even the in-kernel OOM killer is on its
> way to trigger. Applications should do whatever they can to help the
> system. It might be too late to consult with vmstat or any other
> statistics, so it's advisable to take an immediate action.
>
> The events are propagated upward until the event is handled, i.e. the
> events are not pass-through. Here is what this means: for example you have
> three cgroups: A->B->C. Now you set up an event listener on cgroups A, B
> and C, and suppose group C experiences some pressure. In this situation,
> only group C will receive the notification, i.e. groups A and B will not
> receive it. This is done to avoid excessive "broadcasting" of messages,
> which disturbs the system and which is especially bad if we are low on
> memory or thrashing. So, organize the cgroups wisely, or propagate the
> events manually (or, ask us to implement the pass-through events,
> explaining why would you need them.)
>
> Signed-off-by: Anton Vorontsov <anton.vorontsov@linaro.org>
> Acked-by: Kirill A. Shutemov <kirill@shutemov.name>
> ---
>
> Hi all,
>
> Many thanks for the previous reviews! In this revision:
>
> - Addressed Glauber Costa's comments:
> o Use parent_mem_cgroup() instead of own parent function (also suggested
> by Kamezawa). This change also affected events distribution logic, so
> it became more like memory thresholds notifications, i.e. we deliver
> the event to the cgroup where the event originated, not to the parent
> cgroup; (This also addreses Kamezawa's remark regarding which cgroup
> receives which event.)
> o Register vmpressure cgroup file directly in memcontrol.c.
>
> - Addressed Greg Thelen's comments:
> o Fixed bool/int inconsistency in the code;
> o Fixed nr_scanned accounting;
> o Don't use cryptic 's', 'r' abbreviations; get rid of confusing
> 'window' argument.
>
> - Addressed Kamezawa Hiroyuki's comments:
> o Moved declarations from mm/internal.h into linux/vmpressue.h;
> o Removed Kconfig symbol. Vmpressure is pretty lightweight (especially
> comparing to the memcg accounting). If it ever causes any measurable
> performance effect, we want to fix it, not paper it over with a
> Kconfig option. :-)
> o Removed read operation on pressure_level cgroup file. In apps, we only
> use notifications, we don't need the content of the file, so let's
> keep things simple for now. Plus this resolves questions like what
> should we return there when the system is not reclaiming;
> o Reworded documentation;
> o Improved comments for vmpressure_prio().
Should we really enable memcg for just pressure notificaion in embedded side?
I didn't check the size(cgroup + memcg) and performance penalty but I don't want
to add unnecessary overhead if it is possible.
Do you have a plan to support it via global knob(ie, /proc/mempressure), NOT memcg?
--
Kind regards,
Minchan Kim
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
WARNING: multiple messages have this Message-ID (diff)
From: Minchan Kim <minchan@kernel.org>
To: Anton Vorontsov <anton.vorontsov@linaro.org>
Cc: cgroups@vger.kernel.org, Tejun Heo <tj@kernel.org>,
David Rientjes <rientjes@google.com>,
Pekka Enberg <penberg@kernel.org>, Mel Gorman <mgorman@suse.de>,
Glauber Costa <glommer@parallels.com>,
Michal Hocko <mhocko@suse.cz>,
"Kirill A. Shutemov" <kirill@shutemov.name>,
Kamezawa Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
Luiz Capitulino <lcapitulino@redhat.com>,
Andrew Morton <akpm@linux-foundation.org>,
Greg Thelen <gthelen@google.com>,
Leonid Moiseichuk <leonid.moiseichuk@nokia.com>,
KOSAKI Motohiro <kosaki.motohiro@gmail.com>,
Bartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>,
John Stultz <john.stultz@linaro.org>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org,
linaro-kernel@lists.linaro.org, patches@linaro.org,
kernel-team@android.com
Subject: Re: [PATCH v2] memcg: Add memory.pressure_level events
Date: Wed, 20 Feb 2013 09:17:43 +0900 [thread overview]
Message-ID: <20130220001743.GE16950@blaptop> (raw)
In-Reply-To: <20130219044012.GA23356@lizard.sbx00618.mountca.wayport.net>
Hi Anton,
On Mon, Feb 18, 2013 at 08:40:12PM -0800, Anton Vorontsov wrote:
> With this patch userland applications that want to maintain the
> interactivity/memory allocation cost can use the pressure level
> notifications. The levels are defined like this:
>
> The "low" level means that the system is reclaiming memory for new
> allocations. Monitoring this reclaiming activity might be useful for
> maintaining cache level. Upon notification, the program (typically
> "Activity Manager") might analyze vmstat and act in advance (i.e.
> prematurely shutdown unimportant services).
>
> The "medium" level means that the system is experiencing medium memory
> pressure, the system might be making swap, paging out active file caches,
> etc. Upon this event applications may decide to further analyze
> vmstat/zoneinfo/memcg or internal memory usage statistics and free any
> resources that can be easily reconstructed or re-read from a disk.
>
> The "critical" level means that the system is actively thrashing, it is
> about to out of memory (OOM) or even the in-kernel OOM killer is on its
> way to trigger. Applications should do whatever they can to help the
> system. It might be too late to consult with vmstat or any other
> statistics, so it's advisable to take an immediate action.
>
> The events are propagated upward until the event is handled, i.e. the
> events are not pass-through. Here is what this means: for example you have
> three cgroups: A->B->C. Now you set up an event listener on cgroups A, B
> and C, and suppose group C experiences some pressure. In this situation,
> only group C will receive the notification, i.e. groups A and B will not
> receive it. This is done to avoid excessive "broadcasting" of messages,
> which disturbs the system and which is especially bad if we are low on
> memory or thrashing. So, organize the cgroups wisely, or propagate the
> events manually (or, ask us to implement the pass-through events,
> explaining why would you need them.)
>
> Signed-off-by: Anton Vorontsov <anton.vorontsov@linaro.org>
> Acked-by: Kirill A. Shutemov <kirill@shutemov.name>
> ---
>
> Hi all,
>
> Many thanks for the previous reviews! In this revision:
>
> - Addressed Glauber Costa's comments:
> o Use parent_mem_cgroup() instead of own parent function (also suggested
> by Kamezawa). This change also affected events distribution logic, so
> it became more like memory thresholds notifications, i.e. we deliver
> the event to the cgroup where the event originated, not to the parent
> cgroup; (This also addreses Kamezawa's remark regarding which cgroup
> receives which event.)
> o Register vmpressure cgroup file directly in memcontrol.c.
>
> - Addressed Greg Thelen's comments:
> o Fixed bool/int inconsistency in the code;
> o Fixed nr_scanned accounting;
> o Don't use cryptic 's', 'r' abbreviations; get rid of confusing
> 'window' argument.
>
> - Addressed Kamezawa Hiroyuki's comments:
> o Moved declarations from mm/internal.h into linux/vmpressue.h;
> o Removed Kconfig symbol. Vmpressure is pretty lightweight (especially
> comparing to the memcg accounting). If it ever causes any measurable
> performance effect, we want to fix it, not paper it over with a
> Kconfig option. :-)
> o Removed read operation on pressure_level cgroup file. In apps, we only
> use notifications, we don't need the content of the file, so let's
> keep things simple for now. Plus this resolves questions like what
> should we return there when the system is not reclaiming;
> o Reworded documentation;
> o Improved comments for vmpressure_prio().
Should we really enable memcg for just pressure notificaion in embedded side?
I didn't check the size(cgroup + memcg) and performance penalty but I don't want
to add unnecessary overhead if it is possible.
Do you have a plan to support it via global knob(ie, /proc/mempressure), NOT memcg?
--
Kind regards,
Minchan Kim
next prev parent reply other threads:[~2013-02-20 0:17 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-02-19 4:40 [PATCH v2] memcg: Add memory.pressure_level events Anton Vorontsov
2013-02-19 4:40 ` Anton Vorontsov
2013-02-19 4:40 ` Anton Vorontsov
[not found] ` <20130219044012.GA23356-SAfYLu58TvvqjT3d3NRMk4C2/QZ1MJnMKwcig+XE9tjR7s880joybQ@public.gmane.org>
2013-02-20 0:17 ` Minchan Kim [this message]
2013-02-20 0:17 ` Minchan Kim
2013-02-20 0:17 ` Minchan Kim
2013-02-20 0:21 ` Tejun Heo
2013-02-20 0:21 ` Tejun Heo
2013-02-20 0:21 ` Tejun Heo
[not found] ` <CAOS58YOPjcH_pFFhPLJEAdaLkm5FoOy9mG2i-PkWAcb7O6V_Kw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-02-21 23:04 ` Anton Vorontsov
2013-02-21 23:04 ` Anton Vorontsov
2013-02-21 23:04 ` Anton Vorontsov
2013-02-21 23:56 ` Minchan Kim
2013-02-21 23:56 ` Minchan Kim
2013-02-22 6:55 ` Anton Vorontsov
2013-02-22 6:55 ` Anton Vorontsov
2013-02-22 7:26 ` Minchan Kim
2013-02-22 7:26 ` Minchan Kim
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20130220001743.GE16950@blaptop \
--to=minchan-dgejt+ai2ygdnm+yrofe0a@public.gmane.org \
--cc=akpm-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org \
--cc=anton.vorontsov-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org \
--cc=b.zolnierkie-Sze3O3UU22JBDgjK7y7TUQ@public.gmane.org \
--cc=cgroups-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=glommer-bzQdu9zFT3WakBO8gow8eQ@public.gmane.org \
--cc=gthelen-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org \
--cc=john.stultz-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org \
--cc=kamezawa.hiroyu-+CUm20s59erQFUHtdCDX3A@public.gmane.org \
--cc=kernel-team-z5hGa2qSFaRBDgjK7y7TUQ@public.gmane.org \
--cc=kirill-oKw7cIdHH8eLwutG50LtGA@public.gmane.org \
--cc=kosaki.motohiro-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
--cc=lcapitulino-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org \
--cc=leonid.moiseichuk-xNZwKgViW5gAvxtiuMwx3w@public.gmane.org \
--cc=linaro-kernel-cunTk1MwBs8s++Sfvej+rw@public.gmane.org \
--cc=linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
--cc=linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org \
--cc=mgorman-l3A5Bk7waGM@public.gmane.org \
--cc=mhocko-AlSwsSmVLrQ@public.gmane.org \
--cc=patches-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org \
--cc=penberg-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
--cc=rientjes-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org \
--cc=tj-DgEjT+Ai2ygdnm+yROfE0A@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.