linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [LSF/MM TOPIC] Few things I would like to discuss
@ 2013-02-05 12:35 Michal Hocko
  2013-02-05 14:13 ` Glauber Costa
  2013-02-13  0:39 ` Kamezawa Hiroyuki
  0 siblings, 2 replies; 5+ messages in thread
From: Michal Hocko @ 2013-02-05 12:35 UTC (permalink / raw)
  To: lsf-pc; +Cc: linux-mm

Hi,
I would like to discuss the following topics:
* memcg oom should be more sensitive to locked contexts because now
  it is possible that a task is sitting in mem_cgroup_handle_oom holding
  some other lock (e.g. i_mutex or mmap_sem) up the chain which might
  block other task to terminate on OOM so we basically end up in a
  deadlock. Almost all memcg charges happen from the page fault path
  where we can retry but one class of them happen from
  add_to_page_cache_locked and that is a bit more problematic.
* memcg doesn't use PF_MEMALLOC for the targeted reclaim code paths
  which asks for stack overflows (and we have already seen those -
  e.g. from the xfs pageout paths). The primary problem to use the flag
  is that there is no dirty pages throttling and writeback kicked out
  for memcg so if we didn't writeback from the reclaim the caller could
  be blocked for ever. Memcg dirty accounting is shaping slowly so we
  should start thinking about the writeback as well.
* While we are at the memcg dirty pages accounting 
  (https://lkml.org/lkml/2012/12/25/95). It turned out that the locking
  is really nasty (https://lkml.org/lkml/2013/1/2/48). The locking
  should be reworked without incurring any penalty on the fast path.
  This sounds really challenging.
* I would really like to finally settle down on something wrt. soft
  limit reclaim. I am pretty sure Ying would like to discuss this topic
  as well so I will not go into details about it. I will post what I
  have before the conference so that we can discuss her approach and
  what was the primary disagreement the last time. I can go into more
  ditails as a follow up if people are interested of course.
* Finally I would like to collect feedback for the mm git tree.

-- 
Michal Hocko
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [LSF/MM TOPIC] Few things I would like to discuss
  2013-02-05 12:35 [LSF/MM TOPIC] Few things I would like to discuss Michal Hocko
@ 2013-02-05 14:13 ` Glauber Costa
  2013-02-13  0:39 ` Kamezawa Hiroyuki
  1 sibling, 0 replies; 5+ messages in thread
From: Glauber Costa @ 2013-02-05 14:13 UTC (permalink / raw)
  To: Michal Hocko; +Cc: lsf-pc, linux-mm

On 02/05/2013 04:35 PM, Michal Hocko wrote:
> Hi,
> I would like to discuss the following topics:
> * memcg oom should be more sensitive to locked contexts because now
>   it is possible that a task is sitting in mem_cgroup_handle_oom holding
>   some other lock (e.g. i_mutex or mmap_sem) up the chain which might
>   block other task to terminate on OOM so we basically end up in a
>   deadlock. Almost all memcg charges happen from the page fault path
>   where we can retry but one class of them happen from
>   add_to_page_cache_locked and that is a bit more problematic.

This is not the case with kmemcg on. Those charges will usually happen
from the slab/slub grow_cache mechanism, or during fork. This is not to
invalidate your reasoning - since those are usually tricky in terms of
context as well, and would benefit just as much - but to complete it.

> * I would really like to finally settle down on something wrt. soft
>   limit reclaim. I am pretty sure Ying would like to discuss this topic
>   as well so I will not go into details about it. I will post what I
>   have before the conference so that we can discuss her approach and
>   what was the primary disagreement the last time. I can go into more
>   ditails as a follow up if people are interested of course.

This interests me very much as well.



--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [LSF/MM TOPIC] Few things I would like to discuss
  2013-02-05 12:35 [LSF/MM TOPIC] Few things I would like to discuss Michal Hocko
  2013-02-05 14:13 ` Glauber Costa
@ 2013-02-13  0:39 ` Kamezawa Hiroyuki
  2013-02-13  3:53   ` [Lsf-pc] " James Bottomley
  2013-02-13  8:20   ` Glauber Costa
  1 sibling, 2 replies; 5+ messages in thread
From: Kamezawa Hiroyuki @ 2013-02-13  0:39 UTC (permalink / raw)
  To: Michal Hocko; +Cc: lsf-pc, linux-mm

(2013/02/05 21:35), Michal Hocko wrote:
> Hi,
> I would like to discuss the following topics:

I missed the deadline :(


> * memcg oom should be more sensitive to locked contexts because now
>    it is possible that a task is sitting in mem_cgroup_handle_oom holding
>    some other lock (e.g. i_mutex or mmap_sem) up the chain which might
>    block other task to terminate on OOM so we basically end up in a
>    deadlock. Almost all memcg charges happen from the page fault path
>    where we can retry but one class of them happen from
>    add_to_page_cache_locked and that is a bit more problematic.

Yes, this is a topic should be discussed.

> * memcg doesn't use PF_MEMALLOC for the targeted reclaim code paths
>    which asks for stack overflows (and we have already seen those -
>    e.g. from the xfs pageout paths). The primary problem to use the flag
>    is that there is no dirty pages throttling and writeback kicked out
>    for memcg so if we didn't writeback from the reclaim the caller could
>    be blocked for ever. Memcg dirty accounting is shaping slowly so we
>    should start thinking about the writeback as well.

Sure.

> * While we are at the memcg dirty pages accounting
>    (https://lkml.org/lkml/2012/12/25/95). It turned out that the locking
>    is really nasty (https://lkml.org/lkml/2013/1/2/48). The locking
>    should be reworked without incurring any penalty on the fast path.
>    This sounds really challenging.

I'd like to fix the locking problem.

> * I would really like to finally settle down on something wrt. soft
>    limit reclaim. I am pretty sure Ying would like to discuss this topic
>    as well so I will not go into details about it. I will post what I
>    have before the conference so that we can discuss her approach and
>    what was the primary disagreement the last time. I can go into more
>    ditails as a follow up if people are interested of course.
> * Finally I would like to collect feedback for the mm git tree.
>

Other points related to memcg is ...

+ kernel memory accounting + per-zone-per-memcg inode/dentry caching.
   Glaubler tries to account inode/dentry in kmem controller. To do that,
   I think inode and dentry should be hanldled per zone, at first. IIUC, there are
   ongoing work but not merged yet.

+ overheads by memcg
   Mel explained memcg's big overheads last year's MM summit. AFAIK, we have not
   made any progress with that. If someone have detailed data, please share again...

Thanks,
-Kame



--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [Lsf-pc] [LSF/MM TOPIC] Few things I would like to discuss
  2013-02-13  0:39 ` Kamezawa Hiroyuki
@ 2013-02-13  3:53   ` James Bottomley
  2013-02-13  8:20   ` Glauber Costa
  1 sibling, 0 replies; 5+ messages in thread
From: James Bottomley @ 2013-02-13  3:53 UTC (permalink / raw)
  To: Kamezawa Hiroyuki; +Cc: Michal Hocko, linux-mm, lsf-pc

On Wed, 2013-02-13 at 09:39 +0900, Kamezawa Hiroyuki wrote:
> (2013/02/05 21:35), Michal Hocko wrote:
> > Hi,
> > I would like to discuss the following topics:
> 
> I missed the deadline :(

I wouldn't call it a deadline, more a sort of guideline ... we only call
it a deadline because if we didn't some of the procrastinators out there
wouldn't submit anything until the day before the actual summit. We do
do the first cut of invites and topics on 6 Feb, but we keep some slots
open all the way up to the summit just in case something important comes
along.

James


--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [LSF/MM TOPIC] Few things I would like to discuss
  2013-02-13  0:39 ` Kamezawa Hiroyuki
  2013-02-13  3:53   ` [Lsf-pc] " James Bottomley
@ 2013-02-13  8:20   ` Glauber Costa
  1 sibling, 0 replies; 5+ messages in thread
From: Glauber Costa @ 2013-02-13  8:20 UTC (permalink / raw)
  To: Kamezawa Hiroyuki
  Cc: Michal Hocko, lsf-pc, linux-mm, Johannes Weiner, Dave Chinner

> 
> Other points related to memcg is ...
> 
> + kernel memory accounting + per-zone-per-memcg inode/dentry caching.
>   Glaubler tries to account inode/dentry in kmem controller. To do that,
>   I think inode and dentry should be hanldled per zone, at first. IIUC,
> there are
>   ongoing work but not merged yet.
> 

Yes, I've already managed to post an initial version - comments appreciated.

Actually, Johannes correctly pointed out to me once that memcg pressure
is never per-zone, so there is no reason for us to keep per-zone
information. The logic behind this is that if there is per-zone
pressure, it is always global pressure; memcg can only provide go/no-go
signals, and knows nothing about zones.

The only reason I am actually keeping per-zone information, is to avoid
keeping the inodes/dentries in two lists. Without per-zone, we would
have to keep it in a nodeless memcg list, and then in a per-zone (it is
actually per-node) list, and then when global pressure kicks in, follow
the zone lists. This means extra 16 bytes per objects, which adds up
quickly to a large memory overhead.


> + overheads by memcg
>   Mel explained memcg's big overheads last year's MM summit. AFAIK, we
> have not
>   made any progress with that. If someone have detailed data, please
> share again...
> 

I had a patch for that, but didn't manage to go back to it again. Jeff
Liu did some extra work to handle lazy swap enablement as well, that
would go all right with it.

I can probably find the time to resuscitate it before the summit. We
could focus on what is still missing.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2013-02-13  8:19 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-02-05 12:35 [LSF/MM TOPIC] Few things I would like to discuss Michal Hocko
2013-02-05 14:13 ` Glauber Costa
2013-02-13  0:39 ` Kamezawa Hiroyuki
2013-02-13  3:53   ` [Lsf-pc] " James Bottomley
2013-02-13  8:20   ` Glauber Costa

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).