* [PATCH] Restrict stack space reservation to rlimit [not found] ` <10125.1265451732@neuling.org> @ 2010-02-08 0:07 ` Michael Neuling 2010-02-08 0:28 ` Michael Neuling 2010-02-08 5:06 ` KOSAKI Motohiro 0 siblings, 2 replies; 23+ messages in thread From: Michael Neuling @ 2010-02-08 0:07 UTC (permalink / raw) To: Andrew Morton, Linus Torvalds Cc: linux-kernel, aeb, James Morris, miltonm, Oleg Nesterov, linuxppc-dev, Paul Mackerras, Alexander Viro, WANG Cong, linux-fsdevel, Serge Hallyn, Ingo Molnar, stable, Anton Blanchard apkm, linus: this or something like it needs to go into 2.6.33 (& 32) to fix 'ulimit -s'. Mikey [PATCH] Restrict stack space reservation to rlimit When reserving stack space for a new process, make sure we're not attempting to allocate more than rlimit allows. Also, reserve the same stack size independent of page size. This fixes a bug unmasked by fc63cf237078c86214abcb2ee9926d8ad289da9b Signed-off-by: Michael Neuling <mikey@neuling.org> Cc: Anton Blanchard <anton@samba.org> Cc: stable@kernel.org --- fs/exec.c | 9 ++++++--- 1 file changed, 6 insertions(+), 3 deletions(-) Index: clone1/fs/exec.c =================================================================== --- clone1.orig/fs/exec.c +++ clone1/fs/exec.c @@ -554,7 +554,7 @@ static int shift_arg_pages(struct vm_are return 0; } -#define EXTRA_STACK_VM_PAGES 20 /* random */ +#define EXTRA_STACK_VM_SIZE 81920UL /* randomly 20 4K pages */ /* * Finalizes the stack vm_area_struct. The flags and permissions are updated, @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm goto out_unlock; } + stack_base = min(EXTRA_STACK_VM_SIZE, + current->signal->rlim[RLIMIT_STACK].rlim_cur) - + PAGE_SIZE; #ifdef CONFIG_STACK_GROWSUP - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_end + stack_base; #else - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_start - stack_base; #endif ret = expand_stack(vma, stack_base); if (ret) ^ permalink raw reply [flat|nested] 23+ messages in thread
* [PATCH] Restrict stack space reservation to rlimit 2010-02-08 0:07 ` [PATCH] Restrict stack space reservation to rlimit Michael Neuling @ 2010-02-08 0:28 ` Michael Neuling 2010-02-08 5:06 ` KOSAKI Motohiro 1 sibling, 0 replies; 23+ messages in thread From: Michael Neuling @ 2010-02-08 0:28 UTC (permalink / raw) To: Andrew Morton, Linus Torvalds Cc: Ollie Wild, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, Anton Blanchard, stable, linux-kernel, linuxppc-dev, Serge Hallyn, WANG Cong, Paul Mackerras, benh, miltonm, aeb When reserving stack space for a new process, make sure we're not attempting to allocate more than rlimit allows. Also, reserve the same stack size independent of page size. This fixes a bug cause by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba "mm: variable length argument support" and unmasked by fc63cf237078c86214abcb2ee9926d8ad289da9b "exec: setup_arg_pages() fails to return errors". Signed-off-by: Michael Neuling <mikey@neuling.org> Cc: Anton Blanchard <anton@samba.org> Cc: stable@kernel.org --- Update commit message to include patch name and SHA1 of related patches. fs/exec.c | 9 ++++++--- 1 file changed, 6 insertions(+), 3 deletions(-) Index: clone1/fs/exec.c =================================================================== --- clone1.orig/fs/exec.c +++ clone1/fs/exec.c @@ -554,7 +554,7 @@ static int shift_arg_pages(struct vm_are return 0; } -#define EXTRA_STACK_VM_PAGES 20 /* random */ +#define EXTRA_STACK_VM_SIZE 81920UL /* randomly 20 4K pages */ /* * Finalizes the stack vm_area_struct. The flags and permissions are updated, @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm goto out_unlock; } + stack_base = min(EXTRA_STACK_VM_SIZE, + current->signal->rlim[RLIMIT_STACK].rlim_cur) - + PAGE_SIZE; #ifdef CONFIG_STACK_GROWSUP - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_end + stack_base; #else - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_start - stack_base; #endif ret = expand_stack(vma, stack_base); if (ret) ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 0:07 ` [PATCH] Restrict stack space reservation to rlimit Michael Neuling 2010-02-08 0:28 ` Michael Neuling @ 2010-02-08 5:06 ` KOSAKI Motohiro 2010-02-08 5:11 ` Anton Blanchard 1 sibling, 1 reply; 23+ messages in thread From: KOSAKI Motohiro @ 2010-02-08 5:06 UTC (permalink / raw) To: Michael Neuling Cc: kosaki.motohiro, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, Anton Blanchard, stable, linux-kernel, linuxppc-dev, Serge Hallyn, WANG Cong, Paul Mackerras, benh, miltonm, aeb Hi > apkm, linus: this or something like it needs to go into 2.6.33 (& 32) to > fix 'ulimit -s'. "fix ulimit -s" is too cool explanation ;-) we are not ESPer. please consider to provide what bug is exist. > Mikey > > [PATCH] Restrict stack space reservation to rlimit > > When reserving stack space for a new process, make sure we're not > attempting to allocate more than rlimit allows. > > Also, reserve the same stack size independent of page size. Why do we need page size independent stack size? It seems to have compatibility breaking risk. > > This fixes a bug unmasked by fc63cf237078c86214abcb2ee9926d8ad289da9b > > Signed-off-by: Michael Neuling <mikey@neuling.org> > Cc: Anton Blanchard <anton@samba.org> > Cc: stable@kernel.org > --- > fs/exec.c | 9 ++++++--- > 1 file changed, 6 insertions(+), 3 deletions(-) > > Index: clone1/fs/exec.c > =================================================================== > --- clone1.orig/fs/exec.c > +++ clone1/fs/exec.c > @@ -554,7 +554,7 @@ static int shift_arg_pages(struct vm_are > return 0; > } > > -#define EXTRA_STACK_VM_PAGES 20 /* random */ > +#define EXTRA_STACK_VM_SIZE 81920UL /* randomly 20 4K pages */ > > /* > * Finalizes the stack vm_area_struct. The flags and permissions are updated, > @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm > goto out_unlock; > } > > + stack_base = min(EXTRA_STACK_VM_SIZE, > + current->signal->rlim[RLIMIT_STACK].rlim_cur) - > + PAGE_SIZE; > #ifdef CONFIG_STACK_GROWSUP > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_base = vma->vm_end + stack_base; > #else > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_base = vma->vm_start - stack_base; > #endif > ret = expand_stack(vma, stack_base); > if (ret) > -- > To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 5:06 ` KOSAKI Motohiro @ 2010-02-08 5:11 ` Anton Blanchard 2010-02-08 5:22 ` KOSAKI Motohiro 0 siblings, 1 reply; 23+ messages in thread From: Anton Blanchard @ 2010-02-08 5:11 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Michael Neuling, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, WANG Cong, Paul Mackerras, benh, miltonm, aeb Hi, > Why do we need page size independent stack size? It seems to have > compatibility breaking risk. I don't think so. The current behaviour is clearly wrong, we dont need a 16x larger stack just because you went from a 4kB to a 64kB base page size. The user application stack usage is the same in both cases. Anton ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 5:11 ` Anton Blanchard @ 2010-02-08 5:22 ` KOSAKI Motohiro 2010-02-08 5:31 ` Anton Blanchard 2010-02-08 5:37 ` Michael Neuling 0 siblings, 2 replies; 23+ messages in thread From: KOSAKI Motohiro @ 2010-02-08 5:22 UTC (permalink / raw) To: Anton Blanchard Cc: kosaki.motohiro, Michael Neuling, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, WANG Cong, Paul Mackerras, benh, miltonm, aeb > > Hi, > > > Why do we need page size independent stack size? It seems to have > > compatibility breaking risk. > > I don't think so. The current behaviour is clearly wrong, we dont need a > 16x larger stack just because you went from a 4kB to a 64kB base page > size. The user application stack usage is the same in both cases. I didn't discuss which behavior is better. Michael said he want to apply his patch to 2.6.32 & 2.6.33. stable tree never accept the breaking compatibility patch. Your answer doesn't explain why can't we wait it until next merge window. btw, personally, I like page size indepent stack size. but I'm not sure why making stack size independency is related to bug fix. ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 5:22 ` KOSAKI Motohiro @ 2010-02-08 5:31 ` Anton Blanchard 2010-02-08 6:11 ` KOSAKI Motohiro 2010-02-08 5:37 ` Michael Neuling 1 sibling, 1 reply; 23+ messages in thread From: Anton Blanchard @ 2010-02-08 5:31 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Michael Neuling, stable, aeb, James Morris, miltonm, Oleg Nesterov, linuxppc-dev, Paul Mackerras, Alexander Viro, WANG Cong, linux-fsdevel, Serge Hallyn, Andrew Morton, Linus Torvalds, Ingo Molnar, linux-kernel Hi, > I didn't discuss which behavior is better. Michael said he want to apply > his patch to 2.6.32 & 2.6.33. stable tree never accept the breaking > compatibility patch. > > Your answer doesn't explain why can't we wait it until next merge window. > > > btw, personally, I like page size indepent stack size. but I'm not sure > why making stack size independency is related to bug fix. OK sorry, I misunderstood your initial mail. I agree fixing the bit that regressed in 2.6.32 is the most important thing. The difference in page size is clearly wrong but since it isn't a regression we could probably live with it until 2.6.34 Anton ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 5:31 ` Anton Blanchard @ 2010-02-08 6:11 ` KOSAKI Motohiro 0 siblings, 0 replies; 23+ messages in thread From: KOSAKI Motohiro @ 2010-02-08 6:11 UTC (permalink / raw) To: Anton Blanchard Cc: Michael Neuling, stable, aeb, Oleg Nesterov, miltonm, James Morris, linuxppc-dev, Paul Mackerras, Alexander Viro, kosaki.motohiro, WANG Cong, linux-fsdevel, Serge Hallyn, Andrew Morton, Linus Torvalds, Ingo Molnar, linux-kernel > > Hi, > > > I didn't discuss which behavior is better. Michael said he want to apply > > his patch to 2.6.32 & 2.6.33. stable tree never accept the breaking > > compatibility patch. > > > > Your answer doesn't explain why can't we wait it until next merge window. > > > > > > btw, personally, I like page size indepent stack size. but I'm not sure > > why making stack size independency is related to bug fix. > > OK sorry, I misunderstood your initial mail. I agree fixing the bit that > regressed in 2.6.32 is the most important thing. The difference in page size is > clearly wrong but since it isn't a regression we could probably live with it > until 2.6.34 thanks! ^ permalink raw reply [flat|nested] 23+ messages in thread
* [PATCH] Restrict stack space reservation to rlimit 2010-02-08 5:22 ` KOSAKI Motohiro 2010-02-08 5:31 ` Anton Blanchard @ 2010-02-08 5:37 ` Michael Neuling 2010-02-08 6:05 ` KOSAKI Motohiro 1 sibling, 1 reply; 23+ messages in thread From: Michael Neuling @ 2010-02-08 5:37 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, WANG Cong, Paul Mackerras, benh, miltonm, aeb > > > > Hi, > > > > > Why do we need page size independent stack size? It seems to have > > > compatibility breaking risk. > > > > I don't think so. The current behaviour is clearly wrong, we dont need a > > 16x larger stack just because you went from a 4kB to a 64kB base page > > size. The user application stack usage is the same in both cases. > > I didn't discuss which behavior is better. Michael said he want to apply > his patch to 2.6.32 & 2.6.33. stable tree never accept the breaking > compatibility patch. > > Your answer doesn't explain why can't we wait it until next merge window. > > btw, personally, I like page size indepent stack size. but I'm not sure > why making stack size independency is related to bug fix. I tend to agree. Below is just the bug fix to limit the reservation size based rlimit. We still reserve different stack sizes based on the page size as before (unless we hit rlimit of course). Mikey Restrict stack space reservation to rlimit When reserving stack space for a new process, make sure we're not attempting to allocate more than rlimit allows. This fixes a bug cause by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba "mm: variable length argument support" and unmasked by fc63cf237078c86214abcb2ee9926d8ad289da9b "exec: setup_arg_pages() fails to return errors". Signed-off-by: Michael Neuling <mikey@neuling.org> Cc: Anton Blanchard <anton@samba.org> Cc: stable@kernel.org --- fs/exec.c | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) Index: linux-2.6-ozlabs/fs/exec.c =================================================================== --- linux-2.6-ozlabs.orig/fs/exec.c +++ linux-2.6-ozlabs/fs/exec.c @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm goto out_unlock; } + stack_base = min(EXTRA_STACK_VM_PAGES * PAGE_SIZE, + current->signal->rlim[RLIMIT_STACK].rlim_cur - + PAGE_SIZE); #ifdef CONFIG_STACK_GROWSUP - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_end + stack_base; #else - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_start - stack_base; #endif ret = expand_stack(vma, stack_base); if (ret) ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 5:37 ` Michael Neuling @ 2010-02-08 6:05 ` KOSAKI Motohiro 2010-02-08 7:07 ` Américo Wang 2010-02-08 10:45 ` [PATCH] Restrict stack space reservation " Michael Neuling 0 siblings, 2 replies; 23+ messages in thread From: KOSAKI Motohiro @ 2010-02-08 6:05 UTC (permalink / raw) To: Michael Neuling Cc: kosaki.motohiro, Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, WANG Cong, Paul Mackerras, benh, miltonm, aeb > > > > > > Hi, > > > > > > > Why do we need page size independent stack size? It seems to have > > > > compatibility breaking risk. > > > > > > I don't think so. The current behaviour is clearly wrong, we dont need a > > > 16x larger stack just because you went from a 4kB to a 64kB base page > > > size. The user application stack usage is the same in both cases. > > > > I didn't discuss which behavior is better. Michael said he want to apply > > his patch to 2.6.32 & 2.6.33. stable tree never accept the breaking > > compatibility patch. > > > > Your answer doesn't explain why can't we wait it until next merge window. > > > > btw, personally, I like page size indepent stack size. but I'm not sure > > why making stack size independency is related to bug fix. > > I tend to agree. > > Below is just the bug fix to limit the reservation size based rlimit. > We still reserve different stack sizes based on the page size as > before (unless we hit rlimit of course). Thanks. I agree your patch in almost part. but I have very few requests. > Mikey > > Restrict stack space reservation to rlimit > > When reserving stack space for a new process, make sure we're not > attempting to allocate more than rlimit allows. > > This fixes a bug cause by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba > "mm: variable length argument support" and unmasked by > fc63cf237078c86214abcb2ee9926d8ad289da9b > "exec: setup_arg_pages() fails to return errors". Your initial mail have following problem use-case. please append it into the patch description. On recent ppc64 kernels, limiting the stack (using 'ulimit -s blah') is now more restrictive than it was before. On 2.6.31 with 4k pages I could run 'ulimit -s 16; /usr/bin/test' without a problem. Now with mainline, even 'ulimit -s 64; /usr/bin/test' gets killed. > > Signed-off-by: Michael Neuling <mikey@neuling.org> > Cc: Anton Blanchard <anton@samba.org> > Cc: stable@kernel.org > --- > fs/exec.c | 7 +++++-- > 1 file changed, 5 insertions(+), 2 deletions(-) > > Index: linux-2.6-ozlabs/fs/exec.c > =================================================================== > --- linux-2.6-ozlabs.orig/fs/exec.c > +++ linux-2.6-ozlabs/fs/exec.c > @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm > goto out_unlock; > } > > + stack_base = min(EXTRA_STACK_VM_PAGES * PAGE_SIZE, > + current->signal->rlim[RLIMIT_STACK].rlim_cur - > + PAGE_SIZE); This line is a bit unclear why "- PAGE_SIZE" is necessary. personally, I like following likes explicit comments. stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; stack_lim = ACCESS_ONCE(rlim[RLIMIT_STACK].rlim_cur); /* Initial stack must not cause stack overflow. */ if (stack_expand + PAGE_SIZE > stack_lim) stack_expand = stack_lim - PAGE_SIZE; note: accessing rlim_cur require ACCESS_ONCE. Thought? > #ifdef CONFIG_STACK_GROWSUP > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_base = vma->vm_end + stack_base; > #else > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_base = vma->vm_start - stack_base; > #endif > ret = expand_stack(vma, stack_base); > if (ret) > ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 6:05 ` KOSAKI Motohiro @ 2010-02-08 7:07 ` Américo Wang 2010-02-08 7:11 ` KOSAKI Motohiro 2010-02-08 10:45 ` [PATCH] Restrict stack space reservation " Michael Neuling 1 sibling, 1 reply; 23+ messages in thread From: Américo Wang @ 2010-02-08 7:07 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Michael Neuling, Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb On Mon, Feb 8, 2010 at 2:05 PM, KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> wrote: >> --- linux-2.6-ozlabs.orig/fs/exec.c >> +++ linux-2.6-ozlabs/fs/exec.c >> @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm >> goto out_unlock; >> } >> >> + stack_base = min(EXTRA_STACK_VM_PAGES * PAGE_SIZE, >> + current->signal->rlim[RLIMIT_STACK].rlim_cur - >> + PAGE_SIZE); > > This line is a bit unclear why "- PAGE_SIZE" is necessary. > personally, I like following likes explicit comments. > > stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; > stack_lim = ACCESS_ONCE(rlim[RLIMIT_STACK].rlim_cur); > > /* Initial stack must not cause stack overflow. */ > if (stack_expand + PAGE_SIZE > stack_lim) > stack_expand = stack_lim - PAGE_SIZE; > > note: accessing rlim_cur require ACCESS_ONCE. > > > Thought? It's better to use the helper function: rlimit(). -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 7:07 ` Américo Wang @ 2010-02-08 7:11 ` KOSAKI Motohiro 2010-02-09 6:11 ` [PATCH] Restrict initial stack space expansion " Michael Neuling 0 siblings, 1 reply; 23+ messages in thread From: KOSAKI Motohiro @ 2010-02-08 7:11 UTC (permalink / raw) To: Americo Wang Cc: kosaki.motohiro, Michael Neuling, Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb > On Mon, Feb 8, 2010 at 2:05 PM, KOSAKI Motohiro > <kosaki.motohiro@jp.fujitsu.com> wrote: > >> --- linux-2.6-ozlabs.orig/fs/exec.c > >> +++ linux-2.6-ozlabs/fs/exec.c > >> @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm > >> goto out_unlock; > >> } > >> > >> + stack_base = min(EXTRA_STACK_VM_PAGES * PAGE_SIZE, > >> + current->signal->rlim[RLIMIT_STACK].rlim_cur - > >> + PAGE_SIZE); > > > > This line is a bit unclear why "- PAGE_SIZE" is necessary. > > personally, I like following likes explicit comments. > > > > stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > stack_lim = ACCESS_ONCE(rlim[RLIMIT_STACK].rlim_cur); > > > > /* Initial stack must not cause stack overflow. */ > > if (stack_expand + PAGE_SIZE > stack_lim) > > stack_expand = stack_lim - PAGE_SIZE; > > > > note: accessing rlim_cur require ACCESS_ONCE. > > > > > > Thought? > > It's better to use the helper function: rlimit(). AFAIK, stable tree doesn't have rlimit(). but yes, making two patch (for mainline and for stable) is good opinion. ^ permalink raw reply [flat|nested] 23+ messages in thread
* [PATCH] Restrict initial stack space expansion to rlimit 2010-02-08 7:11 ` KOSAKI Motohiro @ 2010-02-09 6:11 ` Michael Neuling 2010-02-09 6:46 ` KOSAKI Motohiro 0 siblings, 1 reply; 23+ messages in thread From: Michael Neuling @ 2010-02-09 6:11 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Americo Wang, Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb When reserving stack space for a new process, make sure we're not attempting to expand the stack by more than rlimit allows. This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba "mm: variable length argument support" and unmasked by fc63cf237078c86214abcb2ee9926d8ad289da9b "exec: setup_arg_pages() fails to return errors". This bug means when limiting the stack to less the 20*PAGE_SIZE (eg. 80K on 4K pages or 'ulimit -s 79') all processes will be killed before they start. This is particularly bad with 64K pages, where a ulimit below 1280K will kill every process. Signed-off-by: Michael Neuling <mikey@neuling.org> Cc: stable@kernel.org --- Attempts to answer comments from Kosaki Motohiro. Tested on PPC only, hence !CONFIG_STACK_GROWSUP. Someone should probably ACK for an arch with CONFIG_STACK_GROWSUP. As noted, stable needs the same patch, but 2.6.32 doesn't have the rlimit() helper. fs/exec.c | 21 ++++++++++++++++++--- 1 file changed, 18 insertions(+), 3 deletions(-) Index: linux-2.6-ozlabs/fs/exec.c =================================================================== --- linux-2.6-ozlabs.orig/fs/exec.c +++ linux-2.6-ozlabs/fs/exec.c @@ -555,6 +555,7 @@ static int shift_arg_pages(struct vm_are } #define EXTRA_STACK_VM_PAGES 20 /* random */ +#define ALIGN_DOWN(addr,size) ((addr)&(~((size)-1))) /* * Finalizes the stack vm_area_struct. The flags and permissions are updated, @@ -570,7 +571,7 @@ int setup_arg_pages(struct linux_binprm struct vm_area_struct *vma = bprm->vma; struct vm_area_struct *prev = NULL; unsigned long vm_flags; - unsigned long stack_base; + unsigned long stack_base, stack_expand, stack_expand_lim, stack_size; #ifdef CONFIG_STACK_GROWSUP /* Limit stack size to 1GB */ @@ -627,10 +628,24 @@ int setup_arg_pages(struct linux_binprm goto out_unlock; } + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_size = vma->vm_end - vma->vm_start; + if (rlimit(RLIMIT_STACK) < stack_size) + stack_expand_lim = 0; /* don't shrick the stack */ + else + /* + * Align this down to a page boundary as expand_stack + * will align it up. + */ + stack_expand_lim = ALIGN_DOWN(rlimit(RLIMIT_STACK) - stack_size, + PAGE_SIZE); + /* Initial stack must not cause stack overflow. */ + if (stack_expand > stack_expand_lim) + stack_expand = stack_expand_lim; #ifdef CONFIG_STACK_GROWSUP - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_end + stack_expand; #else - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_base = vma->vm_start - stack_expand; #endif ret = expand_stack(vma, stack_base); if (ret) ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-09 6:11 ` [PATCH] Restrict initial stack space expansion " Michael Neuling @ 2010-02-09 6:46 ` KOSAKI Motohiro 2010-02-09 8:59 ` Michael Neuling 0 siblings, 1 reply; 23+ messages in thread From: KOSAKI Motohiro @ 2010-02-09 6:46 UTC (permalink / raw) To: Michael Neuling Cc: kosaki.motohiro, Americo Wang, Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb > When reserving stack space for a new process, make sure we're not > attempting to expand the stack by more than rlimit allows. > > This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba "mm: > variable length argument support" and unmasked by > fc63cf237078c86214abcb2ee9926d8ad289da9b "exec: setup_arg_pages() fails > to return errors". This bug means when limiting the stack to less the > 20*PAGE_SIZE (eg. 80K on 4K pages or 'ulimit -s 79') all processes will > be killed before they start. This is particularly bad with 64K pages, > where a ulimit below 1280K will kill every process. > > Signed-off-by: Michael Neuling <mikey@neuling.org> > Cc: stable@kernel.org > --- > Attempts to answer comments from Kosaki Motohiro. > > Tested on PPC only, hence !CONFIG_STACK_GROWSUP. Someone should > probably ACK for an arch with CONFIG_STACK_GROWSUP. > > As noted, stable needs the same patch, but 2.6.32 doesn't have the > rlimit() helper. > > fs/exec.c | 21 ++++++++++++++++++--- > 1 file changed, 18 insertions(+), 3 deletions(-) > > Index: linux-2.6-ozlabs/fs/exec.c > =================================================================== > --- linux-2.6-ozlabs.orig/fs/exec.c > +++ linux-2.6-ozlabs/fs/exec.c > @@ -555,6 +555,7 @@ static int shift_arg_pages(struct vm_are > } > > #define EXTRA_STACK_VM_PAGES 20 /* random */ > +#define ALIGN_DOWN(addr,size) ((addr)&(~((size)-1))) > > /* > * Finalizes the stack vm_area_struct. The flags and permissions are updated, > @@ -570,7 +571,7 @@ int setup_arg_pages(struct linux_binprm > struct vm_area_struct *vma = bprm->vma; > struct vm_area_struct *prev = NULL; > unsigned long vm_flags; > - unsigned long stack_base; > + unsigned long stack_base, stack_expand, stack_expand_lim, stack_size; > > #ifdef CONFIG_STACK_GROWSUP > /* Limit stack size to 1GB */ > @@ -627,10 +628,24 @@ int setup_arg_pages(struct linux_binprm > goto out_unlock; > } > > + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_size = vma->vm_end - vma->vm_start; > + if (rlimit(RLIMIT_STACK) < stack_size) > + stack_expand_lim = 0; /* don't shrick the stack */ > + else > + /* > + * Align this down to a page boundary as expand_stack > + * will align it up. > + */ > + stack_expand_lim = ALIGN_DOWN(rlimit(RLIMIT_STACK) - stack_size, > + PAGE_SIZE); > + /* Initial stack must not cause stack overflow. */ > + if (stack_expand > stack_expand_lim) > + stack_expand = stack_expand_lim; > #ifdef CONFIG_STACK_GROWSUP > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_base = vma->vm_end + stack_expand; > #else > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_base = vma->vm_start - stack_expand; > #endif > ret = expand_stack(vma, stack_base); > if (ret) Umm.. It looks correct. but the nested complex if statement seems a bit ugly. Instead, How about following? note: it's untested. =============== From: Michael Neuling <mikey@neuling.org> Subject: Restrict initial stack space expansion to rlimit When reserving stack space for a new process, make sure we're not attempting to expand the stack by more than rlimit allows. This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba "mm: variable length argument support" and unmasked by fc63cf237078c86214abcb2ee9926d8ad289da9b "exec: setup_arg_pages() fails to return errors". This bug means when limiting the stack to less the 20*PAGE_SIZE (eg. 80K on 4K pages or 'ulimit -s 79') all processes will be killed before they start. This is particularly bad with 64K pages, where a ulimit below 1280K will kill every process. [kosaki.motohiro@jp.fujitsu.com: cleanups] Signed-off-by: Michael Neuling <mikey@neuling.org> Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> Cc: stable@kernel.org --- Attempts to answer comments from Kosaki Motohiro. Tested on PPC only, hence !CONFIG_STACK_GROWSUP. Someone should probably ACK for an arch with CONFIG_STACK_GROWSUP. As noted, stable needs the same patch, but 2.6.32 doesn't have the rlimit() helper. diff --git a/fs/exec.c b/fs/exec.c index 6f7fb0c..325bad4 100644 --- a/fs/exec.c +++ b/fs/exec.c @@ -573,6 +573,9 @@ int setup_arg_pages(struct linux_binprm *bprm, struct vm_area_struct *prev = NULL; unsigned long vm_flags; unsigned long stack_base; + unsigned long stack_size; + unsigned long stack_expand; + unsigned long rlim_stack; #ifdef CONFIG_STACK_GROWSUP /* Limit stack size to 1GB */ @@ -629,10 +632,27 @@ int setup_arg_pages(struct linux_binprm *bprm, goto out_unlock; } + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_size = vma->vm_end - vma->vm_start; + /* + * Align this down to a page boundary as expand_stack + * will align it up. + */ + rlim_stack = rlimit(RLIMIT_STACK) & PAGE_MASK; + if (rlim_stack < stack_size) + rlim_stack = stack_size; #ifdef CONFIG_STACK_GROWSUP - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; + if (stack_size + stack_expand > rlim_stack) { + stack_base = vma->vm_start + rlim_stack; + } else { + stack_base = vma->vm_end + stack_expand; + } #else - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; + if (stack_size + stack_expand > rlim_stack) { + stack_base = vma->vm_end - rlim_stack; + } else { + stack_base = vma->vm_start - stack_expand; + } #endif ret = expand_stack(vma, stack_base); if (ret) ^ permalink raw reply related [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-09 6:46 ` KOSAKI Motohiro @ 2010-02-09 8:59 ` Michael Neuling 2010-02-09 21:25 ` Andrew Morton 0 siblings, 1 reply; 23+ messages in thread From: Michael Neuling @ 2010-02-09 8:59 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Americo Wang, Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb In message <20100209154141.03F0.A69D9226@jp.fujitsu.com> you wrote: > > When reserving stack space for a new process, make sure we're not > > attempting to expand the stack by more than rlimit allows. > > > > This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba "mm: > > variable length argument support" and unmasked by > > fc63cf237078c86214abcb2ee9926d8ad289da9b "exec: setup_arg_pages() fails > > to return errors". This bug means when limiting the stack to less the > > 20*PAGE_SIZE (eg. 80K on 4K pages or 'ulimit -s 79') all processes will > > be killed before they start. This is particularly bad with 64K pages, > > where a ulimit below 1280K will kill every process. > > > > Signed-off-by: Michael Neuling <mikey@neuling.org> > > Cc: stable@kernel.org > > --- > > Attempts to answer comments from Kosaki Motohiro. > > > > Tested on PPC only, hence !CONFIG_STACK_GROWSUP. Someone should > > probably ACK for an arch with CONFIG_STACK_GROWSUP. > > > > As noted, stable needs the same patch, but 2.6.32 doesn't have the > > rlimit() helper. > > > > fs/exec.c | 21 ++++++++++++++++++--- > > 1 file changed, 18 insertions(+), 3 deletions(-) > > > > Index: linux-2.6-ozlabs/fs/exec.c > > =================================================================== > > --- linux-2.6-ozlabs.orig/fs/exec.c > > +++ linux-2.6-ozlabs/fs/exec.c > > @@ -555,6 +555,7 @@ static int shift_arg_pages(struct vm_are > > } > > > > #define EXTRA_STACK_VM_PAGES 20 /* random */ > > +#define ALIGN_DOWN(addr,size) ((addr)&(~((size)-1))) > > > > /* > > * Finalizes the stack vm_area_struct. The flags and permissions are updat ed, > > @@ -570,7 +571,7 @@ int setup_arg_pages(struct linux_binprm > > struct vm_area_struct *vma = bprm->vma; > > struct vm_area_struct *prev = NULL; > > unsigned long vm_flags; > > - unsigned long stack_base; > > + unsigned long stack_base, stack_expand, stack_expand_lim, stack_size; > > > > #ifdef CONFIG_STACK_GROWSUP > > /* Limit stack size to 1GB */ > > @@ -627,10 +628,24 @@ int setup_arg_pages(struct linux_binprm > > goto out_unlock; > > } > > > > + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > + stack_size = vma->vm_end - vma->vm_start; > > + if (rlimit(RLIMIT_STACK) < stack_size) > > + stack_expand_lim = 0; /* don't shrick the stack */ > > + else > > + /* > > + * Align this down to a page boundary as expand_stack > > + * will align it up. > > + */ > > + stack_expand_lim = ALIGN_DOWN(rlimit(RLIMIT_STACK) - stack_size , > > + PAGE_SIZE); > > + /* Initial stack must not cause stack overflow. */ > > + if (stack_expand > stack_expand_lim) > > + stack_expand = stack_expand_lim; > > #ifdef CONFIG_STACK_GROWSUP > > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > + stack_base = vma->vm_end + stack_expand; > > #else > > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > + stack_base = vma->vm_start - stack_expand; > > #endif > > ret = expand_stack(vma, stack_base); > > if (ret) > > Umm.. It looks correct. but the nested complex if statement seems a bit ugly. > Instead, How about following? I don't like the duplicated code in the #ifdef/else but I can live with it. > note: it's untested. Works for me on ppc64 with 4k and 64k pages. Thanks! I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it as well. Mikey > > > > =============== > From: Michael Neuling <mikey@neuling.org> > Subject: Restrict initial stack space expansion to rlimit > > When reserving stack space for a new process, make sure we're not > attempting to expand the stack by more than rlimit allows. > > This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba "mm: > variable length argument support" and unmasked by > fc63cf237078c86214abcb2ee9926d8ad289da9b "exec: setup_arg_pages() fails > to return errors". This bug means when limiting the stack to less the > 20*PAGE_SIZE (eg. 80K on 4K pages or 'ulimit -s 79') all processes will > be killed before they start. This is particularly bad with 64K pages, > where a ulimit below 1280K will kill every process. > > [kosaki.motohiro@jp.fujitsu.com: cleanups] > Signed-off-by: Michael Neuling <mikey@neuling.org> > Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> > Cc: stable@kernel.org > --- > Attempts to answer comments from Kosaki Motohiro. > > Tested on PPC only, hence !CONFIG_STACK_GROWSUP. Someone should > probably ACK for an arch with CONFIG_STACK_GROWSUP. > > As noted, stable needs the same patch, but 2.6.32 doesn't have the > rlimit() helper. > > diff --git a/fs/exec.c b/fs/exec.c > index 6f7fb0c..325bad4 100644 > --- a/fs/exec.c > +++ b/fs/exec.c > @@ -573,6 +573,9 @@ int setup_arg_pages(struct linux_binprm *bprm, > struct vm_area_struct *prev = NULL; > unsigned long vm_flags; > unsigned long stack_base; > + unsigned long stack_size; > + unsigned long stack_expand; > + unsigned long rlim_stack; > > #ifdef CONFIG_STACK_GROWSUP > /* Limit stack size to 1GB */ > @@ -629,10 +632,27 @@ int setup_arg_pages(struct linux_binprm *bprm, > goto out_unlock; > } > > + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_size = vma->vm_end - vma->vm_start; > + /* > + * Align this down to a page boundary as expand_stack > + * will align it up. > + */ > + rlim_stack = rlimit(RLIMIT_STACK) & PAGE_MASK; > + if (rlim_stack < stack_size) > + rlim_stack = stack_size; > #ifdef CONFIG_STACK_GROWSUP > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + if (stack_size + stack_expand > rlim_stack) { > + stack_base = vma->vm_start + rlim_stack; > + } else { > + stack_base = vma->vm_end + stack_expand; > + } > #else > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + if (stack_size + stack_expand > rlim_stack) { > + stack_base = vma->vm_end - rlim_stack; > + } else { > + stack_base = vma->vm_start - stack_expand; > + } > #endif > ret = expand_stack(vma, stack_base); > if (ret) > > > ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-09 8:59 ` Michael Neuling @ 2010-02-09 21:25 ` Andrew Morton 2010-02-09 21:51 ` Michael Neuling 0 siblings, 1 reply; 23+ messages in thread From: Andrew Morton @ 2010-02-09 21:25 UTC (permalink / raw) To: Michael Neuling Cc: linux-parisc, linux-kernel, aeb, Oleg Nesterov, miltonm, James Morris, linuxppc-dev, Paul Mackerras, Anton Blanchard, KOSAKI Motohiro, Serge Hallyn, linux-fsdevel, Americo Wang, Ingo Molnar, Linus Torvalds, stable, Alexander Viro On Tue, 09 Feb 2010 19:59:27 +1100 Michael Neuling <mikey@neuling.org> wrote: > > > + /* Initial stack must not cause stack overflow. */ > > > + if (stack_expand > stack_expand_lim) > > > + stack_expand = stack_expand_lim; > > > #ifdef CONFIG_STACK_GROWSUP > > > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > > + stack_base = vma->vm_end + stack_expand; > > > #else > > > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > > + stack_base = vma->vm_start - stack_expand; > > > #endif > > > ret = expand_stack(vma, stack_base); > > > if (ret) > > > > Umm.. It looks correct. but the nested complex if statement seems a bit ugly. > > Instead, How about following? > > I don't like the duplicated code in the #ifdef/else but I can live with it. cleanup the cleanup: --- a/fs/exec.c~fs-execc-restrict-initial-stack-space-expansion-to-rlimit-cleanup-cleanup +++ a/fs/exec.c @@ -637,20 +637,17 @@ int setup_arg_pages(struct linux_binprm * will align it up. */ rlim_stack = rlimit(RLIMIT_STACK) & PAGE_MASK; - if (rlim_stack < stack_size) - rlim_stack = stack_size; + rlim_stack = min(rlim_stack, stack_size); #ifdef CONFIG_STACK_GROWSUP - if (stack_size + stack_expand > rlim_stack) { + if (stack_size + stack_expand > rlim_stack) stack_base = vma->vm_start + rlim_stack; - } else { + else stack_base = vma->vm_end + stack_expand; - } #else - if (stack_size + stack_expand > rlim_stack) { + if (stack_size + stack_expand > rlim_stack) stack_base = vma->vm_end - rlim_stack; - } else { + else stack_base = vma->vm_start - stack_expand; - } #endif ret = expand_stack(vma, stack_base); if (ret) _ > > note: it's untested. > > Works for me on ppc64 with 4k and 64k pages. Thanks! > > I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it > as well. There's only one CONFIG_GROWSUP arch - parisc. Guys, here's the rolled-up patch. Could someone please test it on parisc? err, I'm not sure what one needs to do to test it, actually. Presumably it involves setting an unusual `ulimit -s'. Can someone please suggest a test plan? From: Michael Neuling <mikey@neuling.org> When reserving stack space for a new process, make sure we're not attempting to expand the stack by more than rlimit allows. This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba ("mm: variable length argument support") and unmasked by fc63cf237078c86214abcb2ee9926d8ad289da9b ("exec: setup_arg_pages() fails to return errors"). This bug means that when limiting the stack to less the 20*PAGE_SIZE (eg. 80K on 4K pages or 'ulimit -s 79') all processes will be killed before they start. This is particularly bad with 64K pages, where a ulimit below 1280K will kill every process. Signed-off-by: Michael Neuling <mikey@neuling.org> Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> Cc: Americo Wang <xiyou.wangcong@gmail.com> Cc: Anton Blanchard <anton@samba.org> Cc: Oleg Nesterov <oleg@redhat.com> Cc: James Morris <jmorris@namei.org> Cc: Ingo Molnar <mingo@elte.hu> Cc: Serge Hallyn <serue@us.ibm.com> Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org> Cc: <stable@kernel.org> fs/exec.c | 21 +++++++++++++++++++-- 1 file changed, 19 insertions(+), 2 deletions(-) diff -puN fs/exec.c~fs-execc-restrict-initial-stack-space-expansion-to-rlimit fs/exec.c --- a/fs/exec.c~fs-execc-restrict-initial-stack-space-expansion-to-rlimit +++ a/fs/exec.c @@ -571,6 +571,9 @@ int setup_arg_pages(struct linux_binprm struct vm_area_struct *prev = NULL; unsigned long vm_flags; unsigned long stack_base; + unsigned long stack_size; + unsigned long stack_expand; + unsigned long rlim_stack; #ifdef CONFIG_STACK_GROWSUP /* Limit stack size to 1GB */ @@ -627,10 +630,24 @@ int setup_arg_pages(struct linux_binprm goto out_unlock; } + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; + stack_size = vma->vm_end - vma->vm_start; + /* + * Align this down to a page boundary as expand_stack + * will align it up. + */ + rlim_stack = rlimit(RLIMIT_STACK) & PAGE_MASK; + rlim_stack = min(rlim_stack, stack_size); #ifdef CONFIG_STACK_GROWSUP - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; + if (stack_size + stack_expand > rlim_stack) + stack_base = vma->vm_start + rlim_stack; + else + stack_base = vma->vm_end + stack_expand; #else - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; + if (stack_size + stack_expand > rlim_stack) + stack_base = vma->vm_end - rlim_stack; + else + stack_base = vma->vm_start - stack_expand; #endif ret = expand_stack(vma, stack_base); if (ret) _ ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-09 21:25 ` Andrew Morton @ 2010-02-09 21:51 ` Michael Neuling 2010-02-09 22:27 ` Helge Deller 0 siblings, 1 reply; 23+ messages in thread From: Michael Neuling @ 2010-02-09 21:51 UTC (permalink / raw) To: Andrew Morton Cc: KOSAKI Motohiro, Americo Wang, Anton Blanchard, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb, linux-parisc > > > note: it's untested. > > > > Works for me on ppc64 with 4k and 64k pages. Thanks! > > > > I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it > > as well. > > There's only one CONFIG_GROWSUP arch - parisc. > > Guys, here's the rolled-up patch. FYI the rolled up patch still works fine on PPC64. Thanks. > Could someone please test it on parisc? > > err, I'm not sure what one needs to do to test it, actually. > Presumably it involves setting an unusual `ulimit -s'. Can someone > please suggest a test plan? How about doing: 'ulimit -s 15; ls' before and after the patch is applied. Before it's applied, 'ls' should be killed. After the patch is applied, 'ls' should no longer be killed. I'm suggesting a stack limit of 15KB since it's small enough to trigger 20*PAGE_SIZE. Also 15KB not a multiple of PAGE_SIZE, which is a trickier case to handle correctly with this code. 4K pages on parisc should be fine to test with. Mikey > > From: Michael Neuling <mikey@neuling.org> > > When reserving stack space for a new process, make sure we're not > attempting to expand the stack by more than rlimit allows. > > This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba ("mm: > variable length argument support") and unmasked by > fc63cf237078c86214abcb2ee9926d8ad289da9b ("exec: setup_arg_pages() fails > to return errors"). > > This bug means that when limiting the stack to less the 20*PAGE_SIZE (eg. > 80K on 4K pages or 'ulimit -s 79') all processes will be killed before > they start. This is particularly bad with 64K pages, where a ulimit below > 1280K will kill every process. > > Signed-off-by: Michael Neuling <mikey@neuling.org> > Cc: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> > Cc: Americo Wang <xiyou.wangcong@gmail.com> > Cc: Anton Blanchard <anton@samba.org> > Cc: Oleg Nesterov <oleg@redhat.com> > Cc: James Morris <jmorris@namei.org> > Cc: Ingo Molnar <mingo@elte.hu> > Cc: Serge Hallyn <serue@us.ibm.com> > Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org> > Cc: <stable@kernel.org> > > fs/exec.c | 21 +++++++++++++++++++-- > 1 file changed, 19 insertions(+), 2 deletions(-) > > diff -puN fs/exec.c~fs-execc-restrict-initial-stack-space-expansion-to-rlimit fs/exec.c > --- a/fs/exec.c~fs-execc-restrict-initial-stack-space-expansion-to-rlimit > +++ a/fs/exec.c > @@ -571,6 +571,9 @@ int setup_arg_pages(struct linux_binprm > struct vm_area_struct *prev = NULL; > unsigned long vm_flags; > unsigned long stack_base; > + unsigned long stack_size; > + unsigned long stack_expand; > + unsigned long rlim_stack; > > #ifdef CONFIG_STACK_GROWSUP > /* Limit stack size to 1GB */ > @@ -627,10 +630,24 @@ int setup_arg_pages(struct linux_binprm > goto out_unlock; > } > > + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + stack_size = vma->vm_end - vma->vm_start; > + /* > + * Align this down to a page boundary as expand_stack > + * will align it up. > + */ > + rlim_stack = rlimit(RLIMIT_STACK) & PAGE_MASK; > + rlim_stack = min(rlim_stack, stack_size); > #ifdef CONFIG_STACK_GROWSUP > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + if (stack_size + stack_expand > rlim_stack) > + stack_base = vma->vm_start + rlim_stack; > + else > + stack_base = vma->vm_end + stack_expand; > #else > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > + if (stack_size + stack_expand > rlim_stack) > + stack_base = vma->vm_end - rlim_stack; > + else > + stack_base = vma->vm_start - stack_expand; > #endif > ret = expand_stack(vma, stack_base); > if (ret) > _ > ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-09 21:51 ` Michael Neuling @ 2010-02-09 22:27 ` Helge Deller 2010-02-10 5:12 ` KOSAKI Motohiro 0 siblings, 1 reply; 23+ messages in thread From: Helge Deller @ 2010-02-09 22:27 UTC (permalink / raw) To: Michael Neuling Cc: Andrew Morton, KOSAKI Motohiro, Americo Wang, Anton Blanchard, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb, linux-parisc On 02/09/2010 10:51 PM, Michael Neuling wrote: >>> I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it >>> as well. >> >> There's only one CONFIG_GROWSUP arch - parisc. >> Could someone please test it on parisc? I did. > How about doing: > 'ulimit -s 15; ls' > before and after the patch is applied. Before it's applied, 'ls' should > be killed. After the patch is applied, 'ls' should no longer be killed. > > I'm suggesting a stack limit of 15KB since it's small enough to trigger > 20*PAGE_SIZE. Also 15KB not a multiple of PAGE_SIZE, which is a trickier > case to handle correctly with this code. > > 4K pages on parisc should be fine to test with. Mikey, thanks for the suggested test plan. I'm not sure if your patch does it correct for parisc/stack-grows-up-case. I tested your patch on a 4k pages kernel: root@c3000:~# uname -a Linux c3000 2.6.33-rc7-32bit #221 Tue Feb 9 23:17:06 CET 2010 parisc GNU/Linux Without your patch: root@c3000:~# ulimit -s 15; ls Killed -> correct. With your patch: root@c3000:~# ulimit -s 15; ls Killed _or_: root@c3000:~# ulimit -s 15; ls Segmentation fault -> ?? Any idea? Helge >> From: Michael Neuling<mikey@neuling.org> >> >> When reserving stack space for a new process, make sure we're not >> attempting to expand the stack by more than rlimit allows. >> >> This fixes a bug caused by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba ("mm: >> variable length argument support") and unmasked by >> fc63cf237078c86214abcb2ee9926d8ad289da9b ("exec: setup_arg_pages() fails >> to return errors"). >> >> This bug means that when limiting the stack to less the 20*PAGE_SIZE (eg. >> 80K on 4K pages or 'ulimit -s 79') all processes will be killed before >> they start. This is particularly bad with 64K pages, where a ulimit below >> 1280K will kill every process. >> >> Signed-off-by: Michael Neuling<mikey@neuling.org> >> Cc: KOSAKI Motohiro<kosaki.motohiro@jp.fujitsu.com> >> Cc: Americo Wang<xiyou.wangcong@gmail.com> >> Cc: Anton Blanchard<anton@samba.org> >> Cc: Oleg Nesterov<oleg@redhat.com> >> Cc: James Morris<jmorris@namei.org> >> Cc: Ingo Molnar<mingo@elte.hu> >> Cc: Serge Hallyn<serue@us.ibm.com> >> Cc: Benjamin Herrenschmidt<benh@kernel.crashing.org> >> Cc:<stable@kernel.org> >> >> fs/exec.c | 21 +++++++++++++++++++-- >> 1 file changed, 19 insertions(+), 2 deletions(-) >> >> diff -puN fs/exec.c~fs-execc-restrict-initial-stack-space-expansion-to-rlimit > fs/exec.c >> --- a/fs/exec.c~fs-execc-restrict-initial-stack-space-expansion-to-rlimit >> +++ a/fs/exec.c >> @@ -571,6 +571,9 @@ int setup_arg_pages(struct linux_binprm >> struct vm_area_struct *prev = NULL; >> unsigned long vm_flags; >> unsigned long stack_base; >> + unsigned long stack_size; >> + unsigned long stack_expand; >> + unsigned long rlim_stack; >> >> #ifdef CONFIG_STACK_GROWSUP >> /* Limit stack size to 1GB */ >> @@ -627,10 +630,24 @@ int setup_arg_pages(struct linux_binprm >> goto out_unlock; >> } >> >> + stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; >> + stack_size = vma->vm_end - vma->vm_start; >> + /* >> + * Align this down to a page boundary as expand_stack >> + * will align it up. >> + */ >> + rlim_stack = rlimit(RLIMIT_STACK)& PAGE_MASK; >> + rlim_stack = min(rlim_stack, stack_size); >> #ifdef CONFIG_STACK_GROWSUP >> - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; >> + if (stack_size + stack_expand> rlim_stack) >> + stack_base = vma->vm_start + rlim_stack; >> + else >> + stack_base = vma->vm_end + stack_expand; >> #else >> - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; >> + if (stack_size + stack_expand> rlim_stack) >> + stack_base = vma->vm_end - rlim_stack; >> + else >> + stack_base = vma->vm_start - stack_expand; >> #endif >> ret = expand_stack(vma, stack_base); >> if (ret) ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-09 22:27 ` Helge Deller @ 2010-02-10 5:12 ` KOSAKI Motohiro 2010-02-10 5:30 ` Michael Neuling 2010-02-10 5:31 ` Michael Neuling 0 siblings, 2 replies; 23+ messages in thread From: KOSAKI Motohiro @ 2010-02-10 5:12 UTC (permalink / raw) To: Helge Deller Cc: Michael Neuling, linux-parisc, stable, aeb, Oleg Nesterov, miltonm, James Morris, linuxppc-dev, Paul Mackerras, Anton Blanchard, kosaki.motohiro, Serge Hallyn, linux-fsdevel, Americo Wang, Andrew Morton, Linus Torvalds, Ingo Molnar, linux-kernel, Alexander Viro > On 02/09/2010 10:51 PM, Michael Neuling wrote: > >>> I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it > >>> as well. > >> > >> There's only one CONFIG_GROWSUP arch - parisc. > >> Could someone please test it on parisc? > > I did. > > > How about doing: > > 'ulimit -s 15; ls' > > before and after the patch is applied. Before it's applied, 'ls' should > > be killed. After the patch is applied, 'ls' should no longer be killed. > > > > I'm suggesting a stack limit of 15KB since it's small enough to trigger > > 20*PAGE_SIZE. Also 15KB not a multiple of PAGE_SIZE, which is a trickier > > case to handle correctly with this code. > > > > 4K pages on parisc should be fine to test with. > > Mikey, thanks for the suggested test plan. > > I'm not sure if your patch does it correct for parisc/stack-grows-up-case. > > I tested your patch on a 4k pages kernel: > root@c3000:~# uname -a > Linux c3000 2.6.33-rc7-32bit #221 Tue Feb 9 23:17:06 CET 2010 parisc GNU/Linux > > Without your patch: > root@c3000:~# ulimit -s 15; ls > Killed > -> correct. > > With your patch: > root@c3000:~# ulimit -s 15; ls > Killed > _or_: > root@c3000:~# ulimit -s 15; ls > Segmentation fault > -> ?? > > Any idea? My x86_64 box also makes segmentation fault. I think "ulimit -s 15" is too small stack for ls. "ulimit -s 27; ls " wroks perfectly fine. ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-10 5:12 ` KOSAKI Motohiro @ 2010-02-10 5:30 ` Michael Neuling 2010-02-10 5:31 ` Michael Neuling 1 sibling, 0 replies; 23+ messages in thread From: Michael Neuling @ 2010-02-10 5:30 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Helge Deller, Andrew Morton, Americo Wang, Anton Blanchard, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb, linux-parisc In message <20100210141016.4D18.A69D9226@jp.fujitsu.com> you wrote: > > On 02/09/2010 10:51 PM, Michael Neuling wrote: > > >>> I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it > > >>> as well. > > >> > > >> There's only one CONFIG_GROWSUP arch - parisc. > > >> Could someone please test it on parisc? > > > > I did. > > > > > How about doing: > > > 'ulimit -s 15; ls' > > > before and after the patch is applied. Before it's applied, 'ls' should > > > be killed. After the patch is applied, 'ls' should no longer be killed. > > > > > > I'm suggesting a stack limit of 15KB since it's small enough to trigger > > > 20*PAGE_SIZE. Also 15KB not a multiple of PAGE_SIZE, which is a trickier > > > case to handle correctly with this code. > > > > > > 4K pages on parisc should be fine to test with. > > > > Mikey, thanks for the suggested test plan. > > > > I'm not sure if your patch does it correct for parisc/stack-grows-up-case. > > > > I tested your patch on a 4k pages kernel: > > root@c3000:~# uname -a > > Linux c3000 2.6.33-rc7-32bit #221 Tue Feb 9 23:17:06 CET 2010 parisc GNU/Li nux > > > > Without your patch: > > root@c3000:~# ulimit -s 15; ls > > Killed > > -> correct. > > > > With your patch: > > root@c3000:~# ulimit -s 15; ls > > Killed > > _or_: > > root@c3000:~# ulimit -s 15; ls > > Segmentation fault > > -> ?? > > > > Any idea? > > My x86_64 box also makes segmentation fault. I think "ulimit -s 15" is too sm all stack for ls. > "ulimit -s 27; ls " wroks perfectly fine. Arrh. I asked Helge offline earlier to check what use to work on parisc on 2.6.31. I guess PPC has a nice clean non-bloated ABI :-D Mikey ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-10 5:12 ` KOSAKI Motohiro 2010-02-10 5:30 ` Michael Neuling @ 2010-02-10 5:31 ` Michael Neuling 2010-02-11 22:16 ` Helge Deller 1 sibling, 1 reply; 23+ messages in thread From: Michael Neuling @ 2010-02-10 5:31 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Helge Deller, Andrew Morton, Americo Wang, Anton Blanchard, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb, linux-parisc In message <20100210141016.4D18.A69D9226@jp.fujitsu.com> you wrote: > > On 02/09/2010 10:51 PM, Michael Neuling wrote: > > >>> I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it > > >>> as well. > > >> > > >> There's only one CONFIG_GROWSUP arch - parisc. > > >> Could someone please test it on parisc? > > > > I did. > > > > > How about doing: > > > 'ulimit -s 15; ls' > > > before and after the patch is applied. Before it's applied, 'ls' should > > > be killed. After the patch is applied, 'ls' should no longer be killed. > > > > > > I'm suggesting a stack limit of 15KB since it's small enough to trigger > > > 20*PAGE_SIZE. Also 15KB not a multiple of PAGE_SIZE, which is a trickier > > > case to handle correctly with this code. > > > > > > 4K pages on parisc should be fine to test with. > > > > Mikey, thanks for the suggested test plan. > > > > I'm not sure if your patch does it correct for parisc/stack-grows-up-case. > > > > I tested your patch on a 4k pages kernel: > > root@c3000:~# uname -a > > Linux c3000 2.6.33-rc7-32bit #221 Tue Feb 9 23:17:06 CET 2010 parisc GNU/Li nux > > > > Without your patch: > > root@c3000:~# ulimit -s 15; ls > > Killed > > -> correct. > > > > With your patch: > > root@c3000:~# ulimit -s 15; ls > > Killed > > _or_: > > root@c3000:~# ulimit -s 15; ls > > Segmentation fault > > -> ?? > > > > Any idea? > > My x86_64 box also makes segmentation fault. I think "ulimit -s 15" is too sm all stack for ls. > "ulimit -s 27; ls " wroks perfectly fine. Arrh. I asked Helge offline earlier to check what use to work on parisc on 2.6.31. I guess PPC has a nice clean non-bloated ABI :-D Mikey ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-10 5:31 ` Michael Neuling @ 2010-02-11 22:16 ` Helge Deller 2010-02-11 22:22 ` Michael Neuling 0 siblings, 1 reply; 23+ messages in thread From: Helge Deller @ 2010-02-11 22:16 UTC (permalink / raw) To: Michael Neuling Cc: KOSAKI Motohiro, Andrew Morton, Americo Wang, Anton Blanchard, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb, linux-parisc On 02/10/2010 06:31 AM, Michael Neuling wrote: > In message<20100210141016.4D18.A69D9226@jp.fujitsu.com> you wrote: >>> On 02/09/2010 10:51 PM, Michael Neuling wrote: >>>>>> I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it >>>>>> as well. >>>>> >>>>> There's only one CONFIG_GROWSUP arch - parisc. >>>>> Could someone please test it on parisc? >>> >>> I did. >>> >>>> How about doing: >>>> 'ulimit -s 15; ls' >>>> before and after the patch is applied. Before it's applied, 'ls' should >>>> be killed. After the patch is applied, 'ls' should no longer be killed. >>>> >>>> I'm suggesting a stack limit of 15KB since it's small enough to trigger >>>> 20*PAGE_SIZE. Also 15KB not a multiple of PAGE_SIZE, which is a trickier >>>> case to handle correctly with this code. >>>> >>>> 4K pages on parisc should be fine to test with. >>> >>> Mikey, thanks for the suggested test plan. >>> >>> I'm not sure if your patch does it correct for parisc/stack-grows-up-case. >>> >>> I tested your patch on a 4k pages kernel: >>> root@c3000:~# uname -a >>> Linux c3000 2.6.33-rc7-32bit #221 Tue Feb 9 23:17:06 CET 2010 parisc GNU/Li > nux >>> >>> Without your patch: >>> root@c3000:~# ulimit -s 15; ls >>> Killed >>> -> correct. >>> >>> With your patch: >>> root@c3000:~# ulimit -s 15; ls >>> Killed >>> _or_: >>> root@c3000:~# ulimit -s 15; ls >>> Segmentation fault >>> -> ?? >>> >>> Any idea? >> >> My x86_64 box also makes segmentation fault. I think "ulimit -s 15" is too sm > all stack for ls. >> "ulimit -s 27; ls " wroks perfectly fine. > > Arrh. I asked Helge offline earlier to check what use to work on parisc > on 2.6.31. > > I guess PPC has a nice clean non-bloated ABI :-D Hi Mikey, I tested again, and it works for me with "ulimit -s 27" as well (on a 4k, 32bit kernel). Still, I'm not 100% sure if your patch is correct. Anyway, it seems to work. But what makes me wonder is, why EXTRA_STACK_VM_PAGES is defined in pages at all. You wrote in your patch description: > This bug means that when limiting the stack to less the 20*PAGE_SIZE (eg. > 80K on 4K pages or 'ulimit -s 79') all processes will be killed before > they start. This is particularly bad with 64K pages, where a ulimit below > 1280K will kill every process. Wouldn't it make sense to define and use EXTRA_STACK_VM_SIZE instead (e.g. as 20*4096 = 80k)? This extra stack reservation should IMHO be independend of the actual kernel page size. Helge ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict initial stack space expansion to rlimit 2010-02-11 22:16 ` Helge Deller @ 2010-02-11 22:22 ` Michael Neuling 0 siblings, 0 replies; 23+ messages in thread From: Michael Neuling @ 2010-02-11 22:22 UTC (permalink / raw) To: Helge Deller Cc: KOSAKI Motohiro, Andrew Morton, Americo Wang, Anton Blanchard, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, Paul Mackerras, benh, miltonm, aeb, linux-parisc In message <4B7481A6.7080300@gmx.de> you wrote: > On 02/10/2010 06:31 AM, Michael Neuling wrote: > > In message<20100210141016.4D18.A69D9226@jp.fujitsu.com> you wrote: > >>> On 02/09/2010 10:51 PM, Michael Neuling wrote: > >>>>>> I'd still like someone with a CONFIG_STACK_GROWSUP arch to test/ACK it > >>>>>> as well. > >>>>> > >>>>> There's only one CONFIG_GROWSUP arch - parisc. > >>>>> Could someone please test it on parisc? > >>> > >>> I did. > >>> > >>>> How about doing: > >>>> 'ulimit -s 15; ls' > >>>> before and after the patch is applied. Before it's applied, 'ls' should > >>>> be killed. After the patch is applied, 'ls' should no longer be killed. > >>>> > >>>> I'm suggesting a stack limit of 15KB since it's small enough to trigger > >>>> 20*PAGE_SIZE. Also 15KB not a multiple of PAGE_SIZE, which is a trickie r > >>>> case to handle correctly with this code. > >>>> > >>>> 4K pages on parisc should be fine to test with. > >>> > >>> Mikey, thanks for the suggested test plan. > >>> > >>> I'm not sure if your patch does it correct for parisc/stack-grows-up-case . > >>> > >>> I tested your patch on a 4k pages kernel: > >>> root@c3000:~# uname -a > >>> Linux c3000 2.6.33-rc7-32bit #221 Tue Feb 9 23:17:06 CET 2010 parisc GNU/ Li > > nux > >>> > >>> Without your patch: > >>> root@c3000:~# ulimit -s 15; ls > >>> Killed > >>> -> correct. > >>> > >>> With your patch: > >>> root@c3000:~# ulimit -s 15; ls > >>> Killed > >>> _or_: > >>> root@c3000:~# ulimit -s 15; ls > >>> Segmentation fault > >>> -> ?? > >>> > >>> Any idea? > >> > >> My x86_64 box also makes segmentation fault. I think "ulimit -s 15" is too sm > > all stack for ls. > >> "ulimit -s 27; ls " wroks perfectly fine. > > > > Arrh. I asked Helge offline earlier to check what use to work on parisc > > on 2.6.31. > > > > I guess PPC has a nice clean non-bloated ABI :-D > > Hi Mikey, > > I tested again, and it works for me with "ulimit -s 27" as well (on a > 4k, 32bit kernel). > Still, I'm not 100% sure if your patch is correct. Thanks for retesting Did "ulimit -s 27" fail before you applied? > Anyway, it seems to work. > > But what makes me wonder is, why EXTRA_STACK_VM_PAGES is defined in pages at all. > You wrote in your patch description: > > This bug means that when limiting the stack to less the 20*PAGE_SIZE (eg. > > 80K on 4K pages or 'ulimit -s 79') all processes will be killed before > > they start. This is particularly bad with 64K pages, where a ulimit below > > 1280K will kill every process. > > Wouldn't it make sense to define and use EXTRA_STACK_VM_SIZE instead > (e.g. as 20*4096 = 80k)? This extra stack reservation should IMHO be > independend of the actual kernel page size. If you look back through this thread, that has already been noted but it's a separate issue to this bug, so that change will be deferred till 2.6.34. Mikey ^ permalink raw reply [flat|nested] 23+ messages in thread
* Re: [PATCH] Restrict stack space reservation to rlimit 2010-02-08 6:05 ` KOSAKI Motohiro 2010-02-08 7:07 ` Américo Wang @ 2010-02-08 10:45 ` Michael Neuling 1 sibling, 0 replies; 23+ messages in thread From: Michael Neuling @ 2010-02-08 10:45 UTC (permalink / raw) To: KOSAKI Motohiro Cc: Anton Blanchard, Andrew Morton, Linus Torvalds, Alexander Viro, Oleg Nesterov, James Morris, Ingo Molnar, linux-fsdevel, stable, linux-kernel, linuxppc-dev, Serge Hallyn, WANG Cong, Paul Mackerras, benh, miltonm, aeb In message <20100208145240.FB58.A69D9226@jp.fujitsu.com> you wrote: > > > > > > > > Hi, > > > > > > > > > Why do we need page size independent stack size? It seems to have > > > > > compatibility breaking risk. > > > > > > > > I don't think so. The current behaviour is clearly wrong, we dont need a > > > > 16x larger stack just because you went from a 4kB to a 64kB base page > > > > size. The user application stack usage is the same in both cases. > > > > > > I didn't discuss which behavior is better. Michael said he want to apply > > > his patch to 2.6.32 & 2.6.33. stable tree never accept the breaking > > > compatibility patch. > > > > > > Your answer doesn't explain why can't we wait it until next merge window. > > > > > > btw, personally, I like page size indepent stack size. but I'm not sure > > > why making stack size independency is related to bug fix. > > > > I tend to agree. > > > > Below is just the bug fix to limit the reservation size based rlimit. > > We still reserve different stack sizes based on the page size as > > before (unless we hit rlimit of course). > > Thanks. > > I agree your patch in almost part. but I have very few requests. > > > > Mikey > > > > Restrict stack space reservation to rlimit > > > > When reserving stack space for a new process, make sure we're not > > attempting to allocate more than rlimit allows. > > > > This fixes a bug cause by b6a2fea39318e43fee84fa7b0b90d68bed92d2ba > > "mm: variable length argument support" and unmasked by > > fc63cf237078c86214abcb2ee9926d8ad289da9b > > "exec: setup_arg_pages() fails to return errors". > > Your initial mail have following problem use-case. please append it > into the patch description. > > On recent ppc64 kernels, limiting the stack (using 'ulimit -s blah') is > now more restrictive than it was before. On 2.6.31 with 4k pages I > could run 'ulimit -s 16; /usr/bin/test' without a problem. Now with > mainline, even 'ulimit -s 64; /usr/bin/test' gets killed. Ok, I'll add this info in. > > > > > Signed-off-by: Michael Neuling <mikey@neuling.org> > > Cc: Anton Blanchard <anton@samba.org> > > Cc: stable@kernel.org > > --- > > fs/exec.c | 7 +++++-- > > 1 file changed, 5 insertions(+), 2 deletions(-) > > > > Index: linux-2.6-ozlabs/fs/exec.c > > =================================================================== > > --- linux-2.6-ozlabs.orig/fs/exec.c > > +++ linux-2.6-ozlabs/fs/exec.c > > @@ -627,10 +627,13 @@ int setup_arg_pages(struct linux_binprm > > goto out_unlock; > > } > > > > + stack_base = min(EXTRA_STACK_VM_PAGES * PAGE_SIZE, > > + current->signal->rlim[RLIMIT_STACK].rlim_cur - > > + PAGE_SIZE); > > This line is a bit unclear why "- PAGE_SIZE" is necessary. This is because the stack is already 1 page in size. I'm going to change that code to make it clearer... hopefully :-) > personally, I like following likes explicit comments. > > stack_expand = EXTRA_STACK_VM_PAGES * PAGE_SIZE; > stack_lim = ACCESS_ONCE(rlim[RLIMIT_STACK].rlim_cur); > > /* Initial stack must not cause stack overflow. */ > if (stack_expand + PAGE_SIZE > stack_lim) > stack_expand = stack_lim - PAGE_SIZE; > > note: accessing rlim_cur require ACCESS_ONCE. > > > Thought? Thanks, looks better/clearer to me too. I'll change, new patch coming.... Mikey > > > > #ifdef CONFIG_STACK_GROWSUP > > - stack_base = vma->vm_end + EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > + stack_base = vma->vm_end + stack_base; > > #else > > - stack_base = vma->vm_start - EXTRA_STACK_VM_PAGES * PAGE_SIZE; > > + stack_base = vma->vm_start - stack_base; > > #endif > > ret = expand_stack(vma, stack_base); > > if (ret) > > > > > ^ permalink raw reply [flat|nested] 23+ messages in thread
end of thread, other threads:[~2010-02-11 22:22 UTC | newest] Thread overview: 23+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <3984.1265416993@neuling.org> [not found] ` <20100206042038.GB32246@kryten> [not found] ` <10125.1265451732@neuling.org> 2010-02-08 0:07 ` [PATCH] Restrict stack space reservation to rlimit Michael Neuling 2010-02-08 0:28 ` Michael Neuling 2010-02-08 5:06 ` KOSAKI Motohiro 2010-02-08 5:11 ` Anton Blanchard 2010-02-08 5:22 ` KOSAKI Motohiro 2010-02-08 5:31 ` Anton Blanchard 2010-02-08 6:11 ` KOSAKI Motohiro 2010-02-08 5:37 ` Michael Neuling 2010-02-08 6:05 ` KOSAKI Motohiro 2010-02-08 7:07 ` Américo Wang 2010-02-08 7:11 ` KOSAKI Motohiro 2010-02-09 6:11 ` [PATCH] Restrict initial stack space expansion " Michael Neuling 2010-02-09 6:46 ` KOSAKI Motohiro 2010-02-09 8:59 ` Michael Neuling 2010-02-09 21:25 ` Andrew Morton 2010-02-09 21:51 ` Michael Neuling 2010-02-09 22:27 ` Helge Deller 2010-02-10 5:12 ` KOSAKI Motohiro 2010-02-10 5:30 ` Michael Neuling 2010-02-10 5:31 ` Michael Neuling 2010-02-11 22:16 ` Helge Deller 2010-02-11 22:22 ` Michael Neuling 2010-02-08 10:45 ` [PATCH] Restrict stack space reservation " Michael Neuling
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).