public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH 1/2] x86, xsave: fix non-lazy allocation of the xsave area
       [not found] <2be7032a-e4b6-4ce7-af53-532d77dbfb48@email.android.com>
@ 2011-04-27 14:26 ` Hans Rosenfeld
  2011-04-28  5:55   ` [tip:x86/xsave] " tip-bot for Hans Rosenfeld
  2011-04-27 14:26 ` [PATCH 2/2] fork: avoid weak function arch_dup_task_struct Hans Rosenfeld
  1 sibling, 1 reply; 7+ messages in thread
From: Hans Rosenfeld @ 2011-04-27 14:26 UTC (permalink / raw)
  To: hpa
  Cc: brgerst, tglx, mingo, suresh.b.siddha, eranian, robert.richter,
	Andreas.Herrmann3, x86, linux-kernel, Hans Rosenfeld

A single static xsave area just for init is not enough, since there are
more user processes that are directly executed by kernel threads. Add a
call to a new arch-specific function to flush_old_exec(), which will in
turn call fpu_alloc() to allocate a xsave area if necessary.

Signed-off-by: Hans Rosenfeld <hans.rosenfeld@amd.com>
---
 arch/x86/include/asm/i387.h        |    6 ------
 arch/x86/include/asm/thread_info.h |    2 ++
 arch/x86/kernel/process.c          |    7 +++++++
 fs/exec.c                          |    8 ++++++++
 4 files changed, 17 insertions(+), 6 deletions(-)

diff --git a/arch/x86/include/asm/i387.h b/arch/x86/include/asm/i387.h
index 989c0ac..0448f45 100644
--- a/arch/x86/include/asm/i387.h
+++ b/arch/x86/include/asm/i387.h
@@ -329,15 +329,9 @@ static inline void fpu_copy(struct fpu *dst, struct fpu *src)
 }
 
 extern void fpu_finit(struct fpu *fpu);
-static union thread_xstate __init_xstate, *init_xstate = &__init_xstate;
 
 static inline void fpu_clear(struct fpu *fpu)
 {
-	if (!fpu_allocated(fpu)) {
-		BUG_ON(init_xstate == NULL);
-		fpu->state = init_xstate;
-		init_xstate = NULL;
-	}
 	memset(fpu->state, 0, xstate_size);
 	fpu_finit(fpu);
 	set_used_math();
diff --git a/arch/x86/include/asm/thread_info.h b/arch/x86/include/asm/thread_info.h
index 0e691c6..07c8cfe 100644
--- a/arch/x86/include/asm/thread_info.h
+++ b/arch/x86/include/asm/thread_info.h
@@ -265,6 +265,8 @@ static inline void set_restore_sigmask(void)
 extern void arch_task_cache_init(void);
 extern void free_thread_info(struct thread_info *ti);
 extern int arch_dup_task_struct(struct task_struct *dst, struct task_struct *src);
+extern int arch_prealloc_fpu(struct task_struct *tsk);
+#define arch_prealloc_fpu arch_prealloc_fpu
 #define arch_task_cache_init arch_task_cache_init
 #endif
 #endif /* _ASM_X86_THREAD_INFO_H */
diff --git a/arch/x86/kernel/process.c b/arch/x86/kernel/process.c
index 0382f98..3edfbf2 100644
--- a/arch/x86/kernel/process.c
+++ b/arch/x86/kernel/process.c
@@ -26,6 +26,13 @@
 struct kmem_cache *task_xstate_cachep;
 EXPORT_SYMBOL_GPL(task_xstate_cachep);
 
+int arch_prealloc_fpu(struct task_struct *tsk)
+{
+	if (!fpu_allocated(&tsk->thread.fpu))
+		return fpu_alloc(&tsk->thread.fpu);
+	return 0;
+}
+
 int arch_dup_task_struct(struct task_struct *dst, struct task_struct *src)
 {
 	int ret;
diff --git a/fs/exec.c b/fs/exec.c
index 5e62d26..932b9ad 100644
--- a/fs/exec.c
+++ b/fs/exec.c
@@ -1022,10 +1022,18 @@ void set_task_comm(struct task_struct *tsk, char *buf)
 	perf_event_comm(tsk);
 }
 
+#if !defined(arch_prealloc_fpu)
+#define arch_prealloc_fpu(tsk) (0)
+#endif
+
 int flush_old_exec(struct linux_binprm * bprm)
 {
 	int retval;
 
+	retval = arch_prealloc_fpu(current);
+	if (retval)
+		goto out;
+
 	/*
 	 * Make sure we have a private signal table and that
 	 * we are unassociated from the previous thread group.
-- 
1.5.6.5



^ permalink raw reply related	[flat|nested] 7+ messages in thread

* [PATCH 2/2] fork: avoid weak function arch_dup_task_struct
       [not found] <2be7032a-e4b6-4ce7-af53-532d77dbfb48@email.android.com>
  2011-04-27 14:26 ` [PATCH 1/2] x86, xsave: fix non-lazy allocation of the xsave area Hans Rosenfeld
@ 2011-04-27 14:26 ` Hans Rosenfeld
  2011-05-01 19:25   ` Andrew Morton
  1 sibling, 1 reply; 7+ messages in thread
From: Hans Rosenfeld @ 2011-04-27 14:26 UTC (permalink / raw)
  To: hpa
  Cc: brgerst, tglx, mingo, suresh.b.siddha, eranian, robert.richter,
	Andreas.Herrmann3, x86, linux-kernel, Hans Rosenfeld

Avoid potential gcc bug by not using a weak function for
arch_dup_task_struct. Use an #ifdef'ed static function for
archs that don't have a special arch_dup_task_struct implementation.

Signed-off-by: Hans Rosenfeld <hans.rosenfeld@amd.com>
---
 arch/sh/include/asm/thread_info.h  |    1 +
 arch/x86/include/asm/thread_info.h |    1 +
 kernel/fork.c                      |    5 +++--
 3 files changed, 5 insertions(+), 2 deletions(-)

diff --git a/arch/sh/include/asm/thread_info.h b/arch/sh/include/asm/thread_info.h
index ea2d508..2d27e1c 100644
--- a/arch/sh/include/asm/thread_info.h
+++ b/arch/sh/include/asm/thread_info.h
@@ -100,6 +100,7 @@ extern void free_thread_info(struct thread_info *ti);
 extern void arch_task_cache_init(void);
 #define arch_task_cache_init arch_task_cache_init
 extern int arch_dup_task_struct(struct task_struct *dst, struct task_struct *src);
+#define arch_dup_task_struct arch_dup_task_struct
 extern void init_thread_xstate(void);
 
 #define __HAVE_ARCH_THREAD_INFO_ALLOCATOR
diff --git a/arch/x86/include/asm/thread_info.h b/arch/x86/include/asm/thread_info.h
index 07c8cfe..ce97947 100644
--- a/arch/x86/include/asm/thread_info.h
+++ b/arch/x86/include/asm/thread_info.h
@@ -266,6 +266,7 @@ extern void arch_task_cache_init(void);
 extern void free_thread_info(struct thread_info *ti);
 extern int arch_dup_task_struct(struct task_struct *dst, struct task_struct *src);
 extern int arch_prealloc_fpu(struct task_struct *tsk);
+#define arch_dup_task_struct arch_dup_task_struct
 #define arch_prealloc_fpu arch_prealloc_fpu
 #define arch_task_cache_init arch_task_cache_init
 #endif
diff --git a/kernel/fork.c b/kernel/fork.c
index e7548de..6d929d0 100644
--- a/kernel/fork.c
+++ b/kernel/fork.c
@@ -243,12 +243,13 @@ void __init fork_init(unsigned long mempages)
 		init_task.signal->rlim[RLIMIT_NPROC];
 }
 
-int __attribute__((weak)) arch_dup_task_struct(struct task_struct *dst,
-					       struct task_struct *src)
+#ifndef arch_dup_task_struct
+static int arch_dup_task_struct(struct task_struct *dst, struct task_struct *src)
 {
 	*dst = *src;
 	return 0;
 }
+#endif
 
 static struct task_struct *dup_task_struct(struct task_struct *orig)
 {
-- 
1.5.6.5



^ permalink raw reply related	[flat|nested] 7+ messages in thread

* [tip:x86/xsave] x86, xsave: fix non-lazy allocation of the xsave area
  2011-04-27 14:26 ` [PATCH 1/2] x86, xsave: fix non-lazy allocation of the xsave area Hans Rosenfeld
@ 2011-04-28  5:55   ` tip-bot for Hans Rosenfeld
  0 siblings, 0 replies; 7+ messages in thread
From: tip-bot for Hans Rosenfeld @ 2011-04-28  5:55 UTC (permalink / raw)
  To: linux-tip-commits; +Cc: linux-kernel, hans.rosenfeld, hpa, mingo, tglx

Commit-ID:  300c6120b4653043e8a80b24e0483c0d223c5aac
Gitweb:     http://git.kernel.org/tip/300c6120b4653043e8a80b24e0483c0d223c5aac
Author:     Hans Rosenfeld <hans.rosenfeld@amd.com>
AuthorDate: Wed, 27 Apr 2011 16:26:34 +0200
Committer:  H. Peter Anvin <hpa@zytor.com>
CommitDate: Wed, 27 Apr 2011 18:03:45 -0700

x86, xsave: fix non-lazy allocation of the xsave area

A single static xsave area just for init is not enough, since there are
more user processes that are directly executed by kernel threads. Add a
call to a new arch-specific function to flush_old_exec(), which will in
turn call fpu_alloc() to allocate a xsave area if necessary.

Signed-off-by: Hans Rosenfeld <hans.rosenfeld@amd.com>
Link: http://lkml.kernel.org/r/1303914395-841373-1-git-send-email-hans.rosenfeld@amd.com
Signed-off-by: H. Peter Anvin <hpa@zytor.com>
---
 arch/x86/include/asm/i387.h        |    6 ------
 arch/x86/include/asm/thread_info.h |    2 ++
 arch/x86/kernel/process.c          |    7 +++++++
 fs/exec.c                          |    8 ++++++++
 4 files changed, 17 insertions(+), 6 deletions(-)

diff --git a/arch/x86/include/asm/i387.h b/arch/x86/include/asm/i387.h
index f410d16..41f1d7e 100644
--- a/arch/x86/include/asm/i387.h
+++ b/arch/x86/include/asm/i387.h
@@ -327,15 +327,9 @@ static inline void fpu_copy(struct fpu *dst, struct fpu *src)
 }
 
 extern void fpu_finit(struct fpu *fpu);
-static union thread_xstate __init_xstate, *init_xstate = &__init_xstate;
 
 static inline void fpu_clear(struct fpu *fpu)
 {
-	if (!fpu_allocated(fpu)) {
-		BUG_ON(init_xstate == NULL);
-		fpu->state = init_xstate;
-		init_xstate = NULL;
-	}
 	memset(fpu->state, 0, xstate_size);
 	fpu_finit(fpu);
 	set_used_math();
diff --git a/arch/x86/include/asm/thread_info.h b/arch/x86/include/asm/thread_info.h
index 0e691c6..07c8cfe 100644
--- a/arch/x86/include/asm/thread_info.h
+++ b/arch/x86/include/asm/thread_info.h
@@ -265,6 +265,8 @@ static inline void set_restore_sigmask(void)
 extern void arch_task_cache_init(void);
 extern void free_thread_info(struct thread_info *ti);
 extern int arch_dup_task_struct(struct task_struct *dst, struct task_struct *src);
+extern int arch_prealloc_fpu(struct task_struct *tsk);
+#define arch_prealloc_fpu arch_prealloc_fpu
 #define arch_task_cache_init arch_task_cache_init
 #endif
 #endif /* _ASM_X86_THREAD_INFO_H */
diff --git a/arch/x86/kernel/process.c b/arch/x86/kernel/process.c
index d46cbe4..a70f620 100644
--- a/arch/x86/kernel/process.c
+++ b/arch/x86/kernel/process.c
@@ -26,6 +26,13 @@
 struct kmem_cache *task_xstate_cachep;
 EXPORT_SYMBOL_GPL(task_xstate_cachep);
 
+int arch_prealloc_fpu(struct task_struct *tsk)
+{
+	if (!fpu_allocated(&tsk->thread.fpu))
+		return fpu_alloc(&tsk->thread.fpu);
+	return 0;
+}
+
 int arch_dup_task_struct(struct task_struct *dst, struct task_struct *src)
 {
 	int ret;
diff --git a/fs/exec.c b/fs/exec.c
index 5e62d26..932b9ad 100644
--- a/fs/exec.c
+++ b/fs/exec.c
@@ -1022,10 +1022,18 @@ void set_task_comm(struct task_struct *tsk, char *buf)
 	perf_event_comm(tsk);
 }
 
+#if !defined(arch_prealloc_fpu)
+#define arch_prealloc_fpu(tsk) (0)
+#endif
+
 int flush_old_exec(struct linux_binprm * bprm)
 {
 	int retval;
 
+	retval = arch_prealloc_fpu(current);
+	if (retval)
+		goto out;
+
 	/*
 	 * Make sure we have a private signal table and that
 	 * we are unassociated from the previous thread group.

^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: [PATCH 2/2] fork: avoid weak function arch_dup_task_struct
  2011-04-27 14:26 ` [PATCH 2/2] fork: avoid weak function arch_dup_task_struct Hans Rosenfeld
@ 2011-05-01 19:25   ` Andrew Morton
  2011-05-01 22:59     ` H. Peter Anvin
  0 siblings, 1 reply; 7+ messages in thread
From: Andrew Morton @ 2011-05-01 19:25 UTC (permalink / raw)
  To: Hans Rosenfeld
  Cc: hpa, brgerst, tglx, mingo, suresh.b.siddha, eranian,
	robert.richter, Andreas.Herrmann3, x86, linux-kernel

On Wed, 27 Apr 2011 16:26:35 +0200 Hans Rosenfeld <hans.rosenfeld@amd.com> wrote:

> Avoid potential gcc bug by not using a weak function for
> arch_dup_task_struct. Use an #ifdef'ed static function for
> archs that don't have a special arch_dup_task_struct implementation.

The patch is unreviewable (and hence unusable) if you don't describe
this "potential gcc bug".


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH 2/2] fork: avoid weak function arch_dup_task_struct
  2011-05-01 19:25   ` Andrew Morton
@ 2011-05-01 22:59     ` H. Peter Anvin
  2011-05-02  0:11       ` Andrew Morton
  0 siblings, 1 reply; 7+ messages in thread
From: H. Peter Anvin @ 2011-05-01 22:59 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Hans Rosenfeld, brgerst, tglx, mingo, suresh.b.siddha, eranian,
	robert.richter, Andreas.Herrmann3, x86, linux-kernel

On 05/01/2011 12:25 PM, Andrew Morton wrote:
> On Wed, 27 Apr 2011 16:26:35 +0200 Hans Rosenfeld <hans.rosenfeld@amd.com> wrote:
> 
>> Avoid potential gcc bug by not using a weak function for
>> arch_dup_task_struct. Use an #ifdef'ed static function for
>> archs that don't have a special arch_dup_task_struct implementation.
> 
> The patch is unreviewable (and hence unusable) if you don't describe
> this "potential gcc bug".
>

http://lkml.indiana.edu/hypermail/linux/kernel/0804.3/3202.html

	-hpa

-- 
H. Peter Anvin, Intel Open Source Technology Center
I work for Intel.  I don't speak on their behalf.


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH 2/2] fork: avoid weak function arch_dup_task_struct
  2011-05-01 22:59     ` H. Peter Anvin
@ 2011-05-02  0:11       ` Andrew Morton
  2011-05-02 15:36         ` H. Peter Anvin
  0 siblings, 1 reply; 7+ messages in thread
From: Andrew Morton @ 2011-05-02  0:11 UTC (permalink / raw)
  To: H. Peter Anvin
  Cc: Hans Rosenfeld, brgerst, tglx, mingo, suresh.b.siddha, eranian,
	robert.richter, Andreas.Herrmann3, x86, linux-kernel

On Sun, 01 May 2011 15:59:12 -0700 "H. Peter Anvin" <hpa@zytor.com> wrote:

> On 05/01/2011 12:25 PM, Andrew Morton wrote:
> > On Wed, 27 Apr 2011 16:26:35 +0200 Hans Rosenfeld <hans.rosenfeld@amd.com> wrote:
> > 
> >> Avoid potential gcc bug by not using a weak function for
> >> arch_dup_task_struct. Use an #ifdef'ed static function for
> >> archs that don't have a special arch_dup_task_struct implementation.
> > 
> > The patch is unreviewable (and hence unusable) if you don't describe
> > this "potential gcc bug".
> >
> 
> http://lkml.indiana.edu/hypermail/linux/kernel/0804.3/3202.html
> 

gcc-4.1.0 and 4.1.1 were explicitly banned via a test in
include/linux/compiler-gcc4.h for this reason.


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH 2/2] fork: avoid weak function arch_dup_task_struct
  2011-05-02  0:11       ` Andrew Morton
@ 2011-05-02 15:36         ` H. Peter Anvin
  0 siblings, 0 replies; 7+ messages in thread
From: H. Peter Anvin @ 2011-05-02 15:36 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Hans Rosenfeld, brgerst, tglx, mingo, suresh.b.siddha, eranian,
	robert.richter, Andreas.Herrmann3, x86, linux-kernel

On 05/01/2011 05:11 PM, Andrew Morton wrote:
> On Sun, 01 May 2011 15:59:12 -0700 "H. Peter Anvin"<hpa@zytor.com>  wrote:
>
>> On 05/01/2011 12:25 PM, Andrew Morton wrote:
>>> On Wed, 27 Apr 2011 16:26:35 +0200 Hans Rosenfeld<hans.rosenfeld@amd.com>  wrote:
>>>
>>>> Avoid potential gcc bug by not using a weak function for
>>>> arch_dup_task_struct. Use an #ifdef'ed static function for
>>>> archs that don't have a special arch_dup_task_struct implementation.
>>>
>>> The patch is unreviewable (and hence unusable) if you don't describe
>>> this "potential gcc bug".
>>
>> http://lkml.indiana.edu/hypermail/linux/kernel/0804.3/3202.html
>>
>
> gcc-4.1.0 and 4.1.1 were explicitly banned via a test in
> include/linux/compiler-gcc4.h for this reason.
>

OK, no issue then.  Sorry for the noise.

	-hpa

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2011-05-02 15:37 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <2be7032a-e4b6-4ce7-af53-532d77dbfb48@email.android.com>
2011-04-27 14:26 ` [PATCH 1/2] x86, xsave: fix non-lazy allocation of the xsave area Hans Rosenfeld
2011-04-28  5:55   ` [tip:x86/xsave] " tip-bot for Hans Rosenfeld
2011-04-27 14:26 ` [PATCH 2/2] fork: avoid weak function arch_dup_task_struct Hans Rosenfeld
2011-05-01 19:25   ` Andrew Morton
2011-05-01 22:59     ` H. Peter Anvin
2011-05-02  0:11       ` Andrew Morton
2011-05-02 15:36         ` H. Peter Anvin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox