public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
@ 2009-04-21 15:16 Stefan Roscher
  2009-04-21 17:34 ` Roland Dreier
                   ` (2 more replies)
  0 siblings, 3 replies; 12+ messages in thread
From: Stefan Roscher @ 2009-04-21 15:16 UTC (permalink / raw)
  To: LinuxPPC-Dev, LKML, OF-EWG, Roland Dreier
  Cc: fenkes, raisch, alexschm, stefan.roscher, hnguyen

From: Anton Blanchard <antonb at au1.ibm.com>

To improve performance of driver ressource allocation,
replace the vmalloc() call with kmalloc().

Signed-off-by: Stefan Roscher <stefan.roscher at de.ibm.com>
---
 drivers/infiniband/hw/ehca/ipz_pt_fn.c |    6 +++---
 1 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/drivers/infiniband/hw/ehca/ipz_pt_fn.c b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
index c3a3284..a260559 100644
--- a/drivers/infiniband/hw/ehca/ipz_pt_fn.c
+++ b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
@@ -220,7 +220,7 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
 	queue->small_page = NULL;
 
 	/* allocate queue page pointers */
-	queue->queue_pages = vmalloc(nr_of_pages * sizeof(void *));
+	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);
 	if (!queue->queue_pages) {
 		ehca_gen_err("Couldn't allocate queue page list");
 		return 0;
@@ -240,7 +240,7 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
 ipz_queue_ctor_exit0:
 	ehca_gen_err("Couldn't alloc pages queue=%p "
 		 "nr_of_pages=%x",  queue, nr_of_pages);
-	vfree(queue->queue_pages);
+	kfree(queue->queue_pages);
 
 	return 0;
 }
@@ -262,7 +262,7 @@ int ipz_queue_dtor(struct ehca_pd *pd, struct ipz_queue *queue)
 			free_page((unsigned long)queue->queue_pages[i]);
 	}
 
-	vfree(queue->queue_pages);
+	kfree(queue->queue_pages);
 
 	return 1;
 }
-- 
1.5.5


^ permalink raw reply related	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-21 15:16 [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc Stefan Roscher
@ 2009-04-21 17:34 ` Roland Dreier
  2009-04-22 14:02   ` Stefan Roscher
  2009-04-28 15:12 ` Dave Hansen
  2009-04-28 16:45 ` Roland Dreier
  2 siblings, 1 reply; 12+ messages in thread
From: Roland Dreier @ 2009-04-21 17:34 UTC (permalink / raw)
  To: Stefan Roscher
  Cc: LinuxPPC-Dev, LKML, OF-EWG, Roland Dreier, fenkes, raisch,
	alexschm, stefan.roscher, hnguyen

 > +	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);

How big might this buffer be?  Any chance of allocation failure due to
memory fragmentation?

 - R.

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-21 17:34 ` Roland Dreier
@ 2009-04-22 14:02   ` Stefan Roscher
  2009-04-22 14:10     ` michael
  2009-04-28 13:07     ` [ewg] " Alexander Schmidt
  0 siblings, 2 replies; 12+ messages in thread
From: Stefan Roscher @ 2009-04-22 14:02 UTC (permalink / raw)
  To: Roland Dreier
  Cc: LinuxPPC-Dev, LKML, OF-EWG, Roland Dreier, fenkes, raisch,
	alexschm, stefan.roscher, hnguyen

In case of large queue pairs there is the possibillity of allocation failures 
due to memory fragmentationo with kmalloc().To ensure the memory is allocated even
if kmalloc() can not find chunks which are big enough, we try to allocate the memory
with vmalloc().

Signed-off-by: Stefan Roscher <stefan.roscher@de.ibm.com>
---

On Tuesday 21 April 2009 07:34:30 pm Roland Dreier wrote:
>  > +	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);
> 
> How big might this buffer be?  Any chance of allocation failure due to
> memory fragmentation?
> 
>  - R.
Hey Roland, 
yes you are right and here is the patch to circumvent the described problem.
It will apply on top of the patchset.
regards Stefan


 
 drivers/infiniband/hw/ehca/ipz_pt_fn.c |   17 +++++++++++++----
 1 files changed, 13 insertions(+), 4 deletions(-)

diff --git a/drivers/infiniband/hw/ehca/ipz_pt_fn.c b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
index a260559..1227c59 100644
--- a/drivers/infiniband/hw/ehca/ipz_pt_fn.c
+++ b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
@@ -222,8 +222,11 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
 	/* allocate queue page pointers */
 	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);
 	if (!queue->queue_pages) {
-		ehca_gen_err("Couldn't allocate queue page list");
-		return 0;
+		queue->queue_pages = vmalloc(nr_of_pages * sizeof(void *));
+		if (!queue->queue_pages) {
+			ehca_gen_err("Couldn't allocate queue page list");
+			return 0;
+		}
 	}
 	memset(queue->queue_pages, 0, nr_of_pages * sizeof(void *));
 
@@ -240,7 +243,10 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
 ipz_queue_ctor_exit0:
 	ehca_gen_err("Couldn't alloc pages queue=%p "
 		 "nr_of_pages=%x",  queue, nr_of_pages);
-	kfree(queue->queue_pages);
+	if (is_vmalloc_addr(queue->queue_pages))
+		vfree(queue->queue_pages);
+	else
+		kfree(queue->queue_pages);
 
 	return 0;
 }
@@ -262,7 +268,10 @@ int ipz_queue_dtor(struct ehca_pd *pd, struct ipz_queue *queue)
 			free_page((unsigned long)queue->queue_pages[i]);
 	}
 
-	kfree(queue->queue_pages);
+	if (is_vmalloc_addr(queue->queue_pages))
+		vfree(queue->queue_pages);
+	else
+		kfree(queue->queue_pages);
 
 	return 1;
 }
-- 
1.5.5





^ permalink raw reply related	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-22 14:02   ` Stefan Roscher
@ 2009-04-22 14:10     ` michael
  2009-04-22 16:00       ` Stefan Roscher
  2009-04-28 13:07     ` [ewg] " Alexander Schmidt
  1 sibling, 1 reply; 12+ messages in thread
From: michael @ 2009-04-22 14:10 UTC (permalink / raw)
  To: Stefan Roscher
  Cc: Roland Dreier, fenkes, LKML, OF-EWG, LinuxPPC-Dev, raisch,
	alexschm, stefan.roscher

Hi,

Stefan Roscher wrote:
> In case of large queue pairs there is the possibillity of allocation failures 
> due to memory fragmentationo with kmalloc().To ensure the memory is allocated even
> if kmalloc() can not find chunks which are big enough, we try to allocate the memory
> with vmalloc().
>
> Signed-off-by: Stefan Roscher <stefan.roscher@de.ibm.com>
> ---
>
> On Tuesday 21 April 2009 07:34:30 pm Roland Dreier wrote:
>   
>>  > +	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);
>>
>> How big might this buffer be?  Any chance of allocation failure due to
>> memory fragmentation?
>>
>>  - R.
>>     
> Hey Roland, 
> yes you are right and here is the patch to circumvent the described problem.
> It will apply on top of the patchset.
> regards Stefan
>
>   
I don't take the point, if it is not import use the vmalloc. Why you try 
with a kmalloc
alloc first? and why do not use kzalloc?
>  
>  drivers/infiniband/hw/ehca/ipz_pt_fn.c |   17 +++++++++++++----
>  1 files changed, 13 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/infiniband/hw/ehca/ipz_pt_fn.c b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
> index a260559..1227c59 100644
> --- a/drivers/infiniband/hw/ehca/ipz_pt_fn.c
> +++ b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
> @@ -222,8 +222,11 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
>  	/* allocate queue page pointers */
>  	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);
>  	if (!queue->queue_pages) {
> -		ehca_gen_err("Couldn't allocate queue page list");
> -		return 0;
> +		queue->queue_pages = vmalloc(nr_of_pages * sizeof(void *));
> +		if (!queue->queue_pages) {
> +			ehca_gen_err("Couldn't allocate queue page list");
> +			return 0;
> +		}
>  	}
>  	memset(queue->queue_pages, 0, nr_of_pages * sizeof(void *));
>  
> @@ -240,7 +243,10 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
>  ipz_queue_ctor_exit0:
>  	ehca_gen_err("Couldn't alloc pages queue=%p "
>  		 "nr_of_pages=%x",  queue, nr_of_pages);
> -	kfree(queue->queue_pages);
> +	if (is_vmalloc_addr(queue->queue_pages))
> +		vfree(queue->queue_pages);
> +	else
> +		kfree(queue->queue_pages);
>  
>  	return 0;
>  }
> @@ -262,7 +268,10 @@ int ipz_queue_dtor(struct ehca_pd *pd, struct ipz_queue *queue)
>  			free_page((unsigned long)queue->queue_pages[i]);
>  	}
>  
> -	kfree(queue->queue_pages);
> +	if (is_vmalloc_addr(queue->queue_pages))
> +		vfree(queue->queue_pages);
> +	else
> +		kfree(queue->queue_pages);
>  
>  	return 1;
>  }
>   

Regards Michael

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-22 14:10     ` michael
@ 2009-04-22 16:00       ` Stefan Roscher
  2009-04-22 16:12         ` michael
  0 siblings, 1 reply; 12+ messages in thread
From: Stefan Roscher @ 2009-04-22 16:00 UTC (permalink / raw)
  To: michael
  Cc: Roland Dreier, fenkes, LKML, OF-EWG, LinuxPPC-Dev, raisch,
	alexschm, stefan.roscher

On Wednesday 22 April 2009 04:10:18 pm michael wrote:
> Hi,
> 

> I don't take the point, if it is not import use the vmalloc. Why you try 
> with a kmalloc
> alloc first? and why do not use kzalloc?

Because kmalloc() is faster than vmalloc() causing a huge performance win
when someone allocates a large number of queue pairs. We fall back to
vmalloc() only if kmalloc() can't deliver the memory chunk.
We don't need kzalloc because we fill the list right after the alloc.

regards Stefan


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-22 16:00       ` Stefan Roscher
@ 2009-04-22 16:12         ` michael
  0 siblings, 0 replies; 12+ messages in thread
From: michael @ 2009-04-22 16:12 UTC (permalink / raw)
  To: Stefan Roscher
  Cc: Roland Dreier, fenkes, LKML, OF-EWG, LinuxPPC-Dev, raisch,
	alexschm, stefan.roscher

Hi,

Stefan Roscher wrote:
> On Wednesday 22 April 2009 04:10:18 pm michael wrote:
>   
>> Hi,
>>
>>     
>
>   
>> I don't take the point, if it is not import use the vmalloc. Why you try 
>> with a kmalloc
>> alloc first? and why do not use kzalloc?
>>     
>
> Because kmalloc() is faster than vmalloc() causing a huge performance win
> when someone allocates a large number of queue pairs. We fall back to
> vmalloc() only if kmalloc() can't deliver the memory chunk.
>   
Sorry I catch later the performace issue.
> We don't need kzalloc because we fill the list right after the alloc.
>
> regards Stefan
>   
Regards Michael
> _______________________________________________
> Linuxppc-dev mailing list
> Linuxppc-dev@ozlabs.org
> https://ozlabs.org/mailman/listinfo/linuxppc-dev
>
>   


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [ewg] Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-22 14:02   ` Stefan Roscher
  2009-04-22 14:10     ` michael
@ 2009-04-28 13:07     ` Alexander Schmidt
  2009-04-28 14:01       ` Roland Dreier
  1 sibling, 1 reply; 12+ messages in thread
From: Alexander Schmidt @ 2009-04-28 13:07 UTC (permalink / raw)
  To: Stefan Roscher, Roland Dreier
  Cc: fenkes, LKML, OF-EWG, LinuxPPC-Dev, raisch, alexschm,
	stefan.roscher

Hi Roland,

did you have a chance to take a look at the patchset and will you apply it, or
are there any outstanding issues we need to address?

Regards,
Alex

On Wed, 22 Apr 2009 16:02:28 +0200
Stefan Roscher <ossrosch@linux.vnet.ibm.com> wrote:

> In case of large queue pairs there is the possibillity of allocation failures 
> due to memory fragmentationo with kmalloc().To ensure the memory is allocated even
> if kmalloc() can not find chunks which are big enough, we try to allocate the memory
> with vmalloc().
> 
> Signed-off-by: Stefan Roscher <stefan.roscher@de.ibm.com>
> ---
> 
> On Tuesday 21 April 2009 07:34:30 pm Roland Dreier wrote:
> >  > +	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);
> > 
> > How big might this buffer be?  Any chance of allocation failure due to
> > memory fragmentation?
> > 
> >  - R.
> Hey Roland, 
> yes you are right and here is the patch to circumvent the described problem.
> It will apply on top of the patchset.
> regards Stefan
> 
> 
> 
>  drivers/infiniband/hw/ehca/ipz_pt_fn.c |   17 +++++++++++++----
>  1 files changed, 13 insertions(+), 4 deletions(-)
> 
> diff --git a/drivers/infiniband/hw/ehca/ipz_pt_fn.c b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
> index a260559..1227c59 100644
> --- a/drivers/infiniband/hw/ehca/ipz_pt_fn.c
> +++ b/drivers/infiniband/hw/ehca/ipz_pt_fn.c
> @@ -222,8 +222,11 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
>  	/* allocate queue page pointers */
>  	queue->queue_pages = kmalloc(nr_of_pages * sizeof(void *), GFP_KERNEL);
>  	if (!queue->queue_pages) {
> -		ehca_gen_err("Couldn't allocate queue page list");
> -		return 0;
> +		queue->queue_pages = vmalloc(nr_of_pages * sizeof(void *));
> +		if (!queue->queue_pages) {
> +			ehca_gen_err("Couldn't allocate queue page list");
> +			return 0;
> +		}
>  	}
>  	memset(queue->queue_pages, 0, nr_of_pages * sizeof(void *));
> 
> @@ -240,7 +243,10 @@ int ipz_queue_ctor(struct ehca_pd *pd, struct ipz_queue *queue,
>  ipz_queue_ctor_exit0:
>  	ehca_gen_err("Couldn't alloc pages queue=%p "
>  		 "nr_of_pages=%x",  queue, nr_of_pages);
> -	kfree(queue->queue_pages);
> +	if (is_vmalloc_addr(queue->queue_pages))
> +		vfree(queue->queue_pages);
> +	else
> +		kfree(queue->queue_pages);
> 
>  	return 0;
>  }
> @@ -262,7 +268,10 @@ int ipz_queue_dtor(struct ehca_pd *pd, struct ipz_queue *queue)
>  			free_page((unsigned long)queue->queue_pages[i]);
>  	}
> 
> -	kfree(queue->queue_pages);
> +	if (is_vmalloc_addr(queue->queue_pages))
> +		vfree(queue->queue_pages);
> +	else
> +		kfree(queue->queue_pages);
> 
>  	return 1;
>  }

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [ewg] Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-28 13:07     ` [ewg] " Alexander Schmidt
@ 2009-04-28 14:01       ` Roland Dreier
  2009-04-28 14:13         ` Alexander Schmidt
  0 siblings, 1 reply; 12+ messages in thread
From: Roland Dreier @ 2009-04-28 14:01 UTC (permalink / raw)
  To: Alexander Schmidt
  Cc: Stefan Roscher, fenkes, LKML, OF-EWG, LinuxPPC-Dev, raisch,
	alexschm, stefan.roscher

 > did you have a chance to take a look at the patchset and will you apply it, or
 > are there any outstanding issues we need to address?

I guess it's OK, but definitely 2.6.31 material.  I guess I'll stick it
linux-next soon.

 - R.

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [ewg] Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-28 14:01       ` Roland Dreier
@ 2009-04-28 14:13         ` Alexander Schmidt
  0 siblings, 0 replies; 12+ messages in thread
From: Alexander Schmidt @ 2009-04-28 14:13 UTC (permalink / raw)
  To: Roland Dreier
  Cc: Stefan Roscher, fenkes, LKML, OF-EWG, LinuxPPC-Dev, raisch,
	alexschm, stefan.roscher

On Tue, 28 Apr 2009 07:01:32 -0700
Roland Dreier <rdreier@cisco.com> wrote:

>  > did you have a chance to take a look at the patchset and will you apply it, or
>  > are there any outstanding issues we need to address?
> 
> I guess it's OK, but definitely 2.6.31 material.  I guess I'll stick it
> linux-next soon.
> 
>  - R.

Okay with us, thank you very much!

Alex

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-21 15:16 [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc Stefan Roscher
  2009-04-21 17:34 ` Roland Dreier
@ 2009-04-28 15:12 ` Dave Hansen
  2009-04-28 16:02   ` Stefan Roscher
  2009-04-28 16:45 ` Roland Dreier
  2 siblings, 1 reply; 12+ messages in thread
From: Dave Hansen @ 2009-04-28 15:12 UTC (permalink / raw)
  To: Stefan Roscher
  Cc: LinuxPPC-Dev, LKML, OF-EWG, Roland Dreier, fenkes, raisch,
	alexschm, stefan.roscher, hnguyen

On Tue, 2009-04-21 at 17:16 +0200, Stefan Roscher wrote:
> From: Anton Blanchard <antonb at au1.ibm.com>
> 
> To improve performance of driver ressource allocation,
> replace the vmalloc() call with kmalloc().

Just curious, but how big are these allocations?  Why was vmalloc() even
ever used if we know they'll be small?

-- Dave


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-28 15:12 ` Dave Hansen
@ 2009-04-28 16:02   ` Stefan Roscher
  0 siblings, 0 replies; 12+ messages in thread
From: Stefan Roscher @ 2009-04-28 16:02 UTC (permalink / raw)
  To: Dave Hansen
  Cc: LinuxPPC-Dev, LKML, Roland Dreier, fenkes, raisch, alexschm,
	stefan.roscher, hnguyen

On Tuesday 28 April 2009 05:12:51 pm Dave Hansen wrote:
> On Tue, 2009-04-21 at 17:16 +0200, Stefan Roscher wrote:
> > From: Anton Blanchard <antonb at au1.ibm.com>
> > 
> > To improve performance of driver ressource allocation,
> > replace the vmalloc() call with kmalloc().
> 
> Just curious, but how big are these allocations?  Why was vmalloc() even
> ever used if we know they'll be small?
> 
> -- Dave
> 
> 

The theoretical maximum size can be 512k, but for common queue pairs 
less than 128k is used.Because of the theoretical maximum we implemented
vmalloc() first, but recognized a huge performance impact.

-- Stefan 


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc
  2009-04-21 15:16 [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc Stefan Roscher
  2009-04-21 17:34 ` Roland Dreier
  2009-04-28 15:12 ` Dave Hansen
@ 2009-04-28 16:45 ` Roland Dreier
  2 siblings, 0 replies; 12+ messages in thread
From: Roland Dreier @ 2009-04-28 16:45 UTC (permalink / raw)
  To: Stefan Roscher
  Cc: LinuxPPC-Dev, LKML, OF-EWG, Roland Dreier, fenkes, raisch,
	alexschm, stefan.roscher, hnguyen

thanks, applied.

 > From: Anton Blanchard <antonb at au1.ibm.com>
 > Signed-off-by: Stefan Roscher <stefan.roscher at de.ibm.com>

please use '@' signs so these are real email addresses.

 - R.

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2009-04-28 16:45 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-04-21 15:16 [PATCH 1/3] IB/ehca: Replace vmalloc with kmalloc Stefan Roscher
2009-04-21 17:34 ` Roland Dreier
2009-04-22 14:02   ` Stefan Roscher
2009-04-22 14:10     ` michael
2009-04-22 16:00       ` Stefan Roscher
2009-04-22 16:12         ` michael
2009-04-28 13:07     ` [ewg] " Alexander Schmidt
2009-04-28 14:01       ` Roland Dreier
2009-04-28 14:13         ` Alexander Schmidt
2009-04-28 15:12 ` Dave Hansen
2009-04-28 16:02   ` Stefan Roscher
2009-04-28 16:45 ` Roland Dreier

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox