All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] 2.6 aacraid: Fix for controller load based timeouts
@ 2005-07-08 17:36 Mark Haverkamp
  2005-07-08 17:41 ` Ryan Anderson
  0 siblings, 1 reply; 7+ messages in thread
From: Mark Haverkamp @ 2005-07-08 17:36 UTC (permalink / raw)
  To: James Bottomley; +Cc: Mark Salyzyn, linux-scsi, Martin Drab

Martin Drab found that he could get aacraid timeouts with high load on
his controller / disk drive combinations.  After some experimentation
Mark Salyzyn has come up with a patch to reduce the default max_sectors
to something that will keep the controller from being overloaded and
will eliminate the timeout issues.

Patch against scsi-misc-2.6 git tree.

Signed-off-by: Mark Haverkamp <markh@osdl.org>

Index: scsi-misc-2.6/drivers/scsi/aacraid/aacraid.h
===================================================================
--- scsi-misc-2.6.orig/drivers/scsi/aacraid/aacraid.h	2005-07-08 09:22:20.000000000 -0700
+++ scsi-misc-2.6/drivers/scsi/aacraid/aacraid.h	2005-07-08 09:23:52.000000000 -0700
@@ -15,11 +15,7 @@
 #define AAC_MAX_LUN		(8)
 
 #define AAC_MAX_HOSTPHYSMEMPAGES (0xfffff)
-/*
- *  max_sectors is an unsigned short, otherwise limit is 0x100000000 / 512
- * Linux has starvation problems if we permit larger than 4MB I/O ...
- */
-#define AAC_MAX_32BIT_SGBCOUNT	((unsigned short)8192)
+#define AAC_MAX_32BIT_SGBCOUNT	((unsigned short)512)
 
 /*
  * These macros convert from physical channels to virtual channels
Index: scsi-misc-2.6/drivers/scsi/aacraid/linit.c
===================================================================
--- scsi-misc-2.6.orig/drivers/scsi/aacraid/linit.c	2005-06-27 09:57:38.000000000 -0700
+++ scsi-misc-2.6/drivers/scsi/aacraid/linit.c	2005-07-08 09:23:52.000000000 -0700
@@ -374,7 +374,8 @@
 	else
 		scsi_adjust_queue_depth(sdev, 0, 1);
 
-	if (host->max_sectors < AAC_MAX_32BIT_SGBCOUNT)
+	if (!(((struct aac_dev *)host->hostdata)->adapter_info.options
+	  & AAC_OPT_NEW_COMM))
 		blk_queue_max_segment_size(sdev->request_queue, 65536);
 
 	return 0;

-- 
Mark Haverkamp <markh@osdl.org>


^ permalink raw reply	[flat|nested] 7+ messages in thread
* RE: [PATCH] 2.6 aacraid: Fix for controller load based timeouts
@ 2005-07-08 18:00 Salyzyn, Mark
  2005-07-08 18:22 ` Martin Drab
  0 siblings, 1 reply; 7+ messages in thread
From: Salyzyn, Mark @ 2005-07-08 18:00 UTC (permalink / raw)
  To: ryan, Mark Haverkamp; +Cc: James Bottomley, linux-scsi, Martin Drab

Yes containers will go offline, but this fix is for a recent change to
the driver; if it was 6 months ago, it was a totally different problem.
You probably can resolve your problems by making sure you have the
latest Firmware. I don't believe there are any changes in the driver in
the past 6 months that would have worked around any
Firmware/Hardware/Compatibility issues.

Sadly, anything that goes wrong (including card, power supply, drives)
can cause containers to go offline; it is a pretty generic symptom to a
multitude of possible problems. Martin's initial problems were
associated with using the WD JD drives, which are not compatible with
RAID cards because their internal error recovery paths.

Sincerely -- Mark Salyzyn

-----Original Message-----
From: Ryan Anderson [mailto:ryan@autoweb.net] 
Sent: Friday, July 08, 2005 1:42 PM
To: Mark Haverkamp
Cc: James Bottomley; Salyzyn, Mark; linux-scsi; Martin Drab
Subject: Re: [PATCH] 2.6 aacraid: Fix for controller load based timeouts

On Fri, 2005-07-08 at 10:36 -0700, Mark Haverkamp wrote:
> Martin Drab found that he could get aacraid timeouts with high load on
> his controller / disk drive combinations.  After some experimentation
> Mark Salyzyn has come up with a patch to reduce the default
max_sectors
> to something that will keep the controller from being overloaded and
> will eliminate the timeout issues.

Would hitting this timeout issue cause the container to go offline?

If so, I think this may fix the issues I was having 6 months ago.  (We
ended up taking the aacraid controller out of our production
environment, in frustration.)

I'll try to get some testing time in on this next week, though, the
problems I've run into were very hard to reproduce on demand.

-- 
Ryan Anderson
AutoWeb Communications, Inc.
email: ryan@autoweb.net


^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2005-07-09 10:03 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-07-08 17:36 [PATCH] 2.6 aacraid: Fix for controller load based timeouts Mark Haverkamp
2005-07-08 17:41 ` Ryan Anderson
2005-07-08 17:59   ` Mark Haverkamp
2005-07-08 18:17   ` Martin Drab
2005-07-09 10:03     ` Mark Overmeer
  -- strict thread matches above, loose matches on Subject: below --
2005-07-08 18:00 Salyzyn, Mark
2005-07-08 18:22 ` Martin Drab

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.