public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* Kernel panic - Ththththaats all folks
@ 2003-09-05 14:01 Sulamita Garcia
  2003-09-05 14:37 ` Alan Cox
  0 siblings, 1 reply; 7+ messages in thread
From: Sulamita Garcia @ 2003-09-05 14:01 UTC (permalink / raw)
  To: linux-scsi

Hi

I have a RocketRAID 404 (HPT374) controller adapter on
a RedHat 9.0 box. I compiled a new fresh kernel
2.4.22, because errors I get with 2.4.20 from
instalation default, and rebuild module from source
because kernel module didn't work. Now I have a Raid 0
by hardware, which is a 800Gb partition. I formated
this as ext3, and for while, was good. 
But I doing this for a high availabity cluster, with
drbd (mirroring). And I can't sync machines, I get
this kernel panic:

Incorrect number of segments after building list
nr_segments is 10
counted segments is 1
Flags 0 0
Kernel Panic: Ththththaats all folks. To dangerous to
continue
In interrupt handler - not syncing.

I found this message in scsi_merge, and I don't know
that did it mean. It's kernel problem, scsi problem,
chipset problem, or what? 

Thanks

Sulamita Garcia


_______________________________________________________________________
Desafio AntiZona: participe do jogo de perguntas e respostas que vai
dar um Renault Clio, computadores, câmeras digitais, videogames e muito
mais! www.cade.com.br/antizona
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Kernel panic - Ththththaats all folks
  2003-09-05 14:01 Sulamita Garcia
@ 2003-09-05 14:37 ` Alan Cox
  2003-09-05 14:52   ` Sulamita Garcia
  2003-09-06 20:00   ` Sulamita Garcia
  0 siblings, 2 replies; 7+ messages in thread
From: Alan Cox @ 2003-09-05 14:37 UTC (permalink / raw)
  To: Sulamita Garcia; +Cc: linux-scsi

On Gwe, 2003-09-05 at 15:01, Sulamita Garcia wrote:
> instalation default, and rebuild module from source
> because kernel module didn't work. Now I have a Raid 0
> by hardware

Problems with HPT's partially binary module should be directed to
highpoint not to this list. Looking at the trace its almost certainly
a bug in their alternative software raid driver.

Alan

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Kernel panic - Ththththaats all folks
  2003-09-05 14:37 ` Alan Cox
@ 2003-09-05 14:52   ` Sulamita Garcia
  2003-09-05 21:38     ` Alan Cox
  2003-09-06 20:00   ` Sulamita Garcia
  1 sibling, 1 reply; 7+ messages in thread
From: Sulamita Garcia @ 2003-09-05 14:52 UTC (permalink / raw)
  To: linux-scsi

 --- Alan Cox <alan@lxorguk.ukuu.org.uk> escreveu: >
On Gwe, 2003-09-05 at 15:01, Sulamita Garcia wrote:
> > instalation default, and rebuild module from
> source
> > because kernel module didn't work. Now I have a
> Raid 0
> > by hardware
> 
> Problems with HPT's partially binary module should
> be directed to
> highpoint not to this list. Looking at the trace its
> almost certainly
> a bug in their alternative software raid driver.

But the module who did the panic was scsi(the panic
error is in scsi_merge). I just want know why.

> 
> Alan 

_______________________________________________________________________
Desafio AntiZona: participe do jogo de perguntas e respostas que vai
dar um Renault Clio, computadores, câmeras digitais, videogames e muito
mais! www.cade.com.br/antizona
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Kernel panic - Ththththaats all folks
@ 2003-09-05 20:47 Martin Peschke3
  0 siblings, 0 replies; 7+ messages in thread
From: Martin Peschke3 @ 2003-09-05 20:47 UTC (permalink / raw)
  To: Sulamita Garcia; +Cc: linux-scsi


We have also seen that panic sporadically with the zfcp lldd
on zSeries. We tracked it down as a situation when a data buffer
is only partially transferred and the request is being requeued
to retrieve the remaining data. Then segment counting needs to
be done before adding a request to the block layer queue again.
The following patch moves segment recounting.
Could you give it a try?

regards,
Martin


--- scsi_lib.c  26 Oct 2001 18:00:38 -0000      1.13
+++ scsi_lib.c  17 Apr 2002 15:58:05 -0000      1.14
@@ -262,6 +262,15 @@
                 * the bad sector.
                 */
                SCpnt->request.special = (void *) SCpnt;
+                /*
+                 * We need to recount the number of
+                 * scatter-gather segments here - the
+                 * normal case code assumes this to be
+                 * correct, as it would be a performance
+                 * loss to always recount.  Handling
+                 * errors is always unusual, of course.
+                 */
+                recount_segments(SCpnt);
                list_add(&SCpnt->request.queue, &q->queue_head);
        }

@@ -968,15 +977,6 @@
                         */
                        if( req->special != NULL ) {
                                SCpnt = (Scsi_Cmnd *) req->special;
-                               /*
-                                * We need to recount the number of
-                                * scatter-gather segments here - the
-                                * normal case code assumes this to be
-                                * correct, as it would be a performance
-                                * lose to always recount.  Handling
-                                * errors is always unusual, of course.
-                                */
-                               recount_segments(SCpnt);
                        } else {
                                SCpnt = scsi_allocate_device(SDpnt, FALSE,
FALSE);
                        }



Sulamita Garcia <sulagarcia@yahoo.com.br>@vger.kernel.org on 05/09/2003
11:01:00

Sent by:    linux-scsi-owner@vger.kernel.org


To:    linux-scsi@vger.kernel.org
cc:
Subject:    Kernel panic - Ththththaats all folks


Hi

I have a RocketRAID 404 (HPT374) controller adapter on
a RedHat 9.0 box. I compiled a new fresh kernel
2.4.22, because errors I get with 2.4.20 from
instalation default, and rebuild module from source
because kernel module didn't work. Now I have a Raid 0
by hardware, which is a 800Gb partition. I formated
this as ext3, and for while, was good.
But I doing this for a high availabity cluster, with
drbd (mirroring). And I can't sync machines, I get
this kernel panic:

Incorrect number of segments after building list
nr_segments is 10
counted segments is 1
Flags 0 0
Kernel Panic: Ththththaats all folks. To dangerous to
continue
In interrupt handler - not syncing.

I found this message in scsi_merge, and I don't know
that did it mean. It's kernel problem, scsi problem,
chipset problem, or what?

Thanks

Sulamita Garcia


_______________________________________________________________________
Desafio AntiZona: participe do jogo de perguntas e respostas que vai
dar um Renault Clio, computadores, câmeras digitais, videogames e muito
mais! www.cade.com.br/antizona
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html




-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Kernel panic - Ththththaats all folks
  2003-09-05 14:52   ` Sulamita Garcia
@ 2003-09-05 21:38     ` Alan Cox
  0 siblings, 0 replies; 7+ messages in thread
From: Alan Cox @ 2003-09-05 21:38 UTC (permalink / raw)
  To: Sulamita Garcia; +Cc: linux-scsi

On Gwe, 2003-09-05 at 15:52, Sulamita Garcia wrote:
> > Problems with HPT's partially binary module should
> > be directed to
> > highpoint not to this list. Looking at the trace its
> > almost certainly
> > a bug in their alternative software raid driver.
> 
> But the module who did the panic was scsi(the panic
> error is in scsi_merge). I just want know why.

Ask hpt. Only they have the code for both parts. The panic 
indicates which bit of code spotted a problem not where the bug is


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Kernel panic - Ththththaats all folks
  2003-09-05 14:37 ` Alan Cox
  2003-09-05 14:52   ` Sulamita Garcia
@ 2003-09-06 20:00   ` Sulamita Garcia
  1 sibling, 0 replies; 7+ messages in thread
From: Sulamita Garcia @ 2003-09-06 20:00 UTC (permalink / raw)
  To: linux-scsi

Em Friday 05 September 2003 11:37, Alan Cox escreveu:
> On Gwe, 2003-09-05 at 15:01, Sulamita Garcia wrote:
> > instalation default, and rebuild module from source
> > because kernel module didn't work. Now I have a Raid 0
> > by hardware
>
> Problems with HPT's partially binary module should be directed to
> highpoint not to this list. Looking at the trace its almost certainly
> a bug in their alternative software raid driver.
>
> Alan

I found the error. Is the arternative Linux distribution that I used. 
I delete RedHat and install Slackware 9.0, and everything it's working...
[]'s

-- 

  °v°  Sulamita Garcia                
 /(_)\ Alta   Disponibilidade            
  ^ ^  http://ha.underlinux.com.br
 Slackware - Because It Works       Linuxchix-BR - http://br.linuxchix.org/
 Linux Registered User 240300         "Meninas boazinhas vão para o céu.
 BSD050867                                   As más vão à luta."

-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Kernel panic - Ththththaats all folks
@ 2003-09-11 20:03 Pete Zaitcev
  0 siblings, 0 replies; 7+ messages in thread
From: Pete Zaitcev @ 2003-09-11 20:03 UTC (permalink / raw)
  To: sulagarcia; +Cc: linux-scsi

Hi, guys,

what is the outcome of the testing of the Martin's patch,
which moves recount_segments()? I would like to know
 a) Did it help Sulamita?
 b) Did anyone try it on previously good x86 for regressions?

Thanks,
-- Pete

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2003-09-11 20:04 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-09-11 20:03 Kernel panic - Ththththaats all folks Pete Zaitcev
  -- strict thread matches above, loose matches on Subject: below --
2003-09-05 20:47 Martin Peschke3
2003-09-05 14:01 Sulamita Garcia
2003-09-05 14:37 ` Alan Cox
2003-09-05 14:52   ` Sulamita Garcia
2003-09-05 21:38     ` Alan Cox
2003-09-06 20:00   ` Sulamita Garcia

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox