* Kernel panic - Ththththaats all folks
@ 2003-09-05 14:01 Sulamita Garcia
2003-09-05 14:37 ` Alan Cox
0 siblings, 1 reply; 7+ messages in thread
From: Sulamita Garcia @ 2003-09-05 14:01 UTC (permalink / raw)
To: linux-scsi
Hi
I have a RocketRAID 404 (HPT374) controller adapter on
a RedHat 9.0 box. I compiled a new fresh kernel
2.4.22, because errors I get with 2.4.20 from
instalation default, and rebuild module from source
because kernel module didn't work. Now I have a Raid 0
by hardware, which is a 800Gb partition. I formated
this as ext3, and for while, was good.
But I doing this for a high availabity cluster, with
drbd (mirroring). And I can't sync machines, I get
this kernel panic:
Incorrect number of segments after building list
nr_segments is 10
counted segments is 1
Flags 0 0
Kernel Panic: Ththththaats all folks. To dangerous to
continue
In interrupt handler - not syncing.
I found this message in scsi_merge, and I don't know
that did it mean. It's kernel problem, scsi problem,
chipset problem, or what?
Thanks
Sulamita Garcia
_______________________________________________________________________
Desafio AntiZona: participe do jogo de perguntas e respostas que vai
dar um Renault Clio, computadores, câmeras digitais, videogames e muito
mais! www.cade.com.br/antizona
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Kernel panic - Ththththaats all folks
2003-09-05 14:01 Sulamita Garcia
@ 2003-09-05 14:37 ` Alan Cox
2003-09-05 14:52 ` Sulamita Garcia
2003-09-06 20:00 ` Sulamita Garcia
0 siblings, 2 replies; 7+ messages in thread
From: Alan Cox @ 2003-09-05 14:37 UTC (permalink / raw)
To: Sulamita Garcia; +Cc: linux-scsi
On Gwe, 2003-09-05 at 15:01, Sulamita Garcia wrote:
> instalation default, and rebuild module from source
> because kernel module didn't work. Now I have a Raid 0
> by hardware
Problems with HPT's partially binary module should be directed to
highpoint not to this list. Looking at the trace its almost certainly
a bug in their alternative software raid driver.
Alan
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Kernel panic - Ththththaats all folks
2003-09-05 14:37 ` Alan Cox
@ 2003-09-05 14:52 ` Sulamita Garcia
2003-09-05 21:38 ` Alan Cox
2003-09-06 20:00 ` Sulamita Garcia
1 sibling, 1 reply; 7+ messages in thread
From: Sulamita Garcia @ 2003-09-05 14:52 UTC (permalink / raw)
To: linux-scsi
--- Alan Cox <alan@lxorguk.ukuu.org.uk> escreveu: >
On Gwe, 2003-09-05 at 15:01, Sulamita Garcia wrote:
> > instalation default, and rebuild module from
> source
> > because kernel module didn't work. Now I have a
> Raid 0
> > by hardware
>
> Problems with HPT's partially binary module should
> be directed to
> highpoint not to this list. Looking at the trace its
> almost certainly
> a bug in their alternative software raid driver.
But the module who did the panic was scsi(the panic
error is in scsi_merge). I just want know why.
>
> Alan
_______________________________________________________________________
Desafio AntiZona: participe do jogo de perguntas e respostas que vai
dar um Renault Clio, computadores, câmeras digitais, videogames e muito
mais! www.cade.com.br/antizona
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Kernel panic - Ththththaats all folks
@ 2003-09-05 20:47 Martin Peschke3
0 siblings, 0 replies; 7+ messages in thread
From: Martin Peschke3 @ 2003-09-05 20:47 UTC (permalink / raw)
To: Sulamita Garcia; +Cc: linux-scsi
We have also seen that panic sporadically with the zfcp lldd
on zSeries. We tracked it down as a situation when a data buffer
is only partially transferred and the request is being requeued
to retrieve the remaining data. Then segment counting needs to
be done before adding a request to the block layer queue again.
The following patch moves segment recounting.
Could you give it a try?
regards,
Martin
--- scsi_lib.c 26 Oct 2001 18:00:38 -0000 1.13
+++ scsi_lib.c 17 Apr 2002 15:58:05 -0000 1.14
@@ -262,6 +262,15 @@
* the bad sector.
*/
SCpnt->request.special = (void *) SCpnt;
+ /*
+ * We need to recount the number of
+ * scatter-gather segments here - the
+ * normal case code assumes this to be
+ * correct, as it would be a performance
+ * loss to always recount. Handling
+ * errors is always unusual, of course.
+ */
+ recount_segments(SCpnt);
list_add(&SCpnt->request.queue, &q->queue_head);
}
@@ -968,15 +977,6 @@
*/
if( req->special != NULL ) {
SCpnt = (Scsi_Cmnd *) req->special;
- /*
- * We need to recount the number of
- * scatter-gather segments here - the
- * normal case code assumes this to be
- * correct, as it would be a performance
- * lose to always recount. Handling
- * errors is always unusual, of course.
- */
- recount_segments(SCpnt);
} else {
SCpnt = scsi_allocate_device(SDpnt, FALSE,
FALSE);
}
Sulamita Garcia <sulagarcia@yahoo.com.br>@vger.kernel.org on 05/09/2003
11:01:00
Sent by: linux-scsi-owner@vger.kernel.org
To: linux-scsi@vger.kernel.org
cc:
Subject: Kernel panic - Ththththaats all folks
Hi
I have a RocketRAID 404 (HPT374) controller adapter on
a RedHat 9.0 box. I compiled a new fresh kernel
2.4.22, because errors I get with 2.4.20 from
instalation default, and rebuild module from source
because kernel module didn't work. Now I have a Raid 0
by hardware, which is a 800Gb partition. I formated
this as ext3, and for while, was good.
But I doing this for a high availabity cluster, with
drbd (mirroring). And I can't sync machines, I get
this kernel panic:
Incorrect number of segments after building list
nr_segments is 10
counted segments is 1
Flags 0 0
Kernel Panic: Ththththaats all folks. To dangerous to
continue
In interrupt handler - not syncing.
I found this message in scsi_merge, and I don't know
that did it mean. It's kernel problem, scsi problem,
chipset problem, or what?
Thanks
Sulamita Garcia
_______________________________________________________________________
Desafio AntiZona: participe do jogo de perguntas e respostas que vai
dar um Renault Clio, computadores, câmeras digitais, videogames e muito
mais! www.cade.com.br/antizona
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Kernel panic - Ththththaats all folks
2003-09-05 14:52 ` Sulamita Garcia
@ 2003-09-05 21:38 ` Alan Cox
0 siblings, 0 replies; 7+ messages in thread
From: Alan Cox @ 2003-09-05 21:38 UTC (permalink / raw)
To: Sulamita Garcia; +Cc: linux-scsi
On Gwe, 2003-09-05 at 15:52, Sulamita Garcia wrote:
> > Problems with HPT's partially binary module should
> > be directed to
> > highpoint not to this list. Looking at the trace its
> > almost certainly
> > a bug in their alternative software raid driver.
>
> But the module who did the panic was scsi(the panic
> error is in scsi_merge). I just want know why.
Ask hpt. Only they have the code for both parts. The panic
indicates which bit of code spotted a problem not where the bug is
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Kernel panic - Ththththaats all folks
2003-09-05 14:37 ` Alan Cox
2003-09-05 14:52 ` Sulamita Garcia
@ 2003-09-06 20:00 ` Sulamita Garcia
1 sibling, 0 replies; 7+ messages in thread
From: Sulamita Garcia @ 2003-09-06 20:00 UTC (permalink / raw)
To: linux-scsi
Em Friday 05 September 2003 11:37, Alan Cox escreveu:
> On Gwe, 2003-09-05 at 15:01, Sulamita Garcia wrote:
> > instalation default, and rebuild module from source
> > because kernel module didn't work. Now I have a Raid 0
> > by hardware
>
> Problems with HPT's partially binary module should be directed to
> highpoint not to this list. Looking at the trace its almost certainly
> a bug in their alternative software raid driver.
>
> Alan
I found the error. Is the arternative Linux distribution that I used.
I delete RedHat and install Slackware 9.0, and everything it's working...
[]'s
--
°v° Sulamita Garcia
/(_)\ Alta Disponibilidade
^ ^ http://ha.underlinux.com.br
Slackware - Because It Works Linuxchix-BR - http://br.linuxchix.org/
Linux Registered User 240300 "Meninas boazinhas vão para o céu.
BSD050867 As más vão à luta."
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Kernel panic - Ththththaats all folks
@ 2003-09-11 20:03 Pete Zaitcev
0 siblings, 0 replies; 7+ messages in thread
From: Pete Zaitcev @ 2003-09-11 20:03 UTC (permalink / raw)
To: sulagarcia; +Cc: linux-scsi
Hi, guys,
what is the outcome of the testing of the Martin's patch,
which moves recount_segments()? I would like to know
a) Did it help Sulamita?
b) Did anyone try it on previously good x86 for regressions?
Thanks,
-- Pete
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2003-09-11 20:04 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-09-11 20:03 Kernel panic - Ththththaats all folks Pete Zaitcev
-- strict thread matches above, loose matches on Subject: below --
2003-09-05 20:47 Martin Peschke3
2003-09-05 14:01 Sulamita Garcia
2003-09-05 14:37 ` Alan Cox
2003-09-05 14:52 ` Sulamita Garcia
2003-09-05 21:38 ` Alan Cox
2003-09-06 20:00 ` Sulamita Garcia
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox