* [parisc-linux] k-2.6.10-rc1-pa3 & c110: high data rate => Kernel panic - ...
@ 2004-10-30 19:58 Joel Soete
2004-10-31 21:11 ` Joel Soete
0 siblings, 1 reply; 2+ messages in thread
From: Joel Soete @ 2004-10-30 19:58 UTC (permalink / raw)
To: James Bottomley, parisc-linux
Hello James,
The serial console pb being fixed, I can now boot again my c110 with recent 2.6.10-rc1-pa3 :)
I do first a 'apt-get dist-upgrade' of the chroot disk which use lasi & 53c700 new driver and I unfortunately encounter numerous
messages (error/warning?):
Oct 30 19:57:33 hpalin kernel: scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG IN
Oct 30 19:57:33 hpalin kernel: scsi1: Bus Reset detected, executing command 2d22a360, slot 2fd6064c, dsp 001d81e8[01e8]
Oct 30 19:57:33 hpalin kernel: failing command because of reset, slot 2fd6064c, cmnd 2d22a360
Oct 30 20:03:53 hpalin kernel: scsi1: (3:0), UNEXPECTED PHASE after command phase (CD BSY REQ CMD_OUT)
Oct 30 20:03:53 hpalin kernel: len = 10, cmd =scsi1 : destination target 3, lun 0
Oct 30 20:03:53 hpalin kernel: command = 0x2a 00 00 0c 09 98 00 04 00 00
Oct 30 20:03:53 hpalin kernel: scsi1: Bus Reset detected, executing command 2d22a8e0, slot 2fd608a4, dsp 001d8210[0210]
Oct 30 20:03:53 hpalin kernel: failing command because of reset, slot 2fd60520, cmnd 2d22ae60
Oct 30 20:03:53 hpalin kernel: failing command because of reset, slot 2fd6064c, cmnd 2d22a780
Oct 30 20:03:53 hpalin kernel: failing command because of reset, slot 2fd60778, cmnd 2fd14b60
Oct 30 20:03:53 hpalin kernel: failing command because of reset, slot 2fd608a4, cmnd 2d22a8e0
Oct 30 20:03:53 hpalin kernel: Incorrect number of segments after building list
Oct 30 20:03:53 hpalin kernel: counted 7, received 6
Oct 30 20:03:53 hpalin kernel: req nr_sec 544, cur_nr_sec 8
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17105
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17106
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17107
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17108
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17109
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17110
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17111
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17112
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17113
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:03:53 hpalin kernel: Buffer I/O error on device sdc5, logical block 17114
Oct 30 20:03:53 hpalin kernel: lost page write due to I/O error on sdc5
Oct 30 20:05:17 hpalin kernel: scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG IN
Oct 30 20:05:17 hpalin kernel: scsi1: Bus Reset detected, executing command 2fd14740, slot 2fd6064c, dsp 001d81e8[01e8]
Oct 30 20:05:17 hpalin kernel: failing command because of reset, slot 2fd6064c, cmnd 2fd14740
Oct 30 20:07:34 hpalin kernel: end_request: I/O error, dev sdc, sector 4139844
Oct 30 20:07:34 hpalin kernel: printk: 58 messages suppressed.
Oct 30 20:07:34 hpalin kernel: Buffer I/O error on device sdc6, logical block 24883
Oct 30 20:07:34 hpalin kernel: lost page write due to I/O error on sdc6
Oct 30 20:07:34 hpalin kernel: Buffer I/O error on device sdc6, logical block 24884
Oct 30 20:07:34 hpalin kernel: lost page write due to I/O error on sdc6
Oct 30 20:07:34 hpalin kernel: Buffer I/O error on device sdc6, logical block 24885
Oct 30 20:07:34 hpalin kernel: lost page write due to I/O error on sdc6
Oct 30 20:07:34 hpalin kernel: Buffer I/O error on device sdc6, logical block 24886
Oct 30 20:07:34 hpalin kernel: lost page write due to I/O error on sdc6
Oct 30 20:07:34 hpalin kernel: Buffer I/O error on device sdc6, logical block 24887
Oct 30 20:07:34 hpalin kernel: lost page write due to I/O error on sdc6
Oct 30 20:12:42 hpalin kernel: scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG IN
Oct 30 20:12:42 hpalin kernel: scsi1: Bus Reset detected, executing command 2fd148a0, slot 2fd60520, dsp 001d81e8[01e8]
Oct 30 20:12:42 hpalin kernel: failing command because of reset, slot 2fd60520, cmnd 2fd148a0
Oct 30 20:12:42 hpalin kernel: failing command because of reset, slot 2fd6064c, cmnd 2fd14060
Oct 30 20:12:42 hpalin kernel: failing command because of reset, slot 2fd60778, cmnd 2d22aba0
Oct 30 20:12:42 hpalin kernel: failing command because of reset, slot 2fd608a4, cmnd 2d22a8e0
Oct 30 20:32:41 hpalin kernel: scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG IN
Oct 30 20:32:41 hpalin kernel: scsi1: Bus Reset detected, executing command 2fd14b60, slot 2fd60778, dsp 001d81e8[01e8]
Oct 30 20:32:41 hpalin kernel: failing command because of reset, slot 2fd60520, cmnd 17cb2340
Oct 30 20:32:41 hpalin kernel: failing command because of reset, slot 2fd60778, cmnd 2fd14b60
Oct 30 20:45:08 hpalin kernel: scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG IN
Oct 30 20:45:08 hpalin kernel: scsi1: Bus Reset detected, executing command 2bdb0340, slot 2fd60778, dsp 001d81e8[01e8]
Oct 30 20:45:08 hpalin kernel: failing command because of reset, slot 2fd60778, cmnd 2bdb0340
Oct 30 20:45:08 hpalin kernel: failing command because of reset, slot 2fd608a4, cmnd 2fd14060
The apt-get operation was near complete:
just a pakage pakage failed to install, a quick check make me appear that postinst and postrm scripts were corrupted :(
But according to previous messages, should I doubt of the intergrity of this chroot disk or only those few files were corrupted?
Secondly the worst thing occured when I try a 'tar cslpf /chroot/Develop/linux-2.6.9-pa1.tar linux-2.6.9-pa1'
(the src linux tree standing on the internal disk using ncr driver and the target file standing on the external disk using 53c700
driver):
numerous messages of style
len = 6, cmd =scsi1 : destination target 3, lun 0
command = 0x2a 00 00 53 3c 66 00 04 00 00
scsi1: Bus Reset detected, executing command 2bc09340, slot 2fd608a4, dsp 001d8] failing command because of reset, slot 2fd60520,
cmnd 2fd14b60
failing command because of reset, slot 2fd6064c, cmnd 2bc094a0
failing command because of reset, slot 2fd60778, cmnd 2bc09b80
failing command because of reset, slot 2fd608a4, cmnd 2bc09340
ending by the fatal:
Kernel panic - not syncing: drivers/parisc/ccio-dma.c: ccio_alloc_range() I/O M.
And at the reboot weird kernel announce:
[...]
53c700: Version 2.8 By James.Bottomley@HansenPartnership.com
scsi1: 53c710 rev 2
scsi1 : LASI SCSI 53c700
scsi1 (0:0) New error handler wants to abort command
scsi1 : destination target 0, lun 0
command = 0x12 00 00 00 24 00
scsi1 (0:0) New error handler wants device reset
scsi1 : destination target 0, lun 0
command = 0x12 00 00 00 24 00
scsi1 (0:0) New error handler wants BUS reset, cmd 2fd2ab60
scsi1 : destination target 0, lun 0
command = 0x12 00 00 00 24 00
scsi1: Bus Reset detected, executing command 2fd2ab60, slot 2fd70520, dsp 001d8] failing command because of reset, slot 2fd70520,
cmnd 2fd2ab60
1:0:0:0: Illegal state transition created->quiesce
Badness in scsi_device_set_state at drivers/scsi/scsi_lib.c:1713
Backtrace:
[<1025de58>] scsi_device_set_state+0xf8/0x1a8
[<1025df1c>] scsi_device_quiesce+0x14/0x64
[<102630ac>] spi_dv_device+0x70/0x1a8
[<102631fc>] spi_dv_device_work_wrapper+0x18/0x3c
[<10139514>] worker_thread+0x1ac/0x278
[<1013e04c>] kthread+0xdc/0xe4
[<1010dc5c>] ret_from_kernel_thread+0x1c/0x24
scsi1 (0:0) New error handler wants HOST reset
scsi1 : destination target 0, lun 0
command = 0x12 00 00 00 24 00
scsi: Device offlined - not ready after error recovery: host 1 channel 0 id 0 l0
Vendor: SEAGATE Model: ST34573W Rev: HP11
Type: Direct-Access ANSI SCSI revision: 02
target1:0:3: Beginning Domain Validation
scsi1: (3:0) Asynchronous
scsi1: (3:0) Enabling Tag Command Queuing
scsi1: (3:0) Synchronous at offset 8, period 100ns
target1:0:3: Domain Validation skipping write tests
target1:0:3: Ending Domain Validation
st: Version 20040403, fixed bufsize 32768, s/g segs 256
[...]
hmm, I try to reproduce the previous panic but I don't reach; this time only one:
scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG IN
scsi1: Bus Reset detected, executing command 2fd2aa00, slot 2fd70778, dsp 001d81e8[01e8]
failing command because of reset, slot 2fd70778, cmnd 2fd2aa00
Thanks in advance for your advise,
Joel
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [parisc-linux] k-2.6.10-rc1-pa3 & c110: high data rate => Kernel panic - ...
2004-10-30 19:58 [parisc-linux] k-2.6.10-rc1-pa3 & c110: high data rate => Kernel panic - Joel Soete
@ 2004-10-31 21:11 ` Joel Soete
0 siblings, 0 replies; 2+ messages in thread
From: Joel Soete @ 2004-10-31 21:11 UTC (permalink / raw)
To: Joel Soete; +Cc: James Bottomley, parisc-linux
Hello James,
>
> The serial console pb being fixed, I can now boot again my c110 with
> recent 2.6.10-rc1-pa3 :)
>
[...]
>
> ending by the fatal:
> Kernel panic - not syncing: drivers/parisc/ccio-dma.c:
> ccio_alloc_range() I/O M.
>
> And at the reboot weird kernel announce:
> [...]
> 53c700: Version 2.8 By James.Bottomley@HansenPartnership.com
> scsi1: 53c710 rev 2
> scsi1 : LASI SCSI 53c700
> scsi1 (0:0) New error handler wants to abort command
> scsi1 : destination target 0, lun 0
> command = 0x12 00 00 00 24 00
> scsi1 (0:0) New error handler wants device reset
> scsi1 : destination target 0, lun 0
> command = 0x12 00 00 00 24 00
> scsi1 (0:0) New error handler wants BUS reset, cmd 2fd2ab60
> scsi1 : destination target 0, lun 0
> command = 0x12 00 00 00 24 00
> scsi1: Bus Reset detected, executing command 2fd2ab60, slot 2fd70520,
> dsp 001d8] failing command because of reset, slot 2fd70520, cmnd 2fd2ab60
> 1:0:0:0: Illegal state transition created->quiesce
> Badness in scsi_device_set_state at drivers/scsi/scsi_lib.c:1713
> Backtrace:
> [<1025de58>] scsi_device_set_state+0xf8/0x1a8
> [<1025df1c>] scsi_device_quiesce+0x14/0x64
> [<102630ac>] spi_dv_device+0x70/0x1a8
> [<102631fc>] spi_dv_device_work_wrapper+0x18/0x3c
> [<10139514>] worker_thread+0x1ac/0x278
> [<1013e04c>] kthread+0xdc/0xe4
> [<1010dc5c>] ret_from_kernel_thread+0x1c/0x24
>
> scsi1 (0:0) New error handler wants HOST reset
> scsi1 : destination target 0, lun 0
> command = 0x12 00 00 00 24 00
> scsi: Device offlined - not ready after error recovery: host 1 channel 0
> id 0 l0
> Vendor: SEAGATE Model: ST34573W Rev: HP11
> Type: Direct-Access ANSI SCSI revision: 02
> target1:0:3: Beginning Domain Validation
> scsi1: (3:0) Asynchronous
> scsi1: (3:0) Enabling Tag Command Queuing
> scsi1: (3:0) Synchronous at offset 8, period 100ns
> target1:0:3: Domain Validation skipping write tests
> target1:0:3: Ending Domain Validation
> st: Version 20040403, fixed bufsize 32768, s/g segs 256
> [...]
>
> hmm, I try to reproduce the previous panic but I don't reach; this time
> only one:
> scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG IN
> scsi1: Bus Reset detected, executing command 2fd2aa00, slot 2fd70778,
> dsp 001d81e8[01e8]
> failing command because of reset, slot 2fd70778, cmnd 2fd2aa00
>
Just a small update:
test 2.6.10-rc1-pa5 boot fine, but didn't solve lasi scsi ctrl:
scsi1: (3:0) phase mismatch at 01e8, phase IO CD MSG BSY REQ MSG N
scsi1: Bus Reset detected, executing command 26fb04c0, slot 2fd6064c, dsp 001d8]
failing command because of reset, slot 2fd60520, cmnd 2fd19740
failing command because of reset, slot 2fd6064c, cmnd 26fb04c0
failing command because of reset, slot 2fd60778, cmnd 2fd19b60
failing command because of reset, slot 2fd608a4, cmnd 26fb0620
Incorrect number of segments after building list
counted 29, received 28
req nr_sec 1024, cur_nr_sec 8
Buffer I/O error on device sdc9, logical block 44761
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44762
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44763
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44764
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44765
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44766
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44767
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44768
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44769
lost page write due to I/O error on sdc9
Buffer I/O error on device sdc9, logical block 44770
lost page write due to I/O error on sdc9
Incorrect number of segments after building list
counted 22, received 21
req nr_sec 1024, cur_nr_sec 8
Just the ggg work-around (ccio_mem_ratio = 2;) seems to help.
Hth,
Joel
_______________________________________________
parisc-linux mailing list
parisc-linux@lists.parisc-linux.org
http://lists.parisc-linux.org/mailman/listinfo/parisc-linux
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2004-10-31 21:11 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-10-30 19:58 [parisc-linux] k-2.6.10-rc1-pa3 & c110: high data rate => Kernel panic - Joel Soete
2004-10-31 21:11 ` Joel Soete
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.