public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Mike Christie <michaelc@cs.wisc.edu>
To: Ashutosh Naik <ashutosh.naik@gmail.com>
Cc: linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org,
	open-iscsi@googlegroups.com
Subject: Re: Kernel Crash when using the open-iscsi initiator on 2.6.25.6
Date: Wed, 25 Jun 2008 11:54:41 -0500	[thread overview]
Message-ID: <48627851.9010804@cs.wisc.edu> (raw)
In-Reply-To: <81083a450806242236m62754185t3099c06f9f77676@mail.gmail.com>

Ashutosh Naik wrote:
> Please find the kernel log attached. I was using the open-iscsi
> initiator on kernel 2.6.25.6 with a chelsio iSCSI target and the crash
> happened on the initiator machine.
> 
>  connection5:0: ping timeout of 5 secs expired, last rx 4309640121,
> last ping 4309645121, now 4309650121
>  connection5:0: detected conn error (1011)

This happens when we cannot reach the target for the noop timout and 
interval seconds, which can happen if a cable is unplugged or the 
network is not reach able or is dropping packets.


>  connection5:0: ping timeout of 5 secs expired, last rx 4309652882,
> last ping 4309657882, now 4309662882


However, once it happens we should not report it again like is done 
here. There is something weird there. Do you have the iscsid output? 
Between these two reports of pings timing out is there any messages from 
iscsid about reconnecting?

>  connection5:0: detected conn error (1011)
>  connection5:0: detected conn error (1011)
>  session5: host reset succeeded


And we should not get here. The iscsi driver's scsi command timeout 
handler should prevent the command from firing the scsi eh, because in 
this case we think it is a transport problem.

What version of the iscsi tools are you using? Are they from a distro or 
open-iscsi.org?

Are you running with the iscsi kernel modules from 2.6.25.6, or are you 
using the iscsi modules from the open-iscsi.org website that come with 
the tarball?

Is the kernel a unmodified 2.6.25.6 or does it have some distro patches 
or patches that you have created?


> INFO: task fdisk:5226 blocked for more than 120 seconds.

I think you get this message and what follows, is a result of the above 
problem. While the iscsi initiator is trying to reconnect, IO is queued 
by the scsi layer so fdisk is going to be waiting around until we 
recover or give up.

  reply	other threads:[~2008-06-25 17:19 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-06-25  5:36 Kernel Crash when using the open-iscsi initiator on 2.6.25.6 Ashutosh Naik
2008-06-25 16:54 ` Mike Christie [this message]
2008-06-25 17:35   ` Ashutosh Naik
2008-06-25 17:47     ` Mike Christie

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=48627851.9010804@cs.wisc.edu \
    --to=michaelc@cs.wisc.edu \
    --cc=ashutosh.naik@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-scsi@vger.kernel.org \
    --cc=open-iscsi@googlegroups.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox