public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Jens Axboe <axboe@suse.de>
To: "Peter T. Breuer" <ptb@it.uc3m.es>
Cc: linux kernel <linux-kernel@vger.kernel.org>
Subject: Re: end_request error procedure in 2.5?
Date: Mon, 16 Sep 2002 16:55:16 +0200	[thread overview]
Message-ID: <20020916145516.GO12364@suse.de> (raw)
In-Reply-To: <200209161419.g8GEJXF09937@oboe.it.uc3m.es>

On Mon, Sep 16 2002, Peter T. Breuer wrote:
> "A month of sundays ago Jens Axboe wrote:"
> > >  end_request( req, (req->errors == 0) ? 1 : 0 );
> > >  ..
> > > 
> > >  static void end_request(struct request *req, int uptodate) {
> > >  struct bio *bio;
> > >  while ((bio = req->bio) != NULL) {
> > >              blk_finished_io(bio_sectors(bio));
> > >              req->bio = bio->bi_next;
> > >              bio->bi_next = NULL;
> > >              bio_endio(bio, uptodate);
> > >      }
> > >      blk_put_request(req);
> > >  }
> > > 
> > > 
> > > It works fine except on error.  Kernel 2.5.31.  I understand that
> > > put_request adds the request back to a free list (if gotten from there
> > > via get_request).  The request is ordinary, except out of range ...
> > > it's produced by an e2fsck of the device when the device itself is
> > > unformatted, and the out of range request gets passed to the driver and
> > > is errored there, and "kapow" ..
> > 
> > The error is most likely in the driver calling end_that_request_first(),
> 
> Hmmm ... it's not called. The above is exactly all that is called
> and LOCAL_END_REQUEST is set. OK. I see what you are saying. Yes, I
> will direct my attention to that function instead ...
> 
>  ... and yes, I see a possible path in which the queue spinlock may be
> taken twice. OK!
> 
> Thanks!
> 
> > not the function itself. Maybe you can try to do at least some
> > debugging, I hope you are not expecting anyone to be able to help you
> > from the above report.
> 
> !! :-)
> 
> Thanks, yes I know! However, it's taken me about 4 days to get it this
> far. As you know, complete lockups are hard to debug! There's a race
> condition between the printk appearing on the console and the machine
> stopping :-(. 

NMI watchdog is invaluable for this sort of thing, beats printk by a
wide margin :-). Of course that only works of you have the hardware it
works on, but that's what test machines are for.

-- 
Jens Axboe


      reply	other threads:[~2002-09-16 14:50 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-09-16 13:49 end_request error procedure in 2.5? Peter T. Breuer
2002-09-16 13:54 ` Jens Axboe
2002-09-16 14:19   ` Peter T. Breuer
2002-09-16 14:55     ` Jens Axboe [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20020916145516.GO12364@suse.de \
    --to=axboe@suse.de \
    --cc=linux-kernel@vger.kernel.org \
    --cc=ptb@it.uc3m.es \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox