public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Andrew Morton <akpm@linux-foundation.org>
To: Alasdair G Kergon <agk@redhat.com>,
	dm-devel@redhat.com, linux-kernel@vger.kernel.org,
	Heinz Mauelshagen <mauelshagen@redhat.com>,
	Christoph Lameter <clameter@sgi.com>
Subject: Re: [2.6.22 PATCH 23/26] dm delay:
Date: Thu, 10 May 2007 12:02:23 -0700	[thread overview]
Message-ID: <20070510120223.bb4c22df.akpm@linux-foundation.org> (raw)
In-Reply-To: <20070508175410.d5b7e91f.akpm@linux-foundation.org>

On Tue, 8 May 2007 17:54:10 -0700 Andrew Morton <akpm@linux-foundation.org> wrote:

> On Tue, 8 May 2007 20:48:59 +0100
> Alasdair G Kergon <agk@redhat.com> wrote:
> 
> > From: Heinz Mauelshagen <mauelshagen@redhat.com>
> > 
> > New device-mapper target that can delay I/O (for testing).
> > Reads can be separated from writes, redirected to different underlying
> > devices and delayed by differing amounts of time.
> > 
> > ..
> >
> > +/*
> > + * Copyright (C) 2005-2007 Red Hat GmbH
> > + *
> > + * A target that delays reads and/or writes and can send
> > + * them to different devices.
> > + *
> > + * This file is released under the GPL.
> > + */
> > +
> > +#include <linux/module.h>
> > +#include <linux/init.h>
> > +#include <linux/blkdev.h>
> > +#include <linux/bio.h>
> > +#include <linux/slab.h>
> > +
> > +#include "dm.h"
> > +#include "dm-bio-list.h"
> > +
> > +#define DM_MSG_PREFIX "delay"
> > +
> > +struct delay_c {
> > +	struct timer_list delay_timer;
> > +	struct semaphore timer_lock;
> 
> Can we get this converted to a mutex asap, please?
> 
> It's only used in a single place and perhaps we don't need this lock at
> all?
> 
> > +	struct work_struct flush_expired_bios;
> > +	struct list_head delayed_bios;
> > +	atomic_t may_delay;
> > +	mempool_t *delayed_pool;
> > +
> > +	struct dm_dev *dev_read;
> > +	sector_t start_read;
> > +	unsigned read_delay;
> > +	unsigned reads;
> > +
> > +	struct dm_dev *dev_write;
> > +	sector_t start_write;
> > +	unsigned write_delay;
> > +	unsigned writes;
> > +};
> > +
> > +struct delay_info {
> > +	struct delay_c *context;
> > +	struct list_head list;
> > +	struct bio *bio;
> > +	unsigned long expires;
> > +};
> > +
> > +static DEFINE_MUTEX(delayed_bios_lock);
> > +
> > +static struct workqueue_struct *kdelayd_wq;
> > +static struct kmem_cache *delayed_cache;
> > +
> > +static void handle_delayed_timer(unsigned long data)
> > +{
> > +	struct delay_c *dc = (struct delay_c *)data;
> > +
> > +	queue_work(kdelayd_wq, &dc->flush_expired_bios);
> > +}
> > +
> > +static void queue_timeout(struct delay_c *dc, unsigned long expires)
> > +{
> > +	down(&dc->timer_lock);
> > +
> > +	if (!timer_pending(&dc->delay_timer) || expires < dc->delay_timer.expires)
> > +		mod_timer(&dc->delay_timer, expires);
> > +
> > +	up(&dc->timer_lock);
> > +}
> >
> > ...
> >
> > +
> > +	init_timer(&dc->delay_timer);
> > +	dc->delay_timer.function = handle_delayed_timer;
> > +	dc->delay_timer.data = (unsigned long)dc;
> 
> setup_timer() could be used here.
> 
> > +	INIT_WORK(&dc->flush_expired_bios, flush_expired_bios);
> > +	INIT_LIST_HEAD(&dc->delayed_bios);
> > +	init_MUTEX(&dc->timer_lock);
> > +	atomic_set(&dc->may_delay, 1);
> > +
> > +	ti->private = dc;
> > +	return 0;
> > +
> > +bad:
> > +	kfree(dc);
> > +	return -EINVAL;
> > +}
> > +
> >
> > ...
> >
> > +
> > +static int __init dm_delay_init(void)
> > +{
> > +	int r = -ENOMEM;
> > +
> > +	kdelayd_wq = create_workqueue("kdelayd");
> > +	if (!kdelayd_wq) {
> > +		DMERR("Couldn't start kdelayd");
> > +		goto bad_queue;
> > +	}
> 
> Do we really really need one instance of kdelayd on each CPU?
> 
> I suspect a single-threaded workqueue would suffice here.  We have a big
> global lock in this driver anyway....
> 
> Probably create_workqueue() should have defaulted to single-threaded on day
> one.  Oh well.
> 
> 
> > +	delayed_cache = kmem_cache_create("dm-delay",
> > +					  sizeof(struct delay_info),
> > +					  __alignof__(struct delay_info),
> > +					  0, NULL, NULL);
> 
> We have an ugly^Wnice new KMEM_CACHE macro for this now.
> 
> What's that __alignof__ doing in there?
> 
> > +	if (!delayed_cache) {
> > +		DMERR("Couldn't create delayed bio cache.");
> > +		goto bad_memcache;
> > +	}
> > +
> > +	r = dm_register_target(&delay_target);
> > +	if (r < 0) {
> > +		DMERR("register failed %d", r);
> > +		goto bad_register;
> > +	}
> > +
> > +	return 0;
> > +
> > +bad_register:
> > +	kmem_cache_destroy(delayed_cache);
> > +bad_memcache:
> > +	destroy_workqueue(kdelayd_wq);
> > +bad_queue:
> > +	return r;
> > +}
> > +
> 

Guys, could I please get some response to this before I forget about it? 
The new per-cpu kernel thread problem is particularly important.  We already
fail to boot on 4096-CPU due to running out of pids.

Thanks.


  reply	other threads:[~2007-05-10 19:02 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-05-08 19:48 [2.6.22 PATCH 23/26] dm delay: Alasdair G Kergon
2007-05-09  0:54 ` Andrew Morton
2007-05-10 19:02   ` Andrew Morton [this message]
2007-05-10 19:04     ` Christoph Lameter
2007-05-10 19:37       ` Andrew Morton

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070510120223.bb4c22df.akpm@linux-foundation.org \
    --to=akpm@linux-foundation.org \
    --cc=agk@redhat.com \
    --cc=clameter@sgi.com \
    --cc=dm-devel@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mauelshagen@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox