Re: [PATCH] squashfs: enhance parallel I/O

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Minchan Kim <minchan@kernel.org>
To: Phillip Lougher <phillip@lougher.demon.co.uk>
Cc: linux-kernel@vger.kernel.org, ch0.han@lge.com, gunho.lee@lge.com
Subject: Re: [PATCH] squashfs: enhance parallel I/O
Date: Fri, 25 Oct 2013 19:07:44 +0900	[thread overview]
Message-ID: <20131025100744.GA6612@gmail.com> (raw)
In-Reply-To: <52676AF3.8050503@lougher.demon.co.uk>

Hello Phillip,

Sorry for late response. I'm still in Edinburgh.

On Wed, Oct 23, 2013 at 07:21:39AM +0100, Phillip Lougher wrote:
> 
> Hi Minchan,
> 
> Apologies for the lateness of this review, I had forgotten I'd not
> send it.
> 
> Minchan Kim <minchan@kernel.org> wrote:
> >Now squashfs have used for only one stream buffer for decompression
> >so it hurts concurrent read performance so this patch supprts
> >multiple decompressor to enhance performance concurrent I/O.
> >
> >Four 1G file dd read on KVM machine which has 2 CPU and 4G memory.
> >
> >dd if=test/test1.dat of=/dev/null &
> >dd if=test/test2.dat of=/dev/null &
> >dd if=test/test3.dat of=/dev/null &
> >dd if=test/test4.dat of=/dev/null &
> >
> >old : 1m39s -> new : 9s
> >
> >This patch is based on [1].
> >
> >[1] Squashfs: Refactor decompressor interface and code
> >
> >Cc: Phillip Lougher <phillip@squashfs.org.uk>
> >Signed-off-by: Minchan Kim <minchan@kernel.org>
> >
> >---
> >fs/squashfs/Kconfig              |   12 +++
> >fs/squashfs/Makefile             |    9 +-
> >fs/squashfs/decompressor_multi.c |  194 ++++++++++++++++++++++++++++++++++++++
> >3 files changed, 214 insertions(+), 1 deletion(-)
> >create mode 100644 fs/squashfs/decompressor_multi.c
> >
> >diff --git a/fs/squashfs/Kconfig b/fs/squashfs/Kconfig
> >index c70111e..7a580d0 100644
> >--- a/fs/squashfs/Kconfig
> >+++ b/fs/squashfs/Kconfig
> >@@ -63,6 +63,18 @@ config SQUASHFS_LZO
> >
> >	  If unsure, say N.
> >
> >+config SQUASHFS_MULTI_DECOMPRESSOR
> 
> Small alterations to the English, I don't like doing this because
> English is probably a foreign language to you, and my Korean is non-existent!
> but, this is user visible and you did say you wanted review !

Yeah, I'm really welcome to fix my English and it's really power of open source
community where native and non-native people could help each other.

> 
> >+	bool "Use multiple decompressor for handling concurrent I/O"
> 
> bool "Use multiple decompressors for handling parallel I/O"
> 
> Concurrent is subtly different to parallel, and you use parallel in
> the following description.
> 
> 
> >+	depends on SQUASHFS
> >+	help
> >+	  By default Squashfs uses a compressor for data but it gives
> >+	  poor performance on parallel I/O workload of multiple CPU
> >+	  mahchine due to waitting a compressor available.
> >+
> 
> By default Squashfs uses a single decompressor but it gives
> poor performance on parallel I/O workloads when using multiple CPU
> machines due to waiting on decompressor availability.
> 
> >+	  If workload has parallel I/O and your system has enough memory,
> >+	  this option may improve overall I/O performance.
> >+	  If unsure, say N.
> >+
> 
> If you have a parallel I/O workload and your system has enough memory,
> using this option may improve overall I/O performance.
> 
> If unsure, say N.

Will correct.

> >+
> >config SQUASHFS_XZ
> >	bool "Include support for XZ compressed file systems"
> >	depends on SQUASHFS
> >diff --git a/fs/squashfs/Makefile b/fs/squashfs/Makefile
> >index c223c84..dfebc3b 100644
> >--- a/fs/squashfs/Makefile
> >+++ b/fs/squashfs/Makefile
> >@@ -4,8 +4,15 @@
> >
> >obj-$(CONFIG_SQUASHFS) += squashfs.o
> >squashfs-y += block.o cache.o dir.o export.o file.o fragment.o id.o inode.o
> >-squashfs-y += namei.o super.o symlink.o decompressor.o decompressor_single.o
> >+squashfs-y += namei.o super.o symlink.o decompressor.o
> >+
> >squashfs-$(CONFIG_SQUASHFS_XATTR) += xattr.o xattr_id.o
> >squashfs-$(CONFIG_SQUASHFS_LZO) += lzo_wrapper.o
> >squashfs-$(CONFIG_SQUASHFS_XZ) += xz_wrapper.o
> >squashfs-$(CONFIG_SQUASHFS_ZLIB) += zlib_wrapper.o
> >+
> >+ifdef CONFIG_SQUASHFS_MULTI_DECOMPRESSOR
> >+	squashfs-y		+= decompressor_multi.o
> >+else
> >+	squashfs-y		+= decompressor_single.o
> >+endif
> >diff --git a/fs/squashfs/decompressor_multi.c b/fs/squashfs/decompressor_multi.c
> >new file mode 100644
> >index 0000000..23f1e94
> >--- /dev/null
> >+++ b/fs/squashfs/decompressor_multi.c
> >@@ -0,0 +1,194 @@
> >+/*
> >+ *  Copyright (c) 2013
> >+ *  Minchan Kim <minchan@kernel.org>
> >+ *
> >+ *  This work is licensed under the terms of the GNU GPL, version 2. See
> >+ *  the COPYING file in the top-level directory.
> >+ */
> >+#include <linux/types.h>
> >+#include <linux/mutex.h>
> >+#include <linux/slab.h>
> >+#include <linux/buffer_head.h>
> >+#include <linux/sched.h>
> >+#include <linux/wait.h>
> >+#include <linux/cpumask.h>
> >+
> >+#include "squashfs_fs.h"
> >+#include "squashfs_fs_sb.h"
> >+#include "decompressor.h"
> >+#include "squashfs.h"
> >+
> >+/*
> >+ * This file implements multi-threaded decompression in the
> >+ * decompressor framework
> >+ */
> >+
> >+
> >+/*
> >+ * The reason that multiply two is that a CPU can request new I/O
> >+ * while it is waitting previous request.
> 
> s/waitting/waiting/

Oops.

> 
> The English in the comments is understandable, and not user-visible, and
> so I won't change it, except for spelling mistakes ...

I understand you respect my work although it couldn't satisfy your high bar
so I don't care if you correct this. :)

> 
> >+ */
> >+#define MAX_DECOMPRESSOR	(num_online_cpus() * 2)
> >+
> >+
> >+int squashfs_max_decompressors(void)
> >+{
> >+	return MAX_DECOMPRESSOR;
> >+}
> >+
> >+
> >+struct squashfs_stream {
> >+	void			*comp_opts;
> >+	struct list_head	strm_list;
> >+	struct mutex		mutex;
> >+	int			avail_comp;
> >+	wait_queue_head_t	wait;
> >+};
> >+
> >+
> >+struct comp_stream {
> >+	void *stream;
> >+	struct list_head list;
> >+};
> >+
> 
> One general small nitpick, you use "comp" to name things, comp_stream,
> avail_comp etc.  But, as this is a decompressor, it should more correctly
> be decomp...

Will change.

> 
> (note comp_opts is not decomp_opts because it is the compression options
> selected by the user at compression time) ...

True.

> 
> >+
> >+static void put_comp_stream(struct comp_stream *comp_strm,
> >+				struct squashfs_stream *stream)
> >+{
> >+	mutex_lock(&stream->mutex);
> >+	list_add(&comp_strm->list, &stream->strm_list);
> >+	mutex_unlock(&stream->mutex);
> >+	wake_up(&stream->wait);
> >+}
> >+
> >+void *squashfs_decompressor_create(struct squashfs_sb_info *msblk,
> >+				void *comp_opts)
> >+{
> >+	struct squashfs_stream *stream;
> >+	struct comp_stream *comp_strm = NULL;
> >+	int err = -ENOMEM;
> >+
> >+	stream = kzalloc(sizeof(*stream), GFP_KERNEL);
> >+	if (!stream)
> >+		goto out;
> >+
> >+	stream->comp_opts = comp_opts;
> >+	mutex_init(&stream->mutex);
> >+	INIT_LIST_HEAD(&stream->strm_list);
> >+	init_waitqueue_head(&stream->wait);
> >+
> >+	comp_strm = kmalloc(sizeof(*comp_strm), GFP_KERNEL);
> >+	if (!comp_strm)
> >+		goto out;
> >+
> >+	comp_strm->stream = msblk->decompressor->init(msblk,
> >+						stream->comp_opts);
> >+	if (IS_ERR(comp_strm->stream)) {
> >+		err = PTR_ERR(comp_strm->stream);
> >+		goto out;
> >+	}
> >+
> >+	list_add(&comp_strm->list, &stream->strm_list);
> >+	stream->avail_comp = 1;
> 
> I assume you're always creating a decompressor here (rather than
> setting it to 0, and allocating the first one in get_comp_stream) to
> ensure there's always at least one decompressor available going into
> get_comp_stream()?  So if decompressor creation fails in
> get_comp_stream() we can always fall back to using the decompressors
> already allocated, because we know there will always be at least one?

Right.

> 
> Maybe a comment to that effect? To show creating a decompressor here
> and setting this to one is deliberate....  This will prevent others
> from thinking they can "optimise" the code by having the first decompressor
> created in get_comp_stream()!

Indeed. I will add a comment about that. Of course you could feel free to
fix English if it doesn't meet your bar and I will learn English from native
people without any charge. I's really nice thing for me.

> 
> /* ensure we have at least one decompressor in get_comp_stream() */ ?
> 
> >+	return stream;
> >+
> >+out:
> >+	kfree(comp_strm);
> >+	kfree(stream);
> >+	return ERR_PTR(err);
> >+}
> >+
> >+
> >+void squashfs_decompressor_destroy(struct squashfs_sb_info *msblk)
> >+{
> >+	struct squashfs_stream *stream = msblk->stream;
> >+	if (stream) {
> >+		struct comp_stream *comp_strm;
> >+
> >+		while (!list_empty(&stream->strm_list)) {
> >+			comp_strm = list_entry(stream->strm_list.prev,
> >+						struct comp_stream, list);
> >+			list_del(&comp_strm->list);
> >+			msblk->decompressor->free(comp_strm->stream);
> >+			kfree(comp_strm);
> >+			stream->avail_comp--;
> >+		}
> >+	}
> >+
> >+	WARN_ON(stream->avail_comp);
> >+	kfree(stream->comp_opts);
> >+	kfree(stream);
> >+}
> >+
> >+
> >+static struct comp_stream *get_comp_stream(struct squashfs_sb_info *msblk,
> >+					struct squashfs_stream *stream)
> >+{
> >+	struct comp_stream *comp_strm;
> >+
> >+	while (1) {
> >+		mutex_lock(&stream->mutex);
> >+
> >+		/* There is available comp_stream */
> >+		if (!list_empty(&stream->strm_list)) {
> >+			comp_strm = list_entry(stream->strm_list.prev,
> >+				struct comp_stream, list);
> >+			list_del(&comp_strm->list);
> >+			mutex_unlock(&stream->mutex);
> >+			break;
> >+		}
> >+
> >+		/*
> >+		 * If there is no available comp and already full,
> >+		 * let's wait for releasing comp from other users.
> >+		 */
> >+		if (stream->avail_comp >= MAX_DECOMPRESSOR)
> >+			goto wait;
> >+
> >+		/* Let's allocate new comp */
> >+		comp_strm = kmalloc(sizeof(*comp_strm), GFP_KERNEL);
> >+		if (!comp_strm)
> >+			goto wait;
> >+
> >+		comp_strm->stream = msblk->decompressor->init(msblk,
> >+						stream->comp_opts);
> >+		if (IS_ERR(comp_strm->stream)) {
> >+			kfree(comp_strm);
> >+			goto wait;
> >+		}
> >+
> >+		stream->avail_comp++;
> >+		WARN_ON(stream->avail_comp > MAX_DECOMPRESSOR);
> >+
> >+		mutex_unlock(&stream->mutex);
> >+		break;
> >+wait:
> >+		/*
> >+		 * If system memory is tough, let's for other's
> >+		 * releasing instead of hurting VM because it could
> >+		 * make page cache thrashing.
> >+		 */
> 
> 
> >+		mutex_unlock(&stream->mutex);
> >+		wait_event(stream->wait,
> >+			!list_empty(&stream->strm_list));
> >+	}
> >+
> >+	return comp_strm;
> >+}
> >+
> >+
> >+int squashfs_decompress(struct squashfs_sb_info *msblk,
> >+	void **buffer, struct buffer_head **bh, int b, int offset, int length,
> >+	int srclength, int pages)
> >+{
> 
> The interface here is changed by the introduction of the page
> handler abstraction...
> 
> I can apply your patch after the refactoring patch and before the
> "Squashfs: Directly decompress into the page cache for file" patch-set
> and fix up the code here....  Or you can fix up the code?  I'm
> happy to do either, whatever you prefer.

I think this stuff is more simpler than "directly decompress" patch
so I hope let's merge this first before "directly deompress" because
it would make git-bisect/revert more simple if "directly decompress"
patch might make a problem.

I will send fixup patch as soon as I return to office in Korea.

Thanks for the your help!


> 
> Thanks
> 
> Phillip
> 
> 
> >+	int res;
> >+	struct squashfs_stream *stream = msblk->stream;
> >+	struct comp_stream *comp_stream = get_comp_stream(msblk, stream);
> >+	res = msblk->decompressor->decompress(msblk, comp_stream->stream,
> >+		buffer, bh, b, offset, length, srclength, pages);
> >+	put_comp_stream(comp_stream, stream);
> >+	if (res < 0)
> >+		ERROR("%s decompression failed, data probably corrupt\n",
> >+			msblk->decompressor->name);
> >+	return res;
> >+}

-- 
Kind regards,
Minchan Kim

next prev parent reply	other threads:[~2013-10-25 10:07 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-10-23  6:21 [PATCH] squashfs: enhance parallel I/O Phillip Lougher
2013-10-25 10:07 ` Minchan Kim [this message]
  -- strict thread matches above, loose matches on Subject: below --
2013-10-11  0:48 Minchan Kim

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20131025100744.GA6612@gmail.com \
    --to=minchan@kernel.org \
    --cc=ch0.han@lge.com \
    --cc=gunho.lee@lge.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=phillip@lougher.demon.co.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.