All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ivan Zahariev <famzah@icdsoft.com>
To: linux-ext4@vger.kernel.org
Subject: Ext4 speedup by storing metadata and data on separate devices
Date: Tue, 20 Nov 2012 13:04:20 +0200	[thread overview]
Message-ID: <50AB63B4.1050506@icdsoft.com> (raw)

Hello all,

This suggestion is not about storing the journal on a separate device.

Many of the tasks on an Ext4 file-system require a full or massive scan 
of the metadata. A few examples:
- backup: you need to get a list with all "mtime" or "size" changed 
files since last backup
- reporting: you need to get a list with all files of a particular 
"group" owner ID
- delete: deleting the "/home/$user" of someone with lots of data and files

I know many efforts have been made to make the (meta)data operations 
"local" -- this speeds up spinning disks operations a lot, also SSD 
ones. However, having the whole metadata on an SSD disk (or a RAID1 of 
two such disks) could speed up many common tasks a lot. And the hardware 
price for such a benefit is really affordable now.

I see two possible implementations:

1. Re-work the Ext4 metadata operations (that work with inodes, etc) to 
read/write on a separate block device.

or

2. Add an option to the "data locality" algorithm to force it to store 
all metadata only at the beginning of a device (we can pre-allocate 
enough space). We can then transparently map in the DM those blocks to a 
separate faster block device, thus making the changes to Ext4 minimal.

Does all this make sense, or I'm missing something obvious?

Thank you for your time.

-- 
Best regards.
Ivan Zahariev | System Administrator | ICDSoft Ltd.


             reply	other threads:[~2012-11-20 11:11 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-20 11:04 Ivan Zahariev [this message]
2012-11-20 20:56 ` Ext4 speedup by storing metadata and data on separate devices Andreas Dilger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50AB63B4.1050506@icdsoft.com \
    --to=famzah@icdsoft.com \
    --cc=linux-ext4@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.