From: Ivan Zahariev <famzah@icdsoft.com>
To: linux-ext4@vger.kernel.org
Subject: Ext4 speedup by storing metadata and data on separate devices
Date: Tue, 20 Nov 2012 13:04:20 +0200 [thread overview]
Message-ID: <50AB63B4.1050506@icdsoft.com> (raw)
Hello all,
This suggestion is not about storing the journal on a separate device.
Many of the tasks on an Ext4 file-system require a full or massive scan
of the metadata. A few examples:
- backup: you need to get a list with all "mtime" or "size" changed
files since last backup
- reporting: you need to get a list with all files of a particular
"group" owner ID
- delete: deleting the "/home/$user" of someone with lots of data and files
I know many efforts have been made to make the (meta)data operations
"local" -- this speeds up spinning disks operations a lot, also SSD
ones. However, having the whole metadata on an SSD disk (or a RAID1 of
two such disks) could speed up many common tasks a lot. And the hardware
price for such a benefit is really affordable now.
I see two possible implementations:
1. Re-work the Ext4 metadata operations (that work with inodes, etc) to
read/write on a separate block device.
or
2. Add an option to the "data locality" algorithm to force it to store
all metadata only at the beginning of a device (we can pre-allocate
enough space). We can then transparently map in the DM those blocks to a
separate faster block device, thus making the changes to Ext4 minimal.
Does all this make sense, or I'm missing something obvious?
Thank you for your time.
--
Best regards.
Ivan Zahariev | System Administrator | ICDSoft Ltd.
next reply other threads:[~2012-11-20 11:11 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-11-20 11:04 Ivan Zahariev [this message]
2012-11-20 20:56 ` Ext4 speedup by storing metadata and data on separate devices Andreas Dilger
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50AB63B4.1050506@icdsoft.com \
--to=famzah@icdsoft.com \
--cc=linux-ext4@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.