From: Ivan Zahariev <famzah@icdsoft.com>
To: linux-ext4@vger.kernel.org
Subject: Ext4 speedup by storing metadata and data on separate devices
Date: Tue, 20 Nov 2012 13:04:20 +0200 [thread overview]
Message-ID: <50AB63B4.1050506@icdsoft.com> (raw)
Hello all,
This suggestion is not about storing the journal on a separate device.
Many of the tasks on an Ext4 file-system require a full or massive scan
of the metadata. A few examples:
- backup: you need to get a list with all "mtime" or "size" changed
files since last backup
- reporting: you need to get a list with all files of a particular
"group" owner ID
- delete: deleting the "/home/$user" of someone with lots of data and files
I know many efforts have been made to make the (meta)data operations
"local" -- this speeds up spinning disks operations a lot, also SSD
ones. However, having the whole metadata on an SSD disk (or a RAID1 of
two such disks) could speed up many common tasks a lot. And the hardware
price for such a benefit is really affordable now.
I see two possible implementations:
1. Re-work the Ext4 metadata operations (that work with inodes, etc) to
read/write on a separate block device.
or
2. Add an option to the "data locality" algorithm to force it to store
all metadata only at the beginning of a device (we can pre-allocate
enough space). We can then transparently map in the DM those blocks to a
separate faster block device, thus making the changes to Ext4 minimal.
Does all this make sense, or I'm missing something obvious?
Thank you for your time.
--
Best regards.
Ivan Zahariev | System Administrator | ICDSoft Ltd.
next reply other threads:[~2012-11-20 11:11 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-11-20 11:04 Ivan Zahariev [this message]
2012-11-20 20:56 ` Ext4 speedup by storing metadata and data on separate devices Andreas Dilger
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50AB63B4.1050506@icdsoft.com \
--to=famzah@icdsoft.com \
--cc=linux-ext4@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).