From: Sunil Mushran <Sunil.Mushran@oracle.com>
To: ocfs2-devel@oss.oracle.com
Subject: [Ocfs2-devel] [PATCH 0/3] Add inode stealing for ocfs2.V1
Date: Fri Feb 22 10:30:01 2008 [thread overview]
Message-ID: <47BF1497.40507@oracle.com> (raw)
In-Reply-To: <47BE89FE.1090204@oracle.com>
True... however, in 1.2 (or anything before 2.6.23) only extent_alloc:0000
is used by all nodes. This was done to avoid deadlocks during truncate.
2.6.23 or 24 onwards Mark added code to allow use of all extent_alloc after
adding code to prevent deadlocks during truncate.
In general, allocations from extent_alloc are not that common as we have
fairly flat trees. If this does become an issue, we will handle it
similarly.
wengang wang wrote:
> not know it clearly, but I remember when extending a file, meta is
> allocated in extent_alloc instead of inode_alloc if necessary(correct
> me if i am wrong).
> if so, do we need to take extent_alloc into consideration as well?
>
> thanks,
> wengang.
>
> Tao Ma wrote:
>> Hi all,
>> This patch set improve the method for inode allocation. Now they
>> are divided into 3 small patches, but I think maybe they can be merged
>> together as one. Any comments are welcomed.
>>
>> In OCFS2, we allocate the inodes from slot specific inode_alloc to avoid
>> inode creation congestion. The local alloc file grows in a large
>> contiguous
>> chunk. As for a 4K bs, it grows 4M every time. So 1024 inodes will be
>> allocated at a time.
>>
>> Over time, if the fs gets fragmented enough(e.g, the user has created
>> many
>> small files and also delete some of them), we can end up in a situation,
>> whereby we cannot extend the inode_alloc as we don't have a large chunk
>> free in the global_bitmap even if df shows few gigs free. More
>> annoying is
>> that this situation will invariably mean that while one cannot create
>> inodes
>> on one node but can from another node. Still more annoying is that an
>> unused
>> slot may have space for plenty of inodes but is unusable as the user
>> may not
>> be mounting as many nodes anymore.
>>
>> This patch series implement a solution which is to steal inodes from
>> another
>> slot. Now the whole inode allocation process looks like this:
>> 1. Allocate from its own inode_alloc:000X
>> 1) If we can reserve, OK.
>> 2) If fails, try to allocate a large chunk and reserve once again.
>> 2. If 1 fails, try to allocate from the last node's inode_alloc. This
>> time,
>> Just try to reserve, we don't go for global_bitmap if this inode also
>> can't allocate the inode.
>> 3. If 2 fails, try the node before it until we reach inode_alloc:0000.
>> In the process, we will skip its own inode_alloc.
>> 4. If 3 fails, try to allocate from its own inode_alloc:000X once
>> again. Here
>> is a chance that the global_bitmap may has a large enough chunk
>> now during
>> the inode iteration process.
>>
>> _______________________________________________
>> Ocfs2-devel mailing list
>> Ocfs2-devel@oss.oracle.com
>> http://oss.oracle.com/mailman/listinfo/ocfs2-devel
>>
>
next prev parent reply other threads:[~2008-02-22 10:30 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-02-22 0:42 [Ocfs2-devel] [PATCH 0/3] Add inode stealing for ocfs2.V1 Tao Ma
2008-02-22 0:48 ` [Ocfs2-devel] [PATCH 1/3] Add a new parameter for ocfs2_reserve_suballoc_bits.V1 Tao Ma
2008-02-22 0:49 ` [Ocfs2-devel] [PATCH 2/3] Add ac_alloc_slot in ocfs2_alloc_context.V1 Tao Ma
2008-02-22 0:49 ` [Ocfs2-devel] [PATCH 3/3] Add inode stealing for ocfs2_reserve_new_inode.V1 Tao Ma
2008-02-22 0:57 ` [Ocfs2-devel] [PATCH 0/3] Add inode stealing for ocfs2.V1 wengang wang
2008-02-22 1:03 ` tao.ma
2008-02-22 1:17 ` wengang wang
2008-02-22 1:26 ` tao.ma
2008-02-22 10:30 ` Sunil Mushran [this message]
2008-02-22 15:09 ` Mark Fasheh
2008-02-22 16:11 ` Tao Ma
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=47BF1497.40507@oracle.com \
--to=sunil.mushran@oracle.com \
--cc=ocfs2-devel@oss.oracle.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.