From: Eric Sandeen <sandeen@redhat.com>
To: "Theodore Ts'o" <tytso@mit.edu>
Cc: Andreas Dilger <adilger@dilger.ca>,
ext4 development <linux-ext4@vger.kernel.org>
Subject: Re: Journal under-reservation bug on first >2G file
Date: Wed, 01 Oct 2014 15:37:17 -0500 [thread overview]
Message-ID: <542C65FD.5040405@redhat.com> (raw)
In-Reply-To: <20141001195954.GD2903@thunk.org>
On 10/1/14 2:59 PM, Theodore Ts'o wrote:
> On Wed, Oct 01, 2014 at 09:43:32AM -0500, Eric Sandeen wrote:
>>> That sounds like a plan. If we only enable it automatically at mount
>>> time (iff we mounted the file system read/write) if any of the ext3 or
>>> ext4 specific features are enabled, that should be completely safe.
>>
>> Ok, so do that, and don't bump the reservations? I suppose
>> the size test & superblock write can be removed, then...
>>
>> This does bug me a little; at one point we were very carefully not
>> enabling any new features by mounting with a new kernel; that was
>> specific to mounting-ext2-with-ext4 etc, but it still feels slightly
>> inconsistent. Although I guess we enable it today by mounting-and-
>> writing-a-big-enough-file.
>
> Yeah, this behaviour was one that dates back a *long* time, before we
> established the rule that we don't enable any new features
> automatically. If this was a new feature, I wouldn't be advocating
> this. But if we change this now, we could introduce a regression, or
> at least a surprising breakage.
>
>> Something like this should fix it too, though, with less unexpected
>> behind-your-back behavior:
>>
>> diff --git a/fs/ext4/inode.c b/fs/ext4/inode.c
>> index 3aa26e9..2f94cd6 100644
>> --- a/fs/ext4/inode.c
>> +++ b/fs/ext4/inode.c
>> @@ -2563,9 +2563,15 @@ retry_grab:
>> * if there is delayed block allocation. But we still need
>> * to journalling the i_disksize update if writes to the end
>> * of file which has an already mapped buffer.
>> + * If this write might need to update the superblock due to the
>> + * filesize adding a new superblock feature flag, add that too.
>> */
>> retry_journal:
>> - handle = ext4_journal_start(inode, EXT4_HT_WRITE_PAGE, 1);
>> + handle = ext4_journal_start(inode, EXT4_HT_WRITE_PAGE,
>> + EXT4_HAS_RO_COMPAT_FEATURE(inode->i_sb,
>> + EXT4_FEATURE_RO_COMPAT_LARGE_FILE) ?
>> + 1 : 2);
>> +
>
> Yes, I suppose that would work as well. It means that file systems
> which don't have LARGE_FILE will waste a bit more space in the
> journal, causing the journal to potentially close prematurely.
>
> The code would be a bit simpler if we removed "set only if i_size has
> gotten too big", and replaced it with a "set it unconditionally at
> mount time". So there are tradeoffs with either approach. At this
> point I'm slightly in favor of enabling it by default if ext4 features
> are enabled, either in the kernel or in the e2fsck. And if we're
> going to do that, doing it in the kernel is more foolproof, and it
> will have the same net result.
Ok. I guess this is only an issue for ext4 - well, at least this specific
issue. Delalloc makes it much different than ext2 & ext3, which reserve quite a
lot more. Whether there's a corner case over there which breaks, I dunno...
So it seems like the simplest test is simply: Are we RW mounted with delalloc?
And if so, update the feature. Seems simpler than mucking with "which features
are unique to ext4"
(because we could be mounting ext3-with-ext4, having no ext4-specific features,
and still hit the problem right? ... test test test ... right.)
I'll whip that up.
Thanks,
-Eric
> - Ted
>
next prev parent reply other threads:[~2014-10-01 20:37 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-09-30 21:10 Journal under-reservation bug on first >2G file Eric Sandeen
2014-09-30 21:22 ` Eric Sandeen
2014-09-30 21:36 ` Andreas Dilger
2014-09-30 22:10 ` Darrick J. Wong
2014-10-01 11:53 ` Theodore Ts'o
2014-10-01 14:43 ` Eric Sandeen
2014-10-01 19:59 ` Theodore Ts'o
2014-10-01 20:37 ` Eric Sandeen [this message]
2014-10-01 22:43 ` Theodore Ts'o
2014-10-02 5:49 ` Eric Sandeen
2014-10-02 11:26 ` Jan Kara
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=542C65FD.5040405@redhat.com \
--to=sandeen@redhat.com \
--cc=adilger@dilger.ca \
--cc=linux-ext4@vger.kernel.org \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.