From: Howard Chu <hyc@symas.com>
To: Ashish Sangwan <ashishsangwan2@gmail.com>
Cc: "Lukáš Czerner" <lczerner@redhat.com>,
linux-fsdevel <linux-fsdevel@vger.kernel.org>
Subject: Re: truncate head of file?
Date: Wed, 20 Aug 2014 15:02:29 -0700 [thread overview]
Message-ID: <53F51AF5.1050509@symas.com> (raw)
In-Reply-To: <CAOiN93nTXiOJc50fJwRxRZhOR9GbU1th0wAqovnp64Hpc61K8Q@mail.gmail.com>
Ashish Sangwan wrote:
> On Wed, Aug 20, 2014 at 2:03 PM, Howard Chu <hyc@symas.com> wrote:
>> Lukáš Czerner wrote:
>>>
>>> On Wed, 20 Aug 2014, Howard Chu wrote:
>>>
>>>> Date: Wed, 20 Aug 2014 00:00:56 -0700
>>>> From: Howard Chu <hyc@symas.com>
>>>> To: Lukáš Czerner <lczerner@redhat.com>
>>>> Cc: linux-fsdevel <linux-fsdevel@vger.kernel.org>
>>>> Subject: Re: truncate head of file?
>>>>
>>>> Lukáš Czerner wrote:
>>>>>
>>>>> On Tue, 19 Aug 2014, Howard Chu wrote:
>>>>>
>>>>>> Date: Tue, 19 Aug 2014 22:45:16 -0700
>>>>>> From: Howard Chu <hyc@symas.com>
>>>>>> To: linux-fsdevel <linux-fsdevel@vger.kernel.org>
>>>>>> Subject: truncate head of file?
>>>>>>
>>>>>> Was thinking it would be very handy to have a truncate() variant that
>>>>>> deletes
>>>>>> pages from the head of a file. This could be leveraged to make logfiles
>>>>>> easier
>>>>>> to maintain, as an example. Anyone else interested, think this would be
>>>>>> nice
>>>>>> to have?
>>>>>>
>>>>>> (Note - not the same as just punching holes in the beginning of the
>>>>>> file -
>>>>>> we
>>>>>> want the beginning of the file to advance as well, past the deleted
>>>>>> pages.)
>>>>>
>>>>>
>>>>> I am not really sure I understand the behaviour you'd like to see.
>>>>> Can you please explain the behaviour including more concrete use
>>>>> case ?
>>>>
>>>>
>>>> For example - we have a logfile (opened O_APPEND) that grows
>>>> continuously. We
>>>> want to delete some old log info from the head of the file. We could use
>>>> "hole
>>>> punching" to cause a specific range of data to be freed, but that just
>>>> leaves
>>>> a sparse file. If we were to cat this file the read() would have to
>>>> advance
>>>> thru all of that empty space before arriving at actual log data. We want
>>>> both
>>>> the data to be freed and for the logical beginning of the file to be
>>>> moved
>>>> forward, to match the location of where the remaining data begins.
>>>>
>>>> Freeing the space would be simplest if we just deallocate X pages from
>>>> the
>>>> file, and then the beginning of the file becomes the beginning of the
>>>> first
>>>> remaining page of the file.
>>>
>>>
>>> Ok, now I understand. It is exactly what FALLOC_FL_COLLAPSE_RANGE is
>>> for. It has been merged into mainline with commit v3.14-rc1-1-g00f5e61
>>
>>
>> Ah, thanks for the tip.
>>
>>
>>> However it will not work with O_APPEND nor does any other fallocate
>>> flag except pure fallocate, so not even punch hole would have
>>> worked.
>>
>>
>> Hm, that's a little bit inconvenient. What if the process calling
> There seems to be some confusion between per fd flag O_APPEND and
> inode flag S_APPEND.
> Collapse range can work well with O_APPEND flag but it will not work
> if S_APPEND is set.
Ah ok, that's much better.
>> fallocate() is different from the one appending to the log (i.e., using two
>> separate file descriptors)? Also, when a range is collapsed, are all
>> processes with descriptors opened on the file updated - are their seek
>> positions correctly decremented? I dug up the patches and didn't see any
>> code to handle this but may have missed it.
> You are right. Currently seek positions are not updated.
> But this will not be of any problem (at least not on XFS) for your use
> case if file is open with O_APPEND.
> Refer to this discussion:
> https://lkml.org/lkml/2014/2/26/701
Thanks, looks good.
I find it somewhat surprising that such a useful feature was implemented for
the benefit of video editing, when it has massive applicability in databases
(particularly for simplified truncation of write-ahead logs, as well as for
append-only database designs). This is such an obviously beneficial feature
but I haven't seen it talked up very much. Of course, being so specific to
very recent Linux releases will certainly impede its adoption.
--
-- Howard Chu
CTO, Symas Corp. http://www.symas.com
Director, Highland Sun http://highlandsun.com/hyc/
Chief Architect, OpenLDAP http://www.openldap.org/project/
--
To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2014-08-20 22:02 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-08-20 5:45 truncate head of file? Howard Chu
2014-08-20 6:26 ` Randy Dunlap
2014-08-20 6:39 ` Lukáš Czerner
2014-08-20 7:00 ` Howard Chu
2014-08-20 7:31 ` Lukáš Czerner
2014-08-20 8:33 ` Howard Chu
2014-08-20 11:44 ` Ashish Sangwan
2014-08-20 22:02 ` Howard Chu [this message]
2014-08-21 18:17 ` Andreas Dilger
2014-08-21 18:59 ` Howard Chu
2014-08-22 23:22 ` Randy Dunlap
2014-08-29 5:06 ` Howard Chu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=53F51AF5.1050509@symas.com \
--to=hyc@symas.com \
--cc=ashishsangwan2@gmail.com \
--cc=lczerner@redhat.com \
--cc=linux-fsdevel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.