* Barriers
@ 2013-05-13 7:54 Sidorov, Andrei
2013-05-14 21:41 ` Barriers Jan Kara
0 siblings, 1 reply; 3+ messages in thread
From: Sidorov, Andrei @ 2013-05-13 7:54 UTC (permalink / raw)
To: ext4 development
Hi,
I'm curious if anyone ever tried fua-only jbd? If done with fua's, there
will be no cache flushes at all, thus no occasional spikes. NCQ fua
journalling is potentially more efficient than cache flush.
I know, stale data will be unavoidable (however unlikely) in fua-based
implementation. It is a compromise between ordered,nobarrier (fs
corruption is likely to happen upon power loss) and ordered,barrier (no
fs corruption).
Any advise on what kind of workload to test?
What about having single journal per device as opposed to partition/fs?
What I've found of quick look at jbd2 code, it doesn't seem to be a
problem to set up single journal for several filesystems on the same device.
This will give an advantage of single commit per commit interval as
opposed to several commits per likely to be same interval.
--
Regards,
Andrei.
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Barriers
2013-05-13 7:54 Barriers Sidorov, Andrei
@ 2013-05-14 21:41 ` Jan Kara
2013-05-15 5:43 ` Barriers Dave Chinner
0 siblings, 1 reply; 3+ messages in thread
From: Jan Kara @ 2013-05-14 21:41 UTC (permalink / raw)
To: Sidorov, Andrei; +Cc: ext4 development
Hi,
On Mon 13-05-13 07:54:48, Sidorov, Andrei wrote:
> I'm curious if anyone ever tried fua-only jbd? If done with fua's, there
> will be no cache flushes at all, thus no occasional spikes. NCQ fua
> journalling is potentially more efficient than cache flush.
If you have such HW, it may be an interesting thing to try.
> I know, stale data will be unavoidable (however unlikely) in fua-based
> implementation. It is a compromise between ordered,nobarrier (fs
> corruption is likely to happen upon power loss) and ordered,barrier (no
> fs corruption).
Well, it will achieve guarantees of barrier,data=writeback mode.
> Any advise on what kind of workload to test?
Not really...
> What about having single journal per device as opposed to partition/fs?
> What I've found of quick look at jbd2 code, it doesn't seem to be a
> problem to set up single journal for several filesystems on the same device.
> This will give an advantage of single commit per commit interval as
> opposed to several commits per likely to be same interval.
Yes, that should be relatively easily possible and might be interesting
for other usecases as well (e.g. when you want to utilize one fast device
for the journal and use it for several slow disks).
Honza
--
Jan Kara <jack@suse.cz>
SUSE Labs, CR
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Barriers
2013-05-14 21:41 ` Barriers Jan Kara
@ 2013-05-15 5:43 ` Dave Chinner
0 siblings, 0 replies; 3+ messages in thread
From: Dave Chinner @ 2013-05-15 5:43 UTC (permalink / raw)
To: Jan Kara; +Cc: Sidorov, Andrei, ext4 development
On Tue, May 14, 2013 at 11:41:58PM +0200, Jan Kara wrote:
> Hi,
>
> On Mon 13-05-13 07:54:48, Sidorov, Andrei wrote:
> > I'm curious if anyone ever tried fua-only jbd? If done with fua's, there
> > will be no cache flushes at all, thus no occasional spikes. NCQ fua
> > journalling is potentially more efficient than cache flush.
> If you have such HW, it may be an interesting thing to try.
Be careful there - you still need to guarantee all metadata that is
being overwritten in the log has been flushed to stable storage, so
even when you are using FUA for the journal writes you still need a
pre-write flush to occur.
Another tricky case here is that if you are using an external log,
the flush needs to occur on the metadata device, not the log device,
and it needs to complete before you submit the FUA journal write.
Cheers,
Dave.
--
Dave Chinner
david@fromorbit.com
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2013-05-15 5:43 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-05-13 7:54 Barriers Sidorov, Andrei
2013-05-14 21:41 ` Barriers Jan Kara
2013-05-15 5:43 ` Barriers Dave Chinner
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).