10 times higher disk load with btrfs

All of lore.kernel.org
 help / color / mirror / Atom feed

* 10 times higher disk load with btrfs
@ 2015-01-05 18:36 Stefan Priebe
  2015-01-05 19:19 ` Stefan Priebe
  0 siblings, 1 reply; 8+ messages in thread
From: Stefan Priebe @ 2015-01-05 18:36 UTC (permalink / raw)
  To: ceph-devel

Hi devs,

while btrfs is now declared as stable ;-) i wanted to retest btrfs on 
our production cluster on 2 out of 54 osds. So if they crash it doesn't 
hurt.

While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same 
OSDs after formatting them with btrfs have spikes of 190MB/s every 4-7s.

Why does just another filesystem raises the disk load by a factor of 10?

I'm running dumpling.

Greets Stefan

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 10 times higher disk load with btrfs
  2015-01-05 18:36 10 times higher disk load with btrfs Stefan Priebe
@ 2015-01-05 19:19 ` Stefan Priebe
  2015-01-05 19:25   ` Sage Weil
  0 siblings, 1 reply; 8+ messages in thread
From: Stefan Priebe @ 2015-01-05 19:19 UTC (permalink / raw)
  To: ceph-devel


Am 05.01.2015 um 19:36 schrieb Stefan Priebe:
> Hi devs,
>
> while btrfs is now declared as stable ;-) i wanted to retest btrfs on
> our production cluster on 2 out of 54 osds. So if they crash it doesn't
> hurt.
>
> While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same
> OSDs after formatting them with btrfs have spikes of 190MB/s every 4-7s.
>
> Why does just another filesystem raises the disk load by a factor of 10?

OK this seems to happen cause ceph is creating every 5s a new subvolume 
/ snap. Is this really expected / needed?

Stefan

>
> I'm running dumpling.
>
> Greets Stefan

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 10 times higher disk load with btrfs
  2015-01-05 19:19 ` Stefan Priebe
@ 2015-01-05 19:25   ` Sage Weil
  2015-01-05 20:20     ` Stefan Priebe
  0 siblings, 1 reply; 8+ messages in thread
From: Sage Weil @ 2015-01-05 19:25 UTC (permalink / raw)
  To: Stefan Priebe; +Cc: ceph-devel

On Mon, 5 Jan 2015, Stefan Priebe wrote:
> 
> Am 05.01.2015 um 19:36 schrieb Stefan Priebe:
> > Hi devs,
> > 
> > while btrfs is now declared as stable ;-) i wanted to retest btrfs on
> > our production cluster on 2 out of 54 osds. So if they crash it doesn't
> > hurt.
> > 
> > While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same
> > OSDs after formatting them with btrfs have spikes of 190MB/s every 4-7s.
> > 
> > Why does just another filesystem raises the disk load by a factor of 10?
> 
> OK this seems to happen cause ceph is creating every 5s a new subvolume /
> snap. Is this really expected / needed?

You can disable it with

 filestore btrfs snap = false

I'm curious how much this drops the load down; originally the 
snaps were no more expensive than a regular sync but perhaps this 
has changed...

sage

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 10 times higher disk load with btrfs
  2015-01-05 19:25   ` Sage Weil
@ 2015-01-05 20:20     ` Stefan Priebe
  2015-01-05 20:29       ` Mark Nelson
  0 siblings, 1 reply; 8+ messages in thread
From: Stefan Priebe @ 2015-01-05 20:20 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

Hi Sage,

Am 05.01.2015 um 20:25 schrieb Sage Weil:
> On Mon, 5 Jan 2015, Stefan Priebe wrote:
>>
>> Am 05.01.2015 um 19:36 schrieb Stefan Priebe:
>>> Hi devs,
>>>
>>> while btrfs is now declared as stable ;-) i wanted to retest btrfs on
>>> our production cluster on 2 out of 54 osds. So if they crash it doesn't
>>> hurt.
>>>
>>> While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same
>>> OSDs after formatting them with btrfs have spikes of 190MB/s every 4-7s.
>>>
>>> Why does just another filesystem raises the disk load by a factor of 10?
>>
>> OK this seems to happen cause ceph is creating every 5s a new subvolume /
>> snap. Is this really expected / needed?
>
> You can disable it with
>
>   filestore btrfs snap = false
>
> I'm curious how much this drops the load down; originally the
> snaps were no more expensive than a regular sync but perhaps this
> has changed...

- with XFS the average write is at 9Mb/s
- with btrfs (filestore_btrfs_snap=true) write is at 40Mb/s
- with btrfs (filestore_btrfs_snap=false) write is at 20Mb/s

Stefan

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 10 times higher disk load with btrfs
  2015-01-05 20:20     ` Stefan Priebe
@ 2015-01-05 20:29       ` Mark Nelson
  2015-01-05 20:33         ` Stefan Priebe
  0 siblings, 1 reply; 8+ messages in thread
From: Mark Nelson @ 2015-01-05 20:29 UTC (permalink / raw)
  To: Stefan Priebe, Sage Weil; +Cc: ceph-devel



On 01/05/2015 02:20 PM, Stefan Priebe wrote:
> Hi Sage,
>
> Am 05.01.2015 um 20:25 schrieb Sage Weil:
>> On Mon, 5 Jan 2015, Stefan Priebe wrote:
>>>
>>> Am 05.01.2015 um 19:36 schrieb Stefan Priebe:
>>>> Hi devs,
>>>>
>>>> while btrfs is now declared as stable ;-) i wanted to retest btrfs on
>>>> our production cluster on 2 out of 54 osds. So if they crash it doesn't
>>>> hurt.
>>>>
>>>> While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same
>>>> OSDs after formatting them with btrfs have spikes of 190MB/s every
>>>> 4-7s.
>>>>
>>>> Why does just another filesystem raises the disk load by a factor of
>>>> 10?
>>>
>>> OK this seems to happen cause ceph is creating every 5s a new
>>> subvolume /
>>> snap. Is this really expected / needed?
>>
>> You can disable it with
>>
>>   filestore btrfs snap = false
>>
>> I'm curious how much this drops the load down; originally the
>> snaps were no more expensive than a regular sync but perhaps this
>> has changed...
>
> - with XFS the average write is at 9Mb/s
> - with btrfs (filestore_btrfs_snap=true) write is at 40Mb/s
> - with btrfs (filestore_btrfs_snap=false) write is at 20Mb/s

Is that the average and not the spikes?  It looks like before the spikes 
were 20MB/s and 190MB/s?

>
> Stefan
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 10 times higher disk load with btrfs
  2015-01-05 20:29       ` Mark Nelson
@ 2015-01-05 20:33         ` Stefan Priebe
       [not found]           ` <713980925.3244877.1420515858378.JavaMail.zimbra@oxygem.tv>
  0 siblings, 1 reply; 8+ messages in thread
From: Stefan Priebe @ 2015-01-05 20:33 UTC (permalink / raw)
  To: mnelson, Sage Weil; +Cc: ceph-devel


Am 05.01.2015 um 21:29 schrieb Mark Nelson:
>
>
> On 01/05/2015 02:20 PM, Stefan Priebe wrote:
>> Hi Sage,
>>
>> Am 05.01.2015 um 20:25 schrieb Sage Weil:
>>> On Mon, 5 Jan 2015, Stefan Priebe wrote:
>>>>
>>>> Am 05.01.2015 um 19:36 schrieb Stefan Priebe:
>>>>> Hi devs,
>>>>>
>>>>> while btrfs is now declared as stable ;-) i wanted to retest btrfs on
>>>>> our production cluster on 2 out of 54 osds. So if they crash it
>>>>> doesn't
>>>>> hurt.
>>>>>
>>>>> While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same
>>>>> OSDs after formatting them with btrfs have spikes of 190MB/s every
>>>>> 4-7s.
>>>>>
>>>>> Why does just another filesystem raises the disk load by a factor of
>>>>> 10?
>>>>
>>>> OK this seems to happen cause ceph is creating every 5s a new
>>>> subvolume /
>>>> snap. Is this really expected / needed?
>>>
>>> You can disable it with
>>>
>>>   filestore btrfs snap = false
>>>
>>> I'm curious how much this drops the load down; originally the
>>> snaps were no more expensive than a regular sync but perhaps this
>>> has changed...
>>
>> - with XFS the average write is at 9Mb/s
>> - with btrfs (filestore_btrfs_snap=true) write is at 40Mb/s
>> - with btrfs (filestore_btrfs_snap=false) write is at 20Mb/s
>
> Is that the average and not the spikes?  It looks like before the spikes
> were 20MB/s and 190MB/s?

Yes these are average values.

Spikes:
- with XFS the spike write is at 20Mb/s
- with btrfs (filestore_btrfs_snap=true) spike write is 200Mb/s
- with btrfs (filestore_btrfs_snap=false) spike is still 185Mb/s but avg 
is 1/2 (20Mb/s) see above


>
>>
>> Stefan
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 10 times higher disk load with btrfs
       [not found]           ` <713980925.3244877.1420515858378.JavaMail.zimbra@oxygem.tv>
@ 2015-01-06  3:44             ` Alexandre DERUMIER
  2015-01-06  7:01               ` Stefan Priebe - Profihost AG
  0 siblings, 1 reply; 8+ messages in thread
From: Alexandre DERUMIER @ 2015-01-06  3:44 UTC (permalink / raw)
  To: Stefan Priebe; +Cc: Mark Nelson, Sage Weil, ceph-devel

Hi Stefan,

Do you see a difference if you force filestore journal writeahead for btrfs instead parrallel ?

filestore journal writeahead = 1
filestore journal parallel = 0

----- Mail original -----
De: "Stefan Priebe" <s.priebe@profihost.ag>
À: "Mark Nelson" <mnelson@redhat.com>, "Sage Weil" <sage@newdream.net>
Cc: "ceph-devel" <ceph-devel@vger.kernel.org>
Envoyé: Lundi 5 Janvier 2015 21:33:22
Objet: Re: 10 times higher disk load with btrfs

Am 05.01.2015 um 21:29 schrieb Mark Nelson: 
> 
> 
> On 01/05/2015 02:20 PM, Stefan Priebe wrote: 
>> Hi Sage, 
>> 
>> Am 05.01.2015 um 20:25 schrieb Sage Weil: 
>>> On Mon, 5 Jan 2015, Stefan Priebe wrote: 
>>>> 
>>>> Am 05.01.2015 um 19:36 schrieb Stefan Priebe: 
>>>>> Hi devs, 
>>>>> 
>>>>> while btrfs is now declared as stable ;-) i wanted to retest btrfs on 
>>>>> our production cluster on 2 out of 54 osds. So if they crash it 
>>>>> doesn't 
>>>>> hurt. 
>>>>> 
>>>>> While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same 
>>>>> OSDs after formatting them with btrfs have spikes of 190MB/s every 
>>>>> 4-7s. 
>>>>> 
>>>>> Why does just another filesystem raises the disk load by a factor of 
>>>>> 10? 
>>>> 
>>>> OK this seems to happen cause ceph is creating every 5s a new 
>>>> subvolume / 
>>>> snap. Is this really expected / needed? 
>>> 
>>> You can disable it with 
>>> 
>>> filestore btrfs snap = false 
>>> 
>>> I'm curious how much this drops the load down; originally the 
>>> snaps were no more expensive than a regular sync but perhaps this 
>>> has changed... 
>> 
>> - with XFS the average write is at 9Mb/s 
>> - with btrfs (filestore_btrfs_snap=true) write is at 40Mb/s 
>> - with btrfs (filestore_btrfs_snap=false) write is at 20Mb/s 
> 
> Is that the average and not the spikes? It looks like before the spikes 
> were 20MB/s and 190MB/s? 

Yes these are average values. 

Spikes: 
- with XFS the spike write is at 20Mb/s 
- with btrfs (filestore_btrfs_snap=true) spike write is 200Mb/s 
- with btrfs (filestore_btrfs_snap=false) spike is still 185Mb/s but avg 
is 1/2 (20Mb/s) see above 


> 
>> 
>> Stefan 
>> -- 
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in 
>> the body of a message to majordomo@vger.kernel.org 
>> More majordomo info at http://vger.kernel.org/majordomo-info.html 
-- 
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in 
the body of a message to majordomo@vger.kernel.org 
More majordomo info at http://vger.kernel.org/majordomo-info.html 
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: 10 times higher disk load with btrfs
  2015-01-06  3:44             ` Alexandre DERUMIER
@ 2015-01-06  7:01               ` Stefan Priebe - Profihost AG
  0 siblings, 0 replies; 8+ messages in thread
From: Stefan Priebe - Profihost AG @ 2015-01-06  7:01 UTC (permalink / raw)
  To: Alexandre DERUMIER; +Cc: Mark Nelson, Sage Weil, ceph-devel

Hi,
Am 06.01.2015 um 04:44 schrieb Alexandre DERUMIER:
> Hi Stefan,
> 
> Do you see a difference if you force filestore journal writeahead for btrfs instead parrallel ?
> 
> filestore journal writeahead = 1
> filestore journal parallel = 0

i already tested filestore btrfs snap = false which automatically
disabled parallel write.

Stefan

> ----- Mail original -----
> De: "Stefan Priebe" <s.priebe@profihost.ag>
> À: "Mark Nelson" <mnelson@redhat.com>, "Sage Weil" <sage@newdream.net>
> Cc: "ceph-devel" <ceph-devel@vger.kernel.org>
> Envoyé: Lundi 5 Janvier 2015 21:33:22
> Objet: Re: 10 times higher disk load with btrfs
> 
> Am 05.01.2015 um 21:29 schrieb Mark Nelson: 
>>
>>
>> On 01/05/2015 02:20 PM, Stefan Priebe wrote: 
>>> Hi Sage, 
>>>
>>> Am 05.01.2015 um 20:25 schrieb Sage Weil: 
>>>> On Mon, 5 Jan 2015, Stefan Priebe wrote: 
>>>>>
>>>>> Am 05.01.2015 um 19:36 schrieb Stefan Priebe: 
>>>>>> Hi devs, 
>>>>>>
>>>>>> while btrfs is now declared as stable ;-) i wanted to retest btrfs on 
>>>>>> our production cluster on 2 out of 54 osds. So if they crash it 
>>>>>> doesn't 
>>>>>> hurt. 
>>>>>>
>>>>>> While if those OSDs run XFS have spikes of 20MB/s every 4-7s. The same 
>>>>>> OSDs after formatting them with btrfs have spikes of 190MB/s every 
>>>>>> 4-7s. 
>>>>>>
>>>>>> Why does just another filesystem raises the disk load by a factor of 
>>>>>> 10? 
>>>>>
>>>>> OK this seems to happen cause ceph is creating every 5s a new 
>>>>> subvolume / 
>>>>> snap. Is this really expected / needed? 
>>>>
>>>> You can disable it with 
>>>>
>>>> filestore btrfs snap = false 
>>>>
>>>> I'm curious how much this drops the load down; originally the 
>>>> snaps were no more expensive than a regular sync but perhaps this 
>>>> has changed... 
>>>
>>> - with XFS the average write is at 9Mb/s 
>>> - with btrfs (filestore_btrfs_snap=true) write is at 40Mb/s 
>>> - with btrfs (filestore_btrfs_snap=false) write is at 20Mb/s 
>>
>> Is that the average and not the spikes? It looks like before the spikes 
>> were 20MB/s and 190MB/s? 
> 
> Yes these are average values. 
> 
> Spikes: 
> - with XFS the spike write is at 20Mb/s 
> - with btrfs (filestore_btrfs_snap=true) spike write is 200Mb/s 
> - with btrfs (filestore_btrfs_snap=false) spike is still 185Mb/s but avg 
> is 1/2 (20Mb/s) see above 
> 
> 
>>
>>>
>>> Stefan 
>>> -- 
>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in 
>>> the body of a message to majordomo@vger.kernel.org 
>>> More majordomo info at http://vger.kernel.org/majordomo-info.html 
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2015-01-06  7:01 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-01-05 18:36 10 times higher disk load with btrfs Stefan Priebe
2015-01-05 19:19 ` Stefan Priebe
2015-01-05 19:25   ` Sage Weil
2015-01-05 20:20     ` Stefan Priebe
2015-01-05 20:29       ` Mark Nelson
2015-01-05 20:33         ` Stefan Priebe
     [not found]           ` <713980925.3244877.1420515858378.JavaMail.zimbra@oxygem.tv>
2015-01-06  3:44             ` Alexandre DERUMIER
2015-01-06  7:01               ` Stefan Priebe - Profihost AG

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.