All of lore.kernel.org
 help / color / mirror / Atom feed
* active MDS and disk write cache
@ 2012-10-31 17:02 Matt Weil
  2012-10-31 20:01 ` Sam Lang
  2012-11-03 17:43 ` Gregory Farnum
  0 siblings, 2 replies; 4+ messages in thread
From: Matt Weil @ 2012-10-31 17:02 UTC (permalink / raw)
  To: ceph-devel

I have a system with a bunch or ram that I want to remain the active MDS 
but still have a backup.

This config doesn't seem to be working.  I can make linuscs92 the active 
by stopping and starting the mds on linuscs95. It would be nice for 
linuscs92 to be the active from the start.

[mds.linuscs95]
	host = linuscs95
	mds standby replay = true
	mds standby for name = linuscs92

Also about disk write cache.

If the Data center, servers and storage units are backed by redundant 
power sources, Batteries and a generator.  Is the concern for data loss 
less?

Any one turn disk write cache on and see a great improvement?

Thanks

Matt

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: active MDS and disk write cache
  2012-10-31 17:02 active MDS and disk write cache Matt Weil
@ 2012-10-31 20:01 ` Sam Lang
  2012-10-31 20:07   ` Matt Weil
  2012-11-03 17:43 ` Gregory Farnum
  1 sibling, 1 reply; 4+ messages in thread
From: Sam Lang @ 2012-10-31 20:01 UTC (permalink / raw)
  To: Matt Weil; +Cc: ceph-devel

On 10/31/2012 12:02 PM, Matt Weil wrote:
> I have a system with a bunch or ram that I want to remain the active MDS
> but still have a backup.
>
> This config doesn't seem to be working.  I can make linuscs92 the active
> by stopping and starting the mds on linuscs95. It would be nice for
> linuscs92 to be the active from the start.
>
> [mds.linuscs95]
>      host = linuscs95
>      mds standby replay = true
>      mds standby for name = linuscs92

Hi Matt,

Can you post your ceph config?  Once you startup your ceph cluster, you 
see that linuscs92 is the standby and linuscs95 is the active?  How are 
you starting your cluster?

>
> Also about disk write cache.
>
> If the Data center, servers and storage units are backed by redundant
> power sources, Batteries and a generator.  Is the concern for data loss
> less?

Yes, if you have that setup you can enable the disk cache and be 
reasonably assured that a power failure of a single component won't 
cause a loss of data.

>
> Any one turn disk write cache on and see a great improvement?

This is dependent on your hardware.  If you have raid controllers with 
battery backed cache, enabling/disabling the disk caches probably won't 
make much of a difference to performance.  Rather than speculate though, 
I would encourage you to benchmark it using Jens Axboe's fio benchmark:

cat > write.fio -<<EOF
[randwrite]
rw=randw
size=128m
direct=1
[write]
rw=write
size=1024m
direct=1
EOF

# enable cache
hdparm -W 1 /dev/sdX

# test with cache
fio write.fio

# disable cache
hdparm -W 0 /dev/sdX

# test without cache
fio write.fio

compare the output of the two fio benchmarks.  Don't forget to return 
the disk cache to its previously enabled/disabled state!

-sam

>
> Thanks
>
> Matt
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: active MDS and disk write cache
  2012-10-31 20:01 ` Sam Lang
@ 2012-10-31 20:07   ` Matt Weil
  0 siblings, 0 replies; 4+ messages in thread
From: Matt Weil @ 2012-10-31 20:07 UTC (permalink / raw)
  To: Sam Lang; +Cc: ceph-devel


>
> Hi Matt,
>
> Can you post your ceph config?  Once you startup your ceph cluster, you
> see that linuscs92 is the standby and linuscs95 is the active?  How are
> you starting your cluster?
>
service ceph -a start

and yes linuscs95 comes out as active.


[global]
	; enable secure authentication
;	auth supported = cephx
;	log_to_syslog = true
;	keyring = /etc/ceph/keyring.bin
	; monitors
	;  You need at least one.  You need at least three if you want to
	;  tolerate any node failures.  Always create an odd number.
	journal dio = true
	osd op threads = 24
	osd disk threads = 24
	filestore op threads = 6
	filestore queue max ops = 24

	osd client message size cap = 14000000
	ms dispatch throttle bytes =  17500000

[mon]
	mon data = /vol/disk2/data/mon$id
	; some minimal logging (just message traffic) to aid debugging
;	debug ms = 1

[mon.0]
	host = linuscs92
	mon addr = 10.0.30.10:6789

#[mon.1]
#	host = linuscs93
#	mon addr = 10.0.30.11:6789

[mon.1]
	host = linuscs95
	mon addr = 10.0.30.13:6789

	; mds
	;  You need at least one.  Define two to get a standby.

[mds]
	; where the mds keeps its secret encryption keys
;	keyring = /etc/ceph/keyring.$name

[mds.linuscs92]
	host = linuscs92

[mds.linuscs95]
	host = linuscs95
	mds standby replay = true
	mds standby for name = linuscs92

#[mds.linuscs94]
#    host = linuscs94

	; osd
	;  You need at least one.  Two if you want data to be replicated.
	;  Define as many as you like.

[osd]
	osd journal size = 1024
;	keyring = /etc/ceph/keyring.$name

[osd.0]
	host = linuscs92
	osd data = /vol/disk2/data/osd$id
	osd journal = /vol/disk1/data/osd$id/journal

[osd.1]
	host = linuscs93
	osd data = /vol/disk2/data/osd$id
	osd journal = /vol/disk1/data/osd$id/journal
	
[osd.2]
	host = linuscs94
	osd data = /vol/disk2/data/osd$id
	osd journal = /vol/disk1/data/osd$id/journal
	
[osd.3]
	host = linuscs95
	osd data = /vol/disk2/data/osd$id
	osd journal = /vol/disk1/data/osd$id/journal
	
[osd.4]
	host = linuscs96
	osd data = /vol/disk2/data/osd$id
	osd journal = /vol/disk1/data/osd$id/journal

[osd.5]
	host = linuscs97
	osd data = /vol/disk2/data/osd$id
	osd journal = /vol/disk1/data/osd$id/journal



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: active MDS and disk write cache
  2012-10-31 17:02 active MDS and disk write cache Matt Weil
  2012-10-31 20:01 ` Sam Lang
@ 2012-11-03 17:43 ` Gregory Farnum
  1 sibling, 0 replies; 4+ messages in thread
From: Gregory Farnum @ 2012-11-03 17:43 UTC (permalink / raw)
  To: Matt Weil; +Cc: ceph-devel

I suspect "mon force standby active" is your problem:
http://ceph.com/wiki/Standby-replay_modes#mon_force_standby_active

Try disabling that?

On Wed, Oct 31, 2012 at 6:02 PM, Matt Weil <mweil@genome.wustl.edu> wrote:
> I have a system with a bunch or ram that I want to remain the active MDS but
> still have a backup.
>
> This config doesn't seem to be working.  I can make linuscs92 the active by
> stopping and starting the mds on linuscs95. It would be nice for linuscs92
> to be the active from the start.
>
> [mds.linuscs95]
>         host = linuscs95
>         mds standby replay = true
>         mds standby for name = linuscs92
>
> Also about disk write cache.
>
> If the Data center, servers and storage units are backed by redundant power
> sources, Batteries and a generator.  Is the concern for data loss less?

As long as your disk caches respect write flushing and ordering
commands, everything should be safe regardless of whether they're
battery-backed or not. :)
-Greg

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2012-11-03 17:43 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-10-31 17:02 active MDS and disk write cache Matt Weil
2012-10-31 20:01 ` Sam Lang
2012-10-31 20:07   ` Matt Weil
2012-11-03 17:43 ` Gregory Farnum

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.