* How to reduce active mds number
@ 2011-04-22 9:20 AnnyRen
2011-04-22 15:46 ` Sage Weil
0 siblings, 1 reply; 5+ messages in thread
From: AnnyRen @ 2011-04-22 9:20 UTC (permalink / raw)
To: ceph-devel
Hi,
I'm testing MDS expasion,
my original environment has 3 MON, 2 MDS (one active, one standby), 10
OSD which are built on 15 physical machines.
the following is my step to test add/remove active MDS:
Step1: check mds status
root@MON1:~# ceph mds dump -o -
4097: 192.168.200.184:6800/2095 '0' mds0.1 up:active seq 66
4107: 192.168.200.185:6800/1911 '1' mds-1.0 up:standby seq 2
There are two mds, one is active, another is standby
Step2: ceph mds set_max_mds 2
Step3: ceph -s
2011-04-22 08:33:03.836554 mds e18: 2/2/2 up {0=up:active,1=up:active}
or
root@MON1:~# ceph mds dump -o -
max_mds 2
in 0,1
up {0=4097,1=4107}
failed
stopped
4097: 192.168.200.184:6800/2095 '0' mds0.1 up:active seq 66
4107: 192.168.200.185:6800/1911 '1' mds1.1 up:active seq 25635
Step4: I want to set one mds to standby, so I execute "ceph mds set_max_mds 1"
root@MON1:~# ceph mds set_max_mds 1
2011-04-22 08:35:29.964667 mon <- [mds,set_max_mds,1]
2011-04-22 08:35:30.133274 mon0 -> 'max_mds = 1' (0)
Step5: Then I check mds status
root@MON1:~# ceph mds stat
2011-04-22 08:39:04.160139 mon <- [mds,stat]
2011-04-22 08:39:04.160823 mon0 -> 'e19: 2/2/1 up {0=up:active,1=up:active}' (0)
^^^^^^
What does the 2/2/1 means ?
I found that the number of active MDS still remains two...
Could you tell me how to reduce the active mds number from 2 to 1...?
and what the 2/2/1 means ?
Thanks a lot ! :)
Best Regards,
Annie
^ permalink raw reply [flat|nested] 5+ messages in thread* Re: How to reduce active mds number 2011-04-22 9:20 How to reduce active mds number AnnyRen @ 2011-04-22 15:46 ` Sage Weil 2011-05-03 2:07 ` doki74216 0 siblings, 1 reply; 5+ messages in thread From: Sage Weil @ 2011-04-22 15:46 UTC (permalink / raw) To: AnnyRen; +Cc: ceph-devel On Fri, 22 Apr 2011, AnnyRen wrote: > Hi, > > I'm testing MDS expasion, > my original environment has 3 MON, 2 MDS (one active, one standby), 10 > OSD which are built on 15 physical machines. > > the following is my step to test add/remove active MDS: > > > Step1: check mds status > > root@MON1:~# ceph mds dump -o - > 4097: 192.168.200.184:6800/2095 '0' mds0.1 up:active seq 66 > 4107: 192.168.200.185:6800/1911 '1' mds-1.0 up:standby seq 2 > There are two mds, one is active, another is standby > > Step2: ceph mds set_max_mds 2 > > Step3: ceph -s > 2011-04-22 08:33:03.836554 mds e18: 2/2/2 up {0=up:active,1=up:active} > > or > root@MON1:~# ceph mds dump -o - > max_mds 2 > in 0,1 > up {0=4097,1=4107} > failed > stopped > 4097: 192.168.200.184:6800/2095 '0' mds0.1 up:active seq 66 > 4107: 192.168.200.185:6800/1911 '1' mds1.1 up:active seq 25635 > > > Step4: I want to set one mds to standby, so I execute "ceph mds set_max_mds 1" > > root@MON1:~# ceph mds set_max_mds 1 > 2011-04-22 08:35:29.964667 mon <- [mds,set_max_mds,1] > 2011-04-22 08:35:30.133274 mon0 -> 'max_mds = 1' (0) > > > Step5: Then I check mds status > > root@MON1:~# ceph mds stat > 2011-04-22 08:39:04.160139 mon <- [mds,stat] > 2011-04-22 08:39:04.160823 mon0 -> 'e19: 2/2/1 up {0=up:active,1=up:active}' (0) > ^^^^^^ > What does the 2/2/1 means ? > > I found that the number of active MDS still remains two... > Could you tell me how to reduce the active mds number from 2 to 1...? > and what the 2/2/1 means ? 2 are "up", 2 are "in" (the cluster set includes 2), and 1 is the target/max. You can do $ ceph mds stop 1 to tell mds1 to export everything and shut down cleanly. sage ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: How to reduce active mds number 2011-04-22 15:46 ` Sage Weil @ 2011-05-03 2:07 ` doki74216 2011-05-03 17:34 ` Tommi Virtanen 0 siblings, 1 reply; 5+ messages in thread From: doki74216 @ 2011-05-03 2:07 UTC (permalink / raw) To: ceph-devel Hi, I am testing the same case. My system is 1 MON , 2 MDS, 3OSD. Here is my steps: 1) First, I set the max number of MDS to 2 , so there are two active MDSs. 192.168.200.184:6800/2095 '0' mds0.1 up:active seq 66 192.168.200.185:6800/1911 '1' mds1.1 up:active seq 25635 2) I want to set the active one to standby. I excute"ceph mds set_max_mds 1" and "ceph mds stop 0". But here is why I confuse: mds0 becomes stopping not standby.?? It shows: 192.138.200.185:6800/14465 ‘1’mds0.6 up:stopping seq 14 192.138.200.184:6800/15442 ‘0’mds1.1 up:active seq 210 3) When I excute"ceph mds set_max_mds 2", mds0 doesn't become active, I type"ceph mds dump -o -" But it shows: 192.138.200.185:6800/14465 ‘1’mds0.6 up:stopping seq 14 export_targets=1 192.138.200.184:6800/15442 ‘0’mds1.1 up:active seq 210 There is no two active MDSes, why? 4) Therefore, I still hope that there are one active mds and one standby mds(by default). I restart the system. I execute" /etc/init.d/ceph -a stop" and " /etc/init.d/ceph -a start" I type"ceph -s" But it shows: mds e52: 2/2/1 up {0=up:rejoin, 1=up:resolve} ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ what means? After awhile I execue”ceph mds stat” , it shows: ‘e61: 2/2/1 up {o=up:relay, 1=up:relay}’ (0) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ what means? 5) I set the man active number to one because I still want the system by default value(one active and one standby). I execute"ceph mds set_max_mds 1" It shows: mds e80: 2/2/1 up {0=up:rejoin, 1=up:rejoin}, 1:up:standby mds e80: 2/2/1 up {0=up:active, 1=up:rejoin}, 1:up:standby mds e80: 2/2/1 up {0=up:active, 1=up:active}, 1:up:standby ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ I don't know what it means. Why there are three statuses? Aren't there just 2 MDSes in my system? I execue"ceph -s" It shows: mds e94: 2/2/1 up {0=up:active,1=up:active}, 1 up:standby Please help me to solve these questions.. And teach me how to set the MDS by the default(1 active & 1 standby)? Thank you very much ^^ Best regards Stefanie Chen -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: How to reduce active mds number 2011-05-03 2:07 ` doki74216 @ 2011-05-03 17:34 ` Tommi Virtanen 2011-05-03 17:51 ` Tommi Virtanen 0 siblings, 1 reply; 5+ messages in thread From: Tommi Virtanen @ 2011-05-03 17:34 UTC (permalink / raw) To: doki74216@gmail.com; +Cc: ceph-devel On Tue, May 03, 2011 at 10:07:01AM +0800, doki74216@gmail.com wrote: > 2) I want to set the active one to standby. I excute"ceph mds > set_max_mds 1" and "ceph mds stop 0". > But here is why I confuse: > mds0 becomes stopping not standby.?? > It shows: > 192.138.200.185:6800/14465 ‘1’mds0.6 up:stopping seq 14 > 192.138.200.184:6800/15442 ‘0’mds1.1 up:active seq 210 I think the confusion here is that "ceph mds stop 0" really tells it to stop, not to go to standby. Once it finishes stopping (= has safely exported all its data to the other mds), you can start it again, and then it'll get to be standby. This ASCII art from the source might explain. Ignore the extra detail: boot --> standby, creating, or starting. dne ----> creating -----> active* ^ ^___________/ / ^ ^ | / / | destroying / / | ^ / / | | / / | stopped <---- stopping* <-/ / | \ / | ----- starting* ----/ | | failed | \ | \--> replay* --> reconnect* --> rejoin* * = can fail Your mds is still in the state "stopping". To get to standby, it needs to finish that, then get to "boot" again (by being restarted), and then it can enter standby. On the other hand, I can reproduce the problem of an MDS just being in state "stopping" for along time. I'll see what others have to say about that. > 3) When I excute"ceph mds set_max_mds 2", mds0 doesn't become active, > I type"ceph mds dump -o -" > But it shows: > 192.138.200.185:6800/14465 ‘1’mds0.6 up:stopping seq 14 export_targets=1 > 192.138.200.184:6800/15442 ‘0’mds1.1 up:active seq 210 > There is no two active MDSes, why? Your mds is still in the state "stopping". It needs to finish that first. If you've set max_mds==2, it should get to active once it's done. > 4) Therefore, I still hope that there are one active mds and one > standby mds(by default). > I restart the system. I execute" /etc/init.d/ceph -a stop" and " > /etc/init.d/ceph -a start" > I type"ceph -s" > But it shows: > mds e52: 2/2/1 up {0=up:rejoin, 1=up:resolve} > ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ what means? > After awhile I execue”ceph mds stat” , it shows: > ‘e61: 2/2/1 up {o=up:relay, 1=up:relay}’ (0) > ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ what means? The word "relay" is never mentioned in the source tree. And seeing that "o" in there makes me think you're typing these lines in. If you're copying log lines to email, please use copy-paste and don't type them in manually; variation from the exact message makes helping you harder. "replay" is when an MDS is starting up and reading its journal, replaying the operations stored there against the final data storage. "resolve" is when the MDSes go through their journals and figure out how to handle operations that can cross MDS boundaries, such as renames across directories. > 5) I set the man active number to one because I still want the system > by default value(one active and one standby). > I execute"ceph mds set_max_mds 1" > It shows: > mds e80: 2/2/1 up {0=up:rejoin, 1=up:rejoin}, 1:up:standby > mds e80: 2/2/1 up {0=up:active, 1=up:rejoin}, 1:up:standby > mds e80: 2/2/1 up {0=up:active, 1=up:active}, 1:up:standby > ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ > I don't know what it means. > Why there are three statuses? Aren't there just 2 MDSes in my system? > I execue"ceph -s" > It shows: > mds e94: 2/2/1 up {0=up:active,1=up:active}, 1 up:standby > > Please help me to solve these questions.. > And teach me how to set the MDS by the default(1 active & 1 standby)? > Thank you very much ^^ I don't have a good answer to this one. The /1 in the 2/2/1 means your max_mds really is 1. It might be a question of there being no automatic transition from active to standby. I can reproduce this problem locally, and will try to figure it out. -- :(){ :|:&};: -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: How to reduce active mds number 2011-05-03 17:34 ` Tommi Virtanen @ 2011-05-03 17:51 ` Tommi Virtanen 0 siblings, 0 replies; 5+ messages in thread From: Tommi Virtanen @ 2011-05-03 17:51 UTC (permalink / raw) To: doki74216@gmail.com; +Cc: ceph-devel On Tue, May 03, 2011 at 10:34:45AM -0700, Tommi Virtanen wrote: > On the other hand, I can reproduce the problem of an MDS just being in > state "stopping" for along time. I'll see what others have to say > about that. Filed as issue #1048. http://tracker.newdream.net/issues/1048 -- :(){ :|:&};: ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2011-05-03 17:51 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-04-22 9:20 How to reduce active mds number AnnyRen 2011-04-22 15:46 ` Sage Weil 2011-05-03 2:07 ` doki74216 2011-05-03 17:34 ` Tommi Virtanen 2011-05-03 17:51 ` Tommi Virtanen
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.