Upgrade/rollback

All of lore.kernel.org
 help / color / mirror / Atom feed

* Upgrade/rollback
@ 2015-02-11 12:09 GuangYang
  2015-02-11 15:17 ` Upgrade/rollback Gregory Farnum
  0 siblings, 1 reply; 7+ messages in thread
From: GuangYang @ 2015-02-11 12:09 UTC (permalink / raw)
  To: ceph-devel@vger.kernel.org

Hi ceph-devel,
Recently we are trying the upgrade from Firefly to Giant and it goes pretty smoothly, however, the problem is that it does not support rollback and seems like that is by design. For example, there is new feature flag / metadata [1] added in the new version and they are persisted. As a result, the old version of software does not recognize those values and will crash themselves.

Ideally we never rollback, but for unknown reasons we couldn't fix in a timely manner, we might want to rollback first. Is that something we will consider to handle?

[1] https://github.com/ceph/ceph/commit/1fe8b846641486cc294fe7e1d2450132c38d2dba

Thanks,
Guang 		 	   		  --
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Upgrade/rollback
  2015-02-11 12:09 Upgrade/rollback GuangYang
@ 2015-02-11 15:17 ` Gregory Farnum
  2015-02-11 17:26   ` Upgrade/rollback Sage Weil
  0 siblings, 1 reply; 7+ messages in thread
From: Gregory Farnum @ 2015-02-11 15:17 UTC (permalink / raw)
  To: GuangYang; +Cc: ceph-devel@vger.kernel.org

On Wed, Feb 11, 2015 at 4:09 AM, GuangYang <yguang11@outlook.com> wrote:
> Hi ceph-devel,
> Recently we are trying the upgrade from Firefly to Giant and it goes pretty smoothly, however, the problem is that it does not support rollback and seems like that is by design. For example, there is new feature flag / metadata [1] added in the new version and they are persisted. As a result, the old version of software does not recognize those values and will crash themselves.
>
> Ideally we never rollback, but for unknown reasons we couldn't fix in a timely manner, we might want to rollback first. Is that something we will consider to handle?

We are unlikely to ever do this. We've talked about rollback options,
but allowing rollback means either:
1) never adding information to the disk format which matters,
2) having a separate switchover point (besides the code upgrade) which
enables all the disk change bits and which doesn't allow you to roll
back.

The first option is obviously infeasible. The second one dramatically
increases the amount of code which can be buggy, increases the testing
version, and doesn't really solve the problem since you still have a
hard point of no return.
-Greg

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Upgrade/rollback
  2015-02-11 15:17 ` Upgrade/rollback Gregory Farnum
@ 2015-02-11 17:26   ` Sage Weil
  2015-02-12  7:40     ` Upgrade/rollback Ric Wheeler
  2015-02-12  8:48     ` Upgrade/rollback GuangYang
  0 siblings, 2 replies; 7+ messages in thread
From: Sage Weil @ 2015-02-11 17:26 UTC (permalink / raw)
  To: Gregory Farnum; +Cc: GuangYang, ceph-devel@vger.kernel.org

On Wed, 11 Feb 2015, Gregory Farnum wrote:
> On Wed, Feb 11, 2015 at 4:09 AM, GuangYang <yguang11@outlook.com> wrote:
> > Hi ceph-devel,
> > Recently we are trying the upgrade from Firefly to Giant and it goes pretty smoothly, however, the problem is that it does not support rollback and seems like that is by design. For example, there is new feature flag / metadata [1] added in the new version and they are persisted. As a result, the old version of software does not recognize those values and will crash themselves.
> >
> > Ideally we never rollback, but for unknown reasons we couldn't fix in a timely manner, we might want to rollback first. Is that something we will consider to handle?
> 
> We are unlikely to ever do this. We've talked about rollback options,
> but allowing rollback means either:
> 1) never adding information to the disk format which matters,
> 2) having a separate switchover point (besides the code upgrade) which
> enables all the disk change bits and which doesn't allow you to roll
> back.
> 
> The first option is obviously infeasible. The second one dramatically
> increases the amount of code which can be buggy, increases the testing
> version, and doesn't really solve the problem since you still have a
> hard point of no return.

The only rollback plan we do have currently is to start testing and 
supporting rollback within a major release, e.g. 0.80.8 -> 0.80.7.  We 
(normally) don't add any incompatible changes within a stable release so 
this won't be a major change except that right now none of the downgrade 
scenarios are tested.

sage

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Upgrade/rollback
  2015-02-11 17:26   ` Upgrade/rollback Sage Weil
@ 2015-02-12  7:40     ` Ric Wheeler
  2015-02-12  8:48     ` Upgrade/rollback GuangYang
  1 sibling, 0 replies; 7+ messages in thread
From: Ric Wheeler @ 2015-02-12  7:40 UTC (permalink / raw)
  To: Sage Weil, Gregory Farnum; +Cc: GuangYang, ceph-devel@vger.kernel.org

On 02/11/2015 07:26 PM, Sage Weil wrote:
> On Wed, 11 Feb 2015, Gregory Farnum wrote:
>> On Wed, Feb 11, 2015 at 4:09 AM, GuangYang <yguang11@outlook.com> wrote:
>>> Hi ceph-devel,
>>> Recently we are trying the upgrade from Firefly to Giant and it goes pretty smoothly, however, the problem is that it does not support rollback and seems like that is by design. For example, there is new feature flag / metadata [1] added in the new version and they are persisted. As a result, the old version of software does not recognize those values and will crash themselves.
>>>
>>> Ideally we never rollback, but for unknown reasons we couldn't fix in a timely manner, we might want to rollback first. Is that something we will consider to handle?
>> We are unlikely to ever do this. We've talked about rollback options,
>> but allowing rollback means either:
>> 1) never adding information to the disk format which matters,
>> 2) having a separate switchover point (besides the code upgrade) which
>> enables all the disk change bits and which doesn't allow you to roll
>> back.
>>
>> The first option is obviously infeasible. The second one dramatically
>> increases the amount of code which can be buggy, increases the testing
>> version, and doesn't really solve the problem since you still have a
>> hard point of no return.
> The only rollback plan we do have currently is to start testing and
> supporting rollback within a major release, e.g. 0.80.8 -> 0.80.7.  We
> (normally) don't add any incompatible changes within a stable release so
> this won't be a major change except that right now none of the downgrade
> scenarios are tested.
>
> sage
>

This makes sense to me.

ric


^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: Upgrade/rollback
  2015-02-11 17:26   ` Upgrade/rollback Sage Weil
  2015-02-12  7:40     ` Upgrade/rollback Ric Wheeler
@ 2015-02-12  8:48     ` GuangYang
  2015-02-12 14:57       ` Upgrade/rollback Gregory Farnum
  1 sibling, 1 reply; 7+ messages in thread
From: GuangYang @ 2015-02-12  8:48 UTC (permalink / raw)
  To: Sage Weil, Gregory Farnum; +Cc: ceph-devel@vger.kernel.org

Thanks Sage and Greg for the response.

> 2) having a separate switchover point (besides the code upgrade) which
>  enables all the disk change bits and which doesn't allow you to roll
>  back.
Let me give two examples which prevent us rollback from Giant to Firefly.

Example #1:
In Giant, there is a new feature flag 'CEPH_FEATURE_ERASURE_CODE_PLUGINS_V2' added/persisted, and monitor would  check the persisted list against the list released along with the software version upon starting, it refuse to start if the list mismatch. However, although the feature is added in Giant, it is not being used until we create a new pool with the profile, which is very unlikely to happen.
1) is it possible to persist the new feature bit when the feature is being used (this looks like complicated to implement). 2) When loading the persisted bit, is it possible to check if it is actually used by someone?

Example #2:
Patch [1] added a new k/v to the PG log which cannot be recognized by old version of binary (PGLog::read_log), as a result, it takes the newly added entry as a pg_log_entry.
Is it possible to recognize pg_log_entry with a concrete pattern and just ignore those that the binary cannot recognize?


For there two cases, we may be able to erase the newly added entries and then roll back (correct me if I am wrong here), but I think there might be more complicated cases which make the rollback impossible. And accept that risk for upgrading.

[1] httpsgithub.com/ceph/ceph/commit/1fe8b846641486cc294fe7e1d2450132c38d2dba

Thanks,
Guang


----------------------------------------
> Date: Wed, 11 Feb 2015 09:26:14 -0800
> From: sage@newdream.net
> To: greg@gregs42.com
> CC: yguang11@outlook.com; ceph-devel@vger.kernel.org
> Subject: Re: Upgrade/rollback
>
> On Wed, 11 Feb 2015, Gregory Farnum wrote:
>> On Wed, Feb 11, 2015 at 4:09 AM, GuangYang <yguang11@outlook.com> wrote:
>>> Hi ceph-devel,
>>> Recently we are trying the upgrade from Firefly to Giant and it goes pretty smoothly, however, the problem is that it does not support rollback and seems like that is by design. For example, there is new feature flag / metadata [1] added in the new version and they are persisted. As a result, the old version of software does not recognize those values and will crash themselves.
>>>
>>> Ideally we never rollback, but for unknown reasons we couldn't fix in a timely manner, we might want to rollback first. Is that something we will consider to handle?
>>
>> We are unlikely to ever do this. We've talked about rollback options,
>> but allowing rollback means either:
>> 1) never adding information to the disk format which matters,
>> 2) having a separate switchover point (besides the code upgrade) which
>> enables all the disk change bits and which doesn't allow you to roll
>> back.
>>
>> The first option is obviously infeasible. The second one dramatically
>> increases the amount of code which can be buggy, increases the testing
>> version, and doesn't really solve the problem since you still have a
>> hard point of no return.
>
> The only rollback plan we do have currently is to start testing and
> supporting rollback within a major release, e.g. 0.80.8 -> 0.80.7. We
> (normally) don't add any incompatible changes within a stable release so
> this won't be a major change except that right now none of the downgrade
> scenarios are tested.
>
> sage
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
 		 	   		  --
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Upgrade/rollback
  2015-02-12  8:48     ` Upgrade/rollback GuangYang
@ 2015-02-12 14:57       ` Gregory Farnum
  2015-02-13  1:45         ` Upgrade/rollback GuangYang
  0 siblings, 1 reply; 7+ messages in thread
From: Gregory Farnum @ 2015-02-12 14:57 UTC (permalink / raw)
  To: GuangYang; +Cc: ceph-devel@vger.kernel.org

On Thu, Feb 12, 2015 at 12:48 AM, GuangYang <yguang11@outlook.com> wrote:
> Thanks Sage and Greg for the response.
>
>> 2) having a separate switchover point (besides the code upgrade) which
>>  enables all the disk change bits and which doesn't allow you to roll
>>  back.
> Let me give two examples which prevent us rollback from Giant to Firefly.
>
> Example #1:
> In Giant, there is a new feature flag 'CEPH_FEATURE_ERASURE_CODE_PLUGINS_V2' added/persisted, and monitor would  check the persisted list against the list released along with the software version upon starting, it refuse to start if the list mismatch. However, although the feature is added in Giant, it is not being used until we create a new pool with the profile, which is very unlikely to happen.
> 1) is it possible to persist the new feature bit when the feature is being used (this looks like complicated to implement). 2) When loading the persisted bit, is it possible to check if it is actually used by someone?
>
> Example #2:
> Patch [1] added a new k/v to the PG log which cannot be recognized by old version of binary (PGLog::read_log), as a result, it takes the newly added entry as a pg_log_entry.
> Is it possible to recognize pg_log_entry with a concrete pattern and just ignore those that the binary cannot recognize?
>
>
> For there two cases, we may be able to erase the newly added entries and then roll back (correct me if I am wrong here), but I think there might be more complicated cases which make the rollback impossible. And accept that risk for upgrading.

For these two specific cases, maybe. But you're missing more
fundamental things: often changes to data structures are about
behavior changes that the daemon needs to understand in order to make
any sense of the data. For instance, any upgrades to CRUSH need to be
understood by everybody participating in the cluster. We could
narrowly have the parsing code ignore anything it doesn't understand,
but then when it does calculations about past_intervals or current
mappings it would be wrong!

Or in the example #2 you have, the extra data is a bug fix that
prevents the OSD doing extra work. But what if it was actually about
changing the shared PG state? In that case you might have OSDs with
their PG in different states depending on how far they'd gotten when
rolled back to the old code.

It's just not a feasible problem, because while there are plenty of
things that we could route around, we still face the inevitable
collision point I described when we do add data of shared import. :(
-Greg

^ permalink raw reply	[flat|nested] 7+ messages in thread

* RE: Upgrade/rollback
  2015-02-12 14:57       ` Upgrade/rollback Gregory Farnum
@ 2015-02-13  1:45         ` GuangYang
  0 siblings, 0 replies; 7+ messages in thread
From: GuangYang @ 2015-02-13  1:45 UTC (permalink / raw)
  To: Gregory Farnum; +Cc: ceph-devel@vger.kernel.org

----------------------------------------
> Date: Thu, 12 Feb 2015 06:57:19 -0800
> Subject: Re: Upgrade/rollback
> From: greg@gregs42.com
> To: yguang11@outlook.com
> CC: ceph-devel@vger.kernel.org
>
> On Thu, Feb 12, 2015 at 12:48 AM, GuangYang <yguang11@outlook.com> wrote:
>> Thanks Sage and Greg for the response.
>>
>>> 2) having a separate switchover point (besides the code upgrade) which
>>> enables all the disk change bits and which doesn't allow you to roll
>>> back.
>> Let me give two examples which prevent us rollback from Giant to Firefly.
>>
>> Example #1:
>> In Giant, there is a new feature flag 'CEPH_FEATURE_ERASURE_CODE_PLUGINS_V2' added/persisted, and monitor would check the persisted list against the list released along with the software version upon starting, it refuse to start if the list mismatch. However, although the feature is added in Giant, it is not being used until we create a new pool with the profile, which is very unlikely to happen.
>> 1) is it possible to persist the new feature bit when the feature is being used (this looks like complicated to implement). 2) When loading the persisted bit, is it possible to check if it is actually used by someone?
>>
>> Example #2:
>> Patch [1] added a new k/v to the PG log which cannot be recognized by old version of binary (PGLog::read_log), as a result, it takes the newly added entry as a pg_log_entry.
>> Is it possible to recognize pg_log_entry with a concrete pattern and just ignore those that the binary cannot recognize?
>>
>>
>> For there two cases, we may be able to erase the newly added entries and then roll back (correct me if I am wrong here), but I think there might be more complicated cases which make the rollback impossible. And accept that risk for upgrading.
>
> For these two specific cases, maybe. But you're missing more
> fundamental things: often changes to data structures are about
> behavior changes that the daemon needs to understand in order to make
> any sense of the data. For instance, any upgrades to CRUSH need to be
> understood by everybody participating in the cluster. We could
> narrowly have the parsing code ignore anything it doesn't understand,
> but then when it does calculations about past_intervals or current
> mappings it would be wrong!
>
> Or in the example #2 you have, the extra data is a bug fix that
> prevents the OSD doing extra work. But what if it was actually about
> changing the shared PG state? In that case you might have OSDs with
> their PG in different states depending on how far they'd gotten when
> rolled back to the old code.
>
> It's just not a feasible problem, because while there are plenty of
> things that we could route around, we still face the inevitable
> collision point I described when we do add data of shared import. :(
That makes total sense. Thanks Greg!

> -Greg
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
 		 	   		  

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2015-02-13  1:45 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-02-11 12:09 Upgrade/rollback GuangYang
2015-02-11 15:17 ` Upgrade/rollback Gregory Farnum
2015-02-11 17:26   ` Upgrade/rollback Sage Weil
2015-02-12  7:40     ` Upgrade/rollback Ric Wheeler
2015-02-12  8:48     ` Upgrade/rollback GuangYang
2015-02-12 14:57       ` Upgrade/rollback Gregory Farnum
2015-02-13  1:45         ` Upgrade/rollback GuangYang

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.