From: Abhishek Lekshmanan <abhishek@suse.com>
To: Yehuda Sadeh-Weinraub <yehuda@redhat.com>
Cc: Abhishek Lekshmanan <abhishek@suse.com>,
Ceph Devel <ceph-devel@vger.kernel.org>
Subject: Re: RGW Multisite delete wierdness
Date: Fri, 03 Jun 2016 11:16:12 +0200 [thread overview]
Message-ID: <87inxqaazn.fsf@suse.com> (raw)
In-Reply-To: <CADRKj5SMz1xe+9yQuc71g4+OiQ_qC=TC3srW8LrKBF_GmmFPvQ@mail.gmail.com>
Yehuda Sadeh-Weinraub writes:
> On Fri, Jun 3, 2016 at 1:28 AM, Abhishek Lekshmanan <abhishek@suse.com> wrote:
>>
>> Yehuda Sadeh-Weinraub writes:
>>
>>> On Thu, Jun 2, 2016 at 6:01 AM, Abhishek Lekshmanan <abhishek@suse.com> wrote:
>>>> [..]
>>>> Yehuda Sadeh-Weinraub writes:
>>>>>
>>>>> Yes, that would be a normal behaviour. The primary should not have
>>>>> concurrent sync operations on the same object if object has not
>>>>> completed previous sync operations. Looking at the log it really seems
>>>>> that we don't identify the concurrent sync operation on the same
>>>>> object. This should have been fixed by commit
>>>>> edea6d58dd25995bcc1ed4fc5be6f72ce4a6835a. Can you try to verify what
>>>>> went wrong there (whether can_do_op() returned true and why)?
>>>>
>>>> Looked into this a bit, can_do_op() has returned true for the case when
>>>> primary issues a Fetch (or GET) and when a delete is issued,(even though
>>>> the Fetch is still not complete yet) by putting a debug log around when
>>>> we clear the keys, both the delete op and the get op creates and deletes
>>>> the same key successfully.
>>>>
>>>> Which makes me suspect, that different instances of
>>>> RGWBucketIncSyncShardMarkerTrack are at play here, leading to different
>>>> independent values for key_to_marker. Is that possible?
>>>>
>>> Shouldn't happen, but maybe something went wrong. Try adding some more
>>> info to the log message to see if that's the case.
>>
>> I just added a debug log whenever a new instance of
>> RGWBucketIncSyncShardMarkerTrack was created, and when we check/delete
>> keys, in all cases, ie. when a GET was called and/or when a DELETE was
>> called, it was a newer instance of marker_tracker that was being invoked.
>> Also a few lines before always showed this:
>>
>> incremental_sync(): async update notification: mybucket:62bc922d-f295-4067-ae36-e23e2f231aad.24099.1:-1
>>
>> which seems to be called whenever we're creating a new SingleEntry CR?
>> (the value of modified_iter was the same in every case)
>>
>> Also looking at the cases where the deletion succeeded in the secondary
>> zone, it seemed here too can_do_op had succeeded every time, the
>> difference was in this case either the Object GET came from the remote
>> site after original site had already processed the DELETE or in other
>> cases, the GET in remote site was processed in time before the DELETE.
>>
>>
>
> Can you provide the log? I'm still not sure how you'd have different
> tracker markers for the same bucket instance, as we take a lease to
> prevent concurrent updates to the same bucket shard. This should
> happen in the async updates too.
id: c24eed72-47d4-452c-8d0e-86d96be8fff1
radosgw.8001.log is the master (and in this case the remote site)
>
> Yehuda
prev parent reply other threads:[~2016-06-03 9:16 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-04-19 16:10 RGW Multisite delete wierdness Abhishek Lekshmanan
2016-04-19 17:52 ` Yehuda Sadeh-Weinraub
2016-04-19 17:54 ` Abhishek L
2016-04-19 18:08 ` Yehuda Sadeh-Weinraub
2016-04-22 0:40 ` Yehuda Sadeh-Weinraub
2016-04-25 8:17 ` Abhishek Lekshmanan
2016-04-25 18:46 ` Yehuda Sadeh-Weinraub
2016-04-25 19:44 ` Abhishek L
2016-04-26 17:37 ` Abhishek Lekshmanan
2016-04-26 22:21 ` Yehuda Sadeh-Weinraub
2016-04-26 23:12 ` Yehuda Sadeh-Weinraub
2016-04-27 20:02 ` Abhishek L
2016-04-27 20:15 ` Yehuda Sadeh-Weinraub
2016-04-27 21:50 ` Yehuda Sadeh-Weinraub
2016-05-31 9:21 ` Abhishek Lekshmanan
2016-05-31 11:06 ` Yehuda Sadeh-Weinraub
2016-06-02 13:01 ` Abhishek Lekshmanan
2016-06-02 13:09 ` Yehuda Sadeh-Weinraub
2016-06-03 8:28 ` Abhishek Lekshmanan
2016-06-03 9:00 ` Yehuda Sadeh-Weinraub
2016-06-03 9:09 ` Yehuda Sadeh-Weinraub
2016-06-03 9:16 ` Abhishek Lekshmanan [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87inxqaazn.fsf@suse.com \
--to=abhishek@suse.com \
--cc=ceph-devel@vger.kernel.org \
--cc=yehuda@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox