* Paxos and long-lasting deleted data
@ 2013-01-31 18:17 Andrey Korolyov
2013-01-31 18:31 ` Gregory Farnum
2013-01-31 18:41 ` Joao Eduardo Luis
0 siblings, 2 replies; 10+ messages in thread
From: Andrey Korolyov @ 2013-01-31 18:17 UTC (permalink / raw)
To: ceph-devel
Hi,
Please take a look, this data remains for days and seems not to be
deleted in future too:
pool name category KB objects clones
degraded unfound rd rd KB wr
wr KB
data - 0 0 0
0 0 0 0 0
0
install - 15736833 3856 0
0 0 16 3 464648
60970390
metadata - 0 0 0
0 0 0 0 0
0
prod-rack0 - 364027905 88895 0
0 0 32 0 267626
689034186
rbd - 4194305 1027 0
0 0 4 1 11269
25165828
total used 6900914368 93778
total avail 18335469376
total space 25236383744
for pool in $(rados lspools) ; do rbd ls -l $pool ; done | grep -v
SIZE | awk '{ sum += $2} END { print sum }'
rbd: pool data doesn't contain rbd images
rbd: pool metadata doesn't contain rbd images
526360
I have same thing before, but not so contrast as there. Cluster was
put on moderate failure test, dropping one or two osds at once under
I/O pressure with replication factor three.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-01-31 18:17 Paxos and long-lasting deleted data Andrey Korolyov
@ 2013-01-31 18:31 ` Gregory Farnum
2013-01-31 18:50 ` Andrey Korolyov
2013-01-31 18:41 ` Joao Eduardo Luis
1 sibling, 1 reply; 10+ messages in thread
From: Gregory Farnum @ 2013-01-31 18:31 UTC (permalink / raw)
To: Andrey Korolyov; +Cc: ceph-devel
Can you pastebin the output of "rados -p rbd ls"?
On Thu, Jan 31, 2013 at 10:17 AM, Andrey Korolyov <andrey@xdel.ru> wrote:
> Hi,
>
> Please take a look, this data remains for days and seems not to be
> deleted in future too:
>
> pool name category KB objects clones
> degraded unfound rd rd KB wr
> wr KB
> data - 0 0 0
> 0 0 0 0 0
> 0
> install - 15736833 3856 0
> 0 0 16 3 464648
> 60970390
> metadata - 0 0 0
> 0 0 0 0 0
> 0
> prod-rack0 - 364027905 88895 0
> 0 0 32 0 267626
> 689034186
> rbd - 4194305 1027 0
> 0 0 4 1 11269
> 25165828
> total used 6900914368 93778
> total avail 18335469376
> total space 25236383744
>
> for pool in $(rados lspools) ; do rbd ls -l $pool ; done | grep -v
> SIZE | awk '{ sum += $2} END { print sum }'
> rbd: pool data doesn't contain rbd images
> rbd: pool metadata doesn't contain rbd images
> 526360
>
> I have same thing before, but not so contrast as there. Cluster was
> put on moderate failure test, dropping one or two osds at once under
> I/O pressure with replication factor three.
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-01-31 18:17 Paxos and long-lasting deleted data Andrey Korolyov
2013-01-31 18:31 ` Gregory Farnum
@ 2013-01-31 18:41 ` Joao Eduardo Luis
1 sibling, 0 replies; 10+ messages in thread
From: Joao Eduardo Luis @ 2013-01-31 18:41 UTC (permalink / raw)
To: Andrey Korolyov; +Cc: ceph-devel
On 01/31/2013 06:17 PM, Andrey Korolyov wrote:
> Hi,
>
> Please take a look, this data remains for days and seems not to be
> deleted in future too:
>
> pool name category KB objects clones
> degraded unfound rd rd KB wr
> wr KB
> data - 0 0 0
> 0 0 0 0 0
> 0
> install - 15736833 3856 0
> 0 0 16 3 464648
> 60970390
> metadata - 0 0 0
> 0 0 0 0 0
> 0
> prod-rack0 - 364027905 88895 0
> 0 0 32 0 267626
> 689034186
> rbd - 4194305 1027 0
> 0 0 4 1 11269
> 25165828
> total used 6900914368 93778
> total avail 18335469376
> total space 25236383744
>
> for pool in $(rados lspools) ; do rbd ls -l $pool ; done | grep -v
> SIZE | awk '{ sum += $2} END { print sum }'
> rbd: pool data doesn't contain rbd images
> rbd: pool metadata doesn't contain rbd images
> 526360
>
> I have same thing before, but not so contrast as there. Cluster was
> put on moderate failure test, dropping one or two osds at once under
> I/O pressure with replication factor three.
Just wondering if there was something else you wanted to discuss on your
email given the email subject. Wanted by any chance discuss anything
regarding Paxos?
-Joao
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-01-31 18:31 ` Gregory Farnum
@ 2013-01-31 18:50 ` Andrey Korolyov
2013-01-31 18:56 ` Gregory Farnum
0 siblings, 1 reply; 10+ messages in thread
From: Andrey Korolyov @ 2013-01-31 18:50 UTC (permalink / raw)
To: Gregory Farnum; +Cc: ceph-devel, Joao Eduardo Luis
http://xdel.ru/downloads/ceph-log/rados-out.txt.gz
On Thu, Jan 31, 2013 at 10:31 PM, Gregory Farnum <greg@inktank.com> wrote:
> Can you pastebin the output of "rados -p rbd ls"?
>
> On Thu, Jan 31, 2013 at 10:17 AM, Andrey Korolyov <andrey@xdel.ru> wrote:
>> Hi,
>>
>> Please take a look, this data remains for days and seems not to be
>> deleted in future too:
>>
>> pool name category KB objects clones
>> degraded unfound rd rd KB wr
>> wr KB
>> data - 0 0 0
>> 0 0 0 0 0
>> 0
>> install - 15736833 3856 0
>> 0 0 16 3 464648
>> 60970390
>> metadata - 0 0 0
>> 0 0 0 0 0
>> 0
>> prod-rack0 - 364027905 88895 0
>> 0 0 32 0 267626
>> 689034186
>> rbd - 4194305 1027 0
>> 0 0 4 1 11269
>> 25165828
>> total used 6900914368 93778
>> total avail 18335469376
>> total space 25236383744
>>
>> for pool in $(rados lspools) ; do rbd ls -l $pool ; done | grep -v
>> SIZE | awk '{ sum += $2} END { print sum }'
>> rbd: pool data doesn't contain rbd images
>> rbd: pool metadata doesn't contain rbd images
>> 526360
>>
>> I have same thing before, but not so contrast as there. Cluster was
>> put on moderate failure test, dropping one or two osds at once under
>> I/O pressure with replication factor three.
>Just wondering if there was something else you wanted to discuss on your email given the email subject. Wanted by any >chance discuss anything regarding Paxos?
Sorry, please nevermind, just thought about paxos-like behavior and
suddenly put that in a title, instead of ``osd data placement''.
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-01-31 18:50 ` Andrey Korolyov
@ 2013-01-31 18:56 ` Gregory Farnum
2013-01-31 19:18 ` Andrey Korolyov
0 siblings, 1 reply; 10+ messages in thread
From: Gregory Farnum @ 2013-01-31 18:56 UTC (permalink / raw)
To: Andrey Korolyov, Dan Mick, Josh Durgin; +Cc: ceph-devel
On Thu, Jan 31, 2013 at 10:50 AM, Andrey Korolyov <andrey@xdel.ru> wrote:
> http://xdel.ru/downloads/ceph-log/rados-out.txt.gz
>
>
> On Thu, Jan 31, 2013 at 10:31 PM, Gregory Farnum <greg@inktank.com> wrote:
>> Can you pastebin the output of "rados -p rbd ls"?
Well, that sure is a lot of rbd objects. Looks like a tool mismatch or
a bug in whatever version you were using. Can you describe how you got
into this state, what versions of the servers and client tools you
used, etc?
-Greg
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-01-31 18:56 ` Gregory Farnum
@ 2013-01-31 19:18 ` Andrey Korolyov
2013-02-03 19:45 ` Andrey Korolyov
0 siblings, 1 reply; 10+ messages in thread
From: Andrey Korolyov @ 2013-01-31 19:18 UTC (permalink / raw)
To: Gregory Farnum; +Cc: Dan Mick, Josh Durgin, ceph-devel
On Thu, Jan 31, 2013 at 10:56 PM, Gregory Farnum <greg@inktank.com> wrote:
> On Thu, Jan 31, 2013 at 10:50 AM, Andrey Korolyov <andrey@xdel.ru> wrote:
>> http://xdel.ru/downloads/ceph-log/rados-out.txt.gz
>>
>>
>> On Thu, Jan 31, 2013 at 10:31 PM, Gregory Farnum <greg@inktank.com> wrote:
>>> Can you pastebin the output of "rados -p rbd ls"?
>
>
> Well, that sure is a lot of rbd objects. Looks like a tool mismatch or
> a bug in whatever version you were using. Can you describe how you got
> into this state, what versions of the servers and client tools you
> used, etc?
> -Greg
That`s relatively fresh data moved into bare new cluster after couple
of days of 0.56.1 release, and tool/daemons version kept consistently
the same at any moment. All garbage data belongs to the same pool
prefix(3.) on which I have put a bunch of VM` images lately, cluster
may have been experienced split-brain problem for a short times during
crash-tests with no workload at all and standard crash tests on osd
removal/readdition during moderate workload. Killed osds have been
returned before,at the time and after process of data rearrangement on
``osd down'' timeout. Is it possible to do a little clean somehow
without pool re-creation?
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-01-31 19:18 ` Andrey Korolyov
@ 2013-02-03 19:45 ` Andrey Korolyov
2013-02-03 21:46 ` Gregory Farnum
0 siblings, 1 reply; 10+ messages in thread
From: Andrey Korolyov @ 2013-02-03 19:45 UTC (permalink / raw)
To: Gregory Farnum; +Cc: Dan Mick, Josh Durgin, ceph-devel
On Thu, Jan 31, 2013 at 11:18 PM, Andrey Korolyov <andrey@xdel.ru> wrote:
> On Thu, Jan 31, 2013 at 10:56 PM, Gregory Farnum <greg@inktank.com> wrote:
>> On Thu, Jan 31, 2013 at 10:50 AM, Andrey Korolyov <andrey@xdel.ru> wrote:
>>> http://xdel.ru/downloads/ceph-log/rados-out.txt.gz
>>>
>>>
>>> On Thu, Jan 31, 2013 at 10:31 PM, Gregory Farnum <greg@inktank.com> wrote:
>>>> Can you pastebin the output of "rados -p rbd ls"?
>>
>>
>> Well, that sure is a lot of rbd objects. Looks like a tool mismatch or
>> a bug in whatever version you were using. Can you describe how you got
>> into this state, what versions of the servers and client tools you
>> used, etc?
>> -Greg
>
> That`s relatively fresh data moved into bare new cluster after couple
> of days of 0.56.1 release, and tool/daemons version kept consistently
> the same at any moment. All garbage data belongs to the same pool
> prefix(3.) on which I have put a bunch of VM` images lately, cluster
> may have been experienced split-brain problem for a short times during
> crash-tests with no workload at all and standard crash tests on osd
> removal/readdition during moderate workload. Killed osds have been
> returned before,at the time and after process of data rearrangement on
> ``osd down'' timeout. Is it possible to do a little clean somehow
> without pool re-creation?
Just an update: this data stayed after pool deletion, so there is
probably a way to delete garbage bytes on live pool without doing any
harm(hope so), since it is can be dissected from actual pool pool data
placement, in theory.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-02-03 19:45 ` Andrey Korolyov
@ 2013-02-03 21:46 ` Gregory Farnum
2013-02-04 5:31 ` Andrey Korolyov
0 siblings, 1 reply; 10+ messages in thread
From: Gregory Farnum @ 2013-02-03 21:46 UTC (permalink / raw)
To: Andrey Korolyov; +Cc: Dan Mick, Josh Durgin, ceph-devel
On Sunday, February 3, 2013 at 11:45 AM, Andrey Korolyov wrote:
> Just an update: this data stayed after pool deletion, so there is
> probably a way to delete garbage bytes on live pool without doing any
> harm(hope so), since it is can be dissected from actual pool pool data
> placement, in theory.
What? You mean you deleted the pool and the data in use by the cluster didn't drop? If that's the case, check and see if it's still at the same level — pool deletes are asynchronous and throttled to prevent impacting client operations too much.
-Greg
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-02-03 21:46 ` Gregory Farnum
@ 2013-02-04 5:31 ` Andrey Korolyov
2013-02-04 8:46 ` Chen, Xiaoxi
0 siblings, 1 reply; 10+ messages in thread
From: Andrey Korolyov @ 2013-02-04 5:31 UTC (permalink / raw)
To: Gregory Farnum; +Cc: Dan Mick, Josh Durgin, ceph-devel
On Mon, Feb 4, 2013 at 1:46 AM, Gregory Farnum <greg@inktank.com> wrote:
> On Sunday, February 3, 2013 at 11:45 AM, Andrey Korolyov wrote:
>> Just an update: this data stayed after pool deletion, so there is
>> probably a way to delete garbage bytes on live pool without doing any
>> harm(hope so), since it is can be dissected from actual pool pool data
>> placement, in theory.
>
>
> What? You mean you deleted the pool and the data in use by the cluster didn't drop? If that's the case, check and see if it's still at the same level — pool deletes are asynchronous and throttled to prevent impacting client operations too much.
Yep, of course, I meant this exactly - I have waited until ``ceph -w''
values was stabilized for a long period, then checked that a bunch of
files with same prefix as in deleted pool remains, then I purged them
manually. I`m not sure if this data was in use at the moment of pool
removal, as I mentioned above, it`s just garbage produced during
periods when cluster was degraded heavily.
> -Greg
>
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Paxos and long-lasting deleted data
2013-02-04 5:31 ` Andrey Korolyov
@ 2013-02-04 8:46 ` Chen, Xiaoxi
0 siblings, 0 replies; 10+ messages in thread
From: Chen, Xiaoxi @ 2013-02-04 8:46 UTC (permalink / raw)
To: Andrey Korolyov; +Cc: Gregory Farnum, Dan Mick, Josh Durgin, ceph-devel
I have hit the same issue,when I try to remove a pool which contains a lot of data,the delete finished,both ceph -w and iostat show no activity.
But the 'used' field remain a large number(alought less than original valie). interate ls all the lefted pool solve the inconsistency.
在 2013-2-4,13:32,"Andrey Korolyov" <andrey@xdel.ru> 写道:
> On Mon, Feb 4, 2013 at 1:46 AM, Gregory Farnum <greg@inktank.com> wrote:
>> On Sunday, February 3, 2013 at 11:45 AM, Andrey Korolyov wrote:
>>> Just an update: this data stayed after pool deletion, so there is
>>> probably a way to delete garbage bytes on live pool without doing any
>>> harm(hope so), since it is can be dissected from actual pool pool data
>>> placement, in theory.
>>
>>
>> What? You mean you deleted the pool and the data in use by the cluster didn't drop? If that's the case, check and see if it's still at the same level ― pool deletes are asynchronous and throttled to prevent impacting client operations too much.
>
> Yep, of course, I meant this exactly - I have waited until ``ceph -w''
> values was stabilized for a long period, then checked that a bunch of
> files with same prefix as in deleted pool remains, then I purged them
> manually. I`m not sure if this data was in use at the moment of pool
> removal, as I mentioned above, it`s just garbage produced during
> periods when cluster was degraded heavily.
>
>> -Greg
>>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2013-02-04 8:46 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-01-31 18:17 Paxos and long-lasting deleted data Andrey Korolyov
2013-01-31 18:31 ` Gregory Farnum
2013-01-31 18:50 ` Andrey Korolyov
2013-01-31 18:56 ` Gregory Farnum
2013-01-31 19:18 ` Andrey Korolyov
2013-02-03 19:45 ` Andrey Korolyov
2013-02-03 21:46 ` Gregory Farnum
2013-02-04 5:31 ` Andrey Korolyov
2013-02-04 8:46 ` Chen, Xiaoxi
2013-01-31 18:41 ` Joao Eduardo Luis
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.