All of lore.kernel.org
 help / color / mirror / Atom feed
* CephFS and the next jewel release v10.2.3
@ 2016-08-23  8:59 Loic Dachary
  2016-08-24 18:11 ` Gregory Farnum
  0 siblings, 1 reply; 8+ messages in thread
From: Loic Dachary @ 2016-08-23  8:59 UTC (permalink / raw)
  To: John Spray; +Cc: Ceph Development, Abhishek Varshney, Gregory Farnum

Hi John,

The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?

Cheers

P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.

-- 
Loïc Dachary, Artisan Logiciel Libre

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: CephFS and the next jewel release v10.2.3
  2016-08-23  8:59 CephFS and the next jewel release v10.2.3 Loic Dachary
@ 2016-08-24 18:11 ` Gregory Farnum
  2016-08-24 20:46   ` Gregory Farnum
  0 siblings, 1 reply; 8+ messages in thread
From: Gregory Farnum @ 2016-08-24 18:11 UTC (permalink / raw)
  To: Loic Dachary; +Cc: John Spray, Ceph Development, Abhishek Varshney

On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote:
> Hi John,
>
> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?
>
> Cheers
>
> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.

Ugh, I went to check these valgrind failures and they're a failed
mutex assert on shutdown. Let me dig into what's happening. :(
-Greg

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: CephFS and the next jewel release v10.2.3
  2016-08-24 18:11 ` Gregory Farnum
@ 2016-08-24 20:46   ` Gregory Farnum
  2016-08-24 21:02     ` Loic Dachary
  0 siblings, 1 reply; 8+ messages in thread
From: Gregory Farnum @ 2016-08-24 20:46 UTC (permalink / raw)
  To: Loic Dachary, Patrick Donnelly
  Cc: John Spray, Ceph Development, Abhishek Varshney

On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote:
> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote:
>> Hi John,
>>
>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?
>>
>> Cheers
>>
>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.
>
> Ugh, I went to check these valgrind failures and they're a failed
> mutex assert on shutdown. Let me dig into what's happening. :(
> -Greg

Hmm, we seem to have some badly-colliding backports. If you look at
commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line
242, you'll see it does mds_lock.Unlock() twice in quick succession.
:(

One of them came in from 7c2eab19, and the other from 1d3a8168. I
think maybe the patches got reordered in their backport order or
something.

Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick
review? Loic, can you then run the failed tests against a backport
branch which includes that patch?
-Greg

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: CephFS and the next jewel release v10.2.3
  2016-08-24 20:46   ` Gregory Farnum
@ 2016-08-24 21:02     ` Loic Dachary
  2016-08-24 21:03       ` Gregory Farnum
  0 siblings, 1 reply; 8+ messages in thread
From: Loic Dachary @ 2016-08-24 21:02 UTC (permalink / raw)
  To: Gregory Farnum, Patrick Donnelly; +Cc: Ceph Development



On 24/08/2016 22:46, Gregory Farnum wrote:
> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote:
>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote:
>>> Hi John,
>>>
>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?
>>>
>>> Cheers
>>>
>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.
>>
>> Ugh, I went to check these valgrind failures and they're a failed
>> mutex assert on shutdown. Let me dig into what's happening. :(
>> -Greg
> 
> Hmm, we seem to have some badly-colliding backports. If you look at
> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line
> 242, you'll see it does mds_lock.Unlock() twice in quick succession.
> :(
> 
> One of them came in from 7c2eab19, and the other from 1d3a8168. I
> think maybe the patches got reordered in their backport order or
> something.
> 
> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick
> review? Loic, can you then run the failed tests against a backport
> branch which includes that patch?

https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix !

> -Greg
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 

-- 
Loïc Dachary, Artisan Logiciel Libre

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: CephFS and the next jewel release v10.2.3
  2016-08-24 21:02     ` Loic Dachary
@ 2016-08-24 21:03       ` Gregory Farnum
  2016-08-25 21:13         ` Loic Dachary
  0 siblings, 1 reply; 8+ messages in thread
From: Gregory Farnum @ 2016-08-24 21:03 UTC (permalink / raw)
  To: Loic Dachary; +Cc: Patrick Donnelly, Ceph Development

On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote:
>
>
> On 24/08/2016 22:46, Gregory Farnum wrote:
>> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote:
>>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote:
>>>> Hi John,
>>>>
>>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?
>>>>
>>>> Cheers
>>>>
>>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.
>>>
>>> Ugh, I went to check these valgrind failures and they're a failed
>>> mutex assert on shutdown. Let me dig into what's happening. :(
>>> -Greg
>>
>> Hmm, we seem to have some badly-colliding backports. If you look at
>> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line
>> 242, you'll see it does mds_lock.Unlock() twice in quick succession.
>> :(
>>
>> One of them came in from 7c2eab19, and the other from 1d3a8168. I
>> think maybe the patches got reordered in their backport order or
>> something.
>>
>> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick
>> review? Loic, can you then run the failed tests against a backport
>> branch which includes that patch?
>
> https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix !

Sure. Assuming that works out everything else looked good to me. :)

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: CephFS and the next jewel release v10.2.3
  2016-08-24 21:03       ` Gregory Farnum
@ 2016-08-25 21:13         ` Loic Dachary
  2016-08-25 21:33           ` Gregory Farnum
  0 siblings, 1 reply; 8+ messages in thread
From: Loic Dachary @ 2016-08-25 21:13 UTC (permalink / raw)
  To: Gregory Farnum; +Cc: Patrick Donnelly, Ceph Development

Hi,

The run completed at http://pulpito.ceph.com/loic-2016-08-25_06:39:08-fs-jewel-backports-distro-basic-smithi/ with no valgrind error but a few other failures. Note that it is based on the jewel-backports branch which contains a few more backports in addition to https://github.com/ceph/ceph/pull/10847 and I'm not sure what to blame yet. I'll look into it tomorrow unless you beat me to it :-)

Cheers

On 24/08/2016 23:03, Gregory Farnum wrote:
> On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote:
>>
>>
>> On 24/08/2016 22:46, Gregory Farnum wrote:
>>> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote:
>>>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote:
>>>>> Hi John,
>>>>>
>>>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?
>>>>>
>>>>> Cheers
>>>>>
>>>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.
>>>>
>>>> Ugh, I went to check these valgrind failures and they're a failed
>>>> mutex assert on shutdown. Let me dig into what's happening. :(
>>>> -Greg
>>>
>>> Hmm, we seem to have some badly-colliding backports. If you look at
>>> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line
>>> 242, you'll see it does mds_lock.Unlock() twice in quick succession.
>>> :(
>>>
>>> One of them came in from 7c2eab19, and the other from 1d3a8168. I
>>> think maybe the patches got reordered in their backport order or
>>> something.
>>>
>>> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick
>>> review? Loic, can you then run the failed tests against a backport
>>> branch which includes that patch?
>>
>> https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix !
> 
> Sure. Assuming that works out everything else looked good to me. :)
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 

-- 
Loïc Dachary, Artisan Logiciel Libre

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: CephFS and the next jewel release v10.2.3
  2016-08-25 21:13         ` Loic Dachary
@ 2016-08-25 21:33           ` Gregory Farnum
  2016-08-25 22:47             ` Loic Dachary
  0 siblings, 1 reply; 8+ messages in thread
From: Gregory Farnum @ 2016-08-25 21:33 UTC (permalink / raw)
  To: Loic Dachary; +Cc: Patrick Donnelly, Ceph Development

You've got two jobs never getting locks, 2 dead on Java linker issues
(which probably never worked; they aren't any kind of release
blocker), and one OSD crash. The crash is

2016-08-25 09:21:45.616945 7fa22623a700 -1 os/filestore/FileStore.cc:
In function 'void
FileStore::_do_transaction(ObjectStore::Transaction&, uint64_t, int,
ThreadPool::TPHandle*)' thread 7fa22623a700 time 2016-08-25
09:21:45.612229
os/filestore/FileStore.cc: 2912: FAILED assert(0 == "unexpected error")

That most often is an fs/disk issue of some kind, but the kernel logs
are sadly empty so I can't guarantee it.


On Thu, Aug 25, 2016 at 2:13 PM, Loic Dachary <loic@dachary.org> wrote:
> Hi,
>
> The run completed at http://pulpito.ceph.com/loic-2016-08-25_06:39:08-fs-jewel-backports-distro-basic-smithi/ with no valgrind error but a few other failures. Note that it is based on the jewel-backports branch which contains a few more backports in addition to https://github.com/ceph/ceph/pull/10847 and I'm not sure what to blame yet. I'll look into it tomorrow unless you beat me to it :-)
>
> Cheers
>
> On 24/08/2016 23:03, Gregory Farnum wrote:
>> On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote:
>>>
>>>
>>> On 24/08/2016 22:46, Gregory Farnum wrote:
>>>> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote:
>>>>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote:
>>>>>> Hi John,
>>>>>>
>>>>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?
>>>>>>
>>>>>> Cheers
>>>>>>
>>>>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.
>>>>>
>>>>> Ugh, I went to check these valgrind failures and they're a failed
>>>>> mutex assert on shutdown. Let me dig into what's happening. :(
>>>>> -Greg
>>>>
>>>> Hmm, we seem to have some badly-colliding backports. If you look at
>>>> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line
>>>> 242, you'll see it does mds_lock.Unlock() twice in quick succession.
>>>> :(
>>>>
>>>> One of them came in from 7c2eab19, and the other from 1d3a8168. I
>>>> think maybe the patches got reordered in their backport order or
>>>> something.
>>>>
>>>> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick
>>>> review? Loic, can you then run the failed tests against a backport
>>>> branch which includes that patch?
>>>
>>> https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix !
>>
>> Sure. Assuming that works out everything else looked good to me. :)
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>
> --
> Loïc Dachary, Artisan Logiciel Libre

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: CephFS and the next jewel release v10.2.3
  2016-08-25 21:33           ` Gregory Farnum
@ 2016-08-25 22:47             ` Loic Dachary
  0 siblings, 0 replies; 8+ messages in thread
From: Loic Dachary @ 2016-08-25 22:47 UTC (permalink / raw)
  To: Gregory Farnum; +Cc: Patrick Donnelly, Ceph Development

https://github.com/ceph/ceph/pull/10847 is merged

On 25/08/2016 23:33, Gregory Farnum wrote:
> You've got two jobs never getting locks, 2 dead on Java linker issues
> (which probably never worked; they aren't any kind of release
> blocker), and one OSD crash. The crash is
> 
> 2016-08-25 09:21:45.616945 7fa22623a700 -1 os/filestore/FileStore.cc:
> In function 'void
> FileStore::_do_transaction(ObjectStore::Transaction&, uint64_t, int,
> ThreadPool::TPHandle*)' thread 7fa22623a700 time 2016-08-25
> 09:21:45.612229
> os/filestore/FileStore.cc: 2912: FAILED assert(0 == "unexpected error")
> 
> That most often is an fs/disk issue of some kind, but the kernel logs
> are sadly empty so I can't guarantee it.
> 
> 
> On Thu, Aug 25, 2016 at 2:13 PM, Loic Dachary <loic@dachary.org> wrote:
>> Hi,
>>
>> The run completed at http://pulpito.ceph.com/loic-2016-08-25_06:39:08-fs-jewel-backports-distro-basic-smithi/ with no valgrind error but a few other failures. Note that it is based on the jewel-backports branch which contains a few more backports in addition to https://github.com/ceph/ceph/pull/10847 and I'm not sure what to blame yet. I'll look into it tomorrow unless you beat me to it :-)
>>
>> Cheers
>>
>> On 24/08/2016 23:03, Gregory Farnum wrote:
>>> On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote:
>>>>
>>>>
>>>> On 24/08/2016 22:46, Gregory Farnum wrote:
>>>>> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote:
>>>>>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote:
>>>>>>> Hi John,
>>>>>>>
>>>>>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ?
>>>>>>>
>>>>>>> Cheers
>>>>>>>
>>>>>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns.
>>>>>>
>>>>>> Ugh, I went to check these valgrind failures and they're a failed
>>>>>> mutex assert on shutdown. Let me dig into what's happening. :(
>>>>>> -Greg
>>>>>
>>>>> Hmm, we seem to have some badly-colliding backports. If you look at
>>>>> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line
>>>>> 242, you'll see it does mds_lock.Unlock() twice in quick succession.
>>>>> :(
>>>>>
>>>>> One of them came in from 7c2eab19, and the other from 1d3a8168. I
>>>>> think maybe the patches got reordered in their backport order or
>>>>> something.
>>>>>
>>>>> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick
>>>>> review? Loic, can you then run the failed tests against a backport
>>>>> branch which includes that patch?
>>>>
>>>> https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix !
>>>
>>> Sure. Assuming that works out everything else looked good to me. :)
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>
>>
>> --
>> Loïc Dachary, Artisan Logiciel Libre
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 

-- 
Loïc Dachary, Artisan Logiciel Libre

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2016-08-26  0:18 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-08-23  8:59 CephFS and the next jewel release v10.2.3 Loic Dachary
2016-08-24 18:11 ` Gregory Farnum
2016-08-24 20:46   ` Gregory Farnum
2016-08-24 21:02     ` Loic Dachary
2016-08-24 21:03       ` Gregory Farnum
2016-08-25 21:13         ` Loic Dachary
2016-08-25 21:33           ` Gregory Farnum
2016-08-25 22:47             ` Loic Dachary

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.