* CephFS and the next jewel release v10.2.3 @ 2016-08-23 8:59 Loic Dachary 2016-08-24 18:11 ` Gregory Farnum 0 siblings, 1 reply; 8+ messages in thread From: Loic Dachary @ 2016-08-23 8:59 UTC (permalink / raw) To: John Spray; +Cc: Ceph Development, Abhishek Varshney, Gregory Farnum Hi John, The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? Cheers P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. -- Loïc Dachary, Artisan Logiciel Libre ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: CephFS and the next jewel release v10.2.3 2016-08-23 8:59 CephFS and the next jewel release v10.2.3 Loic Dachary @ 2016-08-24 18:11 ` Gregory Farnum 2016-08-24 20:46 ` Gregory Farnum 0 siblings, 1 reply; 8+ messages in thread From: Gregory Farnum @ 2016-08-24 18:11 UTC (permalink / raw) To: Loic Dachary; +Cc: John Spray, Ceph Development, Abhishek Varshney On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote: > Hi John, > > The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? > > Cheers > > P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. Ugh, I went to check these valgrind failures and they're a failed mutex assert on shutdown. Let me dig into what's happening. :( -Greg ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: CephFS and the next jewel release v10.2.3 2016-08-24 18:11 ` Gregory Farnum @ 2016-08-24 20:46 ` Gregory Farnum 2016-08-24 21:02 ` Loic Dachary 0 siblings, 1 reply; 8+ messages in thread From: Gregory Farnum @ 2016-08-24 20:46 UTC (permalink / raw) To: Loic Dachary, Patrick Donnelly Cc: John Spray, Ceph Development, Abhishek Varshney On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote: > On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote: >> Hi John, >> >> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? >> >> Cheers >> >> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. > > Ugh, I went to check these valgrind failures and they're a failed > mutex assert on shutdown. Let me dig into what's happening. :( > -Greg Hmm, we seem to have some badly-colliding backports. If you look at commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line 242, you'll see it does mds_lock.Unlock() twice in quick succession. :( One of them came in from 7c2eab19, and the other from 1d3a8168. I think maybe the patches got reordered in their backport order or something. Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick review? Loic, can you then run the failed tests against a backport branch which includes that patch? -Greg ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: CephFS and the next jewel release v10.2.3 2016-08-24 20:46 ` Gregory Farnum @ 2016-08-24 21:02 ` Loic Dachary 2016-08-24 21:03 ` Gregory Farnum 0 siblings, 1 reply; 8+ messages in thread From: Loic Dachary @ 2016-08-24 21:02 UTC (permalink / raw) To: Gregory Farnum, Patrick Donnelly; +Cc: Ceph Development On 24/08/2016 22:46, Gregory Farnum wrote: > On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote: >> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote: >>> Hi John, >>> >>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? >>> >>> Cheers >>> >>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. >> >> Ugh, I went to check these valgrind failures and they're a failed >> mutex assert on shutdown. Let me dig into what's happening. :( >> -Greg > > Hmm, we seem to have some badly-colliding backports. If you look at > commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line > 242, you'll see it does mds_lock.Unlock() twice in quick succession. > :( > > One of them came in from 7c2eab19, and the other from 1d3a8168. I > think maybe the patches got reordered in their backport order or > something. > > Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick > review? Loic, can you then run the failed tests against a backport > branch which includes that patch? https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix ! > -Greg > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > -- Loïc Dachary, Artisan Logiciel Libre ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: CephFS and the next jewel release v10.2.3 2016-08-24 21:02 ` Loic Dachary @ 2016-08-24 21:03 ` Gregory Farnum 2016-08-25 21:13 ` Loic Dachary 0 siblings, 1 reply; 8+ messages in thread From: Gregory Farnum @ 2016-08-24 21:03 UTC (permalink / raw) To: Loic Dachary; +Cc: Patrick Donnelly, Ceph Development On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote: > > > On 24/08/2016 22:46, Gregory Farnum wrote: >> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote: >>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote: >>>> Hi John, >>>> >>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? >>>> >>>> Cheers >>>> >>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. >>> >>> Ugh, I went to check these valgrind failures and they're a failed >>> mutex assert on shutdown. Let me dig into what's happening. :( >>> -Greg >> >> Hmm, we seem to have some badly-colliding backports. If you look at >> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line >> 242, you'll see it does mds_lock.Unlock() twice in quick succession. >> :( >> >> One of them came in from 7c2eab19, and the other from 1d3a8168. I >> think maybe the patches got reordered in their backport order or >> something. >> >> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick >> review? Loic, can you then run the failed tests against a backport >> branch which includes that patch? > > https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix ! Sure. Assuming that works out everything else looked good to me. :) ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: CephFS and the next jewel release v10.2.3 2016-08-24 21:03 ` Gregory Farnum @ 2016-08-25 21:13 ` Loic Dachary 2016-08-25 21:33 ` Gregory Farnum 0 siblings, 1 reply; 8+ messages in thread From: Loic Dachary @ 2016-08-25 21:13 UTC (permalink / raw) To: Gregory Farnum; +Cc: Patrick Donnelly, Ceph Development Hi, The run completed at http://pulpito.ceph.com/loic-2016-08-25_06:39:08-fs-jewel-backports-distro-basic-smithi/ with no valgrind error but a few other failures. Note that it is based on the jewel-backports branch which contains a few more backports in addition to https://github.com/ceph/ceph/pull/10847 and I'm not sure what to blame yet. I'll look into it tomorrow unless you beat me to it :-) Cheers On 24/08/2016 23:03, Gregory Farnum wrote: > On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote: >> >> >> On 24/08/2016 22:46, Gregory Farnum wrote: >>> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote: >>>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote: >>>>> Hi John, >>>>> >>>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? >>>>> >>>>> Cheers >>>>> >>>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. >>>> >>>> Ugh, I went to check these valgrind failures and they're a failed >>>> mutex assert on shutdown. Let me dig into what's happening. :( >>>> -Greg >>> >>> Hmm, we seem to have some badly-colliding backports. If you look at >>> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line >>> 242, you'll see it does mds_lock.Unlock() twice in quick succession. >>> :( >>> >>> One of them came in from 7c2eab19, and the other from 1d3a8168. I >>> think maybe the patches got reordered in their backport order or >>> something. >>> >>> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick >>> review? Loic, can you then run the failed tests against a backport >>> branch which includes that patch? >> >> https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix ! > > Sure. Assuming that works out everything else looked good to me. :) > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > -- Loïc Dachary, Artisan Logiciel Libre ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: CephFS and the next jewel release v10.2.3 2016-08-25 21:13 ` Loic Dachary @ 2016-08-25 21:33 ` Gregory Farnum 2016-08-25 22:47 ` Loic Dachary 0 siblings, 1 reply; 8+ messages in thread From: Gregory Farnum @ 2016-08-25 21:33 UTC (permalink / raw) To: Loic Dachary; +Cc: Patrick Donnelly, Ceph Development You've got two jobs never getting locks, 2 dead on Java linker issues (which probably never worked; they aren't any kind of release blocker), and one OSD crash. The crash is 2016-08-25 09:21:45.616945 7fa22623a700 -1 os/filestore/FileStore.cc: In function 'void FileStore::_do_transaction(ObjectStore::Transaction&, uint64_t, int, ThreadPool::TPHandle*)' thread 7fa22623a700 time 2016-08-25 09:21:45.612229 os/filestore/FileStore.cc: 2912: FAILED assert(0 == "unexpected error") That most often is an fs/disk issue of some kind, but the kernel logs are sadly empty so I can't guarantee it. On Thu, Aug 25, 2016 at 2:13 PM, Loic Dachary <loic@dachary.org> wrote: > Hi, > > The run completed at http://pulpito.ceph.com/loic-2016-08-25_06:39:08-fs-jewel-backports-distro-basic-smithi/ with no valgrind error but a few other failures. Note that it is based on the jewel-backports branch which contains a few more backports in addition to https://github.com/ceph/ceph/pull/10847 and I'm not sure what to blame yet. I'll look into it tomorrow unless you beat me to it :-) > > Cheers > > On 24/08/2016 23:03, Gregory Farnum wrote: >> On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote: >>> >>> >>> On 24/08/2016 22:46, Gregory Farnum wrote: >>>> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote: >>>>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote: >>>>>> Hi John, >>>>>> >>>>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? >>>>>> >>>>>> Cheers >>>>>> >>>>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. >>>>> >>>>> Ugh, I went to check these valgrind failures and they're a failed >>>>> mutex assert on shutdown. Let me dig into what's happening. :( >>>>> -Greg >>>> >>>> Hmm, we seem to have some badly-colliding backports. If you look at >>>> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line >>>> 242, you'll see it does mds_lock.Unlock() twice in quick succession. >>>> :( >>>> >>>> One of them came in from 7c2eab19, and the other from 1d3a8168. I >>>> think maybe the patches got reordered in their backport order or >>>> something. >>>> >>>> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick >>>> review? Loic, can you then run the failed tests against a backport >>>> branch which includes that patch? >>> >>> https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix ! >> >> Sure. Assuming that works out everything else looked good to me. :) >> -- >> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> > > -- > Loïc Dachary, Artisan Logiciel Libre ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: CephFS and the next jewel release v10.2.3 2016-08-25 21:33 ` Gregory Farnum @ 2016-08-25 22:47 ` Loic Dachary 0 siblings, 0 replies; 8+ messages in thread From: Loic Dachary @ 2016-08-25 22:47 UTC (permalink / raw) To: Gregory Farnum; +Cc: Patrick Donnelly, Ceph Development https://github.com/ceph/ceph/pull/10847 is merged On 25/08/2016 23:33, Gregory Farnum wrote: > You've got two jobs never getting locks, 2 dead on Java linker issues > (which probably never worked; they aren't any kind of release > blocker), and one OSD crash. The crash is > > 2016-08-25 09:21:45.616945 7fa22623a700 -1 os/filestore/FileStore.cc: > In function 'void > FileStore::_do_transaction(ObjectStore::Transaction&, uint64_t, int, > ThreadPool::TPHandle*)' thread 7fa22623a700 time 2016-08-25 > 09:21:45.612229 > os/filestore/FileStore.cc: 2912: FAILED assert(0 == "unexpected error") > > That most often is an fs/disk issue of some kind, but the kernel logs > are sadly empty so I can't guarantee it. > > > On Thu, Aug 25, 2016 at 2:13 PM, Loic Dachary <loic@dachary.org> wrote: >> Hi, >> >> The run completed at http://pulpito.ceph.com/loic-2016-08-25_06:39:08-fs-jewel-backports-distro-basic-smithi/ with no valgrind error but a few other failures. Note that it is based on the jewel-backports branch which contains a few more backports in addition to https://github.com/ceph/ceph/pull/10847 and I'm not sure what to blame yet. I'll look into it tomorrow unless you beat me to it :-) >> >> Cheers >> >> On 24/08/2016 23:03, Gregory Farnum wrote: >>> On Wed, Aug 24, 2016 at 2:02 PM, Loic Dachary <loic@dachary.org> wrote: >>>> >>>> >>>> On 24/08/2016 22:46, Gregory Farnum wrote: >>>>> On Wed, Aug 24, 2016 at 11:11 AM, Gregory Farnum <gfarnum@redhat.com> wrote: >>>>>> On Tue, Aug 23, 2016 at 1:59 AM, Loic Dachary <loic@dachary.org> wrote: >>>>>>> Hi John, >>>>>>> >>>>>>> The next jewel release as found at https://github.com/ceph/ceph/tree/jewel passed the fs suite (http://tracker.ceph.com/issues/16344#note-30 except for three valgrind failures). Do you think the jewel branch is ready for QE to start their own round of testing ? >>>>>>> >>>>>>> Cheers >>>>>>> >>>>>>> P.S. CC'ing Greg because John is on vacation, in case a decision is to be made before he returns. >>>>>> >>>>>> Ugh, I went to check these valgrind failures and they're a failed >>>>>> mutex assert on shutdown. Let me dig into what's happening. :( >>>>>> -Greg >>>>> >>>>> Hmm, we seem to have some badly-colliding backports. If you look at >>>>> commit ac449472df2978123192d5f22247d8b8bc5efb28 and MDSRank.cc line >>>>> 242, you'll see it does mds_lock.Unlock() twice in quick succession. >>>>> :( >>>>> >>>>> One of them came in from 7c2eab19, and the other from 1d3a8168. I >>>>> think maybe the patches got reordered in their backport order or >>>>> something. >>>>> >>>>> Patrick, can you give https://github.com/ceph/ceph/pull/10847 a quick >>>>> review? Loic, can you then run the failed tests against a backport >>>>> branch which includes that patch? >>>> >>>> https://github.com/ceph/ceph/pull/10847 has been pushed as part of the jewel-backports integration branch right now and I'll schedule a fs run as soon as it finishes building. Thanks for the quick fix ! >>> >>> Sure. Assuming that works out everything else looked good to me. :) >>> -- >>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>> >> >> -- >> Loïc Dachary, Artisan Logiciel Libre > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > -- Loïc Dachary, Artisan Logiciel Libre ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2016-08-26 0:18 UTC | newest] Thread overview: 8+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2016-08-23 8:59 CephFS and the next jewel release v10.2.3 Loic Dachary 2016-08-24 18:11 ` Gregory Farnum 2016-08-24 20:46 ` Gregory Farnum 2016-08-24 21:02 ` Loic Dachary 2016-08-24 21:03 ` Gregory Farnum 2016-08-25 21:13 ` Loic Dachary 2016-08-25 21:33 ` Gregory Farnum 2016-08-25 22:47 ` Loic Dachary
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.