From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id BC2A3C432C0 for ; Tue, 26 Nov 2019 23:58:24 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 8D6F22064A for ; Tue, 26 Nov 2019 23:58:24 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727051AbfKZX6X (ORCPT ); Tue, 26 Nov 2019 18:58:23 -0500 Received: from dcvr.yhbt.net ([64.71.152.64]:38290 "EHLO dcvr.yhbt.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726380AbfKZX6X (ORCPT ); Tue, 26 Nov 2019 18:58:23 -0500 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id D2BC51F463; Tue, 26 Nov 2019 23:58:22 +0000 (UTC) Date: Tue, 26 Nov 2019 23:58:22 +0000 From: Eric Wong To: Johannes Schindelin Cc: Jeff King , Emily Shaffer , git@vger.kernel.org Subject: Re: GitGitGadget on git/git, was Re: Should we auto-close PRs on git/git? Message-ID: <20191126235822.GA19066@dcvr> References: <20191121105414.GA16238@sigill.intra.peff.net> <20191125143023.GF494@sigill.intra.peff.net> <20191126215648.GA18872@dcvr> <20191126224044.GA13328@dcvr> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: Sender: git-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org Johannes Schindelin wrote: > Hi Eric, > > On Tue, 26 Nov 2019, Eric Wong wrote: > > > Johannes Schindelin wrote: > > > On Tue, 26 Nov 2019, Eric Wong wrote: > > > > Johannes Schindelin wrote: > > > > > The biggest obstacle is that at least one of those Pipelines requires > > > > > access to a clone of public-inbox.org/git, and cloning that is rather > > > > > expensive. Even a shallow fetch would be super expensive, by virtue of > > > > > _all_ the mails being blobs reachable from the tip commit's tree. > > > > > > > > Fwiw, lore.kernel.org/git/$EPOCH.git ought to be somewhat cheaper, > > > > but it's a different (more scalable) format which requires SQLite: > > > > > > > > https://public-inbox.org/public-inbox-v2-format.html > > > > > > Is this incremental? GitGitGadget needs this to be incremental ;-) > > > > Incremental as far as "git fetch" goes? Of course :> > > The "m" file is overwritten with every commit, so the tree size > > stays at 1 (tree growth was a major scalability problem in v1). > > Let me try again: > > GitGitGadget "reads" the mail via the incremental clone, remembering the > hash of the latest processed commit. When the Azure Pipeline runs, it > first fetches, and if the commit is still the same, does nothing but exit > with success. If the commit is different, it looks at the mails that were > added, via `git log -p ..`. > > Is that possible with the v2 format? Of course, yes. The Xapian and SQLite indexing also works the same way "git log prev..tip" and storing the latest commit hash.