From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 101C2492187 for ; Thu, 2 Jul 2026 13:09:19 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=170.10.133.124 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1782997762; cv=none; b=oxxFSM7gtiLZhHl7oFZVPNxVnVObf5JckkVzPOeKqUByEccd8lh+hPtGSqvh2KbCsAnxt0PXAVaC40yyQxJf+WmAGaGRTPS5Rq7EexatBB1mE0mbMdA+LgLrR/uoe2VEm3B8wSLLW0uNpIUTZSnTD8TJiRXAdV3uA6qnwrUpTGE= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1782997762; c=relaxed/simple; bh=fiJrezrPzd/66Esvi9NNyo7ek1v+7YfLuLcqu8NbMQ4=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=saDQ68dBjpmztsekv03nze3w7SaGaf+miLQlqWhgxt8XPH/TWd8T2uXdn3QIWad28F7jwEFoyeVnNka++17nT2q+o1dmXZnH314HW0JXiTNtiU+1M63OKz8kSm40+BPnliAsgB2yjHkkLm7aY5n/9XHExy+WJzvfFufp1m6BVYo= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=redhat.com; spf=pass smtp.mailfrom=redhat.com; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b=e5xSdVp9; arc=none smtp.client-ip=170.10.133.124 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=redhat.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=redhat.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="e5xSdVp9" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1782997759; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=pk2pYhN52w0a/kFsyxeCmsifCUQ0v8hhl1JGZ0yb2rk=; b=e5xSdVp9T9b8A8dVDrIaiGy+Lvt04MOedLt0Rs8EvCdQxxee7hADQUMr+zvRRP66ko7Uqg B6TBIo6TDUy7d3y/jlEPxlLiLLlQdcwFQubYMpBdCXsep6syOqEhwomBPCF68o5WzBYr7n fiLXkKPJJQ70xUSloCclbRWV2vq1Fuo= Received: from mx-prod-mc-03.mail-002.prod.us-west-2.aws.redhat.com (ec2-54-186-198-63.us-west-2.compute.amazonaws.com [54.186.198.63]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-642-9HHLOQTTMlCktmfHsFKvaQ-1; Thu, 02 Jul 2026 09:09:13 -0400 X-MC-Unique: 9HHLOQTTMlCktmfHsFKvaQ-1 X-Mimecast-MFC-AGG-ID: 9HHLOQTTMlCktmfHsFKvaQ_1782997751 Received: from mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com (mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com [10.30.177.4]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mx-prod-mc-03.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id C51CB1944EB6; Thu, 2 Jul 2026 13:09:10 +0000 (UTC) Received: from bfoster (unknown [10.22.80.59]) by mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 29A923001D87; Thu, 2 Jul 2026 13:09:08 +0000 (UTC) Date: Thu, 2 Jul 2026 09:09:05 -0400 From: Brian Foster To: Lorenzo Stoakes Cc: Laurent Pinchart , "Vlastimil Babka (SUSE)" , Jori Koolstra , Christian Brauner , Linus Torvalds , Jonathan Corbet , Jens Axboe , David Hildenbrand , Jeff Layton , workflows@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org Subject: Re: [PATCH RFC] coding-assistants: simplify attribution Message-ID: References: <20260701-work-coding-assistants-v1-1-a20a94d1d606@kernel.org> <20260702093844.GA3491311@killaraus.ideasonboard.com> Precedence: bulk X-Mailing-List: linux-doc@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Scanned-By: MIMEDefang 3.4.1 on 10.30.177.4 On Thu, Jul 02, 2026 at 01:18:06PM +0100, Lorenzo Stoakes wrote: > On Thu, Jul 02, 2026 at 07:57:45AM -0400, Brian Foster wrote: > > On Thu, Jul 02, 2026 at 10:44:09AM +0100, Lorenzo Stoakes wrote: > > > On Thu, Jul 02, 2026 at 12:38:44PM +0300, Laurent Pinchart wrote: > > > > On Thu, Jul 02, 2026 at 10:44:34AM +0200, Vlastimil Babka (SUSE) wrote: > > > > > On 7/2/26 10:12, Jori Koolstra wrote: > > > > > > Ah, I still reigniting this discussion again :) > > > > > > > > > > > > What about a combination of what David and Jeff say? The whole point > > > > > > seems to me that the salient information is not that an LLM was used (or > > > > > > are we going to tag Sashiko as well or any other LLM-based code review > > > > > > tool?), but what is was used to do. This information may be relevant for > > > > > > how the review is approached. The latter should perhaps only be in the > > > > > > cover letter and then we can drop the assisted-by tags altogether. > > > > > > > > > > > > The question about enforcement remains. > > > > > > > > > > It's not possible to enforce it. People can deny it if the tag is missing > > > > > and you confront them and even though the submission has many signs of being > > > > > obviously LLM, there is no definite proof. We've seen (likely, as there's no > > > > > proof!) that happen in mm. > > > > > > > > > > Such situation then penalizes those who disclose so obviously they won't. > > > > > > > > I think there's also a penality for those who don't disclose when > > > > they're told they should: it will lower trust. Kernel development is > > > > largely based on a trust model. If a contributor decides to adopt a > > > > deceiptful behaviour, they can expect maintainers to raise the bar for > > > > accepting patches, when not rejecting them outright. > > > > > > Yes, I explicitly said this in response to somebody for whom there was > > > overwhelming evidence they were submitting AI slop, and that they'd need to > > > build it back up again. > > > > > > It's precisely the issue as I see it. > > > > > > But others within the community disagreed with me, so it turned into a very > > > long and draining discussion that I don't particularly wish to repeat. > > > > > > So we really need clarity on it being OK to do this (I remember saying this > > > last year when I made an ultimately unsuccessful submission to the > > > maintainer's summit about all this :) > > > > > > What matters overall is being able to _quickly_ dismiss AI slop so that > > > asymmetry between LLM generation + maintainer time isn't exploited. > > > > > > And ultimately I think the trust model will end up being 'newcomes have 0, > > > now build it up'. > > > > > > Which sucks but this issue is simply existential for open source. > > > > > > > Has anybody tried throwing any of the obvious LLM slop submissions we > > have seen into one of these LLM detector things? To be clear, I've never > > tried those so I'm certainly no authority on if they even work reliably, > > but if so I wonder if something like that is a potential solution for > > elminating the worst cases.. > > > > I.e., suppose we had some Sashiko type LLM/bot whose job was mainly to > > detect purely LLM generated content based on some minimum level of > > confidence and reply with a loud and clear message to the thread. Maybe > > that would be a clear enough signal to maintainers and reviewers that > > something is not worth prioritizing for review.. Maybe also some "slop > > detected" feedback would help disincentivize flinging slop onto the > > lists. At the very least that could be something that is more easily > > configured/enabled per-subsystem without having to use per-subsystem > > commit tags. > > Yup I thought of this, have done this on series and they do detect it > reliably. > Interesting. > But then it becomes an arms race. People will get AI to try to defeat AI > detection. So I'm not sure it's a safe road to go down. > Yeah that's fair, but my thinking is the goal would be more to detect slop content as opposed to just LLM usage. The trend of these discussions has been that we generally don't care too much about LLM use in general when it's used properly as a tool, but rather as you stated earlier we want to be able to quickly dismiss slop to avoid wasting maintainer and reviewer time. It seems like we're generally able to sniff out pure LLM generated content in most cases. That suggests we should be able document some common patterns that tend to describe slop such that an LLM bot could detect and flag it. For example, "Slop bot has detected comments have LLM prose and just repeat what the code does," "Slop bot has detected unnecessary code volume and duplication," etc. etc. So the idea with that would be that defeating the slop bot is the point. Consider it a sort of informal/minimal goal before a submitter is entitled to human review. But yeah, I guess it's still just a handwavy theory that you'd be able to reliably disentagle detection of general LLM code generation from something that more qualifies as LLM slop. Just thinking out loud. Brian > > > > Brian > > > > Thanks, Lorenzo >