From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-pf1-f181.google.com (mail-pf1-f181.google.com [209.85.210.181]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id EDB7F7494 for ; Thu, 19 Sep 2024 01:43:18 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.210.181 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1726710200; cv=none; b=ueElb0GLB7Gdd9r+w6tUh7uZnJLogu6Ww8RjkpaZ0WbCKEB7V6GeifbNjNLcq2pPFGK36p3KPCmnt4rhYUr4OY5SZmajcHEW1V79UFv9I7gK3kbYSW1CqM+Nr5zTtU/jszFtlw8vXuG+HRGLl7nyY9wjDwcUCYMlTEa2QgAw/iE= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1726710200; c=relaxed/simple; bh=oNXPqtyDgLu9Kh2ue6NBD6CfWTrY2WHFnxBQrLR39sg=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=av7xSP3zGwD17hW+ECugot2v1IJJhi4yMzLnYFS+k0rWzaLBzTLu+zQ403HtsypFwiqkTpnrmt5IqO3nmWLU+PlysFbMZTFFWUCMST8Bz5lR1NKxcqxBjy7jj4b2p599r7jpWOnDHoYsuUeYiLE7ctqVOtFBwmj8ijJwXD3kBhM= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=fromorbit.com; spf=pass smtp.mailfrom=fromorbit.com; dkim=pass (2048-bit key) header.d=fromorbit-com.20230601.gappssmtp.com header.i=@fromorbit-com.20230601.gappssmtp.com header.b=zBDJ6s/7; arc=none smtp.client-ip=209.85.210.181 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=fromorbit.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=fromorbit.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=fromorbit-com.20230601.gappssmtp.com header.i=@fromorbit-com.20230601.gappssmtp.com header.b="zBDJ6s/7" Received: by mail-pf1-f181.google.com with SMTP id d2e1a72fcca58-718d6ad6050so214012b3a.0 for ; Wed, 18 Sep 2024 18:43:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fromorbit-com.20230601.gappssmtp.com; s=20230601; t=1726710198; x=1727314998; darn=vger.kernel.org; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:from:to:cc:subject:date:message-id:reply-to; bh=+jwxQJbxpwl4voWuXzYITDQeat0ESjsMo2U9Mc7bSDY=; b=zBDJ6s/7s54hp1CPEEoM5xvQtVnHO9JpYjNVxYHuwzcCmu0iUhSxVkE1Nz3Qa7lNdP /1cxGp7hj9VIbmrZvqdfkEv4CNhjjqNYgJKbLcplGlWvuwch1HPxXgoXPAfDS8ilZfMF cLxTIeLj45WtloEoPSfyGPGQSN3CVJPq0H8g7P9oX+ffFL8vvdbD3+m5ZpWtfMnu5AuI XMtXmNGFzwVw3D6hFPoVVGcYqaaJ+IBpdMfNbF1j4Z1sc2WXoB47QYRstLaH0Gjv19r4 YJlXquYfhTpv3wP/Mq2RkBeUmvl3vy1sHoZKx9Hf3FZ9jq0XAwGK9sJG2RoQyYr+gZgL h28g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1726710198; x=1727314998; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=+jwxQJbxpwl4voWuXzYITDQeat0ESjsMo2U9Mc7bSDY=; b=wTD/jhdqukgGoRW+C1Zdq4HDWt+LeuohMUQZ9ZSA6Elf1tM5CjK0sdrXdTiZKaSY8W zhCb3NDJVmvJ1TYYs9BHhSHWHXtL8dev9GJ3lRN3c12YDSW2U2u5sKbC5bC/50ZXb5qf U17/nYlDj4lbiI9iIsLbnQZLYo37vBNqgMMXo15QtptWdAOsXq59wqtHGDGMQDS2B6iF kjykkqD+EtT6yDsk6//YzkI789wZPiGTf3bt5jKcfLTqmXlTsIuhqTyo+rVtncemAn2R RFWF6+tL5NZyOXNEI90FxPN6LFRaA2tgxZ8fWOHhJprR3iAyjgqQ5pL/yYH+ai2lQBMd 9oaA== X-Forwarded-Encrypted: i=1; AJvYcCU11Rhji8Tv+w3c6chy7hKVVa0NgXTOt5eSfhYRg9B4TkI1MOFwOy/R2GBew1a0nqGf8A2kb2s15GdVBlI=@vger.kernel.org X-Gm-Message-State: AOJu0Yz9tWlLOXez3e2vZsTJsNLsKqeE9UzXU0g4ZmRXhxSf8+P2IBxI xfB3BpEAQ0nD5T39fkeF2HViK15adgfkdsWbGRVH8KWk16mC2I2rorswvU4sdHQ= X-Google-Smtp-Source: AGHT+IHP3QnTzJJQUGix1BZtfPAN61RzG3kG6wmczzL7AUu1v04MsD1wNi6JBs9VM4Ciz1/aEAynVA== X-Received: by 2002:a05:6a21:3942:b0:1d2:bb49:908b with SMTP id adf61e73a8af0-1d2bb499130mr25408416637.18.1726710198379; Wed, 18 Sep 2024 18:43:18 -0700 (PDT) Received: from dread.disaster.area (pa49-179-78-197.pa.nsw.optusnet.com.au. [49.179.78.197]) by smtp.gmail.com with ESMTPSA id d2e1a72fcca58-719918df7ccsm150940b3a.40.2024.09.18.18.43.17 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 18 Sep 2024 18:43:17 -0700 (PDT) Received: from dave by dread.disaster.area with local (Exim 4.96) (envelope-from ) id 1sr6CR-0072c8-0x; Thu, 19 Sep 2024 11:43:15 +1000 Date: Thu, 19 Sep 2024 11:43:15 +1000 From: Dave Chinner To: Matthew Wilcox Cc: Chris Mason , Jens Axboe , Linus Torvalds , Christian Theune , linux-mm@kvack.org, "linux-xfs@vger.kernel.org" , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Daniel Dao , regressions@lists.linux.dev, regressions@leemhuis.info Subject: Re: Known and unfixed active data loss bug in MM + XFS with large folios since Dec 2021 (any kernel from 6.1 upwards) Message-ID: References: <74cceb67-2e71-455f-a4d4-6c5185ef775b@meta.com> <52d45d22-e108-400e-a63f-f50ef1a0ae1a@meta.com> <5bee194c-9cd3-47e7-919b-9f352441f855@kernel.dk> <459beb1c-defd-4836-952c-589203b7005c@meta.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: On Wed, Sep 18, 2024 at 02:34:57PM +0100, Matthew Wilcox wrote: > On Wed, Sep 18, 2024 at 11:28:52AM +0200, Chris Mason wrote: > > I think the bug was in __filemap_add_folio()'s usage of xarray_split_alloc() > > and the tree changing before taking the lock. It's just a guess, but that > > was always my biggest suspect. > > Oh god, that's it. > > there should have been an xas_reset() after calling xas_split_alloc(). > > and 6758c1128ceb calls xas_reset() after calling xas_split_alloc(). Should we be asking for 6758c1128ceb to be backported to all stable kernels then? -Dave. -- Dave Chinner david@fromorbit.com