public inbox for linux-btrfs@vger.kernel.org
 help / color / mirror / Atom feed
From: Nikolay Borisov <nborisov@suse.com>
To: Zygo Blaxell <ce3g8jdj@umail.furryterror.org>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: WARNING at fs/btrfs/delayed-ref.c:296 btrfs_merge_delayed_refs+0x3dc/0x410 (new on 5.0.4, not in 5.0.3)
Date: Tue, 26 Mar 2019 17:13:53 +0200	[thread overview]
Message-ID: <d5a66daf-cb5a-0acc-3131-7ba150658f06@suse.com> (raw)
In-Reply-To: <20190326150936.GI16651@hungrycats.org>



On 26.03.19 г. 17:09 ч., Zygo Blaxell wrote:
> On Tue, Mar 26, 2019 at 10:42:31AM +0200, Nikolay Borisov wrote:
>>
>>
>> On 26.03.19 г. 6:30 ч., Zygo Blaxell wrote:
>>> On Mon, Mar 25, 2019 at 10:50:28PM -0400, Zygo Blaxell wrote:
>>>> Running balance, rsync, and dedupe, I get kernel warnings every few
>>>> minutes on 5.0.4.  No warnings on 5.0.3 under similar conditions.
>>>>
>>>> Mount options are:  flushoncommit,space_cache=v2,compress=zstd.
>>>>
>>>> There are two different stacks on the warnings.  This one comes from
>>>> btrfs balance:
>>>
>>> [snip]
>>>
>>> Possibly unrelated, but I'm also repeatably getting this in 5.0.4 and
>>> not 5.0.3, after about 5 hours of uptime.  Different processes, same
>>> kernel stack:
>>>
>>> 	[Mon Mar 25 23:35:17 2019] kworker/u8:4: page allocation failure: order:0, mode:0x404000(GFP_NOWAIT|__GFP_COMP), nodemask=(null),cpuset=/,mems_allowed=0
>>> 	[Mon Mar 25 23:35:17 2019] CPU: 2 PID: 29518 Comm: kworker/u8:4 Tainted: G        W         5.0.4-zb64-303ce93b05c9+ #1
>>
>> What commits does this kernel include because it doesn't seem to be a
>> pristine upstream 5.0.4 ? Also what you are seeing below is definitely a
>> bug in MM. The question is whether it's due to your doing faulty
>> backports in the kernel or it's due to something that got automatically
>> backported to 5.0.4
> 
> That was the first thing I thought of, so I reverted to vanilla 5.0.4,
> repeated the test, and obtained the same result.
> 
> You may have a point about non-btrfs patches in 5.0.4, though.
> I previously tested 5.0.3 with most of the 5.0.4 fs/btrfs commits
> already included by cherry-pick:
> 
> 	1098803b8cb7 Btrfs: fix deadlock between clone/dedupe and rename
> 	3486142a68e3 Btrfs: fix corruption reading shared and compressed extents after hole punching
> 	fb9c36acfab1 btrfs: scrub: fix circular locking dependency warning
> 	9d7b327affb8 Btrfs: setup a nofs context for memory allocation at __btrfs_set_acl
> 	80dcd07c27df Btrfs: setup a nofs context for memory allocation at btrfs_create_tree()
> 
> The commits that are in 5.0.4 but not in my last 5.0.3 test run are:
> 
> 	ebbb48419e8a btrfs: init csum_list before possible free
> 	88e610ae4c3a btrfs: ensure that a DUP or RAID1 block group has exactly two stripes
> 	9c58f2ada4fa btrfs: drop the lock on error in btrfs_dev_replace_cancel
> 
> and I don't see how those commits could lead to the observed changes
> in behavior.  I didn't include them for 5.0.3 because my test scenario
> doesn't execute the code they touch.  So the problem might be outside
> of btrfs completely.

I think it might very well be outside of btrfs because you are seeing an
order 0 failure when you have plenty of order 0 free pages. That's
definitely something you might want to report to mm.

> 

  reply	other threads:[~2019-03-26 15:13 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-03-26  2:50 WARNING at fs/btrfs/delayed-ref.c:296 btrfs_merge_delayed_refs+0x3dc/0x410 (new on 5.0.4, not in 5.0.3) Zygo Blaxell
2019-03-26  4:30 ` Zygo Blaxell
2019-03-26  8:42   ` Nikolay Borisov
2019-03-26 15:09     ` Zygo Blaxell
2019-03-26 15:13       ` Nikolay Borisov [this message]
2019-03-26 15:45         ` Zygo Blaxell
2019-03-26 15:30   ` BUG: workqueue lockup - pool cpus=0-3 flags=0x4 nice=0 stuck for 20915s! (on 4.20+) Zygo Blaxell
2019-05-15 20:27   ` btrfs + KASAN + SLAB + lvmcache + rsync + kernel 5.0 = page allocation failure and/or OOM kills (solved...ish) Zygo Blaxell
2019-04-29 22:34 ` WARNING at fs/btrfs/delayed-ref.c:296 btrfs_merge_delayed_refs+0x3dc/0x410 (still on 5.0.10) Zygo Blaxell
2020-02-04  3:16   ` WARNING at fs/btrfs/delayed-ref.c:296 btrfs_merge_delayed_refs+0x3dc/0x410 (fixed in 5.2) Zygo Blaxell

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=d5a66daf-cb5a-0acc-3131-7ba150658f06@suse.com \
    --to=nborisov@suse.com \
    --cc=ce3g8jdj@umail.furryterror.org \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox