netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window
@ 2025-01-23 16:04 Jakub Kicinski
  2025-01-23 17:10 ` Pablo Neira Ayuso
  2025-01-29 11:21 ` Pablo Neira Ayuso
  0 siblings, 2 replies; 5+ messages in thread
From: Jakub Kicinski @ 2025-01-23 16:04 UTC (permalink / raw)
  To: Pablo Neira Ayuso; +Cc: netdev, fw

Hi!

Could be very bad luck but after we fast forwarded net-next yesterday
we have 3 failures in less than 24h in nft_flowtabl.sh:

https://netdev.bots.linux.dev/contest.html?test=nft-flowtable-sh

# FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  2113852 exceeds expected value 2097152, reply counter  60
https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh/stdout

# FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  3530493 exceeds expected value 3478585, reply counter  60
https://netdev-3.bots.linux.dev/vmksft-nf/results/960022/10-nft-flowtable-sh/stdout

# FAIL: dscp counters do not match, expected dscp3 and dscp0 > 0 but got  1431 , 0 
https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh-retry/stdout


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window
  2025-01-23 16:04 [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window Jakub Kicinski
@ 2025-01-23 17:10 ` Pablo Neira Ayuso
  2025-01-29 11:21 ` Pablo Neira Ayuso
  1 sibling, 0 replies; 5+ messages in thread
From: Pablo Neira Ayuso @ 2025-01-23 17:10 UTC (permalink / raw)
  To: Jakub Kicinski; +Cc: netdev, fw

Hi Jakub,

On Thu, Jan 23, 2025 at 08:04:44AM -0800, Jakub Kicinski wrote:
> Hi!
> 
> Could be very bad luck but after we fast forwarded net-next yesterday
> we have 3 failures in less than 24h in nft_flowtabl.sh:
> 
> https://netdev.bots.linux.dev/contest.html?test=nft-flowtable-sh
> 
> # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  2113852 exceeds expected value 2097152, reply counter  60
> https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh/stdout
> 
> # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  3530493 exceeds expected value 3478585, reply counter  60
> https://netdev-3.bots.linux.dev/vmksft-nf/results/960022/10-nft-flowtable-sh/stdout
> 
> # FAIL: dscp counters do not match, expected dscp3 and dscp0 > 0 but got  1431 , 0 
> https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh-retry/stdout

Thanks for your report, let me take a look.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window
  2025-01-23 16:04 [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window Jakub Kicinski
  2025-01-23 17:10 ` Pablo Neira Ayuso
@ 2025-01-29 11:21 ` Pablo Neira Ayuso
  2025-01-30  1:00   ` Jakub Kicinski
  1 sibling, 1 reply; 5+ messages in thread
From: Pablo Neira Ayuso @ 2025-01-29 11:21 UTC (permalink / raw)
  To: Jakub Kicinski; +Cc: netdev, fw, netfilter-devel

Hi Jakub,

On Thu, Jan 23, 2025 at 08:04:44AM -0800, Jakub Kicinski wrote:
> Hi!
> 
> Could be very bad luck but after we fast forwarded net-next yesterday
> we have 3 failures in less than 24h in nft_flowtabl.sh:
> 
> https://netdev.bots.linux.dev/contest.html?test=nft-flowtable-sh
> 
> # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  2113852 exceeds expected value 2097152, reply counter  60
> https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh/stdout
> 
> # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  3530493 exceeds expected value 3478585, reply counter  60
> https://netdev-3.bots.linux.dev/vmksft-nf/results/960022/10-nft-flowtable-sh/stdout

this is reporting a flow in forward chain going over the size of the
file, this is a flow that is not follow flowtable path.

> # FAIL: dscp counters do not match, expected dscp3 and dscp0 > 0 but got  1431 , 0 
> https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh-retry/stdout

this is reporting that occasionally a flow does not follow flowtable
path, dscp3 gets bumped from the forward chain.

I can rarely see this last dscp tests FAIL when running this test in a
loop here.

Just a follow up, I am still diagnosing.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window
  2025-01-29 11:21 ` Pablo Neira Ayuso
@ 2025-01-30  1:00   ` Jakub Kicinski
  2025-02-05 23:20     ` Pablo Neira Ayuso
  0 siblings, 1 reply; 5+ messages in thread
From: Jakub Kicinski @ 2025-01-30  1:00 UTC (permalink / raw)
  To: Pablo Neira Ayuso; +Cc: netdev, fw, netfilter-devel

On Wed, 29 Jan 2025 12:21:24 +0100 Pablo Neira Ayuso wrote:
> > Could be very bad luck but after we fast forwarded net-next yesterday
> > we have 3 failures in less than 24h in nft_flowtabl.sh:
> > 
> > https://netdev.bots.linux.dev/contest.html?test=nft-flowtable-sh
> > 
> > # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  2113852 exceeds expected value 2097152, reply counter  60
> > https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh/stdout
> > 
> > # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  3530493 exceeds expected value 3478585, reply counter  60
> > https://netdev-3.bots.linux.dev/vmksft-nf/results/960022/10-nft-flowtable-sh/stdout  
> 
> this is reporting a flow in forward chain going over the size of the
> file, this is a flow that is not follow flowtable path.
> 
> > # FAIL: dscp counters do not match, expected dscp3 and dscp0 > 0 but got  1431 , 0 
> > https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh-retry/stdout  
> 
> this is reporting that occasionally a flow does not follow flowtable
> path, dscp3 gets bumped from the forward chain.
> 
> I can rarely see this last dscp tests FAIL when running this test in a
> loop here.
> 
> Just a follow up, I am still diagnosing.

Thanks for the update!

FWIW we hit 4 more flakes since I reported it to you last week
(first link from previous message will take you to them).
All four in dscp_fwd

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window
  2025-01-30  1:00   ` Jakub Kicinski
@ 2025-02-05 23:20     ` Pablo Neira Ayuso
  0 siblings, 0 replies; 5+ messages in thread
From: Pablo Neira Ayuso @ 2025-02-05 23:20 UTC (permalink / raw)
  To: Jakub Kicinski; +Cc: netdev, fw, netfilter-devel

Hi Jakub,

On Wed, Jan 29, 2025 at 05:00:57PM -0800, Jakub Kicinski wrote:
> On Wed, 29 Jan 2025 12:21:24 +0100 Pablo Neira Ayuso wrote:
> > > Could be very bad luck but after we fast forwarded net-next yesterday
> > > we have 3 failures in less than 24h in nft_flowtabl.sh:
> > > 
> > > https://netdev.bots.linux.dev/contest.html?test=nft-flowtable-sh
> > > 
> > > # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  2113852 exceeds expected value 2097152, reply counter  60
> > > https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh/stdout
> > > 
> > > # FAIL: flow offload for ns1/ns2 with masquerade and pmtu discovery : original counter  3530493 exceeds expected value 3478585, reply counter  60
> > > https://netdev-3.bots.linux.dev/vmksft-nf/results/960022/10-nft-flowtable-sh/stdout  
> > 
> > this is reporting a flow in forward chain going over the size of the
> > file, this is a flow that is not follow flowtable path.
> > 
> > > # FAIL: dscp counters do not match, expected dscp3 and dscp0 > 0 but got  1431 , 0 
> > > https://netdev-3.bots.linux.dev/vmksft-nf/results/960740/11-nft-flowtable-sh-retry/stdout  
> > 
> > this is reporting that occasionally a flow does not follow flowtable
> > path, dscp3 gets bumped from the forward chain.
> > 
> > I can rarely see this last dscp tests FAIL when running this test in a
> > loop here.
> > 
> > Just a follow up, I am still diagnosing.
> 
> Thanks for the update!
> 
> FWIW we hit 4 more flakes since I reported it to you last week
> (first link from previous message will take you to them).
> All four in dscp_fwd

Just another follow up on this. I am testing here a revert of:

  b8baac3b9c5c ("netfilter: flowtable: teardown flow if cached mtu is stale")

nft_flowtable.sh shows too frequent re-offloads (create/teardown
cycles) with fragments that can lead no packets following the
flowtable path as dscp_fwd reports.

Let me give it more testing then, if results are positive, I will
formally propose this revert.

Thanks.

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2025-02-05 23:20 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-01-23 16:04 [TEST] nft-flowtable-sh flaking after pulling first chunk of the merge window Jakub Kicinski
2025-01-23 17:10 ` Pablo Neira Ayuso
2025-01-29 11:21 ` Pablo Neira Ayuso
2025-01-30  1:00   ` Jakub Kicinski
2025-02-05 23:20     ` Pablo Neira Ayuso

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).