Igt-dev Archive on lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH i-g-t] Bump aborting on network failure deadline to 40 seconds
@ 2025-02-06 15:21 Peter Senna Tschudin
  2025-02-06 21:06 ` ✓ Xe.CI.BAT: success for " Patchwork
                   ` (8 more replies)
  0 siblings, 9 replies; 26+ messages in thread
From: Peter Senna Tschudin @ 2025-02-06 15:21 UTC (permalink / raw)
  To: igt-dev
  Cc: Peter Senna Tschudin, juha-pekka.heikkila, katarzyna.piecielska,
	ryszard.knop, ewelina.musial, adrinael, mateusz.grabski,
	konrad.b.brodzik

Commit ddfde25f16ba ("runner: Add support for aborting on network
failure") introduced a 20 second deadline for the DUT’s network
to recover after a suspend/resume cycle. If the network isn’t
back up within that time, igt_runner aborts the test run to save logs
and prevent potential log loss from an imminent power cycle.

This deadline was set to accommodate our internal CI system, which
checks for DUT network connectivity every 5 seconds and retries up
to 3 times at 20 second intervals. If it fails 3 consecutive checks,
it triggers a power cycle on the DUT.

Although our internal CI system can be configured with a longer
wait time, extending it further would unnecessarily prolong tests
in cases of DUT hangs.

Bumping the deadline to 40 seconds keeps the abort mechanism safely
within our internal CI system retry window while improving chances
of preventing a premature abort. For upstream testing on Jenkins,
the deadlines vary from 16 and 25 minutes, and this change has
no impact.

CC: juha-pekka.heikkila@intel.com
CC: katarzyna.piecielska@intel.com
CC: ryszard.knop@intel.com
CC: ewelina.musial@intel.com
CC: adrinael@adrinael.net
CC: mateusz.grabski@intel.com
CC: konrad.b.brodzik@intel.com
Signed-off-by: Peter Senna Tschudin <peter.senna@linux.intel.com>
---
 runner/executor.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/runner/executor.c b/runner/executor.c
index 999e7f719..2abb18732 100644
--- a/runner/executor.c
+++ b/runner/executor.c
@@ -218,11 +218,11 @@ static bool load_ping_config_from_env(void)
 
 /*
  * On some hosts, getting network back up after suspend takes
- * upwards of 10 seconds. 20 seconds should be enough to see
+ * upwards of 10 seconds. 40 seconds should be enough to see
  * if network comes back at all, and hopefully not too long to
  * make external monitoring freak out.
  */
-#define PING_ABORT_DEADLINE 20
+#define PING_ABORT_DEADLINE 40
 
 static bool can_ping(void)
 {
-- 
2.34.1


^ permalink raw reply related	[flat|nested] 26+ messages in thread

end of thread, other threads:[~2025-02-14  9:26 UTC | newest]

Thread overview: 26+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-02-06 15:21 [PATCH i-g-t] Bump aborting on network failure deadline to 40 seconds Peter Senna Tschudin
2025-02-06 21:06 ` ✓ Xe.CI.BAT: success for " Patchwork
2025-02-07  2:00 ` ✗ Xe.CI.Full: failure " Patchwork
2025-02-07  7:51   ` Peter Senna Tschudin
2025-02-07  8:13     ` Ravali, JupallyX
2025-02-07 14:00 ` ✓ Xe.CI.BAT: success " Patchwork
2025-02-07 14:07 ` ✗ i915.CI.BAT: failure " Patchwork
2025-02-07 14:43   ` Peter Senna Tschudin
2025-02-10  5:35     ` Ravali, JupallyX
2025-02-07 21:01 ` ✓ Xe.CI.Full: success " Patchwork
2025-02-10  5:33 ` ✓ i915.CI.BAT: " Patchwork
2025-02-10  7:08 ` ✗ i915.CI.Full: failure " Patchwork
2025-02-10  8:26   ` Peter Senna Tschudin
2025-02-10 14:43     ` Ravali, JupallyX
2025-02-10 10:36 ` ✓ i915.CI.Full: success " Patchwork
2025-02-11  9:21 ` [PATCH i-g-t] " Kamil Konieczny
2025-02-11  9:55   ` Peter Senna Tschudin
2025-02-11 11:59     ` Piecielska, Katarzyna
2025-02-11 13:38       ` Peter Senna Tschudin
2025-02-12 10:06       ` Knop, Ryszard
2025-02-12 13:52         ` Kamil Konieczny
2025-02-12 14:31           ` Knop, Ryszard
2025-02-12 14:59     ` Kamil Konieczny
2025-02-12 15:38       ` Peter Senna Tschudin
2025-02-13 13:37         ` Kamil Konieczny
2025-02-14  9:26           ` Peter Senna Tschudin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox