Re: [EXTERNAL] [PATCH v4 10/11] selftests: ceph: add validation harness

The Linux Kernel Mailing List
 help / color / mirror / Atom feed

From: Viacheslav Dubeyko <vdubeyko@redhat.com>
To: Alex Markuze <amarkuze@redhat.com>, ceph-devel@vger.kernel.org
Cc: linux-kernel@vger.kernel.org, idryomov@gmail.com
Subject: Re: [EXTERNAL] [PATCH v4 10/11] selftests: ceph: add validation harness
Date: Thu, 07 May 2026 12:33:47 -0700	[thread overview]
Message-ID: <56dfb4b66773b70733464ce17e344e9fdc8ae756.camel@redhat.com> (raw)
In-Reply-To: <20260507122737.2804094-11-amarkuze@redhat.com>

On Thu, 2026-05-07 at 12:27 +0000, Alex Markuze wrote:
> Add a one-shot validation wrapper that orchestrates the full reset
> test suite with per-stage watchdog timeouts and a final status check.
> 
> The harness runs five stages: baseline (no resets), corner cases,
> moderate stress, aggressive stress, and a post-run status validation.
> Each stage runs with an independent timeout so a hang in one stage
> does not block the entire run.
> 
> Signed-off-by: Alex Markuze <amarkuze@redhat.com>
> ---
>  .../filesystems/ceph/run_validation.sh        | 350 ++++++++++++++++++
>  1 file changed, 350 insertions(+)
>  create mode 100755 tools/testing/selftests/filesystems/ceph/run_validation.sh
> 
> diff --git a/tools/testing/selftests/filesystems/ceph/run_validation.sh b/tools/testing/selftests/filesystems/ceph/run_validation.sh
> new file mode 100755
> index 000000000000..5d521e4f9e9b
> --- /dev/null
> +++ b/tools/testing/selftests/filesystems/ceph/run_validation.sh
> @@ -0,0 +1,350 @@
> +#!/bin/bash
> +# SPDX-License-Identifier: GPL-2.0
> +#
> +# CephFS client reset - single-command validation.
> +# Runs all test stages in sequence with per-stage timeouts.
> +# If any stage hangs (filesystem stuck, process blocked), the
> +# timeout kills it and reports failure.
> +#
> +# Usage:
> +#   sudo ./run_validation.sh --mount-point /mnt/mycephfs
> +#
> +# Expected output on success:
> +#
> +#   === CephFS Client Reset Validation ===
> +#   [stage 1/5] baseline         PASS  (60s, no resets)
> +#   [stage 2/5] corner_cases     PASS  (4/4 passed)
> +#   [stage 3/5] moderate         PASS  (120s, resets every 5-15s)
> +#   [stage 4/5] aggressive       PASS  (120s, resets every 1-5s)
> +#   [stage 5/5] status_check     PASS  (phase=idle, last_errno=0)
> +#
> +#   RESULT: 5/5 stages passed
> +#   Artifacts: /tmp/ceph_reset_validation_<timestamp>
> +
> +set -uo pipefail
> +
> +KSFT_SKIP=4
> +SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
> +
> +# kselftest auto-detect: when invoked with no arguments (e.g. by
> +# "make run_tests"), find a CephFS mount automatically or skip.
> +if [[ $# -eq 0 ]]; then
> +	MOUNT_POINT="$(findmnt -t ceph -n -o TARGET 2>/dev/null | head -1)"
> +	if [[ -z "$MOUNT_POINT" ]]; then
> +		echo "SKIP: No CephFS mount found and --mount-point not specified"
> +		exit "$KSFT_SKIP"
> +	fi
> +	exec "$0" --mount-point "$MOUNT_POINT"
> +fi
> +
> +MOUNT_POINT=""
> +CLIENT_ID=""
> +declare -a CLIENT_ARGS=()
> +declare -a DEBUGFS_ARGS=()
> +RUN_ID="$(date +%Y%m%d-%H%M%S)"
> +OUT_DIR="/tmp/ceph_reset_validation_${RUN_ID}"
> +DEBUGFS_ROOT="/sys/kernel/debug/ceph"
> +
> +# Timeout margins: stage runtime + cooldown + validation + safety buffer
> +STAGE1_TIMEOUT=120    # 60s run + 20s cooldown + 40s buffer
> +STAGE2_TIMEOUT=300    # 4 corner cases, 30s each worst case + buffer
> +STAGE3_TIMEOUT=240    # 120s run + 20s cooldown + 100s buffer
> +STAGE4_TIMEOUT=240    # 120s run + 20s cooldown + 100s buffer
> +STAGE5_TIMEOUT=10     # just reading debugfs
> +
> +PASS=0
> +FAIL=0
> +TOTAL=5
> +
> +usage()
> +{
> +	cat <<EOF
> +Usage: $0 --mount-point <cephfs_mount> [options]
> +
> +Required:
> +  --mount-point PATH    CephFS mount point
> +
> +Options:
> +  --out-dir PATH        Artifact directory (default: /tmp/ceph_reset_validation_<ts>)
> +  --client-id ID        Ceph debugfs client id (optional)
> +  --debugfs-root PATH   Debugfs Ceph root (default: /sys/kernel/debug/ceph)
> +  --help                Show this message
> +EOF
> +}
> +
> +stage_result()
> +{
> +	local num="$1"
> +	local name="$2"
> +	local status="$3"
> +	local detail="$4"
> +
> +	if [[ "$status" == "PASS" ]]; then
> +		PASS=$((PASS + 1))
> +	else
> +		FAIL=$((FAIL + 1))
> +	fi
> +	printf '[stage %d/%d] %-16s %s  (%s)\n' "$num" "$TOTAL" "$name" "$status" "$detail"
> +}
> +
> +# Run a command with a timeout. Returns 0 on success, 1 on failure/timeout.
> +# Sets RUN_TIMED_OUT=1 if killed by timeout.
> +#
> +# The stage command runs in its own session/process group (via setsid).
> +# On timeout the entire process group is killed, not just the top-level
> +# script PID.  This is required because stage scripts (reset_stress.sh,
> +# reset_corner_cases.sh) spawn child processes - I/O workers, rename
> +# workers, reset injectors, samplers - that would otherwise survive the
> +# timeout and bleed into later stages, invalidating results.
> +RUN_TIMED_OUT=0
> +
> +run_with_timeout()
> +{
> +	local timeout_sec="$1"
> +	local logfile="$2"
> +	shift 2
> +
> +	RUN_TIMED_OUT=0
> +
> +	# Start the stage in its own session via setsid so all descendant
> +	# processes share a process group that we can kill atomically.
> +	# In a non-interactive script, background children are not process
> +	# group leaders, so setsid(1) calls setsid(2) directly (no extra
> +	# fork) and the PID we capture IS the group leader.
> +	setsid "$@" > "$logfile" 2>&1 &
> +	local pid=$!
> +
> +	# Watchdog: on timeout, kill the entire process group
> +	(
> +		sleep "$timeout_sec"
> +		if kill -0 "$pid" 2>/dev/null; then
> +			echo "TIMEOUT: stage exceeded ${timeout_sec}s, killing process group $pid" >> "$logfile"
> +			kill -TERM -- -"$pid" 2>/dev/null
> +			sleep 2
> +			kill -KILL -- -"$pid" 2>/dev/null
> +		fi
> +	) &
> +	local watchdog_pid=$!
> +
> +	# Wait for the stage command
> +	wait "$pid" 2>/dev/null
> +	local rc=$?
> +
> +	# Kill the watchdog if it's still running
> +	kill "$watchdog_pid" 2>/dev/null
> +	wait "$watchdog_pid" 2>/dev/null
> +
> +	# Check if it was killed by timeout
> +	if grep -q "^TIMEOUT:" "$logfile" 2>/dev/null; then
> +		RUN_TIMED_OUT=1
> +		return 1
> +	fi
> +
> +	return "$rc"
> +}
> +
> +find_status_path()
> +{
> +	local entry
> +
> +	if [[ -n "$CLIENT_ID" ]]; then
> +		if [[ -r "$DEBUGFS_ROOT/$CLIENT_ID/reset/status" ]]; then
> +			echo "$DEBUGFS_ROOT/$CLIENT_ID/reset/status"
> +			return 0
> +		fi
> +		return 1
> +	fi
> +
> +	for entry in "$DEBUGFS_ROOT"/*/; do
> +		if [[ -r "${entry}reset/status" ]]; then
> +			echo "${entry}reset/status"
> +			return 0
> +		fi
> +	done
> +	return 1
> +}
> +
> +read_status_field()
> +{
> +	local status_path="$1"
> +	local field="$2"
> +	awk -F': ' -v key="$field" '$1 == key {print $2}' "$status_path" 2>/dev/null
> +}
> +
> +# --- Parse arguments -------------------------------------------------------
> +
> +while [[ $# -gt 0 ]]; do
> +	case "$1" in
> +	--mount-point)  MOUNT_POINT="$2"; shift 2 ;;
> +	--out-dir)      OUT_DIR="$2"; shift 2 ;;
> +	--client-id)    CLIENT_ID="$2"; shift 2 ;;
> +	--debugfs-root) DEBUGFS_ROOT="$2"; shift 2 ;;
> +	--help|-h)      usage; exit 0 ;;
> +	*)              echo "Unknown option: $1" >&2; usage; exit 2 ;;
> +	esac
> +done
> +
> +if [[ -z "$MOUNT_POINT" ]]; then
> +	echo "SKIP: --mount-point is required" >&2
> +	usage
> +	exit "$KSFT_SKIP"
> +fi
> +
> +if [[ ! -d "$MOUNT_POINT" ]]; then
> +	echo "SKIP: Mount point does not exist: $MOUNT_POINT" >&2
> +	exit "$KSFT_SKIP"
> +fi
> +
> +# Auto-detect client id when not specified, so all stages (including
> +# stage 5 status check) use the same client consistently.
> +if [[ -z "$CLIENT_ID" ]]; then
> +	candidates=()
> +	for entry in "$DEBUGFS_ROOT"/*/; do
> +		name="$(basename "$entry")"
> +		if [[ -r "${entry}reset/status" ]]; then
> +			candidates+=("$name")
> +		fi
> +	done
> +	if [[ ${#candidates[@]} -eq 1 ]]; then
> +		CLIENT_ID="${candidates[0]}"
> +	elif [[ ${#candidates[@]} -gt 1 ]]; then
> +		echo "SKIP: Multiple Ceph clients found (${candidates[*]}). Use --client-id." >&2
> +		exit "$KSFT_SKIP"
> +	fi
> +fi
> +
> +if [[ -n "$CLIENT_ID" ]]; then
> +	CLIENT_ARGS=(--client-id "$CLIENT_ID")
> +fi
> +DEBUGFS_ARGS=(--debugfs-root "$DEBUGFS_ROOT")
> +
> +# Quick sanity: can we write to the mount?
> +if ! touch "$MOUNT_POINT/.validation_probe_$$" 2>/dev/null; then
> +	echo "SKIP: Mount point is not writable: $MOUNT_POINT" >&2
> +	exit "$KSFT_SKIP"
> +fi
> +rm -f "$MOUNT_POINT/.validation_probe_$$"
> +
> +mkdir -p "$OUT_DIR"
> +
> +echo ""
> +echo "=== CephFS Client Reset Validation ==="
> +echo ""
> +
> +# --- Stage 1: Baseline (no resets) -----------------------------------------
> +
> +stage1_out="$OUT_DIR/stage1_baseline"
> +if run_with_timeout "$STAGE1_TIMEOUT" "$stage1_out.log" \
> +	"$SCRIPT_DIR/reset_stress.sh" \
> +	--mount-point "$MOUNT_POINT" \
> +	--profile baseline \
> +	--no-reset \
> +	--duration-sec 60 \
> +	"${CLIENT_ARGS[@]}" \
> +	"${DEBUGFS_ARGS[@]}" \
> +	--out-dir "$stage1_out"; then
> +	stage_result 1 "baseline" "PASS" "60s, no resets"
> +elif [[ "$RUN_TIMED_OUT" -eq 1 ]]; then
> +	stage_result 1 "baseline" "FAIL" "HUNG: killed after ${STAGE1_TIMEOUT}s"
> +else
> +	stage_result 1 "baseline" "FAIL" "see $stage1_out.log"
> +fi
> +
> +# --- Stage 2: Corner cases -------------------------------------------------
> +
> +stage2_out="$OUT_DIR/stage2_corner_cases"
> +mkdir -p "$stage2_out"
> +if run_with_timeout "$STAGE2_TIMEOUT" "$stage2_out/output.log" \
> +	"$SCRIPT_DIR/reset_corner_cases.sh" \
> +	"${CLIENT_ARGS[@]}" \
> +	"${DEBUGFS_ARGS[@]}" \
> +	--mount-point "$MOUNT_POINT"; then
> +	pass_line=$(grep -Eo '[0-9]+ passed, [0-9]+ failed, [0-9]+ skipped' "$stage2_out/output.log" | tail -1)
> +	stage_result 2 "corner_cases" "PASS" "${pass_line:-all tests passed}"
> +elif [[ "$RUN_TIMED_OUT" -eq 1 ]]; then
> +	stage_result 2 "corner_cases" "FAIL" "HUNG: killed after ${STAGE2_TIMEOUT}s"
> +else
> +	fail_line=$(grep -c 'FAIL' "$stage2_out/output.log" 2>/dev/null || echo "?")
> +	stage_result 2 "corner_cases" "FAIL" "${fail_line} failures, see $stage2_out/output.log"
> +fi
> +
> +# --- Stage 3: Moderate resets -----------------------------------------------
> +
> +stage3_out="$OUT_DIR/stage3_moderate"
> +if run_with_timeout "$STAGE3_TIMEOUT" "$stage3_out.log" \
> +	"$SCRIPT_DIR/reset_stress.sh" \
> +	--mount-point "$MOUNT_POINT" \
> +	--profile moderate \
> +	--duration-sec 120 \
> +	"${CLIENT_ARGS[@]}" \
> +	"${DEBUGFS_ARGS[@]}" \
> +	--out-dir "$stage3_out"; then
> +	stage_result 3 "moderate" "PASS" "120s, resets every 5-15s"
> +elif [[ "$RUN_TIMED_OUT" -eq 1 ]]; then
> +	stage_result 3 "moderate" "FAIL" "HUNG: killed after ${STAGE3_TIMEOUT}s"
> +else
> +	stage_result 3 "moderate" "FAIL" "see $stage3_out.log"
> +fi
> +
> +# --- Stage 4: Aggressive resets ---------------------------------------------
> +
> +stage4_out="$OUT_DIR/stage4_aggressive"
> +if run_with_timeout "$STAGE4_TIMEOUT" "$stage4_out.log" \
> +	"$SCRIPT_DIR/reset_stress.sh" \
> +	--mount-point "$MOUNT_POINT" \
> +	--profile aggressive \
> +	--duration-sec 120 \
> +	"${CLIENT_ARGS[@]}" \
> +	"${DEBUGFS_ARGS[@]}" \
> +	--out-dir "$stage4_out"; then
> +	stage_result 4 "aggressive" "PASS" "120s, resets every 1-5s"
> +elif [[ "$RUN_TIMED_OUT" -eq 1 ]]; then
> +	stage_result 4 "aggressive" "FAIL" "HUNG: killed after ${STAGE4_TIMEOUT}s"
> +else
> +	stage_result 4 "aggressive" "FAIL" "see $stage4_out.log"
> +fi
> +
> +# --- Stage 5: Post-run status check ----------------------------------------
> +
> +status_path=""
> +if status_path=$(find_status_path); then
> +	phase=$(read_status_field "$status_path" "phase")
> +	last_errno=$(read_status_field "$status_path" "last_errno")
> +	failure_count=$(read_status_field "$status_path" "failure_count")
> +	drain_timed_out=$(read_status_field "$status_path" "drain_timed_out")
> +	sessions_reset=$(read_status_field "$status_path" "sessions_reset")
> +	blocked=$(read_status_field "$status_path" "blocked_requests")
> +
> +	# Save full status
> +	cat "$status_path" > "$OUT_DIR/final_status.txt" 2>/dev/null
> +
> +	errors=""
> +	[[ "$phase" != "idle" ]] && errors="${errors}phase=$phase "
> +	[[ "$last_errno" != "0" ]] && errors="${errors}last_errno=$last_errno "
> +	[[ "$failure_count" != "0" && -n "$failure_count" ]] && errors="${errors}failure_count=$failure_count "
> +	[[ "$blocked" != "0" ]] && errors="${errors}blocked_requests=$blocked "
> +
> +	if [[ -z "$errors" ]]; then
> +		detail="phase=$phase, last_errno=$last_errno, failure_count=${failure_count:-0}"
> +		[[ "$drain_timed_out" == "yes" ]] && detail="$detail, drain_timed_out=yes"
> +		[[ -n "$sessions_reset" ]] && detail="$detail, sessions_reset=$sessions_reset"
> +		stage_result 5 "status_check" "PASS" "$detail"
> +	else
> +		stage_result 5 "status_check" "FAIL" "$errors"
> +	fi
> +else
> +	stage_result 5 "status_check" "FAIL" "cannot read reset/status"
> +fi
> +
> +# --- Summary ----------------------------------------------------------------
> +
> +echo ""
> +if [[ "$FAIL" -eq 0 ]]; then
> +	echo "RESULT: $PASS/$TOTAL stages passed"
> +else
> +	echo "RESULT: $PASS/$TOTAL stages passed, $FAIL FAILED"
> +fi
> +echo "Artifacts: $OUT_DIR"
> +echo ""
> +
> +exit "$FAIL"

Reviewed-by: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>

Thanks,
Slava.

next prev parent reply	other threads:[~2026-05-07 19:33 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-07 12:27 [PATCH v4 00/11] ceph: manual client session reset Alex Markuze
2026-05-07 12:27 ` [PATCH v4 01/11] ceph: convert inode flags to named bit positions and atomic bitops Alex Markuze
2026-05-07 18:35   ` Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 02/11] ceph: use proper endian conversion for flock_len in reconnect Alex Markuze
2026-05-07 12:27 ` [PATCH v4 03/11] ceph: harden send_mds_reconnect and handle active-MDS peer reset Alex Markuze
2026-05-07 18:43   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 04/11] ceph: add diagnostic timeout loop to wait_caps_flush() Alex Markuze
2026-05-07 19:01   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 05/11] ceph: add client reset state machine and session teardown Alex Markuze
2026-05-07 19:17   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 06/11] ceph: add manual reset debugfs control and tracepoints Alex Markuze
2026-05-07 19:22   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 07/11] selftests: ceph: add reset consistency checker Alex Markuze
2026-05-07 19:24   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 08/11] selftests: ceph: add reset stress test Alex Markuze
2026-05-07 19:29   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 09/11] selftests: ceph: add reset corner-case tests Alex Markuze
2026-05-07 19:31   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 12:27 ` [PATCH v4 10/11] selftests: ceph: add validation harness Alex Markuze
2026-05-07 19:33   ` Viacheslav Dubeyko [this message]
2026-05-07 12:27 ` [PATCH v4 11/11] selftests: ceph: wire up Ceph reset kselftests and documentation Alex Markuze
2026-05-07 19:38   ` [EXTERNAL] " Viacheslav Dubeyko
2026-05-07 18:28 ` [EXTERNAL] [PATCH v4 00/11] ceph: manual client session reset Viacheslav Dubeyko
2026-05-08 17:49   ` Viacheslav Dubeyko

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=56dfb4b66773b70733464ce17e344e9fdc8ae756.camel@redhat.com \
    --to=vdubeyko@redhat.com \
    --cc=amarkuze@redhat.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=idryomov@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox