From: Sebastian Ott <sebott@redhat.com>
To: Zenghui Yu <yuzenghui@huawei.com>
Cc: Marc Zyngier <maz@kernel.org>,
Oliver Upton <oliver.upton@linux.dev>,
Colton Lewis <coltonlewis@google.com>,
Ricardo Koller <ricarkol@google.com>,
Joey Gouly <joey.gouly@arm.com>,
Suzuki K Poulose <suzuki.poulose@arm.com>,
Shuah Khan <shuah@kernel.org>,
linux-arm-kernel@lists.infradead.org, kvmarm@lists.linux.dev,
linux-kernel@vger.kernel.org, linux-kselftest@vger.kernel.org
Subject: Re: [PATCH v2 0/3] KVM: arm64: selftests: arch_timer_edge_cases fixes
Date: Wed, 4 Jun 2025 23:17:15 +0200 (CEST) [thread overview]
Message-ID: <77f07e94-82a2-5dc4-2483-e2ecff151e66@redhat.com> (raw)
In-Reply-To: <9b9f7099-4e81-9b74-a1ac-37cd4965675b@redhat.com>
On Wed, 4 Jun 2025, Sebastian Ott wrote:
> On Tue, 3 Jun 2025, Zenghui Yu wrote:
>> On 2025/5/27 22:24, Sebastian Ott wrote:
>>> Some small fixes for arch_timer_edge_cases that I stumbled upon
>>> while debugging failures for this selftest on ampere-one.
>>>
>>> Changes since v1: modified patch 3 based on suggestions from Marc.
>>>
>>> I've done some tests with this on various machines - seems to be all
>>> good, however on ampere-one I now hit this in 10% of the runs:
>>> ==== Test Assertion Failure ====
>>> arm64/arch_timer_edge_cases.c:481: timer_get_cntct(timer) >= DEF_CNT +
>>> (timer_get_cntfrq() * (uint64_t)(delta_2_ms) / 1000)
>>> pid=166657 tid=166657 errno=4 - Interrupted system call
>>> 1 0x0000000000404db3: test_run at arch_timer_edge_cases.c:933
>>> 2 0x0000000000401f9f: main at arch_timer_edge_cases.c:1062
>>> 3 0x0000ffffaedd625b: ?? ??:0
>>> 4 0x0000ffffaedd633b: ?? ??:0
>>> 5 0x00000000004020af: _start at ??:?
>>> timer_get_cntct(timer) >= DEF_CNT + msec_to_cycles(delta_2_ms)
>>>
>>> This is not new, it was just hidden behind the other failure. I'll
>>> try to figure out what this is about (seems to be independent of
>>> the wait time)..
>>
>> Not sure if you have figured it out. I can easily reproduce it on my box
>> and I *guess* it is that we have some random XVAL values when we enable
>> the timer..
>
> Yes, I think so, too.
>
>> test_reprogramming_timer()
>> {
>> local_irq_disable();
>> reset_timer_state(timer, DEF_CNT);
>
> My first attempt was to also initialize cval here
Forgot to mention that I did this because my tests have shown
that the interrupt didn't only trigger early (like before the
reprogrammed delta) but instantly. This seemed to work but I think
the order in set_tval_irq() is the actual issue.
>
>>
>> /* Program the timer to DEF_CNT + delta_1_ms. */
>> set_tval_irq(timer, msec_to_cycles(delta_1_ms), CTL_ENABLE);
>>
>> [...]
>> }
>>
>> set_tval_irq()
>> {
>> timer_set_ctl(timer, ctl);
>>
>> // There is a window that we enable the timer with *random* XVAL
>> // values and we may get the unexpected interrupt.. And it's
>> // unlikely that KVM can be aware of TVAL's change (and
>> // re-evaluate the interrupt's pending state) before hitting the
>> // GUEST_ASSERT().
>>
>> timer_set_tval(timer, tval_cycles);
>
> Yes, I stumbled over this as well. I've always assumed that this order is
> becauase of this from the architecture "If CNTV_CTL_EL0.ENABLE is 0, the
> value returned is UNKNOWN." However re-reading that part today I realized
> that this only concerns register reads.
>
> Maybe somone on cc knows why it's in that order?
>
> I'm currently testing this with the above swapped and it's looking good,
> so far.
>
>> }
>>
>> I'm not familiar with the test so I'm not 100% sure that this is the
>> root cause. But I hope this helps with your analysis ;-) .
>
> It did, thanks!
>
> Sebastian
>
next prev parent reply other threads:[~2025-06-04 21:17 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-05-27 14:24 [PATCH v2 0/3] KVM: arm64: selftests: arch_timer_edge_cases fixes Sebastian Ott
2025-05-27 14:24 ` [PATCH v2 1/3] KVM: arm64: selftests: fix help text for arch_timer_edge_cases Sebastian Ott
2025-05-27 14:24 ` [PATCH v2 2/3] KVM: arm64: selftests: fix thread migration in arch_timer_edge_cases Sebastian Ott
2025-05-27 14:24 ` [PATCH v2 3/3] KVM: arm64: selftests: arch_timer_edge_cases - determine effective counter width Sebastian Ott
2025-06-03 12:35 ` [PATCH v2 0/3] KVM: arm64: selftests: arch_timer_edge_cases fixes Zenghui Yu
2025-06-04 20:58 ` Sebastian Ott
2025-06-04 21:17 ` Sebastian Ott [this message]
2025-06-05 6:46 ` Marc Zyngier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=77f07e94-82a2-5dc4-2483-e2ecff151e66@redhat.com \
--to=sebott@redhat.com \
--cc=coltonlewis@google.com \
--cc=joey.gouly@arm.com \
--cc=kvmarm@lists.linux.dev \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-kselftest@vger.kernel.org \
--cc=maz@kernel.org \
--cc=oliver.upton@linux.dev \
--cc=ricarkol@google.com \
--cc=shuah@kernel.org \
--cc=suzuki.poulose@arm.com \
--cc=yuzenghui@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).