* [Qemu-devel] [PATCH] replay: wake up vCPU when replaying
@ 2018-07-03 8:52 Pavel Dovgalyuk
2018-07-09 11:24 ` Pavel Dovgalyuk
0 siblings, 1 reply; 5+ messages in thread
From: Pavel Dovgalyuk @ 2018-07-03 8:52 UTC (permalink / raw)
To: qemu-devel
Cc: alex.bennee, pbonzini, maria.klimushenkova, dovgaluk,
pavel.dovgaluk
In record/replay icount mode vCPU thread and iothread synchronize
the execution using the checkpoints.
vCPU thread processes the virtual timers and iothread processes all others.
When iothread wants to wake up sleeping vCPU thread, it sends dummy queued
work. Therefore it could be the following sequence of the events in
record mode:
- IO: sending dummy work
- IO: processing timers
- CPU: wakeup
- CPU: clearing dummy work
- CPU: processing virtual timers
But due to the races in replay mode the sequence may change:
- IO: sending dummy work
- CPU: wakeup
- CPU: clearing dummy work
- CPU: sleeping again because nothing to do
- IO: Processing timers
- CPU: zzzz
In this case vCPU will not wake up, because dummy work is not to be set up
again.
This patch tries to wake up the vCPU when it sleeps and the icount warp
checkpoint isn't met. It means that vCPU has something to do, because
there are no other reasons of non-matching warp checkpoint.
Signed-off-by: Pavel Dovgalyuk <Pavel.Dovgaluk@ispras.ru>
---
cpus.c | 15 ++++++++++-----
1 file changed, 10 insertions(+), 5 deletions(-)
diff --git a/cpus.c b/cpus.c
index 181ce33..bad6a33 100644
--- a/cpus.c
+++ b/cpus.c
@@ -539,11 +539,6 @@ void qemu_start_warp_timer(void)
return;
}
- /* warp clock deterministically in record/replay mode */
- if (!replay_checkpoint(CHECKPOINT_CLOCK_WARP_START)) {
- return;
- }
-
if (!all_cpu_threads_idle()) {
return;
}
@@ -553,6 +548,16 @@ void qemu_start_warp_timer(void)
return;
}
+ /* warp clock deterministically in record/replay mode */
+ if (!replay_checkpoint(CHECKPOINT_CLOCK_WARP_START)) {
+ /* vCPU is sleeping and warp can't be started.
+ It is probably a race condition: notification sent
+ to vCPU was processed in advance and vCPU went to sleep.
+ Therefore we have to wake it up for doing someting. */
+ qemu_clock_notify(QEMU_CLOCK_VIRTUAL);
+ return;
+ }
+
/* We want to use the earliest deadline from ALL vm_clocks */
clock = qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL_RT);
deadline = qemu_clock_deadline_ns_all(QEMU_CLOCK_VIRTUAL);
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] [PATCH] replay: wake up vCPU when replaying
2018-07-03 8:52 [Qemu-devel] [PATCH] replay: wake up vCPU when replaying Pavel Dovgalyuk
@ 2018-07-09 11:24 ` Pavel Dovgalyuk
2018-09-10 13:04 ` Paolo Bonzini
0 siblings, 1 reply; 5+ messages in thread
From: Pavel Dovgalyuk @ 2018-07-09 11:24 UTC (permalink / raw)
To: 'Pavel Dovgalyuk', qemu-devel
Cc: alex.bennee, pbonzini, maria.klimushenkova
There are some situations when this patch still doesn't help.
I think this happens due to the race condition in qemu_tcg_rr_wait_io_event
static void qemu_tcg_rr_wait_io_event(CPUState *cpu)
{
while (all_cpu_threads_idle()) {
stop_tcg_kick_timer();
qemu_cond_wait(cpu->halt_cond, &qemu_global_mutex);
}
start_tcg_kick_timer();
qemu_wait_io_event_common(cpu);
}
all_cpu_threads_idle() returns true when there is no queued work.
But between this call and qemu_cond_wait() iothread may add queued work
and the vCPU thread will sleep infinitely.
Does anyone have an idea how to fix this?
Pavel Dovgalyuk
> -----Original Message-----
> From: Pavel Dovgalyuk [mailto:Pavel.Dovgaluk@ispras.ru]
> Sent: Tuesday, July 03, 2018 11:53 AM
> To: qemu-devel@nongnu.org
> Cc: alex.bennee@linaro.org; pbonzini@redhat.com; maria.klimushenkova@ispras.ru;
> dovgaluk@ispras.ru; pavel.dovgaluk@ispras.ru
> Subject: [PATCH] replay: wake up vCPU when replaying
>
> In record/replay icount mode vCPU thread and iothread synchronize
> the execution using the checkpoints.
> vCPU thread processes the virtual timers and iothread processes all others.
> When iothread wants to wake up sleeping vCPU thread, it sends dummy queued
> work. Therefore it could be the following sequence of the events in
> record mode:
> - IO: sending dummy work
> - IO: processing timers
> - CPU: wakeup
> - CPU: clearing dummy work
> - CPU: processing virtual timers
>
> But due to the races in replay mode the sequence may change:
> - IO: sending dummy work
> - CPU: wakeup
> - CPU: clearing dummy work
> - CPU: sleeping again because nothing to do
> - IO: Processing timers
> - CPU: zzzz
>
> In this case vCPU will not wake up, because dummy work is not to be set up
> again.
>
> This patch tries to wake up the vCPU when it sleeps and the icount warp
> checkpoint isn't met. It means that vCPU has something to do, because
> there are no other reasons of non-matching warp checkpoint.
>
> Signed-off-by: Pavel Dovgalyuk <Pavel.Dovgaluk@ispras.ru>
> ---
> cpus.c | 15 ++++++++++-----
> 1 file changed, 10 insertions(+), 5 deletions(-)
>
> diff --git a/cpus.c b/cpus.c
> index 181ce33..bad6a33 100644
> --- a/cpus.c
> +++ b/cpus.c
> @@ -539,11 +539,6 @@ void qemu_start_warp_timer(void)
> return;
> }
>
> - /* warp clock deterministically in record/replay mode */
> - if (!replay_checkpoint(CHECKPOINT_CLOCK_WARP_START)) {
> - return;
> - }
> -
> if (!all_cpu_threads_idle()) {
> return;
> }
> @@ -553,6 +548,16 @@ void qemu_start_warp_timer(void)
> return;
> }
>
> + /* warp clock deterministically in record/replay mode */
> + if (!replay_checkpoint(CHECKPOINT_CLOCK_WARP_START)) {
> + /* vCPU is sleeping and warp can't be started.
> + It is probably a race condition: notification sent
> + to vCPU was processed in advance and vCPU went to sleep.
> + Therefore we have to wake it up for doing someting. */
> + qemu_clock_notify(QEMU_CLOCK_VIRTUAL);
> + return;
> + }
> +
> /* We want to use the earliest deadline from ALL vm_clocks */
> clock = qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL_RT);
> deadline = qemu_clock_deadline_ns_all(QEMU_CLOCK_VIRTUAL);
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] [PATCH] replay: wake up vCPU when replaying
2018-07-09 11:24 ` Pavel Dovgalyuk
@ 2018-09-10 13:04 ` Paolo Bonzini
2018-09-11 7:37 ` Pavel Dovgalyuk
0 siblings, 1 reply; 5+ messages in thread
From: Paolo Bonzini @ 2018-09-10 13:04 UTC (permalink / raw)
To: Pavel Dovgalyuk, 'Pavel Dovgalyuk', qemu-devel
Cc: alex.bennee, maria.klimushenkova
On 09/07/2018 13:24, Pavel Dovgalyuk wrote:
> static void qemu_tcg_rr_wait_io_event(CPUState *cpu)
> {
> while (all_cpu_threads_idle()) {
> stop_tcg_kick_timer();
> qemu_cond_wait(cpu->halt_cond, &qemu_global_mutex);
> }
>
> start_tcg_kick_timer();
>
> qemu_wait_io_event_common(cpu);
> }
>
> all_cpu_threads_idle() returns true when there is no queued work.
> But between this call and qemu_cond_wait() iothread may add queued work
> and the vCPU thread will sleep infinitely.
Maybe queue_work_on_cpu is called outside BQL?
Paolo
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] [PATCH] replay: wake up vCPU when replaying
2018-09-10 13:04 ` Paolo Bonzini
@ 2018-09-11 7:37 ` Pavel Dovgalyuk
2018-09-11 11:07 ` Paolo Bonzini
0 siblings, 1 reply; 5+ messages in thread
From: Pavel Dovgalyuk @ 2018-09-11 7:37 UTC (permalink / raw)
To: 'Paolo Bonzini', 'Pavel Dovgalyuk', qemu-devel
Cc: alex.bennee, maria.klimushenkova
> From: Paolo Bonzini [mailto:pbonzini@redhat.com]
> On 09/07/2018 13:24, Pavel Dovgalyuk wrote:
> > static void qemu_tcg_rr_wait_io_event(CPUState *cpu)
> > {
> > while (all_cpu_threads_idle()) {
> > stop_tcg_kick_timer();
> > qemu_cond_wait(cpu->halt_cond, &qemu_global_mutex);
> > }
> >
> > start_tcg_kick_timer();
> >
> > qemu_wait_io_event_common(cpu);
> > }
> >
> > all_cpu_threads_idle() returns true when there is no queued work.
> > But between this call and qemu_cond_wait() iothread may add queued work
> > and the vCPU thread will sleep infinitely.
>
> Maybe queue_work_on_cpu is called outside BQL?
I don't remember now.
However, rr series includes the better version of that patch.
Pavel Dovgalyuk
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] [PATCH] replay: wake up vCPU when replaying
2018-09-11 7:37 ` Pavel Dovgalyuk
@ 2018-09-11 11:07 ` Paolo Bonzini
0 siblings, 0 replies; 5+ messages in thread
From: Paolo Bonzini @ 2018-09-11 11:07 UTC (permalink / raw)
To: Pavel Dovgalyuk, 'Pavel Dovgalyuk', qemu-devel
Cc: alex.bennee, maria.klimushenkova
On 11/09/2018 09:37, Pavel Dovgalyuk wrote:
>> From: Paolo Bonzini [mailto:pbonzini@redhat.com]
>> On 09/07/2018 13:24, Pavel Dovgalyuk wrote:
>>> static void qemu_tcg_rr_wait_io_event(CPUState *cpu)
>>> {
>>> while (all_cpu_threads_idle()) {
>>> stop_tcg_kick_timer();
>>> qemu_cond_wait(cpu->halt_cond, &qemu_global_mutex);
>>> }
>>>
>>> start_tcg_kick_timer();
>>>
>>> qemu_wait_io_event_common(cpu);
>>> }
>>>
>>> all_cpu_threads_idle() returns true when there is no queued work.
>>> But between this call and qemu_cond_wait() iothread may add queued work
>>> and the vCPU thread will sleep infinitely.
>>
>> Maybe queue_work_on_cpu is called outside BQL?
>
> I don't remember now.
> However, rr series includes the better version of that patch.
Good, thanks.
Paolo
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2018-09-11 11:08 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2018-07-03 8:52 [Qemu-devel] [PATCH] replay: wake up vCPU when replaying Pavel Dovgalyuk
2018-07-09 11:24 ` Pavel Dovgalyuk
2018-09-10 13:04 ` Paolo Bonzini
2018-09-11 7:37 ` Pavel Dovgalyuk
2018-09-11 11:07 ` Paolo Bonzini
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).