From: Jason Wang <jasowang@redhat.com>
To: Cornelia Huck <cohuck@redhat.com>
Cc: mst@redhat.com, virtio-comment@lists.oasis-open.org,
eperezma@redhat.com, lulu@redhat.com, rob.miller@broadcom.com,
stefanha@redhat.com, pasic@linux.ibm.com, sgarzare@redhat.com
Subject: Re: [virtio-comment] Re: [PATCH] virtio-pci: introduce VIRITO_F_QUEUE_STATE
Date: Tue, 16 Mar 2021 10:53:37 +0800 [thread overview]
Message-ID: <0bd20def-b7fe-0bb7-a660-e5745b727289@redhat.com> (raw)
In-Reply-To: <20210315162432.14f5476a.cohuck@redhat.com>
在 2021/3/15 下午11:24, Cornelia Huck 写道:
> On Mon, 15 Mar 2021 10:58:46 +0800
> Jason Wang <jasowang@redhat.com> wrote:
>
>> This patch adds the ability to save and restore virtqueue state via a
>> new field in the common configuration infrastructure.
>>
>> To simply the implementation, no new device status is introduced. For
>> device, the requirements is not to forget the queue state after
>> virtio reset and clear the virtqueue state upon ACKNOWLEDGE. For
>> driver, it must set the virtqueue state before setting DRIVER_OK.
>>
>> To save a virtqueue state, the driver then need:
>>
>> 1) reset device
>> 2) read virtqueue statue
>>
>> To restore a virtqueue state, the driver need:
>>
>> 1) reset device
>> 2) perform necessary setups (e.g features negotiation)
>> 3) write virtqueue state
>> 4) set DRIVER_OK
>>
>> The main user should be live migration.
>>
>> Signed-off-by: Jason Wang <jasowang@redhat.com>
>> ---
>> content.tex | 38 +++++++++++++++++++++++++++++++++++++
>> virtqueue-state-packed-le.c | 7 +++++++
>> virtqueue-state-split-le.c | 4 ++++
>> 3 files changed, 49 insertions(+)
>> create mode 100644 virtqueue-state-packed-le.c
>> create mode 100644 virtqueue-state-split-le.c
>>
>> diff --git a/content.tex b/content.tex
>> index 620c0e2..d7bff25 100644
>> --- a/content.tex
>> +++ b/content.tex
>> @@ -837,6 +837,7 @@ \subsubsection{Common configuration structure layout}\label{sec:Virtio Transport
>> le64 queue_driver; /* read-write */
>> le64 queue_device; /* read-write */
>> le16 queue_notify_data; /* read-only for driver */
>> + le64 queue_state; /* read-write */
>> };
>> \end{lstlisting}
>>
>> @@ -916,6 +917,29 @@ \subsubsection{Common configuration structure layout}\label{sec:Virtio Transport
>> may benefit from providing another value, for example an internal virtqueue
>> identifier, or an internal offset related to the virtqueue number.
>> \end{note}
>> +
>> +\item[\field{queue_state}]
>> + This field exists only if VIRTIO_F_QUEUE_STATE has been
>> + negotiated. The driver will use this field to get or set the
>> + virtqueue state by reading or writing the 64bit from the
>> + field.
>> + When VIRTIO_F_RING_PACKED has not been negotiated, the driver
>> + can set and get the following states:
>> + \lstinputlisting{virtqueue-state-split-le.c}
>> + The field \field{last_avail_idx} is the location where the
>> + device read for next index from the available ring.
>> + When VIRTIO_F_RING_PACKED has been negotiated, the driver can
>> + set and get the following states:
>> + \lstinputlisting{virtqueue-state-packed-le.c}
>> + The field \field{last_avail_idx} is the next location where
>> + device read for the next descriptor from the descriptor
>> + ring. The field \field{last_avail_wrap_counter} is the last
>> + driver ring wrap counter that is observed by the device. The
>> + field \field{used_idx} is the next location where device write
>> + used descriptor do descriptor ring. The field
>> + \field{used_wrap_counter} is the wrap counter that is used by
>> + the device. See also \ref{sec:Packed Virtqueues / Driver and Device Ring Wrap Counters}.
> While queue_state is a pci-specific field, I don't think any of this is
> transport-specific. I think the description of the layout for the queue
> state should move into a generic section, and this part only reference
> it.
Yes, will move it, probably "basic facility" part.
>
>> +
>> \end{description}
>>
>> \devicenormative{\paragraph}{Common configuration structure layout}{Virtio Transport Options / Virtio Over PCI Bus / PCI Device Layout / Common configuration structure layout}
>> @@ -964,6 +988,10 @@ \subsubsection{Common configuration structure layout}\label{sec:Virtio Transport
>> present either a value of 0 or a power of 2 in
>> \field{queue_size}.
>>
>> +If VIRTIO_F_QUEUE_STATE has been negotiated, a device MUST NOT clear
>> +the queue state upon reset and MUST reset the queue state when
>> +ACKNOWLEDGE has been set through \field{device status} bit.
> What happens if a driver tries to read the queue status outside of this
> window? Should it get zeroes? Unpredictable values?
I'm not sure having normative like this can help. Can we leave it to
device? I had a driver normative to clarify when should driver read or
write to the value.
>
>> +
>> \drivernormative{\paragraph}{Common configuration structure layout}{Virtio Transport Options / Virtio Over PCI Bus / PCI Device Layout / Common configuration structure layout}
>>
>> The driver MUST NOT write to \field{device_feature}, \field{num_queues}, \field{config_generation}, \field{queue_notify_off} or \field{queue_notify_data}.
>> @@ -981,6 +1009,13 @@ \subsubsection{Common configuration structure layout}\label{sec:Virtio Transport
>>
>> The driver MUST NOT write a 0 to \field{queue_enable}.
>>
>> +If VIRTIO_F_QUEUE_STATE has been negotiated, a driver SHOULD set the
>> +state of each virtqueue through \field{queue_state} before setting the
>> +DRIVER_OK \field{device status} bit and SHOULD NOT write to
>> +\field{queue_state} after setting the DRIVER_OK \field{device status}
>> +bit. If a driver want to get the virtqueue state, it MUST first reset
>> +the device then read state from \field{queue_state}.
> What should the driver do with a 'fresh' device? Does it need to start
> out with a reset, read the (zero) state, and then write it back?
If 'fresh' means a normal probe procedure, in this case we don't need to
get the virtqueue state. What we need is to set a proper state. For
split virtqueue, the driver should write 0 (as last_avail_idx). For
packed virtqueue, the driver shoudl write:
{.last_avail_idx = 0, .last_avail_wrap_counter=1, .used_idx=0,
used_wrap_counter=1}.
If 'fresh' means start device after migration, we need to set the
virtqueue state to what source gives us:
in src:
1) reset device
2) read virtqueue states
3) pass virtqueue states to dst
in dst:
1) receivce virtqueue states from src
2) reset device
3) perform necesssary setup ( feature neogitaion etc.)
4) set the virtqueue states we received from src.
Btw, it looks to me we need to clearify that "The driver MUST write to
queue_state after FEATURE_OK but before DRIVER_OK)
Thanks
>
>> +
>> \subsubsection{Notification structure layout}\label{sec:Virtio Transport Options / Virtio Over PCI Bus / PCI Device Layout / Notification capability}
>>
>> The notification location is found using the VIRTIO_PCI_CAP_NOTIFY_CFG
>> @@ -6596,6 +6631,9 @@ \chapter{Reserved Feature Bits}\label{sec:Reserved Feature Bits}
>> transport specific.
>> For more details about driver notifications over PCI see \ref{sec:Virtio Transport Options / Virtio Over PCI Bus / PCI-specific Initialization And Device Operation / Available Buffer Notifications}.
>>
>> +\item[VIRTIO_F_QUEUE_STATE(40)] This feature indicates that the driver
>> + can set and get the virtqueue state.
> Here is probably the best place to put the layout description from the
> pci section above, and to refer to the pci-specific implementation
> (just as it is done for the driver notifications right above.)
>
>> +
>> \end{description}
>>
>> \drivernormative{\section}{Reserved Feature Bits}{Reserved Feature Bits}
>> diff --git a/virtqueue-state-packed-le.c b/virtqueue-state-packed-le.c
>> new file mode 100644
>> index 0000000..f21f9c2
>> --- /dev/null
>> +++ b/virtqueue-state-packed-le.c
>> @@ -0,0 +1,7 @@
>> +le64 {
>> + last_avail_idx : 15;
>> + last_avail_wrap_counter : 1;
>> + used_idx : 15;
>> + used_wrap_counter : 1;
>> + reserved : 32;
>> +};
>> diff --git a/virtqueue-state-split-le.c b/virtqueue-state-split-le.c
>> new file mode 100644
>> index 0000000..daeb4a3
>> --- /dev/null
>> +++ b/virtqueue-state-split-le.c
>> @@ -0,0 +1,4 @@
>> +le64 {
>> + last_avail_idx : 16;
>> + reserved: 48;
>> +};
>
> This publicly archived list offers a means to provide input to the
> OASIS Virtual I/O Device (VIRTIO) TC.
>
> In order to verify user consent to the Feedback License terms and
> to minimize spam in the list archive, subscription is required
> before posting.
>
> Subscribe: virtio-comment-subscribe@lists.oasis-open.org
> Unsubscribe: virtio-comment-unsubscribe@lists.oasis-open.org
> List help: virtio-comment-help@lists.oasis-open.org
> List archive: https://lists.oasis-open.org/archives/virtio-comment/
> Feedback License: https://www.oasis-open.org/who/ipr/feedback_license.pdf
> List Guidelines: https://www.oasis-open.org/policies-guidelines/mailing-lists
> Committee: https://www.oasis-open.org/committees/virtio/
> Join OASIS: https://www.oasis-open.org/join/
>
This publicly archived list offers a means to provide input to the
OASIS Virtual I/O Device (VIRTIO) TC.
In order to verify user consent to the Feedback License terms and
to minimize spam in the list archive, subscription is required
before posting.
Subscribe: virtio-comment-subscribe@lists.oasis-open.org
Unsubscribe: virtio-comment-unsubscribe@lists.oasis-open.org
List help: virtio-comment-help@lists.oasis-open.org
List archive: https://lists.oasis-open.org/archives/virtio-comment/
Feedback License: https://www.oasis-open.org/who/ipr/feedback_license.pdf
List Guidelines: https://www.oasis-open.org/policies-guidelines/mailing-lists
Committee: https://www.oasis-open.org/committees/virtio/
Join OASIS: https://www.oasis-open.org/join/
next prev parent reply other threads:[~2021-03-16 2:53 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-03-15 2:58 [virtio-comment] [PATCH] virtio-pci: introduce VIRITO_F_QUEUE_STATE Jason Wang
2021-03-15 12:24 ` [virtio-comment] " Eugenio Perez Martin
2021-03-16 6:08 ` Jason Wang
2021-03-16 7:37 ` Eugenio Perez Martin
2021-03-15 15:24 ` Cornelia Huck
2021-03-16 2:53 ` Jason Wang [this message]
2021-03-16 11:06 ` Cornelia Huck
2021-03-17 3:43 ` Jason Wang
2021-03-17 12:57 ` Cornelia Huck
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=0bd20def-b7fe-0bb7-a660-e5745b727289@redhat.com \
--to=jasowang@redhat.com \
--cc=cohuck@redhat.com \
--cc=eperezma@redhat.com \
--cc=lulu@redhat.com \
--cc=mst@redhat.com \
--cc=pasic@linux.ibm.com \
--cc=rob.miller@broadcom.com \
--cc=sgarzare@redhat.com \
--cc=stefanha@redhat.com \
--cc=virtio-comment@lists.oasis-open.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox