* [PATCH] virtio_scsi: remove ACCESS_ONCE() and smp_read_barrier_depends()
@ 2014-05-07 14:34 Ming Lei
2014-05-07 15:50 ` Paolo Bonzini
0 siblings, 1 reply; 5+ messages in thread
From: Ming Lei @ 2014-05-07 14:34 UTC (permalink / raw)
To: James E.J. Bottomley, Paolo Bonzini
Cc: linux-scsi, Wanlong Gao, Rusty Russell, Ming Lei
Access to tgt->req_vq is strictly serialized by spin_lock
of tgt->tgt_lock, so the ACCESS_ONCE() isn't necessary.
smp_read_barrier_depends() in virtscsi_req_done was introduced
to order reading req_vq and decreasing tgt->reqs, but it isn't
needed now because req_vq is read from
scsi->req_vqs[vq->index - VIRTIO_SCSI_VQ_BASE] instead of
tgt->req_vq, so remove the unnecessary barrier.
Also remove related comment about the barrier.
Cc: Paolo Bonzini <pbonzini@redhat.com>
Signed-off-by: Ming Lei <tom.leiming@gmail.com>
---
drivers/scsi/virtio_scsi.c | 49 +++-----------------------------------------
1 file changed, 3 insertions(+), 46 deletions(-)
diff --git a/drivers/scsi/virtio_scsi.c b/drivers/scsi/virtio_scsi.c
index 16bfd50..697fa53 100644
--- a/drivers/scsi/virtio_scsi.c
+++ b/drivers/scsi/virtio_scsi.c
@@ -73,15 +73,8 @@ struct virtio_scsi_vq {
* queue, and also lets the driver optimize the IRQ affinity for the virtqueues
* (each virtqueue's affinity is set to the CPU that "owns" the queue).
*
- * An interesting effect of this policy is that only writes to req_vq need to
- * take the tgt_lock. Read can be done outside the lock because:
- *
- * - writes of req_vq only occur when atomic_inc_return(&tgt->reqs) returns 1.
- * In that case, no other CPU is reading req_vq: even if they were in
- * virtscsi_queuecommand_multi, they would be spinning on tgt_lock.
- *
- * - reads of req_vq only occur when the target is not idle (reqs != 0).
- * A CPU that enters virtscsi_queuecommand_multi will not modify req_vq.
+ * tgt_lock is held to serialize reading and writing req_vq. Reading req_vq
+ * could be done locklessly, but we do not do it yet.
*
* Similarly, decrements of reqs are never concurrent with writes of req_vq.
* Thus they can happen outside the tgt_lock, provided of course we make reqs
@@ -238,38 +231,6 @@ static void virtscsi_req_done(struct virtqueue *vq)
int index = vq->index - VIRTIO_SCSI_VQ_BASE;
struct virtio_scsi_vq *req_vq = &vscsi->req_vqs[index];
- /*
- * Read req_vq before decrementing the reqs field in
- * virtscsi_complete_cmd.
- *
- * With barriers:
- *
- * CPU #0 virtscsi_queuecommand_multi (CPU #1)
- * ------------------------------------------------------------
- * lock vq_lock
- * read req_vq
- * read reqs (reqs = 1)
- * write reqs (reqs = 0)
- * increment reqs (reqs = 1)
- * write req_vq
- *
- * Possible reordering without barriers:
- *
- * CPU #0 virtscsi_queuecommand_multi (CPU #1)
- * ------------------------------------------------------------
- * lock vq_lock
- * read reqs (reqs = 1)
- * write reqs (reqs = 0)
- * increment reqs (reqs = 1)
- * write req_vq
- * read (wrong) req_vq
- *
- * We do not need a full smp_rmb, because req_vq is required to get
- * to tgt->reqs: tgt is &vscsi->tgt[sc->device->id], where sc is stored
- * in the virtqueue as the user token.
- */
- smp_read_barrier_depends();
-
virtscsi_vq_done(vscsi, req_vq, virtscsi_complete_cmd);
};
@@ -560,12 +521,8 @@ static struct virtio_scsi_vq *virtscsi_pick_vq(struct virtio_scsi *vscsi,
spin_lock_irqsave(&tgt->tgt_lock, flags);
- /*
- * The memory barrier after atomic_inc_return matches
- * the smp_read_barrier_depends() in virtscsi_req_done.
- */
if (atomic_inc_return(&tgt->reqs) > 1)
- vq = ACCESS_ONCE(tgt->req_vq);
+ vq = tgt->req_vq;
else {
queue_num = smp_processor_id();
while (unlikely(queue_num >= vscsi->num_queues))
--
1.7.9.5
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH] virtio_scsi: remove ACCESS_ONCE() and smp_read_barrier_depends()
2014-05-07 14:34 [PATCH] virtio_scsi: remove ACCESS_ONCE() and smp_read_barrier_depends() Ming Lei
@ 2014-05-07 15:50 ` Paolo Bonzini
2014-05-07 16:03 ` Ming Lei
0 siblings, 1 reply; 5+ messages in thread
From: Paolo Bonzini @ 2014-05-07 15:50 UTC (permalink / raw)
To: Ming Lei, James E.J. Bottomley; +Cc: linux-scsi, Wanlong Gao, Rusty Russell
Il 07/05/2014 16:34, Ming Lei ha scritto:
> *
> - * An interesting effect of this policy is that only writes to req_vq need to
> - * take the tgt_lock. Read can be done outside the lock because:
> - *
> - * - writes of req_vq only occur when atomic_inc_return(&tgt->reqs) returns 1.
> - * In that case, no other CPU is reading req_vq: even if they were in
> - * virtscsi_queuecommand_multi, they would be spinning on tgt_lock.
> - *
> - * - reads of req_vq only occur when the target is not idle (reqs != 0).
> - * A CPU that enters virtscsi_queuecommand_multi will not modify req_vq.
> + * tgt_lock is held to serialize reading and writing req_vq. Reading req_vq
> + * could be done locklessly, but we do not do it yet.
> *
> * Similarly, decrements of reqs are never concurrent with writes of req_vq.
> * Thus they can happen outside the tgt_lock, provided of course we make reqs
The part you deleted explains _why_ decrements of reqs are never
concurrent with writes of req_vq. Perhaps:
* An interesting effect of this policy is that only increments to reqs and writes
* to req_vq need to take the tgt_lock. Reads of req_vq and decrements or req_vq
* can be done outside the lock because:
*
* - writes of req_vq only occur when atomic_inc_return(&tgt->reqs) returns 1.
* In that case, no other CPU is reading req_vq: even if they were in
* virtscsi_queuecommand_multi, they would be spinning on tgt_lock.
*
* - reads of req_vq only occur when the target is not idle (reqs != 0).
* A CPU that enters virtscsi_queuecommand_multi will not modify req_vq.
*
* - likewise, decrements of reqs only occur when reqs != 0. If the decremented
* value is zero, the first CPU that enters virtscsi_queuecommand_multi will
* modify req_vq and the others will spin on tgt_lock.
*
* We do not try to read req_vq locklessly for simplicity, so tgt_lock is used
* to serialize reads of req_vq too.
Paolo
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] virtio_scsi: remove ACCESS_ONCE() and smp_read_barrier_depends()
2014-05-07 15:50 ` Paolo Bonzini
@ 2014-05-07 16:03 ` Ming Lei
2014-05-07 16:12 ` Paolo Bonzini
0 siblings, 1 reply; 5+ messages in thread
From: Ming Lei @ 2014-05-07 16:03 UTC (permalink / raw)
To: Paolo Bonzini
Cc: James E.J. Bottomley, Linux SCSI List, Wanlong Gao, Rusty Russell
On Wed, May 7, 2014 at 11:50 PM, Paolo Bonzini <pbonzini@redhat.com> wrote:
> Il 07/05/2014 16:34, Ming Lei ha scritto:
>> *
>> - * An interesting effect of this policy is that only writes to req_vq need to
>> - * take the tgt_lock. Read can be done outside the lock because:
>> - *
>> - * - writes of req_vq only occur when atomic_inc_return(&tgt->reqs) returns 1.
>> - * In that case, no other CPU is reading req_vq: even if they were in
>> - * virtscsi_queuecommand_multi, they would be spinning on tgt_lock.
>> - *
>> - * - reads of req_vq only occur when the target is not idle (reqs != 0).
>> - * A CPU that enters virtscsi_queuecommand_multi will not modify req_vq.
>> + * tgt_lock is held to serialize reading and writing req_vq. Reading req_vq
>> + * could be done locklessly, but we do not do it yet.
>> *
>> * Similarly, decrements of reqs are never concurrent with writes of req_vq.
>> * Thus they can happen outside the tgt_lock, provided of course we make reqs
>
> The part you deleted explains _why_ decrements of reqs are never
> concurrent with writes of req_vq. Perhaps:
The below part might not need, since the 1st paragraph has
explained the basic principle, also looks current virtscsi_pick_vq()
isn't very difficult to understand.
>
> * An interesting effect of this policy is that only increments to reqs and writes
> * to req_vq need to take the tgt_lock. Reads of req_vq and decrements or req_vq
> * can be done outside the lock because:
> *
> * - writes of req_vq only occur when atomic_inc_return(&tgt->reqs) returns 1.
> * In that case, no other CPU is reading req_vq: even if they were in
> * virtscsi_queuecommand_multi, they would be spinning on tgt_lock.
> *
> * - reads of req_vq only occur when the target is not idle (reqs != 0).
> * A CPU that enters virtscsi_queuecommand_multi will not modify req_vq.
The above two parts should be very clear from current code in
virtscsi_pick_vq(), so I am not sure if we need the description.
> *
> * - likewise, decrements of reqs only occur when reqs != 0. If the decremented
> * value is zero, the first CPU that enters virtscsi_queuecommand_multi will
> * modify req_vq and the others will spin on tgt_lock.
The fact should be obvious too, :-)
> *
> * We do not try to read req_vq locklessly for simplicity, so tgt_lock is used
> * to serialize reads of req_vq too.
Maybe it is better to just mention it isn't done yet, and maybe someone
will figure out simple way to support lockless reading req_vq.
Thanks,
--
Ming Lei
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] virtio_scsi: remove ACCESS_ONCE() and smp_read_barrier_depends()
2014-05-07 16:03 ` Ming Lei
@ 2014-05-07 16:12 ` Paolo Bonzini
2014-05-07 16:29 ` Ming Lei
0 siblings, 1 reply; 5+ messages in thread
From: Paolo Bonzini @ 2014-05-07 16:12 UTC (permalink / raw)
To: Ming Lei
Cc: James E.J. Bottomley, Linux SCSI List, Wanlong Gao, Rusty Russell
Il 07/05/2014 18:03, Ming Lei ha scritto:
>> *
>> * - likewise, decrements of reqs only occur when reqs != 0. If the decremented
>> * value is zero, the first CPU that enters virtscsi_queuecommand_multi will
>> * modify req_vq and the others will spin on tgt_lock.
>
> The fact should be obvious too,
Perhaps, but in your patch you're leaving a "Similarly" that doesn't
apply anymore.
> > * We do not try to read req_vq locklessly for simplicity, so tgt_lock is used
> > * to serialize reads of req_vq too.
>
> Maybe it is better to just mention it isn't done yet, and maybe someone
> will figure out simple way to support lockless reading req_vq.
The one I sketched is not too hard.
Paolo
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] virtio_scsi: remove ACCESS_ONCE() and smp_read_barrier_depends()
2014-05-07 16:12 ` Paolo Bonzini
@ 2014-05-07 16:29 ` Ming Lei
0 siblings, 0 replies; 5+ messages in thread
From: Ming Lei @ 2014-05-07 16:29 UTC (permalink / raw)
To: Paolo Bonzini
Cc: James E.J. Bottomley, Linux SCSI List, Wanlong Gao, Rusty Russell
On Thu, May 8, 2014 at 12:12 AM, Paolo Bonzini <pbonzini@redhat.com> wrote:
> Il 07/05/2014 18:03, Ming Lei ha scritto:
>>>
>>> *
>>>
>>> * - likewise, decrements of reqs only occur when reqs != 0. If the
>>> decremented
>>> * value is zero, the first CPU that enters virtscsi_queuecommand_multi
>>> will
>>> * modify req_vq and the others will spin on tgt_lock.
>>
>>
>> The fact should be obvious too,
>
>
> Perhaps, but in your patch you're leaving a "Similarly" that doesn't apply
> anymore.
OK, I will remove the 'Similarly', ;-)
Thanks,
--
Ming Lei
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2014-05-07 16:29 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-05-07 14:34 [PATCH] virtio_scsi: remove ACCESS_ONCE() and smp_read_barrier_depends() Ming Lei
2014-05-07 15:50 ` Paolo Bonzini
2014-05-07 16:03 ` Ming Lei
2014-05-07 16:12 ` Paolo Bonzini
2014-05-07 16:29 ` Ming Lei
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox