All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Gregory Haskins <gregory.haskins@gmail.com>
Cc: Eric Dumazet <eric.dumazet@gmail.com>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	netdev@vger.kernel.org,
	virtualization@lists.linux-foundation.org, kvm@vger.kernel.org,
	linux-kernel@vger.kernel.org, mingo@elte.hu, linux-mm@kvack.org,
	akpm@linux-foundation.org, hpa@zytor.com,
	Rusty Russell <rusty@rustcorp.com.au>,
	s.hetze@linux-ag.com
Subject: Re: [PATCHv7 3/3] vhost_net: a kernel-level virtio server
Date: Tue, 3 Nov 2009 15:57:44 -0800	[thread overview]
Message-ID: <20091103235744.GF6726@linux.vnet.ibm.com> (raw)
In-Reply-To: <4AF072EE.9020202@gmail.com>

On Tue, Nov 03, 2009 at 01:14:06PM -0500, Gregory Haskins wrote:
> Gregory Haskins wrote:
> > Eric Dumazet wrote:
> >> Michael S. Tsirkin a écrit :
> >>> +static void handle_tx(struct vhost_net *net)
> >>> +{
> >>> +	struct vhost_virtqueue *vq = &net->dev.vqs[VHOST_NET_VQ_TX];
> >>> +	unsigned head, out, in, s;
> >>> +	struct msghdr msg = {
> >>> +		.msg_name = NULL,
> >>> +		.msg_namelen = 0,
> >>> +		.msg_control = NULL,
> >>> +		.msg_controllen = 0,
> >>> +		.msg_iov = vq->iov,
> >>> +		.msg_flags = MSG_DONTWAIT,
> >>> +	};
> >>> +	size_t len, total_len = 0;
> >>> +	int err, wmem;
> >>> +	size_t hdr_size;
> >>> +	struct socket *sock = rcu_dereference(vq->private_data);
> >>> +	if (!sock)
> >>> +		return;
> >>> +
> >>> +	wmem = atomic_read(&sock->sk->sk_wmem_alloc);
> >>> +	if (wmem >= sock->sk->sk_sndbuf)
> >>> +		return;
> >>> +
> >>> +	use_mm(net->dev.mm);
> >>> +	mutex_lock(&vq->mutex);
> >>> +	vhost_no_notify(vq);
> >>> +
> >> using rcu_dereference() and mutex_lock() at the same time seems wrong, I suspect
> >> that your use of RCU is not correct.
> >>
> >> 1) rcu_dereference() should be done inside a read_rcu_lock() section, and
> >>    we are not allowed to sleep in such a section.
> >>    (Quoting Documentation/RCU/whatisRCU.txt :
> >>      It is illegal to block while in an RCU read-side critical section, )
> >>
> >> 2) mutex_lock() can sleep (ie block)
> >>
> > 
> > 
> > Michael,
> >   I warned you that this needed better documentation ;)
> > 
> > Eric,
> >   I think I flagged this once before, but Michael convinced me that it
> > was indeed "ok", if but perhaps a bit unconventional.  I will try to
> > find the thread.
> > 
> > Kind Regards,
> > -Greg
> > 
> 
> Here it is:
> 
> http://lkml.org/lkml/2009/8/12/173

What was happening in that case was that the rcu_dereference()
was being used in a workqueue item.  The role of rcu_read_lock()
was taken on be the start of execution of the workqueue item, of
rcu_read_unlock() by the end of execution of the workqueue item, and
of synchronize_rcu() by flush_workqueue().  This does work, at least
assuming that flush_workqueue() operates as advertised, which it appears
to at first glance.

The above code looks somewhat different, however -- I don't see
handle_tx() being executed in the context of a work queue.  Instead
it appears to be in an interrupt handler.

So what is the story?  Using synchronize_irq() or some such?

							Thanx, Paul

WARNING: multiple messages have this Message-ID (diff)
From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Gregory Haskins <gregory.haskins@gmail.com>
Cc: Eric Dumazet <eric.dumazet@gmail.com>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	netdev@vger.kernel.org,
	virtualization@lists.linux-foundation.org, kvm@vger.kernel.org,
	linux-kernel@vger.kernel.org, mingo@elte.hu, linux-mm@kvack.org,
	akpm@linux-foundation.org, hpa@zytor.com,
	Rusty Russell <rusty@rustcorp.com.au>,
	s.hetze@linux-ag.com
Subject: Re: [PATCHv7 3/3] vhost_net: a kernel-level virtio server
Date: Tue, 3 Nov 2009 15:57:44 -0800	[thread overview]
Message-ID: <20091103235744.GF6726@linux.vnet.ibm.com> (raw)
In-Reply-To: <4AF072EE.9020202@gmail.com>

On Tue, Nov 03, 2009 at 01:14:06PM -0500, Gregory Haskins wrote:
> Gregory Haskins wrote:
> > Eric Dumazet wrote:
> >> Michael S. Tsirkin a écrit :
> >>> +static void handle_tx(struct vhost_net *net)
> >>> +{
> >>> +	struct vhost_virtqueue *vq = &net->dev.vqs[VHOST_NET_VQ_TX];
> >>> +	unsigned head, out, in, s;
> >>> +	struct msghdr msg = {
> >>> +		.msg_name = NULL,
> >>> +		.msg_namelen = 0,
> >>> +		.msg_control = NULL,
> >>> +		.msg_controllen = 0,
> >>> +		.msg_iov = vq->iov,
> >>> +		.msg_flags = MSG_DONTWAIT,
> >>> +	};
> >>> +	size_t len, total_len = 0;
> >>> +	int err, wmem;
> >>> +	size_t hdr_size;
> >>> +	struct socket *sock = rcu_dereference(vq->private_data);
> >>> +	if (!sock)
> >>> +		return;
> >>> +
> >>> +	wmem = atomic_read(&sock->sk->sk_wmem_alloc);
> >>> +	if (wmem >= sock->sk->sk_sndbuf)
> >>> +		return;
> >>> +
> >>> +	use_mm(net->dev.mm);
> >>> +	mutex_lock(&vq->mutex);
> >>> +	vhost_no_notify(vq);
> >>> +
> >> using rcu_dereference() and mutex_lock() at the same time seems wrong, I suspect
> >> that your use of RCU is not correct.
> >>
> >> 1) rcu_dereference() should be done inside a read_rcu_lock() section, and
> >>    we are not allowed to sleep in such a section.
> >>    (Quoting Documentation/RCU/whatisRCU.txt :
> >>      It is illegal to block while in an RCU read-side critical section, )
> >>
> >> 2) mutex_lock() can sleep (ie block)
> >>
> > 
> > 
> > Michael,
> >   I warned you that this needed better documentation ;)
> > 
> > Eric,
> >   I think I flagged this once before, but Michael convinced me that it
> > was indeed "ok", if but perhaps a bit unconventional.  I will try to
> > find the thread.
> > 
> > Kind Regards,
> > -Greg
> > 
> 
> Here it is:
> 
> http://lkml.org/lkml/2009/8/12/173

What was happening in that case was that the rcu_dereference()
was being used in a workqueue item.  The role of rcu_read_lock()
was taken on be the start of execution of the workqueue item, of
rcu_read_unlock() by the end of execution of the workqueue item, and
of synchronize_rcu() by flush_workqueue().  This does work, at least
assuming that flush_workqueue() operates as advertised, which it appears
to at first glance.

The above code looks somewhat different, however -- I don't see
handle_tx() being executed in the context of a work queue.  Instead
it appears to be in an interrupt handler.

So what is the story?  Using synchronize_irq() or some such?

							Thanx, Paul

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

WARNING: multiple messages have this Message-ID (diff)
From: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>
To: Gregory Haskins <gregory.haskins@gmail.com>
Cc: Eric Dumazet <eric.dumazet@gmail.com>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	netdev@vger.kernel.org,
	virtualization@lists.linux-foundation.org, kvm@vger.kernel.org,
	linux-kernel@vger.kernel.org, mingo@elte.hu, linux-mm@kvack.org,
	akpm@linux-foundation.org, hpa@zytor.com,
	Rusty Russell <rusty@rustcorp.com.au>,
	s.hetze@linux-ag.com
Subject: Re: [PATCHv7 3/3] vhost_net: a kernel-level virtio server
Date: Tue, 3 Nov 2009 15:57:44 -0800	[thread overview]
Message-ID: <20091103235744.GF6726@linux.vnet.ibm.com> (raw)
In-Reply-To: <4AF072EE.9020202@gmail.com>

On Tue, Nov 03, 2009 at 01:14:06PM -0500, Gregory Haskins wrote:
> Gregory Haskins wrote:
> > Eric Dumazet wrote:
> >> Michael S. Tsirkin a ecrit :
> >>> +static void handle_tx(struct vhost_net *net)
> >>> +{
> >>> +	struct vhost_virtqueue *vq = &net->dev.vqs[VHOST_NET_VQ_TX];
> >>> +	unsigned head, out, in, s;
> >>> +	struct msghdr msg = {
> >>> +		.msg_name = NULL,
> >>> +		.msg_namelen = 0,
> >>> +		.msg_control = NULL,
> >>> +		.msg_controllen = 0,
> >>> +		.msg_iov = vq->iov,
> >>> +		.msg_flags = MSG_DONTWAIT,
> >>> +	};
> >>> +	size_t len, total_len = 0;
> >>> +	int err, wmem;
> >>> +	size_t hdr_size;
> >>> +	struct socket *sock = rcu_dereference(vq->private_data);
> >>> +	if (!sock)
> >>> +		return;
> >>> +
> >>> +	wmem = atomic_read(&sock->sk->sk_wmem_alloc);
> >>> +	if (wmem >= sock->sk->sk_sndbuf)
> >>> +		return;
> >>> +
> >>> +	use_mm(net->dev.mm);
> >>> +	mutex_lock(&vq->mutex);
> >>> +	vhost_no_notify(vq);
> >>> +
> >> using rcu_dereference() and mutex_lock() at the same time seems wrong, I suspect
> >> that your use of RCU is not correct.
> >>
> >> 1) rcu_dereference() should be done inside a read_rcu_lock() section, and
> >>    we are not allowed to sleep in such a section.
> >>    (Quoting Documentation/RCU/whatisRCU.txt :
> >>      It is illegal to block while in an RCU read-side critical section, )
> >>
> >> 2) mutex_lock() can sleep (ie block)
> >>
> > 
> > 
> > Michael,
> >   I warned you that this needed better documentation ;)
> > 
> > Eric,
> >   I think I flagged this once before, but Michael convinced me that it
> > was indeed "ok", if but perhaps a bit unconventional.  I will try to
> > find the thread.
> > 
> > Kind Regards,
> > -Greg
> > 
> 
> Here it is:
> 
> http://lkml.org/lkml/2009/8/12/173

What was happening in that case was that the rcu_dereference()
was being used in a workqueue item.  The role of rcu_read_lock()
was taken on be the start of execution of the workqueue item, of
rcu_read_unlock() by the end of execution of the workqueue item, and
of synchronize_rcu() by flush_workqueue().  This does work, at least
assuming that flush_workqueue() operates as advertised, which it appears
to at first glance.

The above code looks somewhat different, however -- I don't see
handle_tx() being executed in the context of a work queue.  Instead
it appears to be in an interrupt handler.

So what is the story?  Using synchronize_irq() or some such?

							Thanx, Paul

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  parent reply	other threads:[~2009-11-03 23:57 UTC|newest]

Thread overview: 107+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <cover.1257267892.git.mst@redhat.com>
2009-11-03 17:24 ` [PATCHv7 1/3] tun: export underlying socket Michael S. Tsirkin
2009-11-03 17:24   ` Michael S. Tsirkin
2009-11-03 17:24   ` Michael S. Tsirkin
2009-11-04 13:30   ` David Miller
2009-11-04 13:30     ` David Miller
2009-11-04 13:30   ` David Miller
2009-11-03 17:24 ` Michael S. Tsirkin
2009-11-03 17:24 ` [PATCHv7 2/3] mm: export use_mm/unuse_mm to modules Michael S. Tsirkin
2009-11-03 17:24 ` Michael S. Tsirkin
2009-11-03 17:24   ` Michael S. Tsirkin
2009-11-03 17:24   ` Michael S. Tsirkin
2009-11-03 17:32   ` Gregory Haskins
2009-11-03 17:32   ` Gregory Haskins
2009-11-03 17:24 ` [PATCHv7 3/3] vhost_net: a kernel-level virtio server Michael S. Tsirkin
2009-11-03 17:24   ` Michael S. Tsirkin
2009-11-03 17:24   ` Michael S. Tsirkin
2009-11-03 18:03   ` Eric Dumazet
2009-11-03 18:03     ` Eric Dumazet
2009-11-03 18:03     ` Eric Dumazet
2009-11-03 18:08     ` Gregory Haskins
2009-11-03 18:14       ` Gregory Haskins
2009-11-03 18:14       ` Gregory Haskins
2009-11-03 18:51         ` Eric Dumazet
2009-11-03 18:51           ` Eric Dumazet
2009-11-03 18:51           ` Eric Dumazet
2009-11-03 19:50           ` Gregory Haskins
2009-11-03 19:50           ` Gregory Haskins
2009-11-03 19:58           ` Michael S. Tsirkin
2009-11-03 19:58           ` Michael S. Tsirkin
2009-11-03 19:58             ` Michael S. Tsirkin
2009-11-03 19:58             ` Michael S. Tsirkin
2009-11-03 21:11             ` Eric Dumazet
2009-11-03 21:11               ` Eric Dumazet
2009-11-03 21:11               ` Eric Dumazet
2009-11-04 12:04               ` Michael S. Tsirkin
2009-11-04 12:04               ` Michael S. Tsirkin
2009-11-04 12:04                 ` Michael S. Tsirkin
2009-11-04 12:04                 ` Michael S. Tsirkin
2009-11-03 21:11             ` Eric Dumazet
2009-11-03 18:51         ` Eric Dumazet
2009-11-03 23:57         ` Paul E. McKenney [this message]
2009-11-03 23:57           ` Paul E. McKenney
2009-11-03 23:57           ` Paul E. McKenney
2009-11-04 11:57           ` Michael S. Tsirkin
2009-11-04 11:57             ` Michael S. Tsirkin
2009-11-04 11:57             ` Michael S. Tsirkin
2009-11-04 17:25             ` Paul E. McKenney
2009-11-04 17:25             ` Paul E. McKenney
2009-11-04 17:25               ` Paul E. McKenney
2009-11-04 17:25               ` Paul E. McKenney
2009-11-04 17:33               ` Eric Dumazet
2009-11-04 17:33               ` Eric Dumazet
2009-11-04 17:33                 ` Eric Dumazet
2009-11-04 17:33                 ` Eric Dumazet
2009-11-04 19:06               ` Michael S. Tsirkin
2009-11-04 19:06               ` Michael S. Tsirkin
2009-11-04 19:06                 ` Michael S. Tsirkin
2009-11-04 19:12                 ` Gregory Haskins
2009-11-04 19:12                 ` Gregory Haskins
2009-11-06  5:01               ` Rusty Russell
2009-11-06  5:01                 ` Rusty Russell
2009-11-06 16:30                 ` Paul E. McKenney
2009-11-06 16:30                   ` Paul E. McKenney
2009-11-08  4:09                   ` Rusty Russell
2009-11-08  4:09                   ` Rusty Russell
2009-11-08  4:09                     ` Rusty Russell
2009-11-08 19:36                     ` Paul E. McKenney
2009-11-08 19:36                       ` Paul E. McKenney
2009-11-08 19:36                     ` Paul E. McKenney
2009-11-06 16:30                 ` Paul E. McKenney
2009-11-06  5:01               ` Rusty Russell
2009-11-04 11:57           ` Michael S. Tsirkin
2009-11-03 23:57         ` Paul E. McKenney
2009-11-03 18:08     ` Gregory Haskins
2009-11-03 19:55     ` Michael S. Tsirkin
2009-11-03 19:55     ` Michael S. Tsirkin
2009-11-03 19:55       ` Michael S. Tsirkin
2009-11-03 19:55       ` Michael S. Tsirkin
2009-11-03 18:03   ` Eric Dumazet
2009-11-04 11:08   ` Andi Kleen
2009-11-04 11:08   ` Andi Kleen
2009-11-04 11:08     ` Andi Kleen
2009-11-04 12:10     ` Michael S. Tsirkin
2009-11-04 12:10     ` Michael S. Tsirkin
2009-11-04 12:10       ` Michael S. Tsirkin
2009-11-04 12:59       ` Andi Kleen
2009-11-04 12:59         ` Andi Kleen
2009-11-04 13:08         ` Michael S. Tsirkin
2009-11-04 13:08         ` Michael S. Tsirkin
2009-11-04 13:08           ` Michael S. Tsirkin
2009-11-04 13:15           ` Andi Kleen
2009-11-04 13:15             ` Andi Kleen
2009-11-04 13:17             ` Michael S. Tsirkin
2009-11-04 13:17             ` Michael S. Tsirkin
2009-11-04 13:17               ` Michael S. Tsirkin
2009-11-04 13:37               ` Andi Kleen
2009-11-04 13:37               ` Andi Kleen
2009-11-04 13:37                 ` Andi Kleen
2009-11-04 13:41                 ` Michael S. Tsirkin
2009-11-04 13:41                   ` Michael S. Tsirkin
2009-11-04 16:37                   ` Michael S. Tsirkin
2009-11-04 16:37                     ` Michael S. Tsirkin
2009-11-04 16:37                   ` Michael S. Tsirkin
2009-11-04 13:41                 ` Michael S. Tsirkin
2009-11-04 13:15           ` Andi Kleen
2009-11-04 12:59       ` Andi Kleen
2009-11-03 17:24 ` Michael S. Tsirkin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20091103235744.GF6726@linux.vnet.ibm.com \
    --to=paulmck@linux.vnet.ibm.com \
    --cc=akpm@linux-foundation.org \
    --cc=eric.dumazet@gmail.com \
    --cc=gregory.haskins@gmail.com \
    --cc=hpa@zytor.com \
    --cc=kvm@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mingo@elte.hu \
    --cc=mst@redhat.com \
    --cc=netdev@vger.kernel.org \
    --cc=rusty@rustcorp.com.au \
    --cc=s.hetze@linux-ag.com \
    --cc=virtualization@lists.linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.