From: "Michael S. Tsirkin" <mst@redhat.com>
To: Pawel Moll <pawel.moll@arm.com>
Cc: "virtualization@lists.linux-foundation.org"
<virtualization@lists.linux-foundation.org>
Subject: Re: [RFC] virtio-mmio: Update the device to OASIS spec version
Date: Thu, 15 Jan 2015 19:51:32 +0200 [thread overview]
Message-ID: <20150115175132.GA30453@redhat.com> (raw)
In-Reply-To: <1421343158.4601.7.camel@arm.com>
On Thu, Jan 15, 2015 at 05:32:38PM +0000, Pawel Moll wrote:
> On Thu, 2015-01-15 at 16:51 +0000, Michael S. Tsirkin wrote:
> > > + uint64_t addr = virt_to_phys(info->queue);
> >
> > Kernel normally uses u64 for this type.
>
> Sure, well spotted.
>
> > > +
> > > + writel(addr & 0xffffffff,
> > > + vm_dev->base + VIRTIO_MMIO_QUEUE_DESC_LOW);
> > > + writel((addr >> 32) & 0xffffffff,
> > > + vm_dev->base + VIRTIO_MMIO_QUEUE_DESC_HIGH);
> > > +
> > > + addr += info->num * sizeof(struct vring_desc);
> > > + writel(addr & 0xffffffff,
> > > + vm_dev->base + VIRTIO_MMIO_QUEUE_AVAIL_LOW);
> > > + writel((addr >> 32) & 0xffffffff,
> > > + vm_dev->base + VIRTIO_MMIO_QUEUE_AVAIL_HIGH);
> >
> > 0xffffffff isn't really needed, is it?
>
> I admit I'm never sure what are the narrowing side effects. You are
> probably right that u64 >> 32 will be always 32 bit.
>
> > > +
> > > + addr += sizeof(struct vring_avail) + info->num * sizeof(__u16);
> > > + addr += VIRTIO_MMIO_VRING_ALIGN - 1;
> > > + addr &= ~(VIRTIO_MMIO_VRING_ALIGN - 1);
> >
> >
> > Host no longer knows the alignment, so why is it needed?
>
> [skipped the spec reference, it's a separate discussion]
>
> > I think you shouldn't use VIRTIO_MMIO_VRING_ALIGN in non-legacy code:
> > it's a legacy thing.
>
> But I still need to pass something to vring_new_virtqueue() below, don't
> I? And it will allocate the queue based on some alignment value. I can't
> see anything that would create the layout for me, neither in mainline
> nor in next. Have I missed something? (wouldn't be surprised if I have)
No, but it's no longer a virtio thing - just an optimization
choice by a specific driver. So please just stick e.g. PAGE_SIZE there.
And maybe add a TODO item - we can optimize by allocating chunks
separately.
> > > + writel(addr & 0xffffffff,
> > > + vm_dev->base + VIRTIO_MMIO_QUEUE_USED_LOW);
> > > + writel((addr >> 32) & 0xffffffff,
> > > + vm_dev->base + VIRTIO_MMIO_QUEUE_USED_HIGH);
> > > +
> > > + writel(1, vm_dev->base + VIRTIO_MMIO_QUEUE_READY);
> > > + }
> > >
> > > /* Create the vring */
> > > vq = vring_new_virtqueue(index, info->num, VIRTIO_MMIO_VRING_ALIGN, vdev,
>
> [...]
>
> > > +static struct device_attribute vm_dev_attr_version =
> > > + __ATTR(version, S_IRUGO, vm_dev_attr_version_show, NULL);
> > > +
> > > static int virtio_mmio_probe(struct platform_device *pdev)
> > > {
> > > struct virtio_mmio_device *vm_dev;
> >
> > We already expose feature bits - this one really necessary?
>
> Necessary? Of course not, just a debugging feature, really, to see what
> version of control registers are available. Useful - I strongly believe
> so.
Yes but the point is the same info is already available
in core: just look at feature bit 31.
If you think it's important enough to expose in a decoded
way, let's add this in core?
> > > @@ -476,16 +501,26 @@ static int virtio_mmio_probe(struct platform_device *pdev)
> > >
> > > /* Check device version */
> > > vm_dev->version = readl(vm_dev->base + VIRTIO_MMIO_VERSION);
> > > - if (vm_dev->version != 1) {
> > > + if (vm_dev->version < 1 || vm_dev->version > 2) {
> > > dev_err(&pdev->dev, "Version %ld not supported!\n",
> > > vm_dev->version);
> > > return -ENXIO;
> > > }
> > >
> > > vm_dev->vdev.id.device = readl(vm_dev->base + VIRTIO_MMIO_DEVICE_ID);
> > > + if (vm_dev->vdev.id.device == 0) {
> > > + /*
> > > + * ID 0 means a dummy (placeholder) device, skip quietly
> > > + * (as in: no error) with no further actions
> > > + */
> > > + return 0;
> >
> > Necessary?
> > We don't have drivers for this id anyway.
>
> I'm not sure if you are joking or not, after the battle we fought over
> it.
Sorry, I don't remember anymore. Just asking.
> The short answer is: yes. Necessary.
>
> "4.2.2 MMIO Device Register Layout
>
> [...]
>
> Virtio Subsystem Device ID
> See 5 Device Types for possible values. Value zero (0x0) is used to de-
> fine a system memory map with placeholder devices at static, well known
> addresses, assigning functions to them depending on user’s needs.
>
> [...]
>
> 4.2.2.2 Driver Requirements: MMIO Device Register Layout
>
> The driver MUST ignore a device with DeviceID 0x0, but MUST NOT report
> any error."
Absolutely. So what happens if you drop these code lines?
There's no driver registered for this ID, so it's just ignored.
Seems like what spec is asking for, no?
> > > + }
> >
> > Need to also
> > 1. validate that feature bit VIRTIO_1 is set
> > 2. validate that ID is not for a legacy device
> >
> > otherwise device specific drivers might get invoked
> > on future devices (e.g. when we update balloon for 1.0)
> > and they not do the right thing.
>
> I'm not following you, but I admit I haven't though this problem
> thoroughly. If you can volunteer an example of things going on, it would
> be useful. Either way, I'll think about it again.
1. you need to check ID 0, and assume rev 0. If device also
says it needs rev 1, fail. E.g. see my patch for virtio_pci_modern:
if (virtio_device_is_legacy_only(vp_dev->vdev.id))
return -ENODEV;
you can find the code in my tree, see below.
2. it's easy - just get features on probe and validate VIRTIO_1
bit is set.
s390 does it differently since same device supports version 1 and 0.
No example yet - I forgot to code this up for virtio pci. I'll copy you
on patch.
> > @@ -496,7 +531,8 @@ static int virtio_mmio_remove(struct platform_device *pdev)
> > > {
> > > struct virtio_mmio_device *vm_dev = platform_get_drvdata(pdev);
> > >
> > > - unregister_virtio_device(&vm_dev->vdev);
> > > + if (vm_dev)
> > > + unregister_virtio_device(&vm_dev->vdev);
> > >
> >
> > Will remove ever be called if probe fails?
>
> No.
Then this if isn't necessary: vm_dev is always set.
> > > -/* Guest's memory page size in bytes - Write Only */
> > > +/* Guest's memory page size in bytes - Write Only
> > > + * LEGACY DEVICES ONLY! */
> >
> > This is not the preferred style for multi-line comments :)
>
> Fact. Will fix.
>
> > Also - maybe add a flag to selectively disable legacy
> > or modern macros?
> > Might be clearer than comments that, after all, never compile.
>
> As in, a bunch of #ifdefs disabling the legacy lines of code? Doable,
> although I'm not sure how beautiful would that be. Will have a look, but
> it probably would only make sense with CONFIG_VIRTIO_MMIO_LEGACY option.
>
> Paweł
Not necessarily - the point is for userspace to be able to
avoid getting useless legacy macros by means of a simple #define.
Again, take a look at my tree:
git://git.kernel.org/pub/scm/linux/kernel/git/mst/vhost.git vhost-next
--
MST
_______________________________________________
Virtualization mailing list
Virtualization@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/virtualization
next prev parent reply other threads:[~2015-01-15 17:51 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-12-19 18:38 [RFC] virtio-mmio: Update the device to OASIS spec version Pawel Moll
2015-01-15 16:51 ` Michael S. Tsirkin
2015-01-15 17:12 ` Michael S. Tsirkin
2015-01-15 17:15 ` Pawel Moll
2015-01-15 17:19 ` Michael S. Tsirkin
2015-01-16 9:58 ` Cornelia Huck
2015-01-15 17:32 ` Pawel Moll
2015-01-15 17:51 ` Michael S. Tsirkin [this message]
2015-01-15 18:11 ` Pawel Moll
2015-01-15 18:29 ` Michael S. Tsirkin
2015-01-15 18:42 ` Pawel Moll
2015-01-15 19:12 ` Michael S. Tsirkin
2015-01-19 17:45 ` Pawel Moll
2015-01-19 18:36 ` Michael S. Tsirkin
2015-01-20 17:18 ` Pawel Moll
2015-01-20 17:44 ` Michael S. Tsirkin
2015-01-20 17:51 ` Pawel Moll
2015-01-20 17:56 ` Michael S. Tsirkin
2015-01-15 18:17 ` Michael S. Tsirkin
2015-01-15 18:39 ` Michael S. Tsirkin
2015-01-15 18:51 ` Pawel Moll
2015-01-15 19:12 ` Michael S. Tsirkin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150115175132.GA30453@redhat.com \
--to=mst@redhat.com \
--cc=pawel.moll@arm.com \
--cc=virtualization@lists.linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).