* [RFC 0/2] Memory mapped virtio device
@ 2011-09-02 12:24 Pawel Moll
[not found] ` <1314966257-7503-2-git-send-email-pawel.moll@arm.com>
0 siblings, 1 reply; 9+ messages in thread
From: Pawel Moll @ 2011-09-02 12:24 UTC (permalink / raw)
To: linux-arm-kernel
Hi All,
This is a proposal of a memory mapped virtio device.
The main goal was to provide an equivalent of the virtio PCI device,
happily used by KVM an qemu on x86, which could be used by PCI-less
platforms (common in ARM world). The interface exposed by my device
is very similar to the original virtio_pci one, maybe just a little
bit simplified (no sophisticated IRQ infrastructure here). Of course
all the control registers live in memory space, instead of PCI IO
space (writel vs iowrite32).
This raises an obvious question whether these two implementation
could share some common code. Probably some constant values could
be shared (like VIRTIO_*_HOST_FEATURES or VIRTIO_*_VRING_ALIGN),
maybe struct virtio_*_vq_info as well. On the other hand the vring
interface itself is an abstraction layer already... I really don't
know, all comments and ideas appreciated.
The second thing is AMBAness of the driver... I've decided to
make it an AMBA device rather than simple platform device to add
at least a touch of "dicoverability". The idea was simple - one
can add "permanent" virtio AMBA device in the kernel placing it
somewhere in reserved area of real hardware, so the kernel running
on hardware will not get correct amba_id value and ignore the
device. With the eve of Device-Trees-Everywhere this argument is
slightly out of date, as qemu could add the virtio device even in
runtime. So maybe I should convert it into a platform_device then?
This brings additional problem though. Magnus Damm already proposed
a platform virtio device here:
http://thread.gmane.org/gmane.linux.ports.sh.devel/11554
however it's something completely different, as his use case is a
heterogeneous system and the virtio is used mainly as a transport
between nodes. It also uses lguest-based interface, which is much
less applicable to the qemu reality.
The second patch simply makes it possible to use virtio framework
on ARM.
Let me just add that Peter Maydell (Linaro qemu magician) kindly
agreed to work on the qemu side of the problem once we have some
consensus regarding the interface.
Pawel Moll (2):
virtio: Add AMBA bus driver for virtio device
arm: Add VIRTUALIZATION configuration menu and virtio options
arch/arm/Kconfig | 16 ++
drivers/virtio/Kconfig | 11 +
drivers/virtio/Makefile | 1 +
drivers/virtio/virtio_amba.c | 443 ++++++++++++++++++++++++++++++++++++++++++
include/linux/virtio_amba.h | 62 ++++++
5 files changed, 533 insertions(+), 0 deletions(-)
create mode 100644 drivers/virtio/virtio_amba.c
create mode 100644 include/linux/virtio_amba.h
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC 1/2] virtio: Add AMBA bus driver for virtio device
[not found] ` <1314966257-7503-2-git-send-email-pawel.moll@arm.com>
@ 2011-09-12 1:31 ` Rusty Russell
2011-09-12 16:45 ` Pawel Moll
0 siblings, 1 reply; 9+ messages in thread
From: Rusty Russell @ 2011-09-12 1:31 UTC (permalink / raw)
To: linux-arm-kernel
On Fri, 2 Sep 2011 13:24:16 +0100, Pawel Moll <pawel.moll@arm.com> wrote:
> This patch, based on virtio PCI driver, adds support for virtio
> AMBA device. This should allow environments like qemu to use
> virtio-based block & network devices.
Cool work! I like the concept, but a few implementation notes below:
> + * offset width name description
> + * ------ ----- ------------- -----------------
> + *
> + * 0x000 32 HostFeatures Features supported by the host
> + * 0x004 32 GuestFeatures Features activated by the guest
You need to make these at least 64 bits. Lguest makes them variable
width, in fact.
> + * 0x008 32 QueuePFN PFN for the currently selected queue
> + * 0x00c 32 QueueNum Queue size for the currently selected queue
You should, I believe, seriously consider allowing the guest to set the
queue size, rather than the host (perhaps the host could suggest one,
but the guest should be able to override it).
Anthony or Michael might suggest other changes, since they are most
familiar with virtio_pci limitations...
Cheers,
Rusty.
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC 1/2] virtio: Add AMBA bus driver for virtio device
2011-09-12 1:31 ` [RFC 1/2] virtio: Add AMBA bus driver for " Rusty Russell
@ 2011-09-12 16:45 ` Pawel Moll
[not found] ` <1315846301-13892-1-git-send-email-pawel.moll@arm.com>
2011-09-13 2:58 ` [RFC 1/2] virtio: Add AMBA " Rusty Russell
0 siblings, 2 replies; 9+ messages in thread
From: Pawel Moll @ 2011-09-12 16:45 UTC (permalink / raw)
To: linux-arm-kernel
On Mon, 2011-09-12 at 11:01 +0930, Rusty Russell wrote:
> Cool work! I like the concept, but a few implementation notes below:
Glad to hear that it makes any sense :-)
I'll post the RFC v2 in a second - it's a platform bus driver now, so
it's not ARM specific any more...
> > + * offset width name description
> > + * ------ ----- ------------- -----------------
> > + *
> > + * 0x000 32 HostFeatures Features supported by the host
> > + * 0x004 32 GuestFeatures Features activated by the guest
>
> You need to make these at least 64 bits. Lguest makes them variable
> width, in fact.
No problem, I've reserved 128 bits in the modified registers layout (see
RFC v2), I can even change it into "offset/value" interface, which would
give you variable width. One thing I don't get is how to access feature
bits > 31? The get_features() seems
to be limited to 32:
struct virtio_config_ops {
<...>
u32 (*get_features)(struct virtio_device *vdev);
<...>
}
> > + * 0x008 32 QueuePFN PFN for the currently selected queue
> > + * 0x00c 32 QueueNum Queue size for the currently selected queue
>
> You should, I believe, seriously consider allowing the guest to set the
> queue size, rather than the host (perhaps the host could suggest one,
> but the guest should be able to override it).
I guess the QueueNum could be simply a read/write register - guest can
read suggestion and override it writing it back. The same question
though - how can guest change it? (I mean some API?)
> Anthony or Michael might suggest other changes, since they are most
> familiar with virtio_pci limitations...
I've added them on the RFC v2 Cc as well - all feedback more then
welcome!
Cheers!
Pawe?
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC v2] arm: Add platform bus driver for virtio device
[not found] ` <1315846301-13892-1-git-send-email-pawel.moll@arm.com>
@ 2011-09-12 17:27 ` Anthony Liguori
2011-09-13 8:58 ` Pawel Moll
0 siblings, 1 reply; 9+ messages in thread
From: Anthony Liguori @ 2011-09-12 17:27 UTC (permalink / raw)
To: linux-arm-kernel
On 09/12/2011 11:51 AM, Pawel Moll wrote:
> This patch, based on virtio PCI driver, adds support for memory
> mapped (platform) virtio device. This should allow environments
> like qemu to use virtio-based block& network devices.
>
> One can define and register a platform device which resources
> will describe memory mapped control registers and "mailbox"
> interrupt. Such device can be also instantiated using the Device
> Tree node with compatible property equal "virtio,platform".
>
> Note: Work in progress...
Are you planning on sending patches to QEMU for this? I think it makes
sense to start in QEMU with this effort and make a proper spec from
which you can write the driver against.
benh has also written a platform virtio transports for use with Power.
Ben, could you take a look and see if it's worth merging the two efforts?
Regards,
Anthony Liguori
>
> Cc: Rusty Russell<rusty@rustcorp.com.au>
> Cc: Anthony Liguori<aliguori@us.ibm.com>
> Cc: Michael S.Tsirkin<mst@redhat.com>
> Cc: Magnus Damm<magnus.damm@gmail.com>
> Signed-off-by: Pawel Moll<pawel.moll@arm.com>
> ---
> drivers/virtio/Kconfig | 11 +
> drivers/virtio/Makefile | 1 +
> drivers/virtio/virtio_platform.c | 424 ++++++++++++++++++++++++++++++++++++++
> include/linux/virtio_platform.h | 62 ++++++
> 4 files changed, 498 insertions(+), 0 deletions(-)
> create mode 100644 drivers/virtio/virtio_platform.c
> create mode 100644 include/linux/virtio_platform.h
>
> diff --git a/drivers/virtio/Kconfig b/drivers/virtio/Kconfig
> index 57e493b..63edf72 100644
> --- a/drivers/virtio/Kconfig
> +++ b/drivers/virtio/Kconfig
> @@ -35,4 +35,15 @@ config VIRTIO_BALLOON
>
> If unsure, say M.
>
> + config VIRTIO_PLATFORM
> + tristate "Platform bus driver for virtio devices (EXPERIMENTAL)"
> + depends on EXPERIMENTAL
> + select VIRTIO
> + select VIRTIO_RING
> + ---help---
> + This drivers provides support for memory mapped (platform) virtio
> + based paravirtual device driver.
> +
> + If unsure, say N.
> +
> endmenu
> diff --git a/drivers/virtio/Makefile b/drivers/virtio/Makefile
> index 6738c44..4d175c0 100644
> --- a/drivers/virtio/Makefile
> +++ b/drivers/virtio/Makefile
> @@ -1,4 +1,5 @@
> obj-$(CONFIG_VIRTIO) += virtio.o
> obj-$(CONFIG_VIRTIO_RING) += virtio_ring.o
> +obj-$(CONFIG_VIRTIO_PLATFORM) += virtio_platform.o
> obj-$(CONFIG_VIRTIO_PCI) += virtio_pci.o
> obj-$(CONFIG_VIRTIO_BALLOON) += virtio_balloon.o
> diff --git a/drivers/virtio/virtio_platform.c b/drivers/virtio/virtio_platform.c
> new file mode 100644
> index 0000000..b16027b
> --- /dev/null
> +++ b/drivers/virtio/virtio_platform.c
> @@ -0,0 +1,424 @@
> +/*
> + * Virtio platform device driver
> + *
> + * Copyright 2011, ARM Ltd.
> + *
> + * This module allows virtio devices to be used over a virtual platform device.
> + *
> + * Registers layout:
> + *
> + * offset width name description
> + * ------ ----- ------------- -----------------
> + *
> + * 0x000 32 MagicValue Magic value "virt" (0x74726976 LE)
> + * 0x004 32 DeviceID Virtio device ID
> + * 0x008 32 VendorID Virtio vendor ID
> + *
> + * 0x010 32 HostFeatures Features supported by the host
> + * 0x020 32 GuestFeatures Features activated by the guest
> + *
> + * 0x030 32 QueuePFN PFN for the currently selected queue
> + * 0x034 32 QueueNum Queue size for the currently selected queue
> + * 0x038 32 QueueSel Queue selector
> + * 0x03c 32 QueueNotify Queue notifier
> + *
> + * 0x040 32 InterruptACK Interrupt acknowledge register
> + * 0x050 8 Status Device status register
> + *
> + * 0x100
> + * ... Device-specific configuration space
> + * 0xfff
> + *
> + * Based on Virtio PCI driver by Anthony Liguori, copyright IBM Corp. 2007
> + *
> + * This work is licensed under the terms of the GNU GPL, version 2 or later.
> + * See the COPYING file in the top-level directory.
> + */
> +
> +#include<linux/highmem.h>
> +#include<linux/interrupt.h>
> +#include<linux/io.h>
> +#include<linux/list.h>
> +#include<linux/module.h>
> +#include<linux/platform_device.h>
> +#include<linux/slab.h>
> +#include<linux/spinlock.h>
> +#include<linux/virtio.h>
> +#include<linux/virtio_config.h>
> +#include<linux/virtio_platform.h>
> +#include<linux/virtio_ring.h>
> +
> +
> +
> +#define to_virtio_plat_device(_plat_dev) \
> + container_of(_plat_dev, struct virtio_plat_device, vdev)
> +
> +struct virtio_plat_device {
> + struct virtio_device vdev;
> + struct platform_device *pdev;
> +
> + void __iomem *base;
> +
> + /* a list of queues so we can dispatch IRQs */
> + spinlock_t lock;
> + struct list_head virtqueues;
> +};
> +
> +struct virtio_plat_vq_info {
> + /* the actual virtqueue */
> + struct virtqueue *vq;
> +
> + /* the number of entries in the queue */
> + int num;
> +
> + /* the index of the queue */
> + int queue_index;
> +
> + /* the virtual address of the ring queue */
> + void *queue;
> +
> + /* the list node for the virtqueues list */
> + struct list_head node;
> +};
> +
> +
> +
> +/* Configuration interface */
> +
> +static u32 va_get_features(struct virtio_device *vdev)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> +
> + /* When someone needs more than 32 feature bits, we'll need to
> + * steal a bit to indicate that the rest are somewhere else. */
> + return readl(vpdev->base + VIRTIO_PLAT_HOST_FEATURES);
> +}
> +
> +static void va_finalize_features(struct virtio_device *vdev)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> +
> + /* Give virtio_ring a chance to accept features. */
> + vring_transport_features(vdev);
> +
> + /* We only support 32 feature bits. */
> + BUILD_BUG_ON(ARRAY_SIZE(vdev->features) != 1);
> + writel(vdev->features[0], vpdev->base + VIRTIO_PLAT_GUEST_FEATURES);
> +}
> +
> +static void va_get(struct virtio_device *vdev, unsigned offset,
> + void *buf, unsigned len)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> + u8 *ptr = buf;
> + int i;
> +
> + for (i = 0; i< len; i++)
> + ptr[i] = readb(vpdev->base + VIRTIO_PLAT_CONFIG + offset + i);
> +}
> +
> +static void va_set(struct virtio_device *vdev, unsigned offset,
> + const void *buf, unsigned len)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> + const u8 *ptr = buf;
> + int i;
> +
> + for (i = 0; i< len; i++)
> + writeb(ptr[i], vpdev->base + VIRTIO_PLAT_CONFIG + offset + i);
> +}
> +
> +static u8 va_get_status(struct virtio_device *vdev)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> +
> + return readb(vpdev->base + VIRTIO_PLAT_STATUS)& 0xff;
> +}
> +
> +static void va_set_status(struct virtio_device *vdev, u8 status)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> +
> + /* We should never be setting status to 0. */
> + BUG_ON(status == 0);
> +
> + writeb(status, vpdev->base + VIRTIO_PLAT_STATUS);
> +}
> +
> +static void va_reset(struct virtio_device *vdev)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> +
> + /* 0 status means a reset. */
> + writeb(0, vpdev->base + VIRTIO_PLAT_STATUS);
> +}
> +
> +
> +
> +/* Transport interface */
> +
> +/* the notify function used when creating a virt queue */
> +static void va_notify(struct virtqueue *vq)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vq->vdev);
> + struct virtio_plat_vq_info *info = vq->priv;
> +
> + /* We write the queue's selector into the notification register to
> + * signal the other end */
> + writel(info->queue_index, vpdev->base + VIRTIO_PLAT_QUEUE_NOTIFY);
> +}
> +
> +/* Notify all virtqueues on an interrupt. */
> +static irqreturn_t va_interrupt(int irq, void *opaque)
> +{
> + struct virtio_plat_device *vpdev = opaque;
> + struct virtio_plat_vq_info *info;
> + irqreturn_t ret = IRQ_NONE;
> + unsigned long flags;
> +
> + writel(1, vpdev->base + VIRTIO_PLAT_INTERRUPT_ACK);
> +
> + spin_lock_irqsave(&vpdev->lock, flags);
> + list_for_each_entry(info,&vpdev->virtqueues, node) {
> + if (vring_interrupt(irq, info->vq) == IRQ_HANDLED)
> + ret = IRQ_HANDLED;
> + }
> + spin_unlock_irqrestore(&vpdev->lock, flags);
> +
> + return ret;
> +}
> +
> +
> +
> +static void va_del_vq(struct virtqueue *vq)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vq->vdev);
> + struct virtio_plat_vq_info *info = vq->priv;
> + unsigned long flags, size;
> +
> + spin_lock_irqsave(&vpdev->lock, flags);
> + list_del(&info->node);
> + spin_unlock_irqrestore(&vpdev->lock, flags);
> +
> + writel(info->queue_index, vpdev->base + VIRTIO_PLAT_QUEUE_SEL);
> +
> + vring_del_virtqueue(vq);
> +
> + /* Select and deactivate the queue */
> + writel(0, vpdev->base + VIRTIO_PLAT_QUEUE_PFN);
> +
> + size = PAGE_ALIGN(vring_size(info->num, VIRTIO_PLAT_VRING_ALIGN));
> + free_pages_exact(info->queue, size);
> + kfree(info);
> +}
> +
> +static void va_del_vqs(struct virtio_device *vdev)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> + struct virtqueue *vq, *n;
> +
> + list_for_each_entry_safe(vq, n,&vdev->vqs, list)
> + va_del_vq(vq);
> +
> + free_irq(platform_get_irq(vpdev->pdev, 0), vpdev);
> +}
> +
> +
> +
> +static struct virtqueue *va_setup_vq(struct virtio_device *vdev, unsigned index,
> + void (*callback)(struct virtqueue *vq),
> + const char *name)
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> + struct virtio_plat_vq_info *info;
> + struct virtqueue *vq;
> + unsigned long flags, size;
> + u16 num;
> + int err;
> +
> + /* Select the queue we're interested in */
> + writel(index, vpdev->base + VIRTIO_PLAT_QUEUE_SEL);
> +
> + /* Check if queue is either not available or already active. */
> + num = readl(vpdev->base + VIRTIO_PLAT_QUEUE_NUM);
> + if (!num || readl(vpdev->base + VIRTIO_PLAT_QUEUE_PFN)) {
> + err = -ENOENT;
> + goto error_available;
> + }
> +
> + /* Allocate and fill out our structure the represents an active
> + * queue */
> + info = kmalloc(sizeof(struct virtio_plat_vq_info), GFP_KERNEL);
> + if (!info) {
> + err = -ENOMEM;
> + goto error_kmalloc;
> + }
> +
> + info->queue_index = index;
> + info->num = num;
> +
> + size = PAGE_ALIGN(vring_size(num, VIRTIO_PLAT_VRING_ALIGN));
> + info->queue = alloc_pages_exact(size, GFP_KERNEL | __GFP_ZERO);
> + if (info->queue == NULL) {
> + err = -ENOMEM;
> + goto error_alloc_pages;
> + }
> +
> + /* Activate the queue */
> + writel(virt_to_phys(info->queue)>> VIRTIO_PLAT_QUEUE_ADDR_SHIFT,
> + vpdev->base + VIRTIO_PLAT_QUEUE_PFN);
> +
> + /* Create the vring */
> + vq = vring_new_virtqueue(info->num, VIRTIO_PLAT_VRING_ALIGN,
> + vdev, info->queue, va_notify, callback, name);
> + if (!vq) {
> + err = -ENOMEM;
> + goto error_new_virtqueue;
> + }
> +
> + vq->priv = info;
> + info->vq = vq;
> +
> + spin_lock_irqsave(&vpdev->lock, flags);
> + list_add(&info->node,&vpdev->virtqueues);
> + spin_unlock_irqrestore(&vpdev->lock, flags);
> +
> + return vq;
> +
> +error_new_virtqueue:
> + writel(0, vpdev->base + VIRTIO_PLAT_QUEUE_PFN);
> + free_pages_exact(info->queue, size);
> +error_alloc_pages:
> + kfree(info);
> +error_kmalloc:
> +error_available:
> + return ERR_PTR(err);
> +}
> +
> +static int va_find_vqs(struct virtio_device *vdev, unsigned nvqs,
> + struct virtqueue *vqs[],
> + vq_callback_t *callbacks[],
> + const char *names[])
> +{
> + struct virtio_plat_device *vpdev = to_virtio_plat_device(vdev);
> + unsigned int irq = platform_get_irq(vpdev->pdev, 0);
> + int i, err;
> +
> + err = request_irq(irq, va_interrupt, IRQF_SHARED,
> + dev_name(&vdev->dev), vpdev);
> + if (err)
> + return err;
> +
> + for (i = 0; i< nvqs; ++i) {
> + vqs[i] = va_setup_vq(vdev, i, callbacks[i], names[i]);
> + if (IS_ERR(vqs[i])) {
> + va_del_vqs(vdev);
> + free_irq(irq, vpdev);
> + return PTR_ERR(vqs[i]);
> + }
> + }
> +
> + return 0;
> +}
> +
> +
> +
> +static struct virtio_config_ops virtio_plat_config_ops = {
> + .get = va_get,
> + .set = va_set,
> + .get_status = va_get_status,
> + .set_status = va_set_status,
> + .reset = va_reset,
> + .find_vqs = va_find_vqs,
> + .del_vqs = va_del_vqs,
> + .get_features = va_get_features,
> + .finalize_features = va_finalize_features,
> +};
> +
> +
> +
> +/* Platform device */
> +
> +static int __devinit virtio_plat_probe(struct platform_device *pdev)
> +{
> + struct virtio_plat_device *vpdev;
> + struct resource *mem;
> +
> + mem = platform_get_resource(pdev, IORESOURCE_MEM, 0);
> + if (!mem)
> + return -EINVAL;
> +
> + if (!devm_request_mem_region(&pdev->dev, mem->start,
> + resource_size(mem), pdev->name))
> + return -EBUSY;
> +
> + vpdev = devm_kzalloc(&pdev->dev, sizeof(struct virtio_plat_device),
> + GFP_KERNEL);
> + if (!vpdev)
> + return -ENOMEM;
> +
> + vpdev->vdev.dev.parent =&pdev->dev;
> + vpdev->vdev.config =&virtio_plat_config_ops;
> + vpdev->pdev = pdev;
> + INIT_LIST_HEAD(&vpdev->virtqueues);
> + spin_lock_init(&vpdev->lock);
> +
> + vpdev->base = devm_ioremap(&pdev->dev, mem->start, resource_size(mem));
> + if (vpdev->base == NULL)
> + return -EFAULT;
> +
> + /* TODO: check magic value (VIRTIO_PLAT_MAGIC_VALUE) */
> +
> + vpdev->vdev.id.device = readl(vpdev->base + VIRTIO_PLAT_DEVICE_ID);
> + vpdev->vdev.id.vendor = readl(vpdev->base + VIRTIO_PLAT_VENDOR_ID);
> +
> + platform_set_drvdata(pdev, vpdev);
> +
> + return register_virtio_device(&vpdev->vdev);
> +}
> +
> +static int __devexit virtio_plat_remove(struct platform_device *pdev)
> +{
> + struct virtio_plat_device *vpdev = platform_get_drvdata(pdev);
> +
> + unregister_virtio_device(&vpdev->vdev);
> +
> + return 0;
> +}
> +
> +
> +
> +/* Platform driver */
> +
> +static struct of_device_id virtio_plat_match[] = {
> + { .compatible = "virtio,platform", },
> + {},
> +};
> +MODULE_DEVICE_TABLE(of, virtio_plat_match);
> +
> +static struct platform_driver virtio_plat_driver = {
> + .probe = virtio_plat_probe,
> + .remove = __devexit_p(virtio_plat_remove),
> + .driver = {
> + .name = "virtio-platform",
> + .owner = THIS_MODULE,
> + .of_match_table = virtio_plat_match,
> + },
> +};
> +
> +static int __init virtio_plat_init(void)
> +{
> + return platform_driver_register(&virtio_plat_driver);
> +}
> +
> +static void __exit virtio_plat_exit(void)
> +{
> + platform_driver_unregister(&virtio_plat_driver);
> +}
> +
> +module_init(virtio_plat_init);
> +module_exit(virtio_plat_exit);
> +
> +MODULE_AUTHOR("Pawel Moll<pawel.moll@arm.com>");
> +MODULE_DESCRIPTION("Platform bus driver for virtio devices");
> +MODULE_LICENSE("GPL");
> diff --git a/include/linux/virtio_platform.h b/include/linux/virtio_platform.h
> new file mode 100644
> index 0000000..d4b26f1
> --- /dev/null
> +++ b/include/linux/virtio_platform.h
> @@ -0,0 +1,62 @@
> +/*
> + * Virtio platform device driver
> + *
> + * Copyright 2011, ARM Ltd.
> + *
> + * Based on Virtio PCI driver by Anthony Liguori, copyright IBM Corp. 2007
> + *
> + * This header is BSD licensed so anyone can use the definitions to implement
> + * compatible drivers/servers.
> + */
> +
> +#ifndef _LINUX_VIRTIO_PLATFORM_H
> +#define _LINUX_VIRTIO_PLATFORM_H
> +
> +/* Magic value ("virt" string == 0x74726976 Little Endian word */
> +#define VIRTIO_PLAT_MAGIC_VALUE 0x000
> +
> +/* Virtio device ID */
> +#define VIRTIO_PLAT_DEVICE_ID 0x004
> +
> +/* Virtio vendor ID */
> +#define VIRTIO_PLAT_VENDOR_ID 0x008
> +
> +/* Bitmask of the features supported by the host (32-bit register) */
> +#define VIRTIO_PLAT_HOST_FEATURES 0x010
> +
> +/* Bitmask of features activated by the guest (32-bit register) */
> +#define VIRTIO_PLAT_GUEST_FEATURES 0x020
> +
> +/* PFN for the currently selected queue (32-bit register) */
> +#define VIRTIO_PLAT_QUEUE_PFN 0x030
> +
> +/* Queue size for the currently selected queue (32-bit register) */
> +#define VIRTIO_PLAT_QUEUE_NUM 0x034
> +
> +/* Queue selector (32-bit register) */
> +#define VIRTIO_PLAT_QUEUE_SEL 0x038
> +
> +/* Queue notifier (32-bit register) */
> +#define VIRTIO_PLAT_QUEUE_NOTIFY 0x03c
> +
> +/* Interrupt acknowledge (32-bit register) */
> +#define VIRTIO_PLAT_INTERRUPT_ACK 0x040
> +
> +/* Device status register (8-bit register) */
> +#define VIRTIO_PLAT_STATUS 0x050
> +
> +/* The config space is defined by each driver as
> + * the per-driver configuration space */
> +#define VIRTIO_PLAT_CONFIG 0x100
> +
> +
> +
> +/* How many bits to shift physical queue address written to QUEUE_PFN.
> + * 12 is historical, and due to 4kb page size. */
> +#define VIRTIO_PLAT_QUEUE_ADDR_SHIFT 12
> +
> +/* The alignment to use between consumer and producer parts of vring.
> + * Page size again. */
> +#define VIRTIO_PLAT_VRING_ALIGN 4096
> +
> +#endif
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC 1/2] virtio: Add AMBA bus driver for virtio device
2011-09-12 16:45 ` Pawel Moll
[not found] ` <1315846301-13892-1-git-send-email-pawel.moll@arm.com>
@ 2011-09-13 2:58 ` Rusty Russell
2011-09-13 9:40 ` Pawel Moll
1 sibling, 1 reply; 9+ messages in thread
From: Rusty Russell @ 2011-09-13 2:58 UTC (permalink / raw)
To: linux-arm-kernel
On Mon, 12 Sep 2011 17:45:56 +0100, Pawel Moll <pawel.moll@arm.com> wrote:
> On Mon, 2011-09-12 at 11:01 +0930, Rusty Russell wrote:
> > You need to make these at least 64 bits. Lguest makes them variable
> > width, in fact.
>
> No problem, I've reserved 128 bits in the modified registers layout (see
> RFC v2), I can even change it into "offset/value" interface, which would
> give you variable width. One thing I don't get is how to access feature
> bits > 31? The get_features() seems
> to be limited to 32:
>
> struct virtio_config_ops {
> <...>
> u32 (*get_features)(struct virtio_device *vdev);
> <...>
> }
There's a patch for that already :)
> > > + * 0x008 32 QueuePFN PFN for the currently selected queue
> > > + * 0x00c 32 QueueNum Queue size for the currently selected queue
> >
> > You should, I believe, seriously consider allowing the guest to set the
> > queue size, rather than the host (perhaps the host could suggest one,
> > but the guest should be able to override it).
>
> I guess the QueueNum could be simply a read/write register - guest can
> read suggestion and override it writing it back. The same question
> though - how can guest change it? (I mean some API?)
We can sort that out later... the Guest may try some ambitious large
allocation and fall back if it fails.
> > Anthony or Michael might suggest other changes, since they are most
> > familiar with virtio_pci limitations...
>
> I've added them on the RFC v2 Cc as well - all feedback more then
> welcome!
Excellent... of course, the virtualization list is down at the moment,
making this a bit more awkward.
Cheers,
Rusty.
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC v2] arm: Add platform bus driver for virtio device
2011-09-12 17:27 ` [RFC v2] arm: Add platform " Anthony Liguori
@ 2011-09-13 8:58 ` Pawel Moll
0 siblings, 0 replies; 9+ messages in thread
From: Pawel Moll @ 2011-09-13 8:58 UTC (permalink / raw)
To: linux-arm-kernel
On Mon, 2011-09-12 at 12:27 -0500, Anthony Liguori wrote:
> > This patch, based on virtio PCI driver, adds support for memory
> > mapped (platform) virtio device. This should allow environments
> > like qemu to use virtio-based block& network devices.
>
> Are you planning on sending patches to QEMU for this? I think it makes
> sense to start in QEMU with this effort and make a proper spec from
> which you can write the driver against.
Even better :-) Peter Maydell (Linaro QEMU magician) agreed to do this
himself, once we reach some agreement regarding the interface.
Cheers!
Pawe?
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC 1/2] virtio: Add AMBA bus driver for virtio device
2011-09-13 2:58 ` [RFC 1/2] virtio: Add AMBA " Rusty Russell
@ 2011-09-13 9:40 ` Pawel Moll
2011-09-14 0:18 ` Rusty Russell
0 siblings, 1 reply; 9+ messages in thread
From: Pawel Moll @ 2011-09-13 9:40 UTC (permalink / raw)
To: linux-arm-kernel
> > No problem, I've reserved 128 bits in the modified registers layout (see
> > RFC v2), I can even change it into "offset/value" interface, which would
> > give you variable width. One thing I don't get is how to access feature
> > bits > 31? The get_features() seems
> > to be limited to 32:
> >
> > struct virtio_config_ops {
> > <...>
> > u32 (*get_features)(struct virtio_device *vdev);
> > <...>
> > }
>
> There's a patch for that already :)
Ok, so how about one of the following two options:
1. 4 * 32-bit words of feature flags, 128 bits in total:
* 0x010 32 HostFeatures0 Features supported by the host, bits 0-31
* 0x014 32 HostFeatures1 Features supported by the host, bits 32-63
* 0x018 32 HostFeatures2 Features supported by the host, bits 64-95
* 0x01c 32 HostFeatures3 Features supported by the host, bits 96-127
* 0x020 32 GuestFeatures0 Features activated by the guest, bits 0-31
* 0x024 32 GuestFeatures1 Features activated by the guest, bits 32-63
* 0x028 32 GuestFeatures2 Features activated by the guest, bits 64-95
* 0x02c 32 GuestFeatures3 Features activated by the guest, bits 96-127
2. "Select word -> read flags" interface:
* 0x010 32 HostFeaturesFlags Features supported by the host
* 0x014 32 HostFeaturesWord Set of features to access
* 0x020 32 GuestFeaturesFlags Features activated by the guest
* 0x024 32 GuestFeaturesWord Set of features to set
In this case the algorithm would be:
writel(0, HostFeaturesWord);
bits_0_31 = readl(HostFeaturesFlags);
writel(1, HostFeaturesWord);
bits_32_63 = readl(HostFeaturesFlags);
<etc>
The latter seems more "future-proof" to me, if slightly more complex...
Any other suggestions?
> > > > + * 0x008 32 QueuePFN PFN for the currently selected queue
> > > > + * 0x00c 32 QueueNum Queue size for the currently selected queue
> > >
> > > You should, I believe, seriously consider allowing the guest to set the
> > > queue size, rather than the host (perhaps the host could suggest one,
> > > but the guest should be able to override it).
> >
> > I guess the QueueNum could be simply a read/write register - guest can
> > read suggestion and override it writing it back. The same question
> > though - how can guest change it? (I mean some API?)
>
> We can sort that out later... the Guest may try some ambitious large
> allocation and fall back if it fails.
So, to my mind, the negotiations could look like this (from the Guest's
point of view):
1. (optional) The Guest is asking what size is "suggested" by the Host:
size = readl(QueueNum);
2. The Guest requests size it would like to use:
writel(optimal_size(size), QueueNum);
3. The Host does the best it can to accommodate the Guest and the Guest
checks the effective size:
size = real(QueueNum);
Does it make some sense?
> > > Anthony or Michael might suggest other changes, since they are most
> > > familiar with virtio_pci limitations...
> >
> > I've added them on the RFC v2 Cc as well - all feedback more then
> > welcome!
>
> Excellent... of course, the virtualization list is down at the moment,
> making this a bit more awkward.
Haha - at least once I can't blame our corporate IT for making my job
harder ;-)
Cheers!
Pawe?
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC 1/2] virtio: Add AMBA bus driver for virtio device
2011-09-13 9:40 ` Pawel Moll
@ 2011-09-14 0:18 ` Rusty Russell
2011-09-14 16:47 ` Pawel Moll
0 siblings, 1 reply; 9+ messages in thread
From: Rusty Russell @ 2011-09-14 0:18 UTC (permalink / raw)
To: linux-arm-kernel
On Tue, 13 Sep 2011 10:40:32 +0100, Pawel Moll <pawel.moll@arm.com> wrote:
> > > No problem, I've reserved 128 bits in the modified registers layout (see
> > > RFC v2), I can even change it into "offset/value" interface, which would
> > > give you variable width. One thing I don't get is how to access feature
> > > bits > 31? The get_features() seems
> > > to be limited to 32:
> > >
> > > struct virtio_config_ops {
> > > <...>
> > > u32 (*get_features)(struct virtio_device *vdev);
> > > <...>
> > > }
> >
> > There's a patch for that already :)
>
> Ok, so how about one of the following two options:
>
> 1. 4 * 32-bit words of feature flags, 128 bits in total:
>
> * 0x010 32 HostFeatures0 Features supported by the host, bits 0-31
> * 0x014 32 HostFeatures1 Features supported by the host, bits 32-63
> * 0x018 32 HostFeatures2 Features supported by the host, bits 64-95
> * 0x01c 32 HostFeatures3 Features supported by the host, bits 96-127
>
> * 0x020 32 GuestFeatures0 Features activated by the guest, bits 0-31
> * 0x024 32 GuestFeatures1 Features activated by the guest, bits 32-63
> * 0x028 32 GuestFeatures2 Features activated by the guest, bits 64-95
> * 0x02c 32 GuestFeatures3 Features activated by the guest, bits 96-127
>
> 2. "Select word -> read flags" interface:
>
> * 0x010 32 HostFeaturesFlags Features supported by the host
> * 0x014 32 HostFeaturesWord Set of features to access
>
> * 0x020 32 GuestFeaturesFlags Features activated by the guest
> * 0x024 32 GuestFeaturesWord Set of features to set
>
> In this case the algorithm would be:
>
> writel(0, HostFeaturesWord);
> bits_0_31 = readl(HostFeaturesFlags);
> writel(1, HostFeaturesWord);
> bits_32_63 = readl(HostFeaturesFlags);
> <etc>
>
> The latter seems more "future-proof" to me, if slightly more complex...
> Any other suggestions?
The former is almost certainly sufficient (remember, we can abuse the
final feature bit to indicate there are extended features somewhere
else, if we really have to). But agree the latter is preferable: this
isn't time-critical.
> > We can sort that out later... the Guest may try some ambitious large
> > allocation and fall back if it fails.
>
> So, to my mind, the negotiations could look like this (from the Guest's
> point of view):
>
> 1. (optional) The Guest is asking what size is "suggested" by the Host:
>
> size = readl(QueueNum);
>
> 2. The Guest requests size it would like to use:
>
> writel(optimal_size(size), QueueNum);
>
> 3. The Host does the best it can to accommodate the Guest and the Guest
> checks the effective size:
>
> size = real(QueueNum);
>
> Does it make some sense?
Does Host care? It is sufficient for it to publish min/max values?
You might want to throw the alignment value in there; there has been
talk about adding that to PCI, for really small rings (less than a
page).
Cheers,
Rusty.
^ permalink raw reply [flat|nested] 9+ messages in thread
* [RFC 1/2] virtio: Add AMBA bus driver for virtio device
2011-09-14 0:18 ` Rusty Russell
@ 2011-09-14 16:47 ` Pawel Moll
0 siblings, 0 replies; 9+ messages in thread
From: Pawel Moll @ 2011-09-14 16:47 UTC (permalink / raw)
To: linux-arm-kernel
> > 2. "Select word -> read flags" interface:
> >
> > * 0x010 32 HostFeaturesFlags Features supported by the host
> > * 0x014 32 HostFeaturesWord Set of features to access
> >
> > * 0x020 32 GuestFeaturesFlags Features activated by the guest
> > * 0x024 32 GuestFeaturesWord Set of features to set
> >
> > In this case the algorithm would be:
> >
> > writel(0, HostFeaturesWord);
> > bits_0_31 = readl(HostFeaturesFlags);
> > writel(1, HostFeaturesWord);
> > bits_32_63 = readl(HostFeaturesFlags);
> > <etc>
> >
> > The latter seems more "future-proof" to me, if slightly more complex...
> > Any other suggestions?
>
> The former is almost certainly sufficient (remember, we can abuse the
> final feature bit to indicate there are extended features somewhere
> else, if we really have to). But agree the latter is preferable: this
> isn't time-critical.
Ok, I'll do this in v3 then.
> > > We can sort that out later... the Guest may try some ambitious large
> > > allocation and fall back if it fails.
> >
> > So, to my mind, the negotiations could look like this (from the Guest's
> > point of view):
> >
> > 1. (optional) The Guest is asking what size is "suggested" by the Host:
> >
> > size = readl(QueueNum);
> >
> > 2. The Guest requests size it would like to use:
> >
> > writel(optimal_size(size), QueueNum);
> >
> > 3. The Host does the best it can to accommodate the Guest and the Guest
> > checks the effective size:
> >
> > size = real(QueueNum);
> >
> > Does it make some sense?
>
> Does Host care? It is sufficient for it to publish min/max values?
Hm. You're right, it should no care. Most likely the queue size will
impact some Host's memory allocation size, but that's the Host's
problem :-)
At the same time, how the Host would know what the max value is?
Generally it looks like the matter was getting back to value suggested
by the Host (as it is now) that may be changed by the Guest. Single R/W
register should be fine, I believe?
> You might want to throw the alignment value in there; there has been
> talk about adding that to PCI, for really small rings (less than a
> page).
Sure thing, will do.
v3 will be ready tomorrow.
Cheers!
Pawe?
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2011-09-14 16:47 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-09-02 12:24 [RFC 0/2] Memory mapped virtio device Pawel Moll
[not found] ` <1314966257-7503-2-git-send-email-pawel.moll@arm.com>
2011-09-12 1:31 ` [RFC 1/2] virtio: Add AMBA bus driver for " Rusty Russell
2011-09-12 16:45 ` Pawel Moll
[not found] ` <1315846301-13892-1-git-send-email-pawel.moll@arm.com>
2011-09-12 17:27 ` [RFC v2] arm: Add platform " Anthony Liguori
2011-09-13 8:58 ` Pawel Moll
2011-09-13 2:58 ` [RFC 1/2] virtio: Add AMBA " Rusty Russell
2011-09-13 9:40 ` Pawel Moll
2011-09-14 0:18 ` Rusty Russell
2011-09-14 16:47 ` Pawel Moll
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).