* [PATCH v3 0/2] EEH Error Injection Support for VFIO Devices @ 2015-03-20 19:58 Gavin Shan 2015-03-20 19:58 ` [PATCH v3 1/2] powerpc/eeh: Introduce eeh_pe_inject_err() Gavin Shan 2015-03-20 19:58 ` [PATCH v3 2/2] drivers/vfio: Support EEH error injection Gavin Shan 0 siblings, 2 replies; 14+ messages in thread From: Gavin Shan @ 2015-03-20 19:58 UTC (permalink / raw) To: linuxppc-dev; +Cc: kvm, aik, Gavin Shan, agraf, alex.williamson, david Those two patches are extention to EEH support for VFIO PCI devices, which allows to inject EEH errors to VFIO PCI devices from userspace for testing purpose. Changelog ========= v2 -> v3: * Use offsetofend(), instead of sizeof(struct vfio_eeh_pe_op) to calculate argument buffer size. v1 -> v2: * Use EEH_ERR_FUNC_{MIN,MAX} to validate PCI error function. * Put additional arguments for error injection to union in struct vfio_eeh_pe_op. Gavin Shan (2): powerpc/eeh: Introduce eeh_pe_inject_err() drivers/vfio: Support EEH error injection Documentation/vfio.txt | 12 ++++++++++++ arch/powerpc/include/asm/eeh.h | 26 ++++++++++++++++++++++++++ arch/powerpc/kernel/eeh.c | 35 +++++++++++++++++++++++++++++++++++ drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- 5 files changed, 118 insertions(+), 1 deletion(-) -- 1.8.3.2 ^ permalink raw reply [flat|nested] 14+ messages in thread
* [PATCH v3 1/2] powerpc/eeh: Introduce eeh_pe_inject_err() 2015-03-20 19:58 [PATCH v3 0/2] EEH Error Injection Support for VFIO Devices Gavin Shan @ 2015-03-20 19:58 ` Gavin Shan 2015-03-23 1:33 ` David Gibson 2015-03-20 19:58 ` [PATCH v3 2/2] drivers/vfio: Support EEH error injection Gavin Shan 1 sibling, 1 reply; 14+ messages in thread From: Gavin Shan @ 2015-03-20 19:58 UTC (permalink / raw) To: linuxppc-dev; +Cc: kvm, aik, Gavin Shan, agraf, alex.williamson, david The patch defines PCI error types and functions in eeh.h and exports function eeh_pe_inject_err(), which will be called by VFIO driver to inject the specified PCI error to the indicated PE for testing purpose. Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> --- arch/powerpc/include/asm/eeh.h | 26 ++++++++++++++++++++++++++ arch/powerpc/kernel/eeh.c | 35 +++++++++++++++++++++++++++++++++++ 2 files changed, 61 insertions(+) diff --git a/arch/powerpc/include/asm/eeh.h b/arch/powerpc/include/asm/eeh.h index 61912fc..85a17de 100644 --- a/arch/powerpc/include/asm/eeh.h +++ b/arch/powerpc/include/asm/eeh.h @@ -194,6 +194,30 @@ enum { #define EEH_RESET_FUNDAMENTAL 3 /* Fundamental reset */ #define EEH_LOG_TEMP 1 /* EEH temporary error log */ #define EEH_LOG_PERM 2 /* EEH permanent error log */ +#define EEH_ERR_TYPE_32 0 /* 32-bits PCI error */ +#define EEH_ERR_TYPE_64 1 /* 64-bits PCI error */ +#define EEH_ERR_FUNC_MIN 0 +#define EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ +#define EEH_ERR_FUNC_LD_MEM_DATA 1 +#define EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ +#define EEH_ERR_FUNC_LD_IO_DATA 3 +#define EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ +#define EEH_ERR_FUNC_LD_CFG_DATA 5 +#define EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ +#define EEH_ERR_FUNC_ST_MEM_DATA 7 +#define EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ +#define EEH_ERR_FUNC_ST_IO_DATA 9 +#define EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ +#define EEH_ERR_FUNC_ST_CFG_DATA 11 +#define EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ +#define EEH_ERR_FUNC_DMA_RD_DATA 13 +#define EEH_ERR_FUNC_DMA_RD_MASTER 14 +#define EEH_ERR_FUNC_DMA_RD_TARGET 15 +#define EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ +#define EEH_ERR_FUNC_DMA_WR_DATA 17 +#define EEH_ERR_FUNC_DMA_WR_MASTER 18 +#define EEH_ERR_FUNC_DMA_WR_TARGET 19 +#define EEH_ERR_FUNC_MAX 19 struct eeh_ops { char *name; @@ -293,6 +317,8 @@ int eeh_pe_set_option(struct eeh_pe *pe, int option); int eeh_pe_get_state(struct eeh_pe *pe); int eeh_pe_reset(struct eeh_pe *pe, int option); int eeh_pe_configure(struct eeh_pe *pe); +int eeh_pe_inject_err(struct eeh_pe *pe, int type, int func, + unsigned long addr, unsigned long mask); /** * EEH_POSSIBLE_ERROR() -- test for possible MMIO failure. diff --git a/arch/powerpc/kernel/eeh.c b/arch/powerpc/kernel/eeh.c index 76253eb..daa68a1 100644 --- a/arch/powerpc/kernel/eeh.c +++ b/arch/powerpc/kernel/eeh.c @@ -1636,6 +1636,41 @@ int eeh_pe_configure(struct eeh_pe *pe) } EXPORT_SYMBOL_GPL(eeh_pe_configure); +/** + * eeh_pe_inject_err - Injecting the specified PCI error to the indicated PE + * @pe: the indicated PE + * @type: error type + * @function: error function + * @addr: address + * @mask: address mask + * + * The routine is called to inject the specified PCI error, which + * is determined by @type and @function, to the indicated PE for + * testing purpose. + */ +int eeh_pe_inject_err(struct eeh_pe *pe, int type, int func, + unsigned long addr, unsigned long mask) +{ + /* Invalid PE ? */ + if (!pe) + return -ENODEV; + + /* Unsupported operation ? */ + if (!eeh_ops || !eeh_ops->err_inject) + return -ENOENT; + + /* Check on PCI error type */ + if (type != EEH_ERR_TYPE_32 && type != EEH_ERR_TYPE_64) + return -EINVAL; + + /* Check on PCI error function */ + if (func < EEH_ERR_FUNC_MIN || func > EEH_ERR_FUNC_MAX) + return -EINVAL; + + return eeh_ops->err_inject(pe, type, func, addr, mask); +} +EXPORT_SYMBOL_GPL(eeh_pe_inject_err); + static int proc_eeh_show(struct seq_file *m, void *v) { if (!eeh_enabled()) { -- 1.8.3.2 ^ permalink raw reply related [flat|nested] 14+ messages in thread
* Re: [PATCH v3 1/2] powerpc/eeh: Introduce eeh_pe_inject_err() 2015-03-20 19:58 ` [PATCH v3 1/2] powerpc/eeh: Introduce eeh_pe_inject_err() Gavin Shan @ 2015-03-23 1:33 ` David Gibson 0 siblings, 0 replies; 14+ messages in thread From: David Gibson @ 2015-03-23 1:33 UTC (permalink / raw) To: Gavin Shan; +Cc: aik, linuxppc-dev, alex.williamson, agraf, kvm [-- Attachment #1: Type: text/plain, Size: 599 bytes --] On Sat, Mar 21, 2015 at 06:58:44AM +1100, Gavin Shan wrote: > The patch defines PCI error types and functions in eeh.h and > exports function eeh_pe_inject_err(), which will be called by > VFIO driver to inject the specified PCI error to the indicated > PE for testing purpose. > > Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> -- David Gibson | I'll have my music baroque, and my code david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ | _way_ _around_! http://www.ozlabs.org/~dgibson [-- Attachment #2: Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-20 19:58 [PATCH v3 0/2] EEH Error Injection Support for VFIO Devices Gavin Shan 2015-03-20 19:58 ` [PATCH v3 1/2] powerpc/eeh: Introduce eeh_pe_inject_err() Gavin Shan @ 2015-03-20 19:58 ` Gavin Shan 2015-03-23 1:39 ` David Gibson 1 sibling, 1 reply; 14+ messages in thread From: Gavin Shan @ 2015-03-20 19:58 UTC (permalink / raw) To: linuxppc-dev; +Cc: kvm, aik, Gavin Shan, agraf, alex.williamson, david The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) to inject the specified EEH error, which is represented by (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> --- Documentation/vfio.txt | 12 ++++++++++++ drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- 3 files changed, 57 insertions(+), 1 deletion(-) diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt index 96978ec..c6e11a3 100644 --- a/Documentation/vfio.txt +++ b/Documentation/vfio.txt @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: .... + /* Inject EEH error, which is expected to be caused by 32-bits + * config load. + */ + pe_op.op = VFIO_EEH_PE_INJECT_ERR; + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; + pe_op.err.addr = 0ul; + pe_op.err.mask = 0ul; + ioctl(container, VFIO_EEH_PE_OP, &pe_op); + + .... + /* When 0xFF's returned from reading PCI config space or IO BARs * of the PCI device. Check the PE's state to see if that has been * frozen. diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c index 5fa42db..38edeb4 100644 --- a/drivers/vfio/vfio_spapr_eeh.c +++ b/drivers/vfio/vfio_spapr_eeh.c @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, case VFIO_EEH_PE_CONFIGURE: ret = eeh_pe_configure(pe); break; + case VFIO_EEH_PE_INJECT_ERR: + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); + if (op.argsz < minsz) + return -EINVAL; + if (copy_from_user(&op, (void __user *)arg, minsz)) + return -EFAULT; + + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, + op.err.addr, op.err.mask); + break; default: ret = -EINVAL; } diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h index 82889c3..f68e962 100644 --- a/include/uapi/linux/vfio.h +++ b/include/uapi/linux/vfio.h @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { * - unfreeze IO/DMA for frozen PE; * - read PE state; * - reset PE; - * - configure PE. + * - configure PE; + * - inject EEH error. */ +struct vfio_eeh_pe_err { + __u32 type; + __u32 func; + __u64 addr; + __u64 mask; +}; + struct vfio_eeh_pe_op { __u32 argsz; __u32 flags; __u32 op; + union { + struct vfio_eeh_pe_err err; + }; }; #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 #define VFIO_EEH_PE_OP _IO(VFIO_TYPE, VFIO_BASE + 21) -- 1.8.3.2 ^ permalink raw reply related [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-20 19:58 ` [PATCH v3 2/2] drivers/vfio: Support EEH error injection Gavin Shan @ 2015-03-23 1:39 ` David Gibson 2015-03-23 1:56 ` Gavin Shan 0 siblings, 1 reply; 14+ messages in thread From: David Gibson @ 2015-03-23 1:39 UTC (permalink / raw) To: Gavin Shan; +Cc: aik, linuxppc-dev, alex.williamson, agraf, kvm [-- Attachment #1: Type: text/plain, Size: 4813 bytes --] On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: > The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) > to inject the specified EEH error, which is represented by > (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. > > Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> Reviewed-by: David Gibson <david@gibson.dropbear.id.au> > --- > Documentation/vfio.txt | 12 ++++++++++++ > drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ > include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- > 3 files changed, 57 insertions(+), 1 deletion(-) > > diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt > index 96978ec..c6e11a3 100644 > --- a/Documentation/vfio.txt > +++ b/Documentation/vfio.txt > @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: > > .... > > + /* Inject EEH error, which is expected to be caused by 32-bits > + * config load. > + */ > + pe_op.op = VFIO_EEH_PE_INJECT_ERR; > + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; > + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; > + pe_op.err.addr = 0ul; > + pe_op.err.mask = 0ul; > + ioctl(container, VFIO_EEH_PE_OP, &pe_op); > + > + .... > + > /* When 0xFF's returned from reading PCI config space or IO BARs > * of the PCI device. Check the PE's state to see if that has been > * frozen. > diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c > index 5fa42db..38edeb4 100644 > --- a/drivers/vfio/vfio_spapr_eeh.c > +++ b/drivers/vfio/vfio_spapr_eeh.c > @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, > case VFIO_EEH_PE_CONFIGURE: > ret = eeh_pe_configure(pe); > break; > + case VFIO_EEH_PE_INJECT_ERR: > + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); > + if (op.argsz < minsz) > + return -EINVAL; > + if (copy_from_user(&op, (void __user *)arg, minsz)) > + return -EFAULT; > + > + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, > + op.err.addr, op.err.mask); > + break; > default: > ret = -EINVAL; > } > diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h > index 82889c3..f68e962 100644 > --- a/include/uapi/linux/vfio.h > +++ b/include/uapi/linux/vfio.h > @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { > * - unfreeze IO/DMA for frozen PE; > * - read PE state; > * - reset PE; > - * - configure PE. > + * - configure PE; > + * - inject EEH error. > */ > +struct vfio_eeh_pe_err { > + __u32 type; > + __u32 func; > + __u64 addr; > + __u64 mask; > +}; > + > struct vfio_eeh_pe_op { > __u32 argsz; > __u32 flags; > __u32 op; > + union { > + struct vfio_eeh_pe_err err; > + }; > }; > > #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ > @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { > #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ > #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ > #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ > +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ > +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ > +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ > +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ > +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 > +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ > +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 > +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ > +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 > +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ > +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 > +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ > +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 > +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ > +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 > +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ > +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 > +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 > +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 > +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ > +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 > +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 > +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 These all seem to duplicate (under slightly different names) the defines you just added in arch/powerpc/include/asm/eeh.h. It would make more sense to only have the uapi copy, surely. -- David Gibson | I'll have my music baroque, and my code david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ | _way_ _around_! http://www.ozlabs.org/~dgibson [-- Attachment #2: Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 1:39 ` David Gibson @ 2015-03-23 1:56 ` Gavin Shan 2015-03-23 3:43 ` David Gibson 0 siblings, 1 reply; 14+ messages in thread From: Gavin Shan @ 2015-03-23 1:56 UTC (permalink / raw) To: David Gibson; +Cc: kvm, aik, Gavin Shan, agraf, linuxppc-dev, alex.williamson On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) >> to inject the specified EEH error, which is represented by >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> > >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> > >> --- >> Documentation/vfio.txt | 12 ++++++++++++ >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- >> 3 files changed, 57 insertions(+), 1 deletion(-) >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt >> index 96978ec..c6e11a3 100644 >> --- a/Documentation/vfio.txt >> +++ b/Documentation/vfio.txt >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: >> >> .... >> >> + /* Inject EEH error, which is expected to be caused by 32-bits >> + * config load. >> + */ >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; >> + pe_op.err.addr = 0ul; >> + pe_op.err.mask = 0ul; >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); >> + >> + .... >> + >> /* When 0xFF's returned from reading PCI config space or IO BARs >> * of the PCI device. Check the PE's state to see if that has been >> * frozen. >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c >> index 5fa42db..38edeb4 100644 >> --- a/drivers/vfio/vfio_spapr_eeh.c >> +++ b/drivers/vfio/vfio_spapr_eeh.c >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, >> case VFIO_EEH_PE_CONFIGURE: >> ret = eeh_pe_configure(pe); >> break; >> + case VFIO_EEH_PE_INJECT_ERR: >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); >> + if (op.argsz < minsz) >> + return -EINVAL; >> + if (copy_from_user(&op, (void __user *)arg, minsz)) >> + return -EFAULT; >> + >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, >> + op.err.addr, op.err.mask); >> + break; >> default: >> ret = -EINVAL; >> } >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h >> index 82889c3..f68e962 100644 >> --- a/include/uapi/linux/vfio.h >> +++ b/include/uapi/linux/vfio.h >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { >> * - unfreeze IO/DMA for frozen PE; >> * - read PE state; >> * - reset PE; >> - * - configure PE. >> + * - configure PE; >> + * - inject EEH error. >> */ >> +struct vfio_eeh_pe_err { >> + __u32 type; >> + __u32 func; >> + __u64 addr; >> + __u64 mask; >> +}; >> + >> struct vfio_eeh_pe_op { >> __u32 argsz; >> __u32 flags; >> __u32 op; >> + union { >> + struct vfio_eeh_pe_err err; >> + }; >> }; >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 > >These all seem to duplicate (under slightly different names) the >defines you just added in arch/powerpc/include/asm/eeh.h. It would >make more sense to only have the uapi copy, surely. > David, Thanks for your time. The defines in eeh.h is used not only for mapping those constants defined in vfio.h, but also for validating arguments from debugfs entry "err_injct". So it'd better to keep them separate: one set is used by userspace, another set is used by host kernel. By the way, Alex Williamson, could you please ACK this if you don't find obvious errors in the patch. I really want this to be merged to 4.1. Thanks, Gavin >-- >David Gibson | I'll have my music baroque, and my code >david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ > | _way_ _around_! >http://www.ozlabs.org/~dgibson ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 1:56 ` Gavin Shan @ 2015-03-23 3:43 ` David Gibson 2015-03-23 5:03 ` Gavin Shan 0 siblings, 1 reply; 14+ messages in thread From: David Gibson @ 2015-03-23 3:43 UTC (permalink / raw) To: Gavin Shan; +Cc: aik, linuxppc-dev, alex.williamson, agraf, kvm [-- Attachment #1: Type: text/plain, Size: 5953 bytes --] On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: > On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: > >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: > >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) > >> to inject the specified EEH error, which is represented by > >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. > >> > >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> > > > >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> > > > >> --- > >> Documentation/vfio.txt | 12 ++++++++++++ > >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ > >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- > >> 3 files changed, 57 insertions(+), 1 deletion(-) > >> > >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt > >> index 96978ec..c6e11a3 100644 > >> --- a/Documentation/vfio.txt > >> +++ b/Documentation/vfio.txt > >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: > >> > >> .... > >> > >> + /* Inject EEH error, which is expected to be caused by 32-bits > >> + * config load. > >> + */ > >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; > >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; > >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; > >> + pe_op.err.addr = 0ul; > >> + pe_op.err.mask = 0ul; > >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); > >> + > >> + .... > >> + > >> /* When 0xFF's returned from reading PCI config space or IO BARs > >> * of the PCI device. Check the PE's state to see if that has been > >> * frozen. > >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c > >> index 5fa42db..38edeb4 100644 > >> --- a/drivers/vfio/vfio_spapr_eeh.c > >> +++ b/drivers/vfio/vfio_spapr_eeh.c > >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, > >> case VFIO_EEH_PE_CONFIGURE: > >> ret = eeh_pe_configure(pe); > >> break; > >> + case VFIO_EEH_PE_INJECT_ERR: > >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); > >> + if (op.argsz < minsz) > >> + return -EINVAL; > >> + if (copy_from_user(&op, (void __user *)arg, minsz)) > >> + return -EFAULT; > >> + > >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, > >> + op.err.addr, op.err.mask); > >> + break; > >> default: > >> ret = -EINVAL; > >> } > >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h > >> index 82889c3..f68e962 100644 > >> --- a/include/uapi/linux/vfio.h > >> +++ b/include/uapi/linux/vfio.h > >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { > >> * - unfreeze IO/DMA for frozen PE; > >> * - read PE state; > >> * - reset PE; > >> - * - configure PE. > >> + * - configure PE; > >> + * - inject EEH error. > >> */ > >> +struct vfio_eeh_pe_err { > >> + __u32 type; > >> + __u32 func; > >> + __u64 addr; > >> + __u64 mask; > >> +}; > >> + > >> struct vfio_eeh_pe_op { > >> __u32 argsz; > >> __u32 flags; > >> __u32 op; > >> + union { > >> + struct vfio_eeh_pe_err err; > >> + }; > >> }; > >> > >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ > >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { > >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ > >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ > >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ > >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ > >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ > >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ > >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ > >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 > >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ > >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 > >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ > >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 > >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ > >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 > >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ > >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 > >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ > >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 > >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ > >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 > >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 > >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 > >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ > >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 > >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 > >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 > > > >These all seem to duplicate (under slightly different names) the > >defines you just added in arch/powerpc/include/asm/eeh.h. It would > >make more sense to only have the uapi copy, surely. > > > > David, Thanks for your time. The defines in eeh.h is used not only > for mapping those constants defined in vfio.h, but also for validating > arguments from debugfs entry "err_injct". So it'd better to keep > them separate: one set is used by userspace, another set is used > by host kernel. I don't understand why this is an argument for duplicating them. They have the same meanings and values, why can't you use the uapi versions everywhere? > By the way, Alex Williamson, could you please ACK this if you don't > find obvious errors in the patch. I really want this to be merged > to 4.1. > > Thanks, > Gavin > > > -- David Gibson | I'll have my music baroque, and my code david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ | _way_ _around_! http://www.ozlabs.org/~dgibson [-- Attachment #2: Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 3:43 ` David Gibson @ 2015-03-23 5:03 ` Gavin Shan 2015-03-23 5:10 ` David Gibson 0 siblings, 1 reply; 14+ messages in thread From: Gavin Shan @ 2015-03-23 5:03 UTC (permalink / raw) To: David Gibson; +Cc: kvm, aik, Gavin Shan, agraf, linuxppc-dev, alex.williamson On Mon, Mar 23, 2015 at 02:43:03PM +1100, David Gibson wrote: >On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: >> On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: >> >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: >> >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) >> >> to inject the specified EEH error, which is represented by >> >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. >> >> >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> >> > >> >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> >> > >> >> --- >> >> Documentation/vfio.txt | 12 ++++++++++++ >> >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ >> >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- >> >> 3 files changed, 57 insertions(+), 1 deletion(-) >> >> >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt >> >> index 96978ec..c6e11a3 100644 >> >> --- a/Documentation/vfio.txt >> >> +++ b/Documentation/vfio.txt >> >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: >> >> >> >> .... >> >> >> >> + /* Inject EEH error, which is expected to be caused by 32-bits >> >> + * config load. >> >> + */ >> >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; >> >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; >> >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; >> >> + pe_op.err.addr = 0ul; >> >> + pe_op.err.mask = 0ul; >> >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); >> >> + >> >> + .... >> >> + >> >> /* When 0xFF's returned from reading PCI config space or IO BARs >> >> * of the PCI device. Check the PE's state to see if that has been >> >> * frozen. >> >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c >> >> index 5fa42db..38edeb4 100644 >> >> --- a/drivers/vfio/vfio_spapr_eeh.c >> >> +++ b/drivers/vfio/vfio_spapr_eeh.c >> >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, >> >> case VFIO_EEH_PE_CONFIGURE: >> >> ret = eeh_pe_configure(pe); >> >> break; >> >> + case VFIO_EEH_PE_INJECT_ERR: >> >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); >> >> + if (op.argsz < minsz) >> >> + return -EINVAL; >> >> + if (copy_from_user(&op, (void __user *)arg, minsz)) >> >> + return -EFAULT; >> >> + >> >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, >> >> + op.err.addr, op.err.mask); >> >> + break; >> >> default: >> >> ret = -EINVAL; >> >> } >> >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h >> >> index 82889c3..f68e962 100644 >> >> --- a/include/uapi/linux/vfio.h >> >> +++ b/include/uapi/linux/vfio.h >> >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { >> >> * - unfreeze IO/DMA for frozen PE; >> >> * - read PE state; >> >> * - reset PE; >> >> - * - configure PE. >> >> + * - configure PE; >> >> + * - inject EEH error. >> >> */ >> >> +struct vfio_eeh_pe_err { >> >> + __u32 type; >> >> + __u32 func; >> >> + __u64 addr; >> >> + __u64 mask; >> >> +}; >> >> + >> >> struct vfio_eeh_pe_op { >> >> __u32 argsz; >> >> __u32 flags; >> >> __u32 op; >> >> + union { >> >> + struct vfio_eeh_pe_err err; >> >> + }; >> >> }; >> >> >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ >> >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { >> >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ >> >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ >> >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ >> >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ >> >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ >> >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 >> > >> >These all seem to duplicate (under slightly different names) the >> >defines you just added in arch/powerpc/include/asm/eeh.h. It would >> >make more sense to only have the uapi copy, surely. >> > >> >> David, Thanks for your time. The defines in eeh.h is used not only >> for mapping those constants defined in vfio.h, but also for validating >> arguments from debugfs entry "err_injct". So it'd better to keep >> them separate: one set is used by userspace, another set is used >> by host kernel. > >I don't understand why this is an argument for duplicating them. They >have the same meanings and values, why can't you use the uapi versions >everywhere? > We can drop the constants defined in eeh.h and just use those defines in vfio.h. But I prefer separate constants for some ovbious reasons except the one I mentioned above :) - We already had separate sets of constants for EEH and VFIO. For example, VFIO_EEH_PE_STATE_* in include/uapi/vfio.h always have counter part in arch/powerpc/asm/include/eeh.h - The UAPI interface is expected to be stable, or back-compatible when introducing changes to it. The defines (at least their names) in eeh.h doesn't have to be stable. - I would like keep the code between EEH/VFIO independent to each other enough. Thanks, Gavin >> By the way, Alex Williamson, could you please ACK this if you don't >> find obvious errors in the patch. I really want this to be merged >> to 4.1. >> >> Thanks, >> Gavin >> >> >> > >-- >David Gibson | I'll have my music baroque, and my code >david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ > | _way_ _around_! >http://www.ozlabs.org/~dgibson ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 5:03 ` Gavin Shan @ 2015-03-23 5:10 ` David Gibson 2015-03-23 5:20 ` Gavin Shan 0 siblings, 1 reply; 14+ messages in thread From: David Gibson @ 2015-03-23 5:10 UTC (permalink / raw) To: Gavin Shan; +Cc: aik, linuxppc-dev, alex.williamson, agraf, kvm [-- Attachment #1: Type: text/plain, Size: 7688 bytes --] On Mon, Mar 23, 2015 at 04:03:59PM +1100, Gavin Shan wrote: > On Mon, Mar 23, 2015 at 02:43:03PM +1100, David Gibson wrote: > >On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: > >> On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: > >> >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: > >> >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) > >> >> to inject the specified EEH error, which is represented by > >> >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. > >> >> > >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> > >> > > >> >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> > >> > > >> >> --- > >> >> Documentation/vfio.txt | 12 ++++++++++++ > >> >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ > >> >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- > >> >> 3 files changed, 57 insertions(+), 1 deletion(-) > >> >> > >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt > >> >> index 96978ec..c6e11a3 100644 > >> >> --- a/Documentation/vfio.txt > >> >> +++ b/Documentation/vfio.txt > >> >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: > >> >> > >> >> .... > >> >> > >> >> + /* Inject EEH error, which is expected to be caused by 32-bits > >> >> + * config load. > >> >> + */ > >> >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; > >> >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; > >> >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; > >> >> + pe_op.err.addr = 0ul; > >> >> + pe_op.err.mask = 0ul; > >> >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); > >> >> + > >> >> + .... > >> >> + > >> >> /* When 0xFF's returned from reading PCI config space or IO BARs > >> >> * of the PCI device. Check the PE's state to see if that has been > >> >> * frozen. > >> >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c > >> >> index 5fa42db..38edeb4 100644 > >> >> --- a/drivers/vfio/vfio_spapr_eeh.c > >> >> +++ b/drivers/vfio/vfio_spapr_eeh.c > >> >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, > >> >> case VFIO_EEH_PE_CONFIGURE: > >> >> ret = eeh_pe_configure(pe); > >> >> break; > >> >> + case VFIO_EEH_PE_INJECT_ERR: > >> >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); > >> >> + if (op.argsz < minsz) > >> >> + return -EINVAL; > >> >> + if (copy_from_user(&op, (void __user *)arg, minsz)) > >> >> + return -EFAULT; > >> >> + > >> >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, > >> >> + op.err.addr, op.err.mask); > >> >> + break; > >> >> default: > >> >> ret = -EINVAL; > >> >> } > >> >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h > >> >> index 82889c3..f68e962 100644 > >> >> --- a/include/uapi/linux/vfio.h > >> >> +++ b/include/uapi/linux/vfio.h > >> >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { > >> >> * - unfreeze IO/DMA for frozen PE; > >> >> * - read PE state; > >> >> * - reset PE; > >> >> - * - configure PE. > >> >> + * - configure PE; > >> >> + * - inject EEH error. > >> >> */ > >> >> +struct vfio_eeh_pe_err { > >> >> + __u32 type; > >> >> + __u32 func; > >> >> + __u64 addr; > >> >> + __u64 mask; > >> >> +}; > >> >> + > >> >> struct vfio_eeh_pe_op { > >> >> __u32 argsz; > >> >> __u32 flags; > >> >> __u32 op; > >> >> + union { > >> >> + struct vfio_eeh_pe_err err; > >> >> + }; > >> >> }; > >> >> > >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ > >> >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { > >> >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ > >> >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ > >> >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ > >> >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ > >> >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ > >> >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ > >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ > >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 > >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ > >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 > >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ > >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 > >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ > >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 > >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ > >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 > >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ > >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 > >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 > >> > > >> >These all seem to duplicate (under slightly different names) the > >> >defines you just added in arch/powerpc/include/asm/eeh.h. It would > >> >make more sense to only have the uapi copy, surely. > >> > > >> > >> David, Thanks for your time. The defines in eeh.h is used not only > >> for mapping those constants defined in vfio.h, but also for validating > >> arguments from debugfs entry "err_injct". So it'd better to keep > >> them separate: one set is used by userspace, another set is used > >> by host kernel. > > > >I don't understand why this is an argument for duplicating them. They > >have the same meanings and values, why can't you use the uapi versions > >everywhere? > > > > We can drop the constants defined in eeh.h and just use those defines > in vfio.h. But I prefer separate constants for some ovbious reasons > except the one I mentioned above :) > > - We already had separate sets of constants for EEH and VFIO. For example, > VFIO_EEH_PE_STATE_* in include/uapi/vfio.h always have counter part in > arch/powerpc/asm/include/eeh.h Oh.. I see your point. Yes, the EEH internal code shouldn't need to use the VFIO defined constants. But it's still silly to duplicate, I think the correct think would be to expose the EEH constants in uapi, and use those for VFIO as well. > - The UAPI interface is expected to be stable, or back-compatible when > introducing changes to it. The defines (at least their names) in eeh.h > doesn't have to be stable. But is there any reason not to make them stable. The VFIO case shows that we need to lock down an interface somewhere. I don't see that there's an advantage to having a not locked down interface in addition to a locked down one. > - I would like keep the code between EEH/VFIO independent to each other > enough. > > Thanks, > Gavin > > >> By the way, Alex Williamson, could you please ACK this if you don't > >> find obvious errors in the patch. I really want this to be merged > >> to 4.1. > >> > >> Thanks, > >> Gavin > >> > >> > >> > > > > -- David Gibson | I'll have my music baroque, and my code david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ | _way_ _around_! http://www.ozlabs.org/~dgibson [-- Attachment #2: Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 5:10 ` David Gibson @ 2015-03-23 5:20 ` Gavin Shan 2015-03-23 16:14 ` Alex Williamson 0 siblings, 1 reply; 14+ messages in thread From: Gavin Shan @ 2015-03-23 5:20 UTC (permalink / raw) To: David Gibson; +Cc: kvm, aik, Gavin Shan, agraf, linuxppc-dev, alex.williamson On Mon, Mar 23, 2015 at 04:10:20PM +1100, David Gibson wrote: >On Mon, Mar 23, 2015 at 04:03:59PM +1100, Gavin Shan wrote: >> On Mon, Mar 23, 2015 at 02:43:03PM +1100, David Gibson wrote: >> >On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: >> >> On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: >> >> >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: >> >> >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) >> >> >> to inject the specified EEH error, which is represented by >> >> >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. >> >> >> >> >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> >> >> > >> >> >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> >> >> > >> >> >> --- >> >> >> Documentation/vfio.txt | 12 ++++++++++++ >> >> >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ >> >> >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- >> >> >> 3 files changed, 57 insertions(+), 1 deletion(-) >> >> >> >> >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt >> >> >> index 96978ec..c6e11a3 100644 >> >> >> --- a/Documentation/vfio.txt >> >> >> +++ b/Documentation/vfio.txt >> >> >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: >> >> >> >> >> >> .... >> >> >> >> >> >> + /* Inject EEH error, which is expected to be caused by 32-bits >> >> >> + * config load. >> >> >> + */ >> >> >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; >> >> >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; >> >> >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; >> >> >> + pe_op.err.addr = 0ul; >> >> >> + pe_op.err.mask = 0ul; >> >> >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); >> >> >> + >> >> >> + .... >> >> >> + >> >> >> /* When 0xFF's returned from reading PCI config space or IO BARs >> >> >> * of the PCI device. Check the PE's state to see if that has been >> >> >> * frozen. >> >> >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c >> >> >> index 5fa42db..38edeb4 100644 >> >> >> --- a/drivers/vfio/vfio_spapr_eeh.c >> >> >> +++ b/drivers/vfio/vfio_spapr_eeh.c >> >> >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, >> >> >> case VFIO_EEH_PE_CONFIGURE: >> >> >> ret = eeh_pe_configure(pe); >> >> >> break; >> >> >> + case VFIO_EEH_PE_INJECT_ERR: >> >> >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); >> >> >> + if (op.argsz < minsz) >> >> >> + return -EINVAL; >> >> >> + if (copy_from_user(&op, (void __user *)arg, minsz)) >> >> >> + return -EFAULT; >> >> >> + >> >> >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, >> >> >> + op.err.addr, op.err.mask); >> >> >> + break; >> >> >> default: >> >> >> ret = -EINVAL; >> >> >> } >> >> >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h >> >> >> index 82889c3..f68e962 100644 >> >> >> --- a/include/uapi/linux/vfio.h >> >> >> +++ b/include/uapi/linux/vfio.h >> >> >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { >> >> >> * - unfreeze IO/DMA for frozen PE; >> >> >> * - read PE state; >> >> >> * - reset PE; >> >> >> - * - configure PE. >> >> >> + * - configure PE; >> >> >> + * - inject EEH error. >> >> >> */ >> >> >> +struct vfio_eeh_pe_err { >> >> >> + __u32 type; >> >> >> + __u32 func; >> >> >> + __u64 addr; >> >> >> + __u64 mask; >> >> >> +}; >> >> >> + >> >> >> struct vfio_eeh_pe_op { >> >> >> __u32 argsz; >> >> >> __u32 flags; >> >> >> __u32 op; >> >> >> + union { >> >> >> + struct vfio_eeh_pe_err err; >> >> >> + }; >> >> >> }; >> >> >> >> >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ >> >> >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { >> >> >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ >> >> >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ >> >> >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ >> >> >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ >> >> >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ >> >> >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 >> >> > >> >> >These all seem to duplicate (under slightly different names) the >> >> >defines you just added in arch/powerpc/include/asm/eeh.h. It would >> >> >make more sense to only have the uapi copy, surely. >> >> > >> >> >> >> David, Thanks for your time. The defines in eeh.h is used not only >> >> for mapping those constants defined in vfio.h, but also for validating >> >> arguments from debugfs entry "err_injct". So it'd better to keep >> >> them separate: one set is used by userspace, another set is used >> >> by host kernel. >> > >> >I don't understand why this is an argument for duplicating them. They >> >have the same meanings and values, why can't you use the uapi versions >> >everywhere? >> > >> >> We can drop the constants defined in eeh.h and just use those defines >> in vfio.h. But I prefer separate constants for some ovbious reasons >> except the one I mentioned above :) >> >> - We already had separate sets of constants for EEH and VFIO. For example, >> VFIO_EEH_PE_STATE_* in include/uapi/vfio.h always have counter part in >> arch/powerpc/asm/include/eeh.h > >Oh.. I see your point. > >Yes, the EEH internal code shouldn't need to use the VFIO defined >constants. > >But it's still silly to duplicate, I think the correct think would be >to expose the EEH constants in uapi, and use those for VFIO as well. > I'm not sure. If Alex.W agree on this, I can have one patch to address it in future :-) >> - The UAPI interface is expected to be stable, or back-compatible when >> introducing changes to it. The defines (at least their names) in eeh.h >> doesn't have to be stable. > >But is there any reason not to make them stable. The VFIO case shows >that we need to lock down an interface somewhere. I don't see that >there's an advantage to having a not locked down interface in addition >to a locked down one. > I was saying that the UAPI interace should be stable. I don't see there're any reasons the UAPI interface will be changed in near future :) Thanks, Gavin >> - I would like keep the code between EEH/VFIO independent to each other >> enough. >> >> Thanks, >> Gavin >> >> >> By the way, Alex Williamson, could you please ACK this if you don't >> >> find obvious errors in the patch. I really want this to be merged >> >> to 4.1. >> >> >> >> Thanks, >> >> Gavin >> >> >> >> >> >> >> > >> >> > >-- >David Gibson | I'll have my music baroque, and my code >david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ > | _way_ _around_! >http://www.ozlabs.org/~dgibson ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 5:20 ` Gavin Shan @ 2015-03-23 16:14 ` Alex Williamson 2015-03-23 22:47 ` Gavin Shan 0 siblings, 1 reply; 14+ messages in thread From: Alex Williamson @ 2015-03-23 16:14 UTC (permalink / raw) To: Gavin Shan; +Cc: aik, linuxppc-dev, agraf, kvm, David Gibson On Mon, 2015-03-23 at 16:20 +1100, Gavin Shan wrote: > On Mon, Mar 23, 2015 at 04:10:20PM +1100, David Gibson wrote: > >On Mon, Mar 23, 2015 at 04:03:59PM +1100, Gavin Shan wrote: > >> On Mon, Mar 23, 2015 at 02:43:03PM +1100, David Gibson wrote: > >> >On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: > >> >> On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: > >> >> >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: > >> >> >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) > >> >> >> to inject the specified EEH error, which is represented by > >> >> >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. > >> >> >> > >> >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> > >> >> > > >> >> >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> > >> >> > > >> >> >> --- > >> >> >> Documentation/vfio.txt | 12 ++++++++++++ > >> >> >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ > >> >> >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- > >> >> >> 3 files changed, 57 insertions(+), 1 deletion(-) > >> >> >> > >> >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt > >> >> >> index 96978ec..c6e11a3 100644 > >> >> >> --- a/Documentation/vfio.txt > >> >> >> +++ b/Documentation/vfio.txt > >> >> >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: > >> >> >> > >> >> >> .... > >> >> >> > >> >> >> + /* Inject EEH error, which is expected to be caused by 32-bits > >> >> >> + * config load. > >> >> >> + */ > >> >> >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; > >> >> >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; > >> >> >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; > >> >> >> + pe_op.err.addr = 0ul; > >> >> >> + pe_op.err.mask = 0ul; > >> >> >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); > >> >> >> + > >> >> >> + .... > >> >> >> + > >> >> >> /* When 0xFF's returned from reading PCI config space or IO BARs > >> >> >> * of the PCI device. Check the PE's state to see if that has been > >> >> >> * frozen. > >> >> >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c > >> >> >> index 5fa42db..38edeb4 100644 > >> >> >> --- a/drivers/vfio/vfio_spapr_eeh.c > >> >> >> +++ b/drivers/vfio/vfio_spapr_eeh.c > >> >> >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, > >> >> >> case VFIO_EEH_PE_CONFIGURE: > >> >> >> ret = eeh_pe_configure(pe); > >> >> >> break; > >> >> >> + case VFIO_EEH_PE_INJECT_ERR: > >> >> >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); > >> >> >> + if (op.argsz < minsz) > >> >> >> + return -EINVAL; > >> >> >> + if (copy_from_user(&op, (void __user *)arg, minsz)) > >> >> >> + return -EFAULT; > >> >> >> + > >> >> >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, > >> >> >> + op.err.addr, op.err.mask); > >> >> >> + break; > >> >> >> default: > >> >> >> ret = -EINVAL; > >> >> >> } > >> >> >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h > >> >> >> index 82889c3..f68e962 100644 > >> >> >> --- a/include/uapi/linux/vfio.h > >> >> >> +++ b/include/uapi/linux/vfio.h > >> >> >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { > >> >> >> * - unfreeze IO/DMA for frozen PE; > >> >> >> * - read PE state; > >> >> >> * - reset PE; > >> >> >> - * - configure PE. > >> >> >> + * - configure PE; > >> >> >> + * - inject EEH error. > >> >> >> */ > >> >> >> +struct vfio_eeh_pe_err { > >> >> >> + __u32 type; > >> >> >> + __u32 func; > >> >> >> + __u64 addr; > >> >> >> + __u64 mask; > >> >> >> +}; > >> >> >> + > >> >> >> struct vfio_eeh_pe_op { > >> >> >> __u32 argsz; > >> >> >> __u32 flags; > >> >> >> __u32 op; > >> >> >> + union { > >> >> >> + struct vfio_eeh_pe_err err; > >> >> >> + }; > >> >> >> }; > >> >> >> > >> >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ > >> >> >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { > >> >> >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ > >> >> >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ > >> >> >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ > >> >> >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ > >> >> >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ > >> >> >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 > >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 > >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 > >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 > >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 > >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 > >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 > >> >> > > >> >> >These all seem to duplicate (under slightly different names) the > >> >> >defines you just added in arch/powerpc/include/asm/eeh.h. It would > >> >> >make more sense to only have the uapi copy, surely. > >> >> > > >> >> > >> >> David, Thanks for your time. The defines in eeh.h is used not only > >> >> for mapping those constants defined in vfio.h, but also for validating > >> >> arguments from debugfs entry "err_injct". So it'd better to keep > >> >> them separate: one set is used by userspace, another set is used > >> >> by host kernel. > >> > > >> >I don't understand why this is an argument for duplicating them. They > >> >have the same meanings and values, why can't you use the uapi versions > >> >everywhere? > >> > > >> > >> We can drop the constants defined in eeh.h and just use those defines > >> in vfio.h. But I prefer separate constants for some ovbious reasons > >> except the one I mentioned above :) > >> > >> - We already had separate sets of constants for EEH and VFIO. For example, > >> VFIO_EEH_PE_STATE_* in include/uapi/vfio.h always have counter part in > >> arch/powerpc/asm/include/eeh.h > > > >Oh.. I see your point. > > > >Yes, the EEH internal code shouldn't need to use the VFIO defined > >constants. > > > >But it's still silly to duplicate, I think the correct think would be > >to expose the EEH constants in uapi, and use those for VFIO as well. > > > > I'm not sure. If Alex.W agree on this, I can have one patch to address > it in future :-) I agree with David, there should be a uapi eeh header and the vfio documentation should define that as the valid set of functions. > >> - The UAPI interface is expected to be stable, or back-compatible when > >> introducing changes to it. The defines (at least their names) in eeh.h > >> doesn't have to be stable. > > > >But is there any reason not to make them stable. The VFIO case shows > >that we need to lock down an interface somewhere. I don't see that > >there's an advantage to having a not locked down interface in addition > >to a locked down one. > > > > I was saying that the UAPI interace should be stable. I don't see > there're any reasons the UAPI interface will be changed in near > future :) When I asked previously whether a translation layer was needed between the vfio uapi and kernel internal eeh api I was told that the values were defined by the spapr spec and therefore not changeable. If that's true, we shouldn't need to worry about exporting them via uapi. Thanks, Alex > >> - I would like keep the code between EEH/VFIO independent to each other > >> enough. > >> > >> Thanks, > >> Gavin > >> > >> >> By the way, Alex Williamson, could you please ACK this if you don't > >> >> find obvious errors in the patch. I really want this to be merged > >> >> to 4.1. > >> >> > >> >> Thanks, > >> >> Gavin > >> >> > >> >> > >> >> > >> > > >> > >> > > > >-- > >David Gibson | I'll have my music baroque, and my code > >david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ > > | _way_ _around_! > >http://www.ozlabs.org/~dgibson > > ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 16:14 ` Alex Williamson @ 2015-03-23 22:47 ` Gavin Shan 2015-03-24 1:22 ` David Gibson 0 siblings, 1 reply; 14+ messages in thread From: Gavin Shan @ 2015-03-23 22:47 UTC (permalink / raw) To: Alex Williamson; +Cc: kvm, aik, Gavin Shan, agraf, linuxppc-dev, David Gibson On Mon, Mar 23, 2015 at 10:14:59AM -0600, Alex Williamson wrote: >On Mon, 2015-03-23 at 16:20 +1100, Gavin Shan wrote: >> On Mon, Mar 23, 2015 at 04:10:20PM +1100, David Gibson wrote: >> >On Mon, Mar 23, 2015 at 04:03:59PM +1100, Gavin Shan wrote: >> >> On Mon, Mar 23, 2015 at 02:43:03PM +1100, David Gibson wrote: >> >> >On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: >> >> >> On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: >> >> >> >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: >> >> >> >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) >> >> >> >> to inject the specified EEH error, which is represented by >> >> >> >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. >> >> >> >> >> >> >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> >> >> >> > >> >> >> >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> >> >> >> > >> >> >> >> --- >> >> >> >> Documentation/vfio.txt | 12 ++++++++++++ >> >> >> >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ >> >> >> >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- >> >> >> >> 3 files changed, 57 insertions(+), 1 deletion(-) >> >> >> >> >> >> >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt >> >> >> >> index 96978ec..c6e11a3 100644 >> >> >> >> --- a/Documentation/vfio.txt >> >> >> >> +++ b/Documentation/vfio.txt >> >> >> >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: >> >> >> >> >> >> >> >> .... >> >> >> >> >> >> >> >> + /* Inject EEH error, which is expected to be caused by 32-bits >> >> >> >> + * config load. >> >> >> >> + */ >> >> >> >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; >> >> >> >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; >> >> >> >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; >> >> >> >> + pe_op.err.addr = 0ul; >> >> >> >> + pe_op.err.mask = 0ul; >> >> >> >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); >> >> >> >> + >> >> >> >> + .... >> >> >> >> + >> >> >> >> /* When 0xFF's returned from reading PCI config space or IO BARs >> >> >> >> * of the PCI device. Check the PE's state to see if that has been >> >> >> >> * frozen. >> >> >> >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c >> >> >> >> index 5fa42db..38edeb4 100644 >> >> >> >> --- a/drivers/vfio/vfio_spapr_eeh.c >> >> >> >> +++ b/drivers/vfio/vfio_spapr_eeh.c >> >> >> >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, >> >> >> >> case VFIO_EEH_PE_CONFIGURE: >> >> >> >> ret = eeh_pe_configure(pe); >> >> >> >> break; >> >> >> >> + case VFIO_EEH_PE_INJECT_ERR: >> >> >> >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); >> >> >> >> + if (op.argsz < minsz) >> >> >> >> + return -EINVAL; >> >> >> >> + if (copy_from_user(&op, (void __user *)arg, minsz)) >> >> >> >> + return -EFAULT; >> >> >> >> + >> >> >> >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, >> >> >> >> + op.err.addr, op.err.mask); >> >> >> >> + break; >> >> >> >> default: >> >> >> >> ret = -EINVAL; >> >> >> >> } >> >> >> >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h >> >> >> >> index 82889c3..f68e962 100644 >> >> >> >> --- a/include/uapi/linux/vfio.h >> >> >> >> +++ b/include/uapi/linux/vfio.h >> >> >> >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { >> >> >> >> * - unfreeze IO/DMA for frozen PE; >> >> >> >> * - read PE state; >> >> >> >> * - reset PE; >> >> >> >> - * - configure PE. >> >> >> >> + * - configure PE; >> >> >> >> + * - inject EEH error. >> >> >> >> */ >> >> >> >> +struct vfio_eeh_pe_err { >> >> >> >> + __u32 type; >> >> >> >> + __u32 func; >> >> >> >> + __u64 addr; >> >> >> >> + __u64 mask; >> >> >> >> +}; >> >> >> >> + >> >> >> >> struct vfio_eeh_pe_op { >> >> >> >> __u32 argsz; >> >> >> >> __u32 flags; >> >> >> >> __u32 op; >> >> >> >> + union { >> >> >> >> + struct vfio_eeh_pe_err err; >> >> >> >> + }; >> >> >> >> }; >> >> >> >> >> >> >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ >> >> >> >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { >> >> >> >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ >> >> >> >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ >> >> >> >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ >> >> >> >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ >> >> >> >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ >> >> >> >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 >> >> >> > >> >> >> >These all seem to duplicate (under slightly different names) the >> >> >> >defines you just added in arch/powerpc/include/asm/eeh.h. It would >> >> >> >make more sense to only have the uapi copy, surely. >> >> >> > >> >> >> >> >> >> David, Thanks for your time. The defines in eeh.h is used not only >> >> >> for mapping those constants defined in vfio.h, but also for validating >> >> >> arguments from debugfs entry "err_injct". So it'd better to keep >> >> >> them separate: one set is used by userspace, another set is used >> >> >> by host kernel. >> >> > >> >> >I don't understand why this is an argument for duplicating them. They >> >> >have the same meanings and values, why can't you use the uapi versions >> >> >everywhere? >> >> > >> >> >> >> We can drop the constants defined in eeh.h and just use those defines >> >> in vfio.h. But I prefer separate constants for some ovbious reasons >> >> except the one I mentioned above :) >> >> >> >> - We already had separate sets of constants for EEH and VFIO. For example, >> >> VFIO_EEH_PE_STATE_* in include/uapi/vfio.h always have counter part in >> >> arch/powerpc/asm/include/eeh.h >> > >> >Oh.. I see your point. >> > >> >Yes, the EEH internal code shouldn't need to use the VFIO defined >> >constants. >> > >> >But it's still silly to duplicate, I think the correct think would be >> >to expose the EEH constants in uapi, and use those for VFIO as well. >> > >> >> I'm not sure. If Alex.W agree on this, I can have one patch to address >> it in future :-) > >I agree with David, there should be a uapi eeh header and the vfio >documentation should define that as the valid set of functions. > >> >> - The UAPI interface is expected to be stable, or back-compatible when >> >> introducing changes to it. The defines (at least their names) in eeh.h >> >> doesn't have to be stable. >> > >> >But is there any reason not to make them stable. The VFIO case shows >> >that we need to lock down an interface somewhere. I don't see that >> >there's an advantage to having a not locked down interface in addition >> >to a locked down one. >> > >> >> I was saying that the UAPI interace should be stable. I don't see >> there're any reasons the UAPI interface will be changed in near >> future :) > >When I asked previously whether a translation layer was needed between >the vfio uapi and kernel internal eeh api I was told that the values >were defined by the spapr spec and therefore not changeable. If that's >true, we shouldn't need to worry about exporting them via uapi. Thanks, > Well. It sounds a good idea. I'll add one separate patch, which would be PATCH[3/3] in next revision to address it. Currently, there are not too much constants, which can be put into arch/powerpc/include/uapi/asm/eeh.h. The following list are the candidates from include/api/vfio.h, which can be moved to uapi/asm/eeh.h. The names of those constants will be changed from "VFIO_EEH_*" to "EEH_*" and then user space will have some code as folows to use them. That looks a bit weird. Do we really want moving those constants to uapi/asm/eeh.h and rename them accordingly? #include <stdio.h> #include <sys/ioctl.h> #include <eeh.h> #include <vfio.h> struct vfio_eeh_pe_op pe_op = { .argsz = sizeof(pe_op), .flags = 0 }; pe_op.op = VFIO_EEH_PE_INJECT_ERR; /* Its prefix is "VFIO_EEH_* */ pe_op.err.type = EEH_ERR_TYPE_32; /* It starts with "EEH_*" */ pe_op.err.func = EEH_ERR_FUNC_LD_CFG_ADDR; pe_op.err.addr = 0ul; pe_op.err.mask = 0ul; ioctl(container, VFIO_EEH_PE_OP, &pe_op); --- #define VFIO_EEH_PE_STATE_NORMAL 0 /* PE in functional state */ #define VFIO_EEH_PE_STATE_RESET 1 /* PE reset in progress */ #define VFIO_EEH_PE_STATE_STOPPED 2 /* Stopped DMA and IO */ #define VFIO_EEH_PE_STATE_STOPPED_DMA 4 /* Stopped DMA only */ #define VFIO_EEH_PE_STATE_UNAVAIL 5 /* State unavailable */ #define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ #define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ #define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ #define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 #define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ #define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 #define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ #define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 #define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ #define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 #define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ #define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 #define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ #define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 #define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ #define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 #define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 #define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 #define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ #define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 #define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 #define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 Thanks, Gavin >Alex > >> >> - I would like keep the code between EEH/VFIO independent to each other >> >> enough. >> >> >> >> Thanks, >> >> Gavin >> >> >> >> >> By the way, Alex Williamson, could you please ACK this if you don't >> >> >> find obvious errors in the patch. I really want this to be merged >> >> >> to 4.1. >> >> >> >> >> >> Thanks, >> >> >> Gavin >> >> >> >> >> >> >> >> >> >> >> > >> >> >> >> >> > >> >-- >> >David Gibson | I'll have my music baroque, and my code >> >david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ >> > | _way_ _around_! >> >http://www.ozlabs.org/~dgibson >> >> > > > >_______________________________________________ >Linuxppc-dev mailing list >Linuxppc-dev@lists.ozlabs.org >https://lists.ozlabs.org/listinfo/linuxppc-dev ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-23 22:47 ` Gavin Shan @ 2015-03-24 1:22 ` David Gibson 2015-03-24 2:41 ` Gavin Shan 0 siblings, 1 reply; 14+ messages in thread From: David Gibson @ 2015-03-24 1:22 UTC (permalink / raw) To: Gavin Shan; +Cc: aik, linuxppc-dev, Alex Williamson, agraf, kvm [-- Attachment #1: Type: text/plain, Size: 13107 bytes --] On Tue, Mar 24, 2015 at 09:47:54AM +1100, Gavin Shan wrote: > On Mon, Mar 23, 2015 at 10:14:59AM -0600, Alex Williamson wrote: > >On Mon, 2015-03-23 at 16:20 +1100, Gavin Shan wrote: > >> On Mon, Mar 23, 2015 at 04:10:20PM +1100, David Gibson wrote: > >> >On Mon, Mar 23, 2015 at 04:03:59PM +1100, Gavin Shan wrote: > >> >> On Mon, Mar 23, 2015 at 02:43:03PM +1100, David Gibson wrote: > >> >> >On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: > >> >> >> On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: > >> >> >> >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: > >> >> >> >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) > >> >> >> >> to inject the specified EEH error, which is represented by > >> >> >> >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. > >> >> >> >> > >> >> >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> > >> >> >> > > >> >> >> >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> > >> >> >> > > >> >> >> >> --- > >> >> >> >> Documentation/vfio.txt | 12 ++++++++++++ > >> >> >> >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ > >> >> >> >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- > >> >> >> >> 3 files changed, 57 insertions(+), 1 deletion(-) > >> >> >> >> > >> >> >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt > >> >> >> >> index 96978ec..c6e11a3 100644 > >> >> >> >> --- a/Documentation/vfio.txt > >> >> >> >> +++ b/Documentation/vfio.txt > >> >> >> >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: > >> >> >> >> > >> >> >> >> .... > >> >> >> >> > >> >> >> >> + /* Inject EEH error, which is expected to be caused by 32-bits > >> >> >> >> + * config load. > >> >> >> >> + */ > >> >> >> >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; > >> >> >> >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; > >> >> >> >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; > >> >> >> >> + pe_op.err.addr = 0ul; > >> >> >> >> + pe_op.err.mask = 0ul; > >> >> >> >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); > >> >> >> >> + > >> >> >> >> + .... > >> >> >> >> + > >> >> >> >> /* When 0xFF's returned from reading PCI config space or IO BARs > >> >> >> >> * of the PCI device. Check the PE's state to see if that has been > >> >> >> >> * frozen. > >> >> >> >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c > >> >> >> >> index 5fa42db..38edeb4 100644 > >> >> >> >> --- a/drivers/vfio/vfio_spapr_eeh.c > >> >> >> >> +++ b/drivers/vfio/vfio_spapr_eeh.c > >> >> >> >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, > >> >> >> >> case VFIO_EEH_PE_CONFIGURE: > >> >> >> >> ret = eeh_pe_configure(pe); > >> >> >> >> break; > >> >> >> >> + case VFIO_EEH_PE_INJECT_ERR: > >> >> >> >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); > >> >> >> >> + if (op.argsz < minsz) > >> >> >> >> + return -EINVAL; > >> >> >> >> + if (copy_from_user(&op, (void __user *)arg, minsz)) > >> >> >> >> + return -EFAULT; > >> >> >> >> + > >> >> >> >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, > >> >> >> >> + op.err.addr, op.err.mask); > >> >> >> >> + break; > >> >> >> >> default: > >> >> >> >> ret = -EINVAL; > >> >> >> >> } > >> >> >> >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h > >> >> >> >> index 82889c3..f68e962 100644 > >> >> >> >> --- a/include/uapi/linux/vfio.h > >> >> >> >> +++ b/include/uapi/linux/vfio.h > >> >> >> >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { > >> >> >> >> * - unfreeze IO/DMA for frozen PE; > >> >> >> >> * - read PE state; > >> >> >> >> * - reset PE; > >> >> >> >> - * - configure PE. > >> >> >> >> + * - configure PE; > >> >> >> >> + * - inject EEH error. > >> >> >> >> */ > >> >> >> >> +struct vfio_eeh_pe_err { > >> >> >> >> + __u32 type; > >> >> >> >> + __u32 func; > >> >> >> >> + __u64 addr; > >> >> >> >> + __u64 mask; > >> >> >> >> +}; > >> >> >> >> + > >> >> >> >> struct vfio_eeh_pe_op { > >> >> >> >> __u32 argsz; > >> >> >> >> __u32 flags; > >> >> >> >> __u32 op; > >> >> >> >> + union { > >> >> >> >> + struct vfio_eeh_pe_err err; > >> >> >> >> + }; > >> >> >> >> }; > >> >> >> >> > >> >> >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ > >> >> >> >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { > >> >> >> >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ > >> >> >> >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ > >> >> >> >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ > >> >> >> >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ > >> >> >> >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ > >> >> >> >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 > >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 > >> >> >> > > >> >> >> >These all seem to duplicate (under slightly different names) the > >> >> >> >defines you just added in arch/powerpc/include/asm/eeh.h. It would > >> >> >> >make more sense to only have the uapi copy, surely. > >> >> >> > > >> >> >> > >> >> >> David, Thanks for your time. The defines in eeh.h is used not only > >> >> >> for mapping those constants defined in vfio.h, but also for validating > >> >> >> arguments from debugfs entry "err_injct". So it'd better to keep > >> >> >> them separate: one set is used by userspace, another set is used > >> >> >> by host kernel. > >> >> > > >> >> >I don't understand why this is an argument for duplicating them. They > >> >> >have the same meanings and values, why can't you use the uapi versions > >> >> >everywhere? > >> >> > > >> >> > >> >> We can drop the constants defined in eeh.h and just use those defines > >> >> in vfio.h. But I prefer separate constants for some ovbious reasons > >> >> except the one I mentioned above :) > >> >> > >> >> - We already had separate sets of constants for EEH and VFIO. For example, > >> >> VFIO_EEH_PE_STATE_* in include/uapi/vfio.h always have counter part in > >> >> arch/powerpc/asm/include/eeh.h > >> > > >> >Oh.. I see your point. > >> > > >> >Yes, the EEH internal code shouldn't need to use the VFIO defined > >> >constants. > >> > > >> >But it's still silly to duplicate, I think the correct think would be > >> >to expose the EEH constants in uapi, and use those for VFIO as well. > >> > > >> > >> I'm not sure. If Alex.W agree on this, I can have one patch to address > >> it in future :-) > > > >I agree with David, there should be a uapi eeh header and the vfio > >documentation should define that as the valid set of functions. > > > >> >> - The UAPI interface is expected to be stable, or back-compatible when > >> >> introducing changes to it. The defines (at least their names) in eeh.h > >> >> doesn't have to be stable. > >> > > >> >But is there any reason not to make them stable. The VFIO case shows > >> >that we need to lock down an interface somewhere. I don't see that > >> >there's an advantage to having a not locked down interface in addition > >> >to a locked down one. > >> > > >> > >> I was saying that the UAPI interace should be stable. I don't see > >> there're any reasons the UAPI interface will be changed in near > >> future :) > > > >When I asked previously whether a translation layer was needed between > >the vfio uapi and kernel internal eeh api I was told that the values > >were defined by the spapr spec and therefore not changeable. If that's > >true, we shouldn't need to worry about exporting them via uapi. Thanks, > > > > Well. It sounds a good idea. I'll add one separate patch, which would be > PATCH[3/3] in next revision to address it. Currently, there are not > too much constants, which can be put into arch/powerpc/include/uapi/asm/eeh.h. > The following list are the candidates from include/api/vfio.h, which can be > moved to uapi/asm/eeh.h. The names of those constants will be changed from > "VFIO_EEH_*" to "EEH_*" and then user space will have some code as folows > to use them. That looks a bit weird. Do we really want moving those constants > to uapi/asm/eeh.h and rename them accordingly? I think so, yes. I think the patch series will be cleaner if you do the exporting of the EEH constants in 1/3 then build your other patches on top of that. > #include <stdio.h> > #include <sys/ioctl.h> > #include <eeh.h> > #include <vfio.h> > > struct vfio_eeh_pe_op pe_op = { .argsz = sizeof(pe_op), .flags = 0 }; > > pe_op.op = VFIO_EEH_PE_INJECT_ERR; /* Its prefix is "VFIO_EEH_* */ > pe_op.err.type = EEH_ERR_TYPE_32; /* It starts with "EEH_*" */ I think that's fine. > pe_op.err.func = EEH_ERR_FUNC_LD_CFG_ADDR; > pe_op.err.addr = 0ul; > pe_op.err.mask = 0ul; > ioctl(container, VFIO_EEH_PE_OP, &pe_op); > > --- > > #define VFIO_EEH_PE_STATE_NORMAL 0 /* PE in functional state */ > #define VFIO_EEH_PE_STATE_RESET 1 /* PE reset in progress */ > #define VFIO_EEH_PE_STATE_STOPPED 2 /* Stopped DMA and IO */ > #define VFIO_EEH_PE_STATE_STOPPED_DMA 4 /* Stopped DMA only */ > #define VFIO_EEH_PE_STATE_UNAVAIL 5 /* State unavailable */ > > #define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ > #define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ > #define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ > #define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 > #define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ > #define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 > #define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ > #define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 > #define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ > #define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 > #define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ > #define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 > #define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ > #define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 > #define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ > #define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 > #define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 > #define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 > #define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ > #define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 > #define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 > #define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 > > Thanks, > Gavin > > >Alex > > > >> >> - I would like keep the code between EEH/VFIO independent to each other > >> >> enough. > >> >> > >> >> Thanks, > >> >> Gavin > >> >> > >> >> >> By the way, Alex Williamson, could you please ACK this if you don't > >> >> >> find obvious errors in the patch. I really want this to be merged > >> >> >> to 4.1. > >> >> >> > >> >> >> Thanks, > >> >> >> Gavin > >> >> >> > >> >> >> > >> >> >> > >> >> > > >> >> > >> >> > >> > > >> > >> > > > > > > > >_______________________________________________ > >Linuxppc-dev mailing list > >Linuxppc-dev@lists.ozlabs.org > >https://lists.ozlabs.org/listinfo/linuxppc-dev > -- David Gibson | I'll have my music baroque, and my code david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ | _way_ _around_! http://www.ozlabs.org/~dgibson [-- Attachment #2: Type: application/pgp-signature, Size: 819 bytes --] ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [PATCH v3 2/2] drivers/vfio: Support EEH error injection 2015-03-24 1:22 ` David Gibson @ 2015-03-24 2:41 ` Gavin Shan 0 siblings, 0 replies; 14+ messages in thread From: Gavin Shan @ 2015-03-24 2:41 UTC (permalink / raw) To: David Gibson; +Cc: kvm, aik, Gavin Shan, agraf, linuxppc-dev, Alex Williamson On Tue, Mar 24, 2015 at 12:22:25PM +1100, David Gibson wrote: >On Tue, Mar 24, 2015 at 09:47:54AM +1100, Gavin Shan wrote: >> On Mon, Mar 23, 2015 at 10:14:59AM -0600, Alex Williamson wrote: >> >On Mon, 2015-03-23 at 16:20 +1100, Gavin Shan wrote: >> >> On Mon, Mar 23, 2015 at 04:10:20PM +1100, David Gibson wrote: >> >> >On Mon, Mar 23, 2015 at 04:03:59PM +1100, Gavin Shan wrote: >> >> >> On Mon, Mar 23, 2015 at 02:43:03PM +1100, David Gibson wrote: >> >> >> >On Mon, Mar 23, 2015 at 12:56:36PM +1100, Gavin Shan wrote: >> >> >> >> On Mon, Mar 23, 2015 at 12:39:45PM +1100, David Gibson wrote: >> >> >> >> >On Sat, Mar 21, 2015 at 06:58:45AM +1100, Gavin Shan wrote: >> >> >> >> >> The patch adds one more EEH sub-command (VFIO_EEH_PE_INJECT_ERR) >> >> >> >> >> to inject the specified EEH error, which is represented by >> >> >> >> >> (struct vfio_eeh_pe_err), to the indicated PE for testing purpose. >> >> >> >> >> >> >> >> >> >> Signed-off-by: Gavin Shan <gwshan@linux.vnet.ibm.com> >> >> >> >> > >> >> >> >> >Reviewed-by: David Gibson <david@gibson.dropbear.id.au> >> >> >> >> > >> >> >> >> >> --- >> >> >> >> >> Documentation/vfio.txt | 12 ++++++++++++ >> >> >> >> >> drivers/vfio/vfio_spapr_eeh.c | 10 ++++++++++ >> >> >> >> >> include/uapi/linux/vfio.h | 36 +++++++++++++++++++++++++++++++++++- >> >> >> >> >> 3 files changed, 57 insertions(+), 1 deletion(-) >> >> >> >> >> >> >> >> >> >> diff --git a/Documentation/vfio.txt b/Documentation/vfio.txt >> >> >> >> >> index 96978ec..c6e11a3 100644 >> >> >> >> >> --- a/Documentation/vfio.txt >> >> >> >> >> +++ b/Documentation/vfio.txt >> >> >> >> >> @@ -385,6 +385,18 @@ The code flow from the example above should be slightly changed: >> >> >> >> >> >> >> >> >> >> .... >> >> >> >> >> >> >> >> >> >> + /* Inject EEH error, which is expected to be caused by 32-bits >> >> >> >> >> + * config load. >> >> >> >> >> + */ >> >> >> >> >> + pe_op.op = VFIO_EEH_PE_INJECT_ERR; >> >> >> >> >> + pe_op.err.type = VFIO_EEH_ERR_TYPE_32; >> >> >> >> >> + pe_op.err.func = VFIO_EEH_ERR_FUNC_LD_CFG_ADDR; >> >> >> >> >> + pe_op.err.addr = 0ul; >> >> >> >> >> + pe_op.err.mask = 0ul; >> >> >> >> >> + ioctl(container, VFIO_EEH_PE_OP, &pe_op); >> >> >> >> >> + >> >> >> >> >> + .... >> >> >> >> >> + >> >> >> >> >> /* When 0xFF's returned from reading PCI config space or IO BARs >> >> >> >> >> * of the PCI device. Check the PE's state to see if that has been >> >> >> >> >> * frozen. >> >> >> >> >> diff --git a/drivers/vfio/vfio_spapr_eeh.c b/drivers/vfio/vfio_spapr_eeh.c >> >> >> >> >> index 5fa42db..38edeb4 100644 >> >> >> >> >> --- a/drivers/vfio/vfio_spapr_eeh.c >> >> >> >> >> +++ b/drivers/vfio/vfio_spapr_eeh.c >> >> >> >> >> @@ -85,6 +85,16 @@ long vfio_spapr_iommu_eeh_ioctl(struct iommu_group *group, >> >> >> >> >> case VFIO_EEH_PE_CONFIGURE: >> >> >> >> >> ret = eeh_pe_configure(pe); >> >> >> >> >> break; >> >> >> >> >> + case VFIO_EEH_PE_INJECT_ERR: >> >> >> >> >> + minsz = offsetofend(struct vfio_eeh_pe_op, err.mask); >> >> >> >> >> + if (op.argsz < minsz) >> >> >> >> >> + return -EINVAL; >> >> >> >> >> + if (copy_from_user(&op, (void __user *)arg, minsz)) >> >> >> >> >> + return -EFAULT; >> >> >> >> >> + >> >> >> >> >> + ret = eeh_pe_inject_err(pe, op.err.type, op.err.func, >> >> >> >> >> + op.err.addr, op.err.mask); >> >> >> >> >> + break; >> >> >> >> >> default: >> >> >> >> >> ret = -EINVAL; >> >> >> >> >> } >> >> >> >> >> diff --git a/include/uapi/linux/vfio.h b/include/uapi/linux/vfio.h >> >> >> >> >> index 82889c3..f68e962 100644 >> >> >> >> >> --- a/include/uapi/linux/vfio.h >> >> >> >> >> +++ b/include/uapi/linux/vfio.h >> >> >> >> >> @@ -468,12 +468,23 @@ struct vfio_iommu_spapr_tce_info { >> >> >> >> >> * - unfreeze IO/DMA for frozen PE; >> >> >> >> >> * - read PE state; >> >> >> >> >> * - reset PE; >> >> >> >> >> - * - configure PE. >> >> >> >> >> + * - configure PE; >> >> >> >> >> + * - inject EEH error. >> >> >> >> >> */ >> >> >> >> >> +struct vfio_eeh_pe_err { >> >> >> >> >> + __u32 type; >> >> >> >> >> + __u32 func; >> >> >> >> >> + __u64 addr; >> >> >> >> >> + __u64 mask; >> >> >> >> >> +}; >> >> >> >> >> + >> >> >> >> >> struct vfio_eeh_pe_op { >> >> >> >> >> __u32 argsz; >> >> >> >> >> __u32 flags; >> >> >> >> >> __u32 op; >> >> >> >> >> + union { >> >> >> >> >> + struct vfio_eeh_pe_err err; >> >> >> >> >> + }; >> >> >> >> >> }; >> >> >> >> >> >> >> >> >> >> #define VFIO_EEH_PE_DISABLE 0 /* Disable EEH functionality */ >> >> >> >> >> @@ -490,6 +501,29 @@ struct vfio_eeh_pe_op { >> >> >> >> >> #define VFIO_EEH_PE_RESET_HOT 6 /* Assert hot reset */ >> >> >> >> >> #define VFIO_EEH_PE_RESET_FUNDAMENTAL 7 /* Assert fundamental reset */ >> >> >> >> >> #define VFIO_EEH_PE_CONFIGURE 8 /* PE configuration */ >> >> >> >> >> +#define VFIO_EEH_PE_INJECT_ERR 9 /* Inject EEH error */ >> >> >> >> >> +#define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ >> >> >> >> >> +#define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 >> >> >> >> >> +#define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 >> >> >> >> > >> >> >> >> >These all seem to duplicate (under slightly different names) the >> >> >> >> >defines you just added in arch/powerpc/include/asm/eeh.h. It would >> >> >> >> >make more sense to only have the uapi copy, surely. >> >> >> >> > >> >> >> >> >> >> >> >> David, Thanks for your time. The defines in eeh.h is used not only >> >> >> >> for mapping those constants defined in vfio.h, but also for validating >> >> >> >> arguments from debugfs entry "err_injct". So it'd better to keep >> >> >> >> them separate: one set is used by userspace, another set is used >> >> >> >> by host kernel. >> >> >> > >> >> >> >I don't understand why this is an argument for duplicating them. They >> >> >> >have the same meanings and values, why can't you use the uapi versions >> >> >> >everywhere? >> >> >> > >> >> >> >> >> >> We can drop the constants defined in eeh.h and just use those defines >> >> >> in vfio.h. But I prefer separate constants for some ovbious reasons >> >> >> except the one I mentioned above :) >> >> >> >> >> >> - We already had separate sets of constants for EEH and VFIO. For example, >> >> >> VFIO_EEH_PE_STATE_* in include/uapi/vfio.h always have counter part in >> >> >> arch/powerpc/asm/include/eeh.h >> >> > >> >> >Oh.. I see your point. >> >> > >> >> >Yes, the EEH internal code shouldn't need to use the VFIO defined >> >> >constants. >> >> > >> >> >But it's still silly to duplicate, I think the correct think would be >> >> >to expose the EEH constants in uapi, and use those for VFIO as well. >> >> > >> >> >> >> I'm not sure. If Alex.W agree on this, I can have one patch to address >> >> it in future :-) >> > >> >I agree with David, there should be a uapi eeh header and the vfio >> >documentation should define that as the valid set of functions. >> > >> >> >> - The UAPI interface is expected to be stable, or back-compatible when >> >> >> introducing changes to it. The defines (at least their names) in eeh.h >> >> >> doesn't have to be stable. >> >> > >> >> >But is there any reason not to make them stable. The VFIO case shows >> >> >that we need to lock down an interface somewhere. I don't see that >> >> >there's an advantage to having a not locked down interface in addition >> >> >to a locked down one. >> >> > >> >> >> >> I was saying that the UAPI interace should be stable. I don't see >> >> there're any reasons the UAPI interface will be changed in near >> >> future :) >> > >> >When I asked previously whether a translation layer was needed between >> >the vfio uapi and kernel internal eeh api I was told that the values >> >were defined by the spapr spec and therefore not changeable. If that's >> >true, we shouldn't need to worry about exporting them via uapi. Thanks, >> > >> >> Well. It sounds a good idea. I'll add one separate patch, which would be >> PATCH[3/3] in next revision to address it. Currently, there are not >> too much constants, which can be put into arch/powerpc/include/uapi/asm/eeh.h. >> The following list are the candidates from include/api/vfio.h, which can be >> moved to uapi/asm/eeh.h. The names of those constants will be changed from >> "VFIO_EEH_*" to "EEH_*" and then user space will have some code as folows >> to use them. That looks a bit weird. Do we really want moving those constants >> to uapi/asm/eeh.h and rename them accordingly? > >I think so, yes. I think the patch series will be cleaner if you do >the exporting of the EEH constants in 1/3 then build your other >patches on top of that. > Thanks for confirm, David. I'll have one patch (1/3) to address this in next revision. Thanks, Gavin >> #include <stdio.h> >> #include <sys/ioctl.h> >> #include <eeh.h> >> #include <vfio.h> >> >> struct vfio_eeh_pe_op pe_op = { .argsz = sizeof(pe_op), .flags = 0 }; >> >> pe_op.op = VFIO_EEH_PE_INJECT_ERR; /* Its prefix is "VFIO_EEH_* */ >> pe_op.err.type = EEH_ERR_TYPE_32; /* It starts with "EEH_*" */ > >I think that's fine. > >> pe_op.err.func = EEH_ERR_FUNC_LD_CFG_ADDR; >> pe_op.err.addr = 0ul; >> pe_op.err.mask = 0ul; >> ioctl(container, VFIO_EEH_PE_OP, &pe_op); >> >> --- >> >> #define VFIO_EEH_PE_STATE_NORMAL 0 /* PE in functional state */ >> #define VFIO_EEH_PE_STATE_RESET 1 /* PE reset in progress */ >> #define VFIO_EEH_PE_STATE_STOPPED 2 /* Stopped DMA and IO */ >> #define VFIO_EEH_PE_STATE_STOPPED_DMA 4 /* Stopped DMA only */ >> #define VFIO_EEH_PE_STATE_UNAVAIL 5 /* State unavailable */ >> >> #define VFIO_EEH_ERR_TYPE_32 0 /* 32-bits EEH error type */ >> #define VFIO_EEH_ERR_TYPE_64 1 /* 64-bits EEH error type */ >> #define VFIO_EEH_ERR_FUNC_LD_MEM_ADDR 0 /* Memory load */ >> #define VFIO_EEH_ERR_FUNC_LD_MEM_DATA 1 >> #define VFIO_EEH_ERR_FUNC_LD_IO_ADDR 2 /* IO load */ >> #define VFIO_EEH_ERR_FUNC_LD_IO_DATA 3 >> #define VFIO_EEH_ERR_FUNC_LD_CFG_ADDR 4 /* Config load */ >> #define VFIO_EEH_ERR_FUNC_LD_CFG_DATA 5 >> #define VFIO_EEH_ERR_FUNC_ST_MEM_ADDR 6 /* Memory store */ >> #define VFIO_EEH_ERR_FUNC_ST_MEM_DATA 7 >> #define VFIO_EEH_ERR_FUNC_ST_IO_ADDR 8 /* IO store */ >> #define VFIO_EEH_ERR_FUNC_ST_IO_DATA 9 >> #define VFIO_EEH_ERR_FUNC_ST_CFG_ADDR 10 /* Config store */ >> #define VFIO_EEH_ERR_FUNC_ST_CFG_DATA 11 >> #define VFIO_EEH_ERR_FUNC_DMA_RD_ADDR 12 /* DMA read */ >> #define VFIO_EEH_ERR_FUNC_DMA_RD_DATA 13 >> #define VFIO_EEH_ERR_FUNC_DMA_RD_MASTER 14 >> #define VFIO_EEH_ERR_FUNC_DMA_RD_TARGET 15 >> #define VFIO_EEH_ERR_FUNC_DMA_WR_ADDR 16 /* DMA write */ >> #define VFIO_EEH_ERR_FUNC_DMA_WR_DATA 17 >> #define VFIO_EEH_ERR_FUNC_DMA_WR_MASTER 18 >> #define VFIO_EEH_ERR_FUNC_DMA_WR_TARGET 19 >> >> Thanks, >> Gavin >> >> >Alex >> > >> >> >> - I would like keep the code between EEH/VFIO independent to each other >> >> >> enough. >> >> >> >> >> >> Thanks, >> >> >> Gavin >> >> >> >> >> >> >> By the way, Alex Williamson, could you please ACK this if you don't >> >> >> >> find obvious errors in the patch. I really want this to be merged >> >> >> >> to 4.1. >> >> >> >> >> >> >> >> Thanks, >> >> >> >> Gavin >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> > >> >> >> >> >> >> >> >> > >> >> >> >> >> > >> > >> > >> >_______________________________________________ >> >Linuxppc-dev mailing list >> >Linuxppc-dev@lists.ozlabs.org >> >https://lists.ozlabs.org/listinfo/linuxppc-dev >> > >-- >David Gibson | I'll have my music baroque, and my code >david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_ > | _way_ _around_! >http://www.ozlabs.org/~dgibson ^ permalink raw reply [flat|nested] 14+ messages in thread
end of thread, other threads:[~2015-03-24 2:42 UTC | newest] Thread overview: 14+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2015-03-20 19:58 [PATCH v3 0/2] EEH Error Injection Support for VFIO Devices Gavin Shan 2015-03-20 19:58 ` [PATCH v3 1/2] powerpc/eeh: Introduce eeh_pe_inject_err() Gavin Shan 2015-03-23 1:33 ` David Gibson 2015-03-20 19:58 ` [PATCH v3 2/2] drivers/vfio: Support EEH error injection Gavin Shan 2015-03-23 1:39 ` David Gibson 2015-03-23 1:56 ` Gavin Shan 2015-03-23 3:43 ` David Gibson 2015-03-23 5:03 ` Gavin Shan 2015-03-23 5:10 ` David Gibson 2015-03-23 5:20 ` Gavin Shan 2015-03-23 16:14 ` Alex Williamson 2015-03-23 22:47 ` Gavin Shan 2015-03-24 1:22 ` David Gibson 2015-03-24 2:41 ` Gavin Shan
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).