* Re: Problems writing to intel P3700 NVMe drive [not found] ` <alpine.LNX.2.00.1505282059400.15930@localhost.lm.intel.com> @ 2015-06-03 20:10 ` Keith Busch 2015-06-03 20:20 ` Bjorn Helgaas 0 siblings, 1 reply; 3+ messages in thread From: Keith Busch @ 2015-06-03 20:10 UTC (permalink / raw) To: Keith Busch, Bjorn Helgaas; +Cc: Pavilion Storage, linux-nvme, linux-pci +cc the pci list Hi Bjorn, A while back, there were a few proposals on changing the pci driver's default MPS tuning from the existing do-nothing policy to something safe so end-users don't need to remember kernel parameters as described below. Is this still active, or can we kick that back to life if not? Thanks, Keith On Thu, 28 May 2015, Keith Busch wrote: > Perhaps not a desirable/permanent solution, but for everyone's benefit, > we can work-around the problem with kernel parameter: > > pci=pcie_bus_safe > > Using 'pcie_bus_perf' instead may also work, though your original > solution of just using x86 sounded perfectly reasonable to me. :) > > On Thu, 28 May 2015, Pavilion Storage wrote: >> Thanks to Keith Busch for the providing the clues. >> The problem was that the PCIe MPSS on the drive and the CPU root >> complex did not match and that caused a problem. Setting it to a >> common value fixed the problem. >> Kishore ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Problems writing to intel P3700 NVMe drive 2015-06-03 20:10 ` Problems writing to intel P3700 NVMe drive Keith Busch @ 2015-06-03 20:20 ` Bjorn Helgaas 2015-06-03 21:37 ` Keith Busch 0 siblings, 1 reply; 3+ messages in thread From: Bjorn Helgaas @ 2015-06-03 20:20 UTC (permalink / raw) To: Keith Busch; +Cc: Pavilion Storage, linux-nvme, linux-pci@vger.kernel.org On Wed, Jun 3, 2015 at 3:10 PM, Keith Busch <keith.busch@intel.com> wrote: > +cc the pci list > > Hi Bjorn, > > A while back, there were a few proposals on changing the pci driver's > default MPS tuning from the existing do-nothing policy to something > safe so end-users don't need to remember kernel parameters as described > below. Is this still active, or can we kick that back to life if not? I'd love to see some activity there. Somebody else asked me about this a few weeks ago, and I sent him this list of some of the things I think are wrong with the current situation: - MPS configuration is done in pcie_bus_configure_settings(). This is not arch-specific, but it is done by arch code, and only arm, powerpc, tile, and x86 call it. - MPS configuration should be done before a driver claims a device. This is currently wrong on arm. - I would rather have MPS configuration done somewhere in the pci_scan_single_device() path. That way it would be done on every arch and in every hotplug path. - I know MPS depends on more than just the individual device and its upstream bridge, so it's not as simple as some other properties. But if we have a correctly-configured tree, we should be able to figure out what needs to change if we add one more device to it. - I do not believe it is safe for drivers to manipulate MPS behind the back of the PCI core, so pcie_set_mps() should probably not be exported. 5f39e6705faa ("PCI: Disable MPS configuration by default") turned off most the tuning we used to do. I expect that if we turn it back on, we'll trip over a few issues. That in itself is not so much of a problem, but before we turn things back on, I want to have better infrastructure in place that makes it easier to diagnose and fix those issues. For example, I'd like to see just enough clues in dmesg to enable us to see what the kernel's doing and verify that it's correct. And I want it do be done cleanly on every arch for every device, with hot-added ones being handled the same as those present at boot. > On Thu, 28 May 2015, Keith Busch wrote: >> >> Perhaps not a desirable/permanent solution, but for everyone's benefit, >> we can work-around the problem with kernel parameter: >> >> pci=pcie_bus_safe >> >> Using 'pcie_bus_perf' instead may also work, though your original >> solution of just using x86 sounded perfectly reasonable to me. :) >> >> On Thu, 28 May 2015, Pavilion Storage wrote: >>> >>> Thanks to Keith Busch for the providing the clues. >>> The problem was that the PCIe MPSS on the drive and the CPU root >>> complex did not match and that caused a problem. Setting it to a >>> common value fixed the problem. >>> Kishore ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Problems writing to intel P3700 NVMe drive 2015-06-03 20:20 ` Bjorn Helgaas @ 2015-06-03 21:37 ` Keith Busch 0 siblings, 0 replies; 3+ messages in thread From: Keith Busch @ 2015-06-03 21:37 UTC (permalink / raw) To: Bjorn Helgaas Cc: Keith Busch, Pavilion Storage, linux-nvme, linux-pci@vger.kernel.org On Wed, 3 Jun 2015, Bjorn Helgaas wrote: > On Wed, Jun 3, 2015 at 3:10 PM, Keith Busch <keith.busch@intel.com> wrote: >> A while back, there were a few proposals on changing the pci driver's >> default MPS tuning from the existing do-nothing policy to something >> safe so end-users don't need to remember kernel parameters as described >> below. Is this still active, or can we kick that back to life if not? > > I'd love to see some activity there. Somebody else asked me about > this a few weeks ago, and I sent him this list of some of the things I > think are wrong with the current situation: Thanks for the update. I got in contact with some comrades already working on this, so we'll post an updated proposal when it's ready. This feature is getting more attention with broader hot plug pci-e storage usage (among other reasons), so I'm sure we'll give this its due consideration this time. ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2015-06-03 21:38 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <CAFMq3wEyk9PMbD8Cgn5trJyDjaOKpMzR-TNzbx0zXN=Mms573Q@mail.gmail.com>
[not found] ` <00c601d098cd$41255e30$c3701a90$@storageio.com>
[not found] ` <CAFMq3wE7PzKs=dYxxUhFsHeTkR0CT8mh6NVfrk-9mKxdisoDTg@mail.gmail.com>
[not found] ` <00cd01d098d1$94047010$bc0d5030$@storageio.com>
[not found] ` <CAFMq3wEa=kxznE77oFdg03QBHqszmH5-Vvmmm9pW1nkPUZi_Og@mail.gmail.com>
[not found] ` <CA+At7pWf6Ge3pc7=aeiqz=iWxtYJSPVdM_W2XW9pL4d22q432Q@mail.gmail.com>
[not found] ` <000a01d098ee$b52ce0c0$1f86a240$@storageio.com>
[not found] ` <CAFMq3wH67yMqWGcde77kiyWJ6_ieZjVyhc8L-oq1e_b4x5mS7w@mail.gmail.com>
[not found] ` <alpine.LNX.2.00.1505282059400.15930@localhost.lm.intel.com>
2015-06-03 20:10 ` Problems writing to intel P3700 NVMe drive Keith Busch
2015-06-03 20:20 ` Bjorn Helgaas
2015-06-03 21:37 ` Keith Busch
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox