* [PATCH v2 0/3] spi: A better solution for cros_ec_spi reliability
@ 2019-05-13 20:18 Douglas Anderson
2019-05-13 20:18 ` [PATCH v2 1/3] spi: Allow SPI devices to force transfers on a realtime thread Douglas Anderson
0 siblings, 1 reply; 2+ messages in thread
From: Douglas Anderson @ 2019-05-13 20:18 UTC (permalink / raw)
To: Mark Brown, Benson Leung, Enric Balletbo i Serra
Cc: linux-rockchip, drinkcat, Guenter Roeck, briannorris, mka,
Douglas Anderson, linux-kernel, linux-spi
This series is a much better solution for getting the Chrome OS EC to
talk reliably and replaces commit 37a186225a0c ("platform/chrome:
cros_ec_spi: Transfer messages at high priority").
Specifically note that even though the above commit made things
better, we still saw some failures.
The majority of these failures were because we were competing for time
with dm-crypt which also scheduled work on HIGHPRI workqueues. While
we can consider reverting the change that made dm-crypt run its work
at HIGHPRI, the argument in commit a1b89132dc4f ("dm crypt: use
WQ_HIGHPRI for the IO and crypt workqueues") is somewhat compelling.
It does make sense for IO to be scheduled at a priority that's higher
than the default user priority.
It turns out that we could also see problems because loop devices also
run at high priority. See the set_user_nice() in
loop_prepare_queue().
Looking in more detail, it can be seen that the high priority
workqueue isn't actually that high of a priority. It runs at MIN_NICE
which is _fairly_ high priority but still below all real time
priority. We can do better by using realtime priority. That makes
sense because cros_ec_spi actually needs to run quickly for
correctness. As I understand this is exactly what real time priority
is for. Note that there is a discussion going on about the dm-crypt
priority [1].
We also had other problems with the previous patch because sometimes
we'd end up on the SPI pumping thread and had our priority downgraded.
Both the competition with other high priority things and the priority
downgrading are fixed by this new series.
After this series I can run the following test on Chrome OS (which
mounts /var as stateful encrypted) with no errors:
dd if=/dev/zero of=/var/log/foo.txt bs=4M count=512&
while true; do
ectool version > /dev/null;
done
Special thanks to Guenter Roeck for pointing out the "realtime"
feature of the SPI framework so I didn't re-invent the wheel. I have
no idea how I missed it. :-/
Also note: if you want some history on investigation done here, feel
free to peruse the Chrome OS bug [2].
[1] https://lkml.kernel.org/r/CAD=FV=VOAjgdrvkK8YKPP-8zqwPpo39rA43JH2BCeYLB0UkgAQ@mail.gmail.com
[2] https://crbug.com/948742
Changes in v2:
- Now only force transfers to the thread for devices that want it.
- Squashed patch #1 and #2 together.
- Renamed variable to "force_rt_transfers".
Douglas Anderson (3):
spi: Allow SPI devices to force transfers on a realtime thread
platform/chrome: cros_ec_spi: Force transfers to realtime priority
Revert "platform/chrome: cros_ec_spi: Transfer messages at high
priority"
drivers/platform/chrome/cros_ec_spi.c | 81 +++------------------------
drivers/spi/spi.c | 49 +++++++++++++---
include/linux/spi/spi.h | 5 ++
3 files changed, 52 insertions(+), 83 deletions(-)
--
2.21.0.1020.gf2820cf01a-goog
^ permalink raw reply [flat|nested] 2+ messages in thread
* [PATCH v2 1/3] spi: Allow SPI devices to force transfers on a realtime thread
2019-05-13 20:18 [PATCH v2 0/3] spi: A better solution for cros_ec_spi reliability Douglas Anderson
@ 2019-05-13 20:18 ` Douglas Anderson
0 siblings, 0 replies; 2+ messages in thread
From: Douglas Anderson @ 2019-05-13 20:18 UTC (permalink / raw)
To: Mark Brown, Benson Leung, Enric Balletbo i Serra
Cc: linux-rockchip, drinkcat, Guenter Roeck, briannorris, mka,
Douglas Anderson, linux-kernel, linux-spi
For communication with some SPI devices it may be necessary (for
correctness) not to get interrupted during a transfer. One example is
the EC (Embedded Controller) on Chromebooks. The Chrome OS EC will
drop a transfer if more than ~8ms passes between the chip select being
asserted and the transfer finishing.
The best way to make transfers to devices like this succeed is to run
the transfers at realtime priority. ...but, since there is no easy
way in the kernel to just temporarily bump the priority of the current
thread the current best way to do this is to schedule work on another
thread and give that thread a boosted priority.
The SPI framework already has another thread and, in fact, that other
thread is already made realtime priority in some cases. Let's add an
ability for a SPI device driver to request that this thread always be
used and that it's realtime priority.
NOTE: Forcing all transfers to the message pumping thread will add
extra overhead to each transfer. On some systems this may lower your
overall bus throughput and may increase CPU utilization. You only
want to use this feature when it's important that a transfer not get
interrupt once started.
Signed-off-by: Douglas Anderson <dianders@chromium.org>
---
Another option is to change this patch to allow a SPI device driver to
request that the pumping thread be realtime but not to _force_ all
messages onto that thread. In the case of cros_ec this means a bunch
of code duplication (we need to create a thread to do basically the
same thing as the SPI core) but that would make sense if the SPI
framework doesn't expect other use cases like this and doesn't want to
carry the baggage in the SPI core.
Changes in v2:
- Now only force transfers to the thread for devices that want it.
- Squashed patch #1 and #2 together.
- Renamed variable to "force_rt_transfers".
drivers/spi/spi.c | 49 +++++++++++++++++++++++++++++++++--------
include/linux/spi/spi.h | 5 +++++
2 files changed, 45 insertions(+), 9 deletions(-)
diff --git a/drivers/spi/spi.c b/drivers/spi/spi.c
index 8eb7460dd744..911961bb230d 100644
--- a/drivers/spi/spi.c
+++ b/drivers/spi/spi.c
@@ -1217,6 +1217,7 @@ EXPORT_SYMBOL_GPL(spi_finalize_current_transfer);
*/
static void __spi_pump_messages(struct spi_controller *ctlr, bool in_kthread)
{
+ struct spi_message *msg;
unsigned long flags;
bool was_busy = false;
int ret;
@@ -1276,10 +1277,16 @@ static void __spi_pump_messages(struct spi_controller *ctlr, bool in_kthread)
return;
}
- /* Extract head of queue */
- ctlr->cur_msg =
- list_first_entry(&ctlr->queue, struct spi_message, queue);
+ /* If device for this message needs a realtime thread; queue it */
+ msg = list_first_entry(&ctlr->queue, struct spi_message, queue);
+ if (msg->spi->force_rt_transfers & !in_kthread) {
+ kthread_queue_work(&ctlr->kworker, &ctlr->pump_messages);
+ spin_unlock_irqrestore(&ctlr->queue_lock, flags);
+ return;
+ }
+ /* Extract head of queue */
+ ctlr->cur_msg = msg;
list_del_init(&ctlr->cur_msg->queue);
if (ctlr->busy)
was_busy = true;
@@ -1364,10 +1371,32 @@ static void spi_pump_messages(struct kthread_work *work)
__spi_pump_messages(ctlr, true);
}
-static int spi_init_queue(struct spi_controller *ctlr)
+/**
+ * spi_set_thread_rt - set the controller to pump at realtime priority
+ * @ctlr: controller to boost priority of
+ *
+ * This can be called because the controller requested realtime priority
+ * (by setting the ->rt value before calling spi_register_controller()) or
+ * because a device on the bus said that its transfers needed realtime
+ * priority.
+ *
+ * NOTE: at the moment if any device on a bus says it needs realtime then
+ * the thread will be at realtime priority for all transfers on that
+ * controller. If this eventually becomes a problem we may see if we can
+ * find a way to boost the priority only temporarily during relevant
+ * transfers.
+ */
+static void spi_set_thread_rt(struct spi_controller *ctlr)
{
struct sched_param param = { .sched_priority = MAX_RT_PRIO - 1 };
+ dev_info(&ctlr->dev,
+ "will run message pump with realtime priority\n");
+ sched_setscheduler(ctlr->kworker_task, SCHED_FIFO, ¶m);
+}
+
+static int spi_init_queue(struct spi_controller *ctlr)
+{
ctlr->running = false;
ctlr->busy = false;
@@ -1387,11 +1416,8 @@ static int spi_init_queue(struct spi_controller *ctlr)
* request and the scheduling of the message pump thread. Without this
* setting the message pump thread will remain at default priority.
*/
- if (ctlr->rt) {
- dev_info(&ctlr->dev,
- "will run message pump with realtime priority\n");
- sched_setscheduler(ctlr->kworker_task, SCHED_FIFO, ¶m);
- }
+ if (ctlr->rt)
+ spi_set_thread_rt(ctlr);
return 0;
}
@@ -2982,6 +3008,11 @@ int spi_setup(struct spi_device *spi)
spi_set_cs(spi, false);
+ if (spi->force_rt_transfers && !spi->controller->rt) {
+ spi->controller->rt = true;
+ spi_set_thread_rt(spi->controller);
+ }
+
dev_dbg(&spi->dev, "setup mode %d, %s%s%s%s%u bits/w, %u Hz max --> %d\n",
(int) (spi->mode & (SPI_CPOL | SPI_CPHA)),
(spi->mode & SPI_CS_HIGH) ? "cs_high, " : "",
diff --git a/include/linux/spi/spi.h b/include/linux/spi/spi.h
index 053abd22ad31..4d6ea3366480 100644
--- a/include/linux/spi/spi.h
+++ b/include/linux/spi/spi.h
@@ -109,6 +109,10 @@ void spi_statistics_add_transfer_stats(struct spi_statistics *stats,
* This may be changed by the device's driver, or left at the
* default (0) indicating protocol words are eight bit bytes.
* The spi_transfer.bits_per_word can override this for each transfer.
+ * @force_rt_transfers: Transfers for this device should always be done
+ * at realtime priority. NOTE that this might add some latency in
+ * starting the transfer (since we may need to context switch) but
+ * once the transfer starts it will be done at high priority.
* @irq: Negative, or the number passed to request_irq() to receive
* interrupts from this device.
* @controller_state: Controller's runtime state
@@ -143,6 +147,7 @@ struct spi_device {
u32 max_speed_hz;
u8 chip_select;
u8 bits_per_word;
+ bool force_rt_transfers;
u32 mode;
#define SPI_CPHA 0x01 /* clock phase */
#define SPI_CPOL 0x02 /* clock polarity */
--
2.21.0.1020.gf2820cf01a-goog
^ permalink raw reply related [flat|nested] 2+ messages in thread
end of thread, other threads:[~2019-05-13 20:18 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2019-05-13 20:18 [PATCH v2 0/3] spi: A better solution for cros_ec_spi reliability Douglas Anderson
2019-05-13 20:18 ` [PATCH v2 1/3] spi: Allow SPI devices to force transfers on a realtime thread Douglas Anderson
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).