* [PATCH] Fix effect aborting in ff-memless @ 2016-06-19 12:44 Manuel Reimer 2016-06-20 17:33 ` Dmitry Torokhov 0 siblings, 1 reply; 8+ messages in thread From: Manuel Reimer @ 2016-06-19 12:44 UTC (permalink / raw) To: linux-input; +Cc: Cameron Gutman, jikos, dmitry.torokhov Hello, while debugging a problem with hid-sony I got stuck with a problem actually caused by ff-memless. In some situations effect aborting is delayed, so it may be triggered seconds after all devices have been destroyed, which causes the kernel to panic. The aborting request actually gets received here: https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L467 This "aborting" flag is then handled here: https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L376 But before this line is reached, there is a time check to check if the effect actually is due to be started: https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L359 This time check now causes a problem if the effect, which is meant to be *aborted* was scheduled to be *started* some time in future and the device is destroyed before this time is reached. My patch fixes this by setting the trigger times to "now" after setting the aborting flag. This way the following code deals with aborting the effect immediately without setting a timer for it. There may be other ways to fix this, so I would be happy to get some feedback. Signed-off-by: Manuel Reimer <mail@m-reimer.de> --- a/drivers/input/ff-memless.c 2016-05-13 16:06:29.722685021 +0200 +++ b/drivers/input/ff-memless.c 2016-06-19 14:25:39.790375270 +0200 @@ -463,9 +463,11 @@ static int ml_ff_playback(struct input_d } else { pr_debug("initiated stop\n"); - if (test_bit(FF_EFFECT_PLAYING, &state->flags)) + if (test_bit(FF_EFFECT_PLAYING, &state->flags)) { __set_bit(FF_EFFECT_ABORTING, &state->flags); - else + state->play_at = jiffies; + state->adj_at = jiffies; + } else __clear_bit(FF_EFFECT_STARTED, &state->flags); } ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] Fix effect aborting in ff-memless 2016-06-19 12:44 [PATCH] Fix effect aborting in ff-memless Manuel Reimer @ 2016-06-20 17:33 ` Dmitry Torokhov 2016-06-21 3:24 ` Dmitry Torokhov 2016-06-25 13:28 ` Manuel Reimer 0 siblings, 2 replies; 8+ messages in thread From: Dmitry Torokhov @ 2016-06-20 17:33 UTC (permalink / raw) To: Manuel Reimer; +Cc: linux-input, Cameron Gutman, jikos On Sun, Jun 19, 2016 at 02:44:35PM +0200, Manuel Reimer wrote: > Hello, > > while debugging a problem with hid-sony I got stuck with a problem > actually caused by ff-memless. In some situations effect aborting is > delayed, so it may be triggered seconds after all devices have been > destroyed, which causes the kernel to panic. > > The aborting request actually gets received here: > https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L467 > This "aborting" flag is then handled here: > https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L376 > But before this line is reached, there is a time check to check if > the effect actually is due to be started: > https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L359 > > This time check now causes a problem if the effect, which is meant > to be *aborted* was scheduled to be *started* some time in future > and the device is destroyed before this time is reached. I am not clear how this can happen. If effect hasn't actually started playing (i.e. we have FF_EFFECT_STARTED bit set, but FF_EFFECT_PLAYING is not yet set), then when stopping effect we do not need to do anything except clear FF_EFFECT_STARTED (since we did not touch the hardware yet). Now, if FF_EFFECT_PLAYING is set, that means that play_at time is in the past and we won't be skipping this effect in ml_get_combo_effect(). Could you please post a stack trace of the crash you observed? > > My patch fixes this by setting the trigger times to "now" after > setting the aborting flag. This way the following code deals with > aborting the effect immediately without setting a timer for it. > > There may be other ways to fix this, so I would be happy to get some > feedback. > > > Signed-off-by: Manuel Reimer <mail@m-reimer.de> > > --- a/drivers/input/ff-memless.c 2016-05-13 16:06:29.722685021 +0200 > +++ b/drivers/input/ff-memless.c 2016-06-19 14:25:39.790375270 +0200 > @@ -463,9 +463,11 @@ static int ml_ff_playback(struct input_d > } else { > pr_debug("initiated stop\n"); > > - if (test_bit(FF_EFFECT_PLAYING, &state->flags)) > + if (test_bit(FF_EFFECT_PLAYING, &state->flags)) { > __set_bit(FF_EFFECT_ABORTING, &state->flags); > - else > + state->play_at = jiffies; > + state->adj_at = jiffies; > + } else > __clear_bit(FF_EFFECT_STARTED, &state->flags); > } > Thanks. -- Dmitry ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] Fix effect aborting in ff-memless 2016-06-20 17:33 ` Dmitry Torokhov @ 2016-06-21 3:24 ` Dmitry Torokhov 2016-06-25 12:23 ` Manuel Reimer 2016-06-25 13:28 ` Manuel Reimer 1 sibling, 1 reply; 8+ messages in thread From: Dmitry Torokhov @ 2016-06-21 3:24 UTC (permalink / raw) To: Manuel Reimer; +Cc: linux-input, Cameron Gutman, jikos On Mon, Jun 20, 2016 at 10:33:03AM -0700, Dmitry Torokhov wrote: > On Sun, Jun 19, 2016 at 02:44:35PM +0200, Manuel Reimer wrote: > > Hello, > > > > while debugging a problem with hid-sony I got stuck with a problem > > actually caused by ff-memless. In some situations effect aborting is > > delayed, so it may be triggered seconds after all devices have been > > destroyed, which causes the kernel to panic. > > > > The aborting request actually gets received here: > > https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L467 > > This "aborting" flag is then handled here: > > https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L376 > > But before this line is reached, there is a time check to check if > > the effect actually is due to be started: > > https://github.com/torvalds/linux/blob/master/drivers/input/ff-memless.c#L359 > > > > This time check now causes a problem if the effect, which is meant > > to be *aborted* was scheduled to be *started* some time in future > > and the device is destroyed before this time is reached. > > I am not clear how this can happen. If effect hasn't actually started > playing (i.e. we have FF_EFFECT_STARTED bit set, but FF_EFFECT_PLAYING > is not yet set), then when stopping effect we do not need to do anything > except clear FF_EFFECT_STARTED (since we did not touch the hardware > yet). > > Now, if FF_EFFECT_PLAYING is set, that means that play_at time is in > the past and we won't be skipping this effect in ml_get_combo_effect(). > > Could you please post a stack trace of the crash you observed? Actually, I wonder if the patch below will fix the issue you are seeing. Thanks. -- Dmitry Input: fix flushing of FF effects when unregistering devices From: Dmitry Torokhov <dmitry.torokhov@gmail.com> Signed-off-by: Dmitry Torokhov <dmitry.torokhov@gmail.com> --- drivers/input/input.c | 10 +++++++--- include/linux/input.h | 8 ++++++-- 2 files changed, 13 insertions(+), 5 deletions(-) diff --git a/drivers/input/input.c b/drivers/input/input.c index d95c34e..220aa17 100644 --- a/drivers/input/input.c +++ b/drivers/input/input.c @@ -144,7 +144,7 @@ static void input_pass_values(struct input_dev *dev, count = input_to_handler(handle, vals, count); } else { list_for_each_entry_rcu(handle, &dev->h_list, d_node) - if (handle->open) { + if (handle->active) { count = input_to_handler(handle, vals, count); if (!count) break; @@ -552,7 +552,7 @@ static void __input_release_device(struct input_handle *handle) synchronize_rcu(); list_for_each_entry(handle, &dev->h_list, d_node) - if (handle->open && handle->handler->start) + if (handle->active && handle->handler->start) handle->handler->start(handle); } } @@ -613,6 +613,8 @@ int input_open_device(struct input_handle *handle) } } + handle->active = handle->open > 0; + out: mutex_unlock(&dev->mutex); return retval; @@ -663,6 +665,8 @@ void input_close_device(struct input_handle *handle) synchronize_rcu(); } + handle->active = handle->open > 0; + mutex_unlock(&dev->mutex); } EXPORT_SYMBOL(input_close_device); @@ -716,7 +720,7 @@ static void input_disconnect_device(struct input_dev *dev) input_dev_release_keys(dev); list_for_each_entry(handle, &dev->h_list, d_node) - handle->open = 0; + handle->active = false; spin_unlock_irq(&dev->event_lock); } diff --git a/include/linux/input.h b/include/linux/input.h index 1e96769..87ec2f7 100644 --- a/include/linux/input.h +++ b/include/linux/input.h @@ -307,8 +307,10 @@ struct input_handler { /** * struct input_handle - links input device with an input handler * @private: handler-specific data - * @open: counter showing whether the handle is 'open', i.e. should deliver - * events from its device + * @open: counter showing whether the handle is 'open', i.e. there is + * a consumer for the events from input device + * @active: indicates that input events should be delivered from input + * device to input handler through this handle * @name: name given to the handle by handler that created it * @dev: input device the handle is attached to * @handler: handler that works with the device through this handle @@ -321,6 +323,8 @@ struct input_handle { void *private; int open; + bool active; + const char *name; struct input_dev *dev; ^ permalink raw reply related [flat|nested] 8+ messages in thread
* Re: [PATCH] Fix effect aborting in ff-memless 2016-06-21 3:24 ` Dmitry Torokhov @ 2016-06-25 12:23 ` Manuel Reimer 2016-06-25 15:31 ` Dmitry Torokhov 0 siblings, 1 reply; 8+ messages in thread From: Manuel Reimer @ 2016-06-25 12:23 UTC (permalink / raw) To: Dmitry Torokhov; +Cc: linux-input, Cameron Gutman, jikos On 06/21/2016 05:24 AM, Dmitry Torokhov wrote: > Actually, I wonder if the patch below will fix the issue you are seeing. At first: Thank you for helping with fixing this issue. And sorry for the late answer. I did a complete kernel rebuild with your patch and now I can say, that your patch doesn't make any difference. The problem still exists. Manuel ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] Fix effect aborting in ff-memless 2016-06-25 12:23 ` Manuel Reimer @ 2016-06-25 15:31 ` Dmitry Torokhov 0 siblings, 0 replies; 8+ messages in thread From: Dmitry Torokhov @ 2016-06-25 15:31 UTC (permalink / raw) To: Manuel Reimer; +Cc: linux-input, Cameron Gutman, jikos On Sat, Jun 25, 2016 at 02:23:44PM +0200, Manuel Reimer wrote: > On 06/21/2016 05:24 AM, Dmitry Torokhov wrote: > >Actually, I wonder if the patch below will fix the issue you are seeing. > > At first: Thank you for helping with fixing this issue. > > And sorry for the late answer. > > I did a complete kernel rebuild with your patch and now I can say, > that your patch doesn't make any difference. The problem still > exists. I see. I suspect my patch is still needed to ensure that dev->close() is called when we forcibly remove input device but I'll need to experiment a bit with it. Thanks. -- Dmitry ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] Fix effect aborting in ff-memless 2016-06-20 17:33 ` Dmitry Torokhov 2016-06-21 3:24 ` Dmitry Torokhov @ 2016-06-25 13:28 ` Manuel Reimer 2016-06-25 15:29 ` Dmitry Torokhov 1 sibling, 1 reply; 8+ messages in thread From: Manuel Reimer @ 2016-06-25 13:28 UTC (permalink / raw) To: Dmitry Torokhov; +Cc: linux-input, Cameron Gutman, jikos On 06/20/2016 07:33 PM, Dmitry Torokhov wrote: > I am not clear how this can happen. If effect hasn't actually started > playing (i.e. we have FF_EFFECT_STARTED bit set, but FF_EFFECT_PLAYING > is not yet set), then when stopping effect we do not need to do anything > except clear FF_EFFECT_STARTED (since we did not touch the hardware > yet). That's true, but I think my crash works a bit different. What I'm doing is "hammering" the code with "start playback" events. Maybe not common in normal use, but it shouldn't be possible to crash the kernel with something like this. For the crash to happen, the uploaded effect has to have a replay delay of some seconds. With this, what is actually happening is: - The first playback request is accepted in ml_ff_playback (nonzero value to ml_ff_playback). --> FF_EFFECT_STARTED is set, FF_EFFECT_PLAYING not set --> play_at is in the future - Some time later (play_at reached) the effect actually is started --> Now both, FF_EFFECT_STARTED and FF_EFFECT_PLAYING are set --> play_at is in the past - Now a new playback request for the same effect is accepted in ml_ff_playback --> Still both, FF_EFFECT_STARTED and FF_EFFECT_PLAYING set --> play_at is now in the future!!! => That's the time where the USB plug is pulled => Kernel is trying to stop possible running effects Current situation: - The effect is playing and hasn't stopped playing so far - The same effect is scheduled to be playing again (play_at in future) - Now we get a "stop playback" request (zero value to ml_ff_playback) --> FF_EFFECT_PLAYING still set --> FF_EFFECT_ABORTING will be set --> play_at still in the future - The "FF_EFFECT_ABORTING" request now isn't handled directly in ml_get_combo_effect, as it is filtered out "play_at in future". - A timer is set (as play_at is in future) - Some time later the aborting is triggered with all memory already freed in both, ff-memless and hid-sony, modules. With this knowledge, it now is *very* easy to reproduce this one: - Open fftest on the device - press "4" and "enter", immediately pre-enter a "4" after that - As soon as the controller starts to vibrate, press enter and disconnect the controller - Wait some seconds Crashes *every time* with the hid-sony module. Produces ugly output on dmesg with xpad module. Seems like some strings are read from freed memory in this case. My patch fixes this by setting play_at to "now", so the "FF_EFFECT_ABORTING" is handled immediately in this case. --> No crash Manuel ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] Fix effect aborting in ff-memless 2016-06-25 13:28 ` Manuel Reimer @ 2016-06-25 15:29 ` Dmitry Torokhov 2016-06-27 18:31 ` Anssi Hannula 0 siblings, 1 reply; 8+ messages in thread From: Dmitry Torokhov @ 2016-06-25 15:29 UTC (permalink / raw) To: Manuel Reimer, Anssi Hannula; +Cc: linux-input, Cameron Gutman, jikos On Sat, Jun 25, 2016 at 03:28:05PM +0200, Manuel Reimer wrote: > On 06/20/2016 07:33 PM, Dmitry Torokhov wrote: > >I am not clear how this can happen. If effect hasn't actually started > >playing (i.e. we have FF_EFFECT_STARTED bit set, but FF_EFFECT_PLAYING > >is not yet set), then when stopping effect we do not need to do anything > >except clear FF_EFFECT_STARTED (since we did not touch the hardware > >yet). > > That's true, but I think my crash works a bit different. > > What I'm doing is "hammering" the code with "start playback" events. > Maybe not common in normal use, but it shouldn't be possible to > crash the kernel with something like this. > > > For the crash to happen, the uploaded effect has to have a replay > delay of some seconds. With this, what is actually happening is: > > - The first playback request is accepted in ml_ff_playback > (nonzero value to ml_ff_playback). > --> FF_EFFECT_STARTED is set, FF_EFFECT_PLAYING not set > --> play_at is in the future > - Some time later (play_at reached) the effect actually is started > --> Now both, FF_EFFECT_STARTED and FF_EFFECT_PLAYING are set > --> play_at is in the past > - Now a new playback request for the same effect is accepted > in ml_ff_playback > --> Still both, FF_EFFECT_STARTED and FF_EFFECT_PLAYING set > --> play_at is now in the future!!! > > => That's the time where the USB plug is pulled > => Kernel is trying to stop possible running effects > Current situation: > - The effect is playing and hasn't stopped playing so far > - The same effect is scheduled to be playing again > (play_at in future) > > - Now we get a "stop playback" request (zero value to ml_ff_playback) > --> FF_EFFECT_PLAYING still set --> FF_EFFECT_ABORTING will be set > --> play_at still in the future > - The "FF_EFFECT_ABORTING" request now isn't handled directly in > ml_get_combo_effect, as it is filtered out "play_at in future". > - A timer is set (as play_at is in future) > - Some time later the aborting is triggered with all memory already > freed in both, ff-memless and hid-sony, modules. > > > > With this knowledge, it now is *very* easy to reproduce this one: > - Open fftest on the device > - press "4" and "enter", immediately pre-enter a "4" after that > - As soon as the controller starts to vibrate, press enter and > disconnect the controller > - Wait some seconds > > Crashes *every time* with the hid-sony module. > Produces ugly output on dmesg with xpad module. Seems like some > strings are read from freed memory in this case. > > My patch fixes this by setting play_at to "now", so the > "FF_EFFECT_ABORTING" is handled immediately in this case. --> No > crash I see. I however wonder what the proper behavior should be when we basically request to restart the effect. If start delay is 0 then new effect parameters would immediately be applied, so I guess it would be reasonable to say that if we apply new start_delay to the effect then currently playing instance should be stopped... Anssi, what is your take here? Thanks. -- Dmitry ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] Fix effect aborting in ff-memless 2016-06-25 15:29 ` Dmitry Torokhov @ 2016-06-27 18:31 ` Anssi Hannula 0 siblings, 0 replies; 8+ messages in thread From: Anssi Hannula @ 2016-06-27 18:31 UTC (permalink / raw) To: Dmitry Torokhov; +Cc: Manuel Reimer, linux-input, Cameron Gutman, jikos 25.06.2016, 18:29, Dmitry Torokhov kirjoitti: > On Sat, Jun 25, 2016 at 03:28:05PM +0200, Manuel Reimer wrote: >> On 06/20/2016 07:33 PM, Dmitry Torokhov wrote: >>> I am not clear how this can happen. If effect hasn't actually started >>> playing (i.e. we have FF_EFFECT_STARTED bit set, but FF_EFFECT_PLAYING >>> is not yet set), then when stopping effect we do not need to do anything >>> except clear FF_EFFECT_STARTED (since we did not touch the hardware >>> yet). >> >> That's true, but I think my crash works a bit different. >> >> What I'm doing is "hammering" the code with "start playback" events. >> Maybe not common in normal use, but it shouldn't be possible to >> crash the kernel with something like this. >> >> >> For the crash to happen, the uploaded effect has to have a replay >> delay of some seconds. With this, what is actually happening is: >> >> - The first playback request is accepted in ml_ff_playback >> (nonzero value to ml_ff_playback). >> --> FF_EFFECT_STARTED is set, FF_EFFECT_PLAYING not set >> --> play_at is in the future >> - Some time later (play_at reached) the effect actually is started >> --> Now both, FF_EFFECT_STARTED and FF_EFFECT_PLAYING are set >> --> play_at is in the past >> - Now a new playback request for the same effect is accepted >> in ml_ff_playback >> --> Still both, FF_EFFECT_STARTED and FF_EFFECT_PLAYING set >> --> play_at is now in the future!!! >> >> => That's the time where the USB plug is pulled >> => Kernel is trying to stop possible running effects >> Current situation: >> - The effect is playing and hasn't stopped playing so far >> - The same effect is scheduled to be playing again >> (play_at in future) >> >> - Now we get a "stop playback" request (zero value to ml_ff_playback) >> --> FF_EFFECT_PLAYING still set --> FF_EFFECT_ABORTING will be set >> --> play_at still in the future >> - The "FF_EFFECT_ABORTING" request now isn't handled directly in >> ml_get_combo_effect, as it is filtered out "play_at in future". >> - A timer is set (as play_at is in future) >> - Some time later the aborting is triggered with all memory already >> freed in both, ff-memless and hid-sony, modules. >> >> >> >> With this knowledge, it now is *very* easy to reproduce this one: >> - Open fftest on the device >> - press "4" and "enter", immediately pre-enter a "4" after that >> - As soon as the controller starts to vibrate, press enter and >> disconnect the controller >> - Wait some seconds >> >> Crashes *every time* with the hid-sony module. >> Produces ugly output on dmesg with xpad module. Seems like some >> strings are read from freed memory in this case. >> >> My patch fixes this by setting play_at to "now", so the >> "FF_EFFECT_ABORTING" is handled immediately in this case. --> No >> crash > > I see. I however wonder what the proper behavior should be when we > basically request to restart the effect. If start delay is 0 then new > effect parameters would immediately be applied, so I guess it would be > reasonable to say that if we apply new start_delay to the effect then > currently playing instance should be stopped... > > Anssi, what is your take here? I agree with you, starting a delayed effect should immediately stop the effect if it is already running. So I guess e.g. ml_ff_playback() should set FF_EFFECT_ABORTING in such a case, and ml_get_combo_effect() should process FF_EFFECT_ABORTING effects regardless of their ->play_at being in the future (the latter would fix the original issue too, AFAICS). I guess an easier-to-follow state bit set would be e.g. the following: FF_EFFECT_STARTED: always reflects started/stopped state FF_EFFECT_PLAYING: always reflects hardware state FF_EFFECT_DIRTY: effect requires immediate refresh (stop or modify) which would make it clearer where each is set/cleared as there is no more need for "delayed" handling of STARTED/PLAYING via ABORTING or clearing FF_EFFECT_PLAYING in various places to force an effect refresh. -- Anssi Hannula ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2016-06-27 18:56 UTC | newest] Thread overview: 8+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2016-06-19 12:44 [PATCH] Fix effect aborting in ff-memless Manuel Reimer 2016-06-20 17:33 ` Dmitry Torokhov 2016-06-21 3:24 ` Dmitry Torokhov 2016-06-25 12:23 ` Manuel Reimer 2016-06-25 15:31 ` Dmitry Torokhov 2016-06-25 13:28 ` Manuel Reimer 2016-06-25 15:29 ` Dmitry Torokhov 2016-06-27 18:31 ` Anssi Hannula
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).