linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Jiang Liu <liuj97@gmail.com>
To: "Rafael J. Wysocki" <rjw@sisk.pl>
Cc: linux-acpi@vger.kernel.org, Toshi Kani <toshi.kani@hp.com>,
	Hanjun Guo <guohanjun@huawei.com>,
	Vasilis Liaskovitis <vasilis.liaskovitis@profitbricks.com>,
	isimatu.yasuaki@jp.fujitsu.com, wency@cn.fujitsu.com,
	lenb@kernel.org, gregkh@linuxfoundation.org,
	linux-kernel@vger.kernel.org, linux-mm@kvack.org,
	Tang Chen <tangchen@cn.fujitsu.com>
Subject: Re: [RFC PATCH v3 0/3] acpi: Introduce prepare_remove device operation
Date: Fri, 07 Dec 2012 00:56:22 +0800	[thread overview]
Message-ID: <50C0CE36.1060009@gmail.com> (raw)
In-Reply-To: <75241306.UQIr1RW8Qh@vostro.rjw.lan>

On 11/29/2012 06:15 PM, Rafael J. Wysocki wrote:
> On Wednesday, November 28, 2012 11:41:36 AM Toshi Kani wrote:
>> On Wed, 2012-11-28 at 19:05 +0800, Hanjun Guo wrote:
>>> On 2012/11/24 1:50, Vasilis Liaskovitis wrote:
>>>> As discussed in https://patchwork.kernel.org/patch/1581581/
>>>> the driver core remove function needs to always succeed. This means we need
>>>> to know that the device can be successfully removed before acpi_bus_trim / 
>>>> acpi_bus_hot_remove_device are called. This can cause panics when OSPM-initiated
>>>> or SCI-initiated eject of memory devices fail e.g with:
>>>> echo 1 >/sys/bus/pci/devices/PNP0C80:XX/eject
>>>>
>>>> since the ACPI core goes ahead and ejects the device regardless of whether the
>>>> the memory is still in use or not.
>>>>
>>>> For this reason a new acpi_device operation called prepare_remove is introduced.
>>>> This operation should be registered for acpi devices whose removal (from kernel
>>>> perspective) can fail.  Memory devices fall in this category.
>>>>
>>>> acpi_bus_remove() is changed to handle removal in 2 steps:
>>>> - preparation for removal i.e. perform part of removal that can fail. Should
>>>>   succeed for device and all its children.
>>>> - if above step was successfull, proceed to actual device removal
>>>
>>> Hi Vasilis,
>>> We met the same problem when we doing computer node hotplug, It is a good idea
>>> to introduce prepare_remove before actual device removal.
>>>
>>> I think we could do more in prepare_remove, such as rollback. In most cases, we can
>>> offline most of memory sections except kernel used pages now, should we rollback
>>> and online the memory sections when prepare_remove failed ?
>>
>> I think hot-plug operation should have all-or-nothing semantics.  That
>> is, an operation should either complete successfully, or rollback to the
>> original state.
> 
> That's correct.
> 
>>> As you may know, the ACPI based hotplug framework we are working on already addressed
>>> this problem, and the way we slove this problem is a bit like yours.
>>>
>>> We introduce hp_ops in struct acpi_device_ops:
>>> struct acpi_device_ops {
>>> 	acpi_op_add add;
>>> 	acpi_op_remove remove;
>>> 	acpi_op_start start;
>>> 	acpi_op_bind bind;
>>> 	acpi_op_unbind unbind;
>>> 	acpi_op_notify notify;
>>> #ifdef	CONFIG_ACPI_HOTPLUG
>>> 	struct acpihp_dev_ops *hp_ops;
>>> #endif	/* CONFIG_ACPI_HOTPLUG */
>>> };
>>>
>>> in hp_ops, we divide the prepare_remove into six small steps, that is:
>>> 1) pre_release(): optional step to mark device going to be removed/busy
>>> 2) release(): reclaim device from running system
>>> 3) post_release(): rollback if cancelled by user or error happened
>>> 4) pre_unconfigure(): optional step to solve possible dependency issue
>>> 5) unconfigure(): remove devices from running system
>>> 6) post_unconfigure(): free resources used by devices
>>>
>>> In this way, we can easily rollback if error happens.
>>> How do you think of this solution, any suggestion ? I think we can achieve
>>> a better way for sharing ideas. :)
>>
>> Yes, sharing idea is good. :)  I do not know if we need all 6 steps (I
>> have not looked at all your changes yet..), but in my mind, a hot-plug
>> operation should be composed with the following 3 phases.
>>
>> 1. Validate phase - Verify if the request is a supported operation.  All
>> known restrictions are verified at this phase.  For instance, if a
>> hot-remove request involves kernel memory, it is failed in this phase.
>> Since this phase makes no change, no rollback is necessary to fail.  
> 
> Actually, we can't do it this way, because the conditions may change between
> the check and the execution.  So the first phase needs to involve execution
> to some extent, although only as far as it remains reversible.
Hi Rafael,
	A possible way to solve this issue is:
1) mark device busy
2) check condition and mark device as normal if condition check fails.
3) reclaim the device and mark device as normal if reclaim fails.
4) remove the device.

> 
>> 2. Execute phase - Perform hot-add / hot-remove operation that can be
>> rolled-back in case of error or cancel.
> 
> I would just merge 1 and 2.
> 
>> 3. Commit phase - Perform the final hot-add / hot-remove operation that
>> cannot be rolled-back.  No error / cancel is allowed in this phase.  For
>> instance, eject operation is performed at this phase.  
> 
> Yup.
> 
> Thanks,
> Rafael
> 
> 

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  parent reply	other threads:[~2012-12-06 16:56 UTC|newest]

Thread overview: 92+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-23 17:50 [RFC PATCH v3 0/3] acpi: Introduce prepare_remove device operation Vasilis Liaskovitis
2012-11-23 17:50 ` [RFC PATCH v3 1/3] acpi: Introduce prepare_remove operation in acpi_device_ops Vasilis Liaskovitis
2012-11-27  0:10   ` Toshi Kani
2012-11-27 18:36     ` Vasilis Liaskovitis
2012-11-27 23:18     ` Rafael J. Wysocki
2012-11-23 17:50 ` [RFC PATCH v3 2/3] acpi_memhotplug: Add prepare_remove operation Vasilis Liaskovitis
2012-11-24 16:23   ` Wen Congyang
2012-11-23 17:50 ` [RFC PATCH v3 3/3] acpi_memhotplug: Allow eject to proceed on rebind scenario Vasilis Liaskovitis
2012-11-24 16:20   ` Wen Congyang
2012-11-26  8:36     ` Vasilis Liaskovitis
2012-11-26  9:11       ` Wen Congyang
2012-11-27  0:19         ` Toshi Kani
2012-11-27 18:32           ` Vasilis Liaskovitis
2012-11-27 22:03             ` Toshi Kani
2012-11-27 23:41               ` Rafael J. Wysocki
2012-11-28 16:01                 ` Toshi Kani
2012-11-28 18:40                   ` Rafael J. Wysocki
2012-11-28 21:02                     ` Toshi Kani
2012-11-28 21:40                       ` Rafael J. Wysocki
2012-11-28 21:40                         ` Toshi Kani
2012-11-28 22:01                           ` Rafael J. Wysocki
2012-11-28 22:04                             ` Toshi Kani
2012-11-28 22:21                               ` Rafael J. Wysocki
2012-11-28 22:16                                 ` Toshi Kani
2012-11-28 22:39                                   ` Rafael J. Wysocki
2012-11-28 22:46                                     ` Greg KH
2012-11-28 23:05                                       ` Rafael J. Wysocki
2012-11-28 23:10                                         ` Greg KH
2012-11-28 23:31                                           ` Rafael J. Wysocki
2012-11-28 23:49                       ` Rafael J. Wysocki
2012-11-29  1:02                         ` Toshi Kani
2012-11-29  1:15                           ` Toshi Kani
2012-11-29 10:03                             ` Rafael J. Wysocki
2012-11-29 11:30                               ` Vasilis Liaskovitis
2012-11-29 16:57                                 ` Rafael J. Wysocki
2012-11-29 17:56                                 ` Toshi Kani
2012-11-29 20:25                                   ` Rafael J. Wysocki
2012-11-29 20:38                                     ` Toshi Kani
2012-11-29 21:23                                       ` Rafael J. Wysocki
2012-11-29 21:46                                         ` Toshi Kani
2012-11-29 22:11                                           ` Rafael J. Wysocki
2012-11-29 23:17                                             ` Toshi Kani
2012-11-30  0:13                                               ` Rafael J. Wysocki
2012-11-30  1:09                                                 ` Toshi Kani
2012-11-29 16:43                               ` Toshi Kani
2012-11-29 11:04                             ` Vasilis Liaskovitis
2012-11-29 17:44                               ` Toshi Kani
2012-12-06  9:30                                 ` Vasilis Liaskovitis
2012-12-06 12:50                                   ` Rafael J. Wysocki
2012-12-06 15:41                                     ` Toshi Kani
2012-12-06 20:32                                       ` Rafael J. Wysocki
2012-11-28 11:05 ` [RFC PATCH v3 0/3] acpi: Introduce prepare_remove device operation Hanjun Guo
2012-11-28 18:41   ` Toshi Kani
2012-11-29  4:48     ` Hanjun Guo
2012-11-29 22:27       ` Toshi Kani
2012-12-03  4:25         ` Hanjun Guo
2012-12-04  0:10           ` Toshi Kani
2012-12-04  9:16             ` Hanjun Guo
2012-12-04 23:23               ` Toshi Kani
2012-12-05 12:10                 ` Hanjun Guo
2012-12-05 22:31                   ` Toshi Kani
2012-12-06 16:47                 ` Jiang Liu
2012-12-07  2:25                   ` Toshi Kani
2012-12-06 16:40             ` Jiang Liu
2012-12-06 20:30               ` Rafael J. Wysocki
2012-12-07  2:57               ` Toshi Kani
2012-12-07  5:57                 ` Jiang Liu
2012-12-08  1:08                   ` Toshi Kani
2012-12-11 14:34                     ` Jiang Liu
2012-12-13 14:42                       ` Toshi Kani
2012-12-13 15:15                         ` Jiang Liu
2012-12-15  1:19                           ` Toshi Kani
2012-11-29 10:15     ` Rafael J. Wysocki
2012-11-29 11:36       ` Vasilis Liaskovitis
2012-12-06 16:59         ` Jiang Liu
2012-11-29 17:03       ` Toshi Kani
2012-11-29 20:30         ` Rafael J. Wysocki
2012-11-29 20:39           ` Toshi Kani
2012-11-29 20:56             ` Toshi Kani
2012-11-29 21:25               ` Rafael J. Wysocki
2012-12-06 17:10                 ` Jiang Liu
2012-12-06 17:07           ` Jiang Liu
2012-12-06 17:01         ` Jiang Liu
2012-12-06 16:56       ` Jiang Liu [this message]
2012-12-06 16:00     ` Jiang Liu
2012-12-06 16:03       ` Toshi Kani
2012-12-06 16:25         ` Jiang Liu
2012-12-06 16:31           ` Toshi Kani
2012-12-06 16:52             ` Jiang Liu
2012-12-06 17:09               ` Toshi Kani
2012-12-06 17:30                 ` Jiang Liu
2012-12-06 17:28                   ` Toshi Kani

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50C0CE36.1060009@gmail.com \
    --to=liuj97@gmail.com \
    --cc=gregkh@linuxfoundation.org \
    --cc=guohanjun@huawei.com \
    --cc=isimatu.yasuaki@jp.fujitsu.com \
    --cc=lenb@kernel.org \
    --cc=linux-acpi@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=rjw@sisk.pl \
    --cc=tangchen@cn.fujitsu.com \
    --cc=toshi.kani@hp.com \
    --cc=vasilis.liaskovitis@profitbricks.com \
    --cc=wency@cn.fujitsu.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).