From: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
To: Dan Williams <dan.j.williams@intel.com>
Cc: linux-scsi@vger.kernel.org, linux-ide@vger.kernel.org
Subject: Re: [libsas PATCH v12 04/11] sysfs: handle 'parent deleted before child added'
Date: Thu, 22 Mar 2012 07:39:59 -0700 [thread overview]
Message-ID: <20120322143959.GF19835@kroah.com> (raw)
In-Reply-To: <20120322063214.22036.77957.stgit@dwillia2-linux.jf.intel.com>
On Wed, Mar 21, 2012 at 11:32:14PM -0700, Dan Williams wrote:
> In scsi at least two cases of the parent device being deleted before the
> child is added have been observed.
>
> 1/ scsi is performing async scans and the device is removed prior to the
> async can thread running.
>
> 2/ libsas discovery event running after the parent port has been torn
> down.
>
> Result in crash signatures like:
> BUG: unable to handle kernel NULL pointer dereference at 0000000000000098
> IP: [<ffffffff8115e100>] sysfs_create_dir+0x32/0xb6
> ...
> Process scsi_scan_8 (pid: 5417, threadinfo ffff88080bd16000, task ffff880801b8a0b0)
> Stack:
> 00000000fffffffe ffff880813470628 ffff88080bd17cd0 ffff88080614b7e8
> ffff88080b45c108 00000000fffffffe ffff88080bd17d20 ffffffff8125e4a8
> ffff88080bd17cf0 ffffffff81075149 ffff88080bd17d30 ffff88080614b7e8
> Call Trace:
> [<ffffffff8125e4a8>] kobject_add_internal+0x120/0x1e3
> [<ffffffff81075149>] ? trace_hardirqs_on+0xd/0xf
> [<ffffffff8125e641>] kobject_add_varg+0x41/0x50
> [<ffffffff8125e70b>] kobject_add+0x64/0x66
> [<ffffffff8131122b>] device_add+0x12d/0x63a
>
> These scenarios need to be cleaned up, but in the meantime the system
> need not crash if this ordering occurs. Instead report:
>
> kobject_add_internal failed for target8:0:16 (error: -2 parent: end_device-8:0:24)
> Pid: 2942, comm: scsi_scan_8 Not tainted 3.3.0-rc7-isci+ #2
> Call Trace:
> [<ffffffff8125e551>] kobject_add_internal+0x1c1/0x1f3
> [<ffffffff81075149>] ? trace_hardirqs_on+0xd/0xf
> [<ffffffff8125e659>] kobject_add_varg+0x41/0x50
> [<ffffffff8125e723>] kobject_add+0x64/0x66
> [<ffffffff8131124b>] device_add+0x12d/0x63a
> [<ffffffff8125e0ef>] ? kobject_put+0x4c/0x50
> [<ffffffff8132f370>] scsi_sysfs_add_sdev+0x4e/0x28a
> [<ffffffff8132dce3>] do_scan_async+0x9c/0x145
>
> Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> ---
> fs/sysfs/dir.c | 3 +++
> lib/kobject.c | 7 ++++---
> 2 files changed, 7 insertions(+), 3 deletions(-)
>
> diff --git a/fs/sysfs/dir.c b/fs/sysfs/dir.c
> index 7fdf6a7..86521ee 100644
> --- a/fs/sysfs/dir.c
> +++ b/fs/sysfs/dir.c
> @@ -714,6 +714,9 @@ int sysfs_create_dir(struct kobject * kobj)
> else
> parent_sd = &sysfs_root;
>
> + if (!parent_sd)
> + return -ENOENT;
> +
> if (sysfs_ns_type(parent_sd))
> ns = kobj->ktype->namespace(kobj);
> type = sysfs_read_ns_type(kobj);
So what happens if this is true? Does this patch fix the oops? What
kernels should this be applied to where this problem has been seen?
> diff --git a/lib/kobject.c b/lib/kobject.c
> index c33d7a1..e5f86c0 100644
> --- a/lib/kobject.c
> +++ b/lib/kobject.c
> @@ -192,13 +192,14 @@ static int kobject_add_internal(struct kobject *kobj)
>
> /* be noisy on error issues */
> if (error == -EEXIST)
> - printk(KERN_ERR "%s failed for %s with "
> + pr_err("%s failed for %s with "
> "-EEXIST, don't try to register things with "
> "the same name in the same directory.\n",
> __func__, kobject_name(kobj));
> else
> - printk(KERN_ERR "%s failed for %s (%d)\n",
> - __func__, kobject_name(kobj), error);
> + pr_err("%s failed for %s (error: %d parent: %s)\n",
> + __func__, kobject_name(kobj), error,
> + parent ? kobject_name(parent) : "'none'");
> dump_stack();
> } else
> kobj->state_in_sysfs = 1;
These changes have nothing to do with the above fix, so why include them
here?
And note, I hate pr_err(), what's wrong with printk() in this instance?
greg k-h
next prev parent reply other threads:[~2012-03-22 14:39 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-03-22 6:31 [libsas PATCH v12 00/11] libsas eh, discovery, suspend, and fixes Dan Williams
2012-03-22 6:31 ` [libsas PATCH v12 01/11] libsas: cleanup spurious calls to scsi_schedule_eh Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 02/11] libsas: trim sas_task of slow path infrastructure Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 03/11] libata: reset once Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 04/11] sysfs: handle 'parent deleted before child added' Dan Williams
2012-03-22 14:39 ` Greg Kroah-Hartman [this message]
2012-03-22 16:27 ` Williams, Dan J
2012-03-22 22:51 ` Stefan Richter
2012-03-22 23:11 ` Williams, Dan J
2012-03-22 23:26 ` Stefan Richter
2012-03-23 18:43 ` Dan Williams
2012-03-23 20:54 ` Greg Kroah-Hartman
2012-03-23 21:15 ` Williams, Dan J
2012-03-22 14:47 ` James Bottomley
2012-03-22 16:34 ` Williams, Dan J
2012-03-22 6:32 ` [libsas PATCH v12 05/11] scsi_transport_sas: fix delete vs scan race Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 06/11] libsas: unify domain_device sas_rphy lifetimes Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 07/11] libsas: fix false positive 'device attached' conditions Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 08/11] libsas: fix ata_eh clobbering ex_phys via smp_ata_check_ready Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 09/11] libsas, libata: fix start of life for a sas ata_port Dan Williams
2012-03-22 6:32 ` [libsas PATCH v12 10/11] scsi, sd: limit the scope of the async probe domain Dan Williams
2012-03-22 14:20 ` Alan Stern
2012-03-22 19:09 ` Williams, Dan J
2012-03-22 6:32 ` [libsas PATCH v12 11/11] libsas: suspend / resume support Dan Williams
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120322143959.GF19835@kroah.com \
--to=gregkh@linuxfoundation.org \
--cc=dan.j.williams@intel.com \
--cc=linux-ide@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.