* Re: OOps in 2.6.0-test4-mm3-1
[not found] ` <20030830014309.GA898@matchmail.com>
@ 2003-08-30 3:09 ` Andrew Morton
2003-08-30 17:00 ` Andrew Morton
2003-08-30 23:20 ` Mike Fedyk
0 siblings, 2 replies; 4+ messages in thread
From: Andrew Morton @ 2003-08-30 3:09 UTC (permalink / raw)
To: Mike Fedyk; +Cc: linux-kernel, linux-scsi
Mike Fedyk <mfedyk@matchmail.com> wrote:
>
> It's vanilla mm3-1 with this one patch added from Neil Brown. I don't think
> it has anything to do with it (it looks like a driver issue to me). But it
> can't hurt to mention it.
>
No, it is not an MD thing.
You need two patches. It's up to the scsi guys to decide if they're the
right way to go. I think they are.
Some drivers such as aha1542 and aic7xxx_old will call scsi_register() and
then, if some succeeding operations fails they will call scsi_unregister(),
without an intervening scsi_set_host().
This causes an oops in scsi_put_device(), because kobj->parent is NULL.
In other words, scsi_register() immediately followed by scsi_unregister()
is guaranteed to oops.
The patch makes scsi_host_dev_release() more robust against this usage
pattern.
drivers/scsi/hosts.c | 8 +++++++-
1 files changed, 7 insertions(+), 1 deletion(-)
diff -puN drivers/scsi/hosts.c~aha1542-oops-fix drivers/scsi/hosts.c
--- 25/drivers/scsi/hosts.c~aha1542-oops-fix 2003-08-29 19:48:37.000000000 -0700
+++ 25-akpm/drivers/scsi/hosts.c 2003-08-29 20:02:49.000000000 -0700
@@ -158,7 +158,13 @@ static void scsi_host_dev_release(struct
scsi_proc_hostdir_rm(shost->hostt);
scsi_destroy_command_freelist(shost);
- put_device(parent);
+ /*
+ * Some drivers (eg aha1542) do scsi_register()/scsi_unregister()
+ * during probing without performing a scsi_set_device() in between.
+ * In this case dev->parent is NULL.
+ */
+ if (parent)
+ put_device(parent);
kfree(shost);
}
_
and
scsi_unregister() unconditionally does list_del(&shost->sht_legacy_list).
But scsi_register() leaves that list_head uninitialised if scsi_host_alloc()
returned NULL.
In other words: scsi_unregister() is guaranteed to oops if scsi_host_alloc()
fails.
Fix it by initialising the list_head in scsi_register().
drivers/scsi/hosts.c | 6 +++++-
1 files changed, 5 insertions(+), 1 deletion(-)
diff -puN drivers/scsi/hosts.c~scsi_unregister-oops-fix drivers/scsi/hosts.c
--- 25/drivers/scsi/hosts.c~scsi_unregister-oops-fix 2003-08-29 20:02:53.000000000 -0700
+++ 25-akpm/drivers/scsi/hosts.c 2003-08-29 20:02:53.000000000 -0700
@@ -297,8 +297,12 @@ struct Scsi_Host *scsi_register(struct s
dump_stack();
}
- if (shost)
+ if (shost) {
list_add_tail(&shost->sht_legacy_list, &sht->legacy_hosts);
+ } else {
+ /* Do this to keep scsi_unregister() happy */
+ INIT_LIST_HEAD(&shost->sht_legacy_list);
+ }
return shost;
}
_
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OOps in 2.6.0-test4-mm3-1
2003-08-30 3:09 ` OOps in 2.6.0-test4-mm3-1 Andrew Morton
@ 2003-08-30 17:00 ` Andrew Morton
2003-08-30 23:20 ` Mike Fedyk
1 sibling, 0 replies; 4+ messages in thread
From: Andrew Morton @ 2003-08-30 17:00 UTC (permalink / raw)
To: linux-scsi
Andrew Morton <akpm@osdl.org> wrote:
>
> diff -puN drivers/scsi/hosts.c~scsi_unregister-oops-fix drivers/scsi/hosts.c
> --- 25/drivers/scsi/hosts.c~scsi_unregister-oops-fix 2003-08-29 20:02:53.000000000 -0700
> +++ 25-akpm/drivers/scsi/hosts.c 2003-08-29 20:02:53.000000000 -0700
> @@ -297,8 +297,12 @@ struct Scsi_Host *scsi_register(struct s
> dump_stack();
> }
>
> - if (shost)
> + if (shost) {
> list_add_tail(&shost->sht_legacy_list, &sht->legacy_hosts);
> + } else {
> + /* Do this to keep scsi_unregister() happy */
> + INIT_LIST_HEAD(&shost->sht_legacy_list);
> + }
> return shost;
> }
Manfred points out that this one is crap. The scsi_host_dev_release()
patch is sufficient.
But I did get an oops in a separate place with just that patch (doing
modprobe aha1542 with no hardware) butthat seems to have gone away now.
Sigh.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OOps in 2.6.0-test4-mm3-1
2003-08-30 3:09 ` OOps in 2.6.0-test4-mm3-1 Andrew Morton
2003-08-30 17:00 ` Andrew Morton
@ 2003-08-30 23:20 ` Mike Fedyk
2003-08-30 23:36 ` Andrew Morton
1 sibling, 1 reply; 4+ messages in thread
From: Mike Fedyk @ 2003-08-30 23:20 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, linux-scsi
On Fri, Aug 29, 2003 at 08:09:26PM -0700, Andrew Morton wrote:
> Mike Fedyk <mfedyk@matchmail.com> wrote:
> >
> > It's vanilla mm3-1 with this one patch added from Neil Brown. I don't think
> > it has anything to do with it (it looks like a driver issue to me). But it
> > can't hurt to mention it.
> Some drivers such as aha1542 and aic7xxx_old will call scsi_register() and
> then, if some succeeding operations fails they will call scsi_unregister(),
> without an intervening scsi_set_host().
>
> This causes an oops in scsi_put_device(), because kobj->parent is NULL.
>
> In other words, scsi_register() immediately followed by scsi_unregister()
> is guaranteed to oops.
>
> The patch makes scsi_host_dev_release() more robust against this usage
> pattern.
Ok, I'll give that patch a try. Though, is there any reason why
2.6.0-test2-mm1 doesn't oops too? (that was the previous kernel on that
machine)
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OOps in 2.6.0-test4-mm3-1
2003-08-30 23:20 ` Mike Fedyk
@ 2003-08-30 23:36 ` Andrew Morton
0 siblings, 0 replies; 4+ messages in thread
From: Andrew Morton @ 2003-08-30 23:36 UTC (permalink / raw)
To: Mike Fedyk; +Cc: linux-kernel, linux-scsi
Mike Fedyk <mfedyk@matchmail.com> wrote:
>
> Though, is there any reason why
> 2.6.0-test2-mm1 doesn't oops too?
Of course. drivers/scsi/hosts.c was changed and it broke.
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2003-08-30 23:36 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <20030828235649.61074690.akpm@osdl.org>
[not found] ` <20030830014309.GA898@matchmail.com>
2003-08-30 3:09 ` OOps in 2.6.0-test4-mm3-1 Andrew Morton
2003-08-30 17:00 ` Andrew Morton
2003-08-30 23:20 ` Mike Fedyk
2003-08-30 23:36 ` Andrew Morton
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox