public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* Re: BUG in dma-mapping.h:218 // MESH SCSI driver not working
       [not found]           ` <4A7553E7.6030804@scarlet.be>
@ 2009-08-02 23:13             ` Benjamin Herrenschmidt
  2009-08-05  1:04               ` FUJITA Tomonori
  0 siblings, 1 reply; 3+ messages in thread
From: Benjamin Herrenschmidt @ 2009-08-02 23:13 UTC (permalink / raw)
  To: Stef Simoens; +Cc: linuxppc-dev, FUJITA Tomonori, James Bottomley, linux-scsi

On Sun, 2009-08-02 at 10:52 +0200, Stef Simoens wrote:
> Hey Benjamin,
> 
Thanks for the bisection. I'll have a look when I'm back from skiing :-)
In the meantime, maybe Fujita has an idea ?

Mesh is an old crappy piece of HW with an old driver full of dark
secrets that Paulus wrote eons ago, so I'd rather avoid cracking it
open :-)
 
Cheers,
Ben.

> Benjamin Herrenschmidt schreef: 
> > On Thu, 2009-07-30 at 01:42 +0200, Stef Simoens wrote:
> >   
> > > What would be the best approach?
> > > - if the kernel boots, it's obviously 'good'
> > > - but what if the kernel hits the 'BUG', should I apply your patch
> > > then? If it doesn't work with your patch, would it be 'bad' then? 
> > >     
> A few reboots later...
> As you said, during my bisecting, at a certain moment I needed your
> patch (I booted, got a problem, patched the tree with your patch,
> rebooted, it worked).
> 
> Then, git says:
> f078727b250c2653fc9a564f15547c17ebac3f99 is first bad commit
> commit f078727b250c2653fc9a564f15547c17ebac3f99
> Author: FUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
> Date:   Sun Dec 14 01:23:45 2008 +0900
> 
>     [SCSI] remove scsi_req_map_sg
>     
>     No one uses scsi_execute_async with data transfer now. We can
> remove
>     scsi_req_map_sg.
>     
>     Only scsi_eh_lock_door uses scsi_execute_async. scsi_eh_lock_door
>     doesn't handle sense and the callback. So we can remove
>     scsi_io_context too.
>     
>     Signed-off-by: FUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
>     Signed-off-by: James Bottomley
> <James.Bottomley@HansenPartnership.com>
> 
> :040000 040000 c4621d866c1ee5fd8d30e44f702a4966b8ebdc44
> 3ffca800399e52ef12f1001721c0c7ff431efafd M    drivers
> :040000 040000 805c02c4ad3c63c45dffa18f413e92bfca99caf6
> 6fb58bb5fb19c4198fa7d626d6241086655c6307 M    include
> 
> At this moment, the reason for the crash is different then in v2.6.30
> I noted the following (I hope to have written the most important
> stuff).
> driver 'sd' needs update
> mesh: lost arbitration  
> sd 0:0:0:0 sda read CAPACITY failed
> sd ...
> sd 0:0:0:0 sdb read CAPACITY failed
> sd ...
> mice  
> sd ...
> mice: PS/2 ...
> TCP cubic ... 
> Initializing XFRM ...
> NET ... protcol 17   
> XFS ...
> VFS : unable to mount root FS
> 
> If you want more input ... please let me know.
> -- 
> Kr,
> Stef Simoens
> plain text document attachment (git-bisect-log)
> git bisect start
> # good: [8e0ee43bc2c3e19db56a4adaa9a9b04ce885cd84] Linux 2.6.29
> git bisect good 8e0ee43bc2c3e19db56a4adaa9a9b04ce885cd84
> # bad: [577c9c456f0e1371cbade38eaf91ae8e8a308555] Linux 2.6.30-rc1
> git bisect bad 577c9c456f0e1371cbade38eaf91ae8e8a308555
> # bad: [5658ae9007490c18853fbf112f1b3516f5949e62] V4L/DVB (10342): gspca - stv06xx: Add ctrl caching to the vv6410.
> git bisect bad 5658ae9007490c18853fbf112f1b3516f5949e62
> # good: [08abe18af1f78ee80c3c3a5ac47c3e0ae0beadf6] Merge branch 'master' of /home/davem/src/GIT/linux-2.6/
> git bisect good 08abe18af1f78ee80c3c3a5ac47c3e0ae0beadf6
> # good: [6e15cf04860074ad032e88c306bea656bbdd0f22] Merge branch 'core/percpu' into percpu-cpumask-x86-for-linus-2
> git bisect good 6e15cf04860074ad032e88c306bea656bbdd0f22
> # bad: [eedf2c5296a8dfaaf9aec1a938c1d3bd73159a30] Merge git://git.kernel.org/pub/scm/linux/kernel/git/arjan/linux-2.6-async-for-30
> git bisect bad eedf2c5296a8dfaaf9aec1a938c1d3bd73159a30
> # good: [0870352bc6e0dee485c86a0c99dd60e7089c8917] Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/linville/wireless-2.6
> git bisect good 0870352bc6e0dee485c86a0c99dd60e7089c8917
> # good: [febb02bdfe5e2c6ceaa0a38d8b7afca3d98f415a] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/ieee1394/linux1394-2.6
> git bisect good febb02bdfe5e2c6ceaa0a38d8b7afca3d98f415a
> # bad: [21283916322f579a580e413652cdefbfa3ec676f] [SCSI] zfcp: remove undefined subtype for status read response
> git bisect bad 21283916322f579a580e413652cdefbfa3ec676f
> # good: [6864abd8b730435d6ae9cb061095229a5a85153f] [SCSI] osd: Kconfig file for in-tree builds
> git bisect good 6864abd8b730435d6ae9cb061095229a5a85153f
> # bad: [b3f1f9aa082b2ab86dec4db3d8b1566af345387e] [SCSI] ses: code_set == 1 is tested twice
> git bisect bad b3f1f9aa082b2ab86dec4db3d8b1566af345387e
> # bad: [ea41e41588c248ee8b8162869c1e1c0565a4b3f6] [SCSI] scsi_dh_rdac: Retry for Quiescence in Progress in rdac device handler
> git bisect bad ea41e41588c248ee8b8162869c1e1c0565a4b3f6
> # bad: [f078727b250c2653fc9a564f15547c17ebac3f99] [SCSI] remove scsi_req_map_sg
> git bisect bad f078727b250c2653fc9a564f15547c17ebac3f99
> # good: [78a42ce8fb2604c459e9ebb2a4f2d546b8250111] [SCSI] osst: make all the buffer the same size
> git bisect good 78a42ce8fb2604c459e9ebb2a4f2d546b8250111
> # good: [26243043f207b3faa00594a33e10b2103205f27b] [SCSI] osst: replace scsi_execute_async with the block layer API
> git bisect good 26243043f207b3faa00594a33e10b2103205f27b


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: BUG in dma-mapping.h:218 // MESH SCSI driver not working
  2009-08-02 23:13             ` BUG in dma-mapping.h:218 // MESH SCSI driver not working Benjamin Herrenschmidt
@ 2009-08-05  1:04               ` FUJITA Tomonori
  2009-08-05  1:11                 ` Benjamin Herrenschmidt
  0 siblings, 1 reply; 3+ messages in thread
From: FUJITA Tomonori @ 2009-08-05  1:04 UTC (permalink / raw)
  To: benh
  Cc: fujita.tomonori, James.Bottomley, linuxppc-dev, linux-scsi,
	stef.simoens

On Mon, 03 Aug 2009 09:13:37 +1000
Benjamin Herrenschmidt <benh@kernel.crashing.org> wrote:

> On Sun, 2009-08-02 at 10:52 +0200, Stef Simoens wrote:
> > Hey Benjamin,
> > 
> Thanks for the bisection. I'll have a look when I'm back from skiing :-)
> In the meantime, maybe Fujita has an idea ?

The commit just removes the unused functions. So I'm not sure how this
patch could cause any regression.

Looks like that READ CAPACITY fails. We use kmalloc'ed buffer for READ
CAPACITY so I'm not sure about an alignment issue that you mentioned
in this thread earlier.

You said your machine with a MESH appears to work. Did you confirm it?


> Mesh is an old crappy piece of HW with an old driver full of dark
> secrets that Paulus wrote eons ago, so I'd rather avoid cracking it
> open :-)
>  
> Cheers,
> Ben.
> 
> > Benjamin Herrenschmidt schreef: 
> > > On Thu, 2009-07-30 at 01:42 +0200, Stef Simoens wrote:
> > >   
> > > > What would be the best approach?
> > > > - if the kernel boots, it's obviously 'good'
> > > > - but what if the kernel hits the 'BUG', should I apply your patch
> > > > then? If it doesn't work with your patch, would it be 'bad' then? 
> > > >     
> > A few reboots later...
> > As you said, during my bisecting, at a certain moment I needed your
> > patch (I booted, got a problem, patched the tree with your patch,
> > rebooted, it worked).
> > 
> > Then, git says:
> > f078727b250c2653fc9a564f15547c17ebac3f99 is first bad commit
> > commit f078727b250c2653fc9a564f15547c17ebac3f99
> > Author: FUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
> > Date:   Sun Dec 14 01:23:45 2008 +0900
> > 
> >     [SCSI] remove scsi_req_map_sg
> >     
> >     No one uses scsi_execute_async with data transfer now. We can
> > remove
> >     scsi_req_map_sg.
> >     
> >     Only scsi_eh_lock_door uses scsi_execute_async. scsi_eh_lock_door
> >     doesn't handle sense and the callback. So we can remove
> >     scsi_io_context too.
> >     
> >     Signed-off-by: FUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
> >     Signed-off-by: James Bottomley
> > <James.Bottomley@HansenPartnership.com>
> > 
> > :040000 040000 c4621d866c1ee5fd8d30e44f702a4966b8ebdc44
> > 3ffca800399e52ef12f1001721c0c7ff431efafd M    drivers
> > :040000 040000 805c02c4ad3c63c45dffa18f413e92bfca99caf6
> > 6fb58bb5fb19c4198fa7d626d6241086655c6307 M    include
> > 
> > At this moment, the reason for the crash is different then in v2.6.30
> > I noted the following (I hope to have written the most important
> > stuff).
> > driver 'sd' needs update
> > mesh: lost arbitration  
> > sd 0:0:0:0 sda read CAPACITY failed
> > sd ...
> > sd 0:0:0:0 sdb read CAPACITY failed
> > sd ...
> > mice  
> > sd ...
> > mice: PS/2 ...
> > TCP cubic ... 
> > Initializing XFRM ...
> > NET ... protcol 17   
> > XFS ...
> > VFS : unable to mount root FS
> > 
> > If you want more input ... please let me know.
> > -- 
> > Kr,
> > Stef Simoens
> > plain text document attachment (git-bisect-log)
> > git bisect start
> > # good: [8e0ee43bc2c3e19db56a4adaa9a9b04ce885cd84] Linux 2.6.29
> > git bisect good 8e0ee43bc2c3e19db56a4adaa9a9b04ce885cd84
> > # bad: [577c9c456f0e1371cbade38eaf91ae8e8a308555] Linux 2.6.30-rc1
> > git bisect bad 577c9c456f0e1371cbade38eaf91ae8e8a308555
> > # bad: [5658ae9007490c18853fbf112f1b3516f5949e62] V4L/DVB (10342): gspca - stv06xx: Add ctrl caching to the vv6410.
> > git bisect bad 5658ae9007490c18853fbf112f1b3516f5949e62
> > # good: [08abe18af1f78ee80c3c3a5ac47c3e0ae0beadf6] Merge branch 'master' of /home/davem/src/GIT/linux-2.6/
> > git bisect good 08abe18af1f78ee80c3c3a5ac47c3e0ae0beadf6
> > # good: [6e15cf04860074ad032e88c306bea656bbdd0f22] Merge branch 'core/percpu' into percpu-cpumask-x86-for-linus-2
> > git bisect good 6e15cf04860074ad032e88c306bea656bbdd0f22
> > # bad: [eedf2c5296a8dfaaf9aec1a938c1d3bd73159a30] Merge git://git.kernel.org/pub/scm/linux/kernel/git/arjan/linux-2.6-async-for-30
> > git bisect bad eedf2c5296a8dfaaf9aec1a938c1d3bd73159a30
> > # good: [0870352bc6e0dee485c86a0c99dd60e7089c8917] Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/linville/wireless-2.6
> > git bisect good 0870352bc6e0dee485c86a0c99dd60e7089c8917
> > # good: [febb02bdfe5e2c6ceaa0a38d8b7afca3d98f415a] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/ieee1394/linux1394-2.6
> > git bisect good febb02bdfe5e2c6ceaa0a38d8b7afca3d98f415a
> > # bad: [21283916322f579a580e413652cdefbfa3ec676f] [SCSI] zfcp: remove undefined subtype for status read response
> > git bisect bad 21283916322f579a580e413652cdefbfa3ec676f
> > # good: [6864abd8b730435d6ae9cb061095229a5a85153f] [SCSI] osd: Kconfig file for in-tree builds
> > git bisect good 6864abd8b730435d6ae9cb061095229a5a85153f
> > # bad: [b3f1f9aa082b2ab86dec4db3d8b1566af345387e] [SCSI] ses: code_set == 1 is tested twice
> > git bisect bad b3f1f9aa082b2ab86dec4db3d8b1566af345387e
> > # bad: [ea41e41588c248ee8b8162869c1e1c0565a4b3f6] [SCSI] scsi_dh_rdac: Retry for Quiescence in Progress in rdac device handler
> > git bisect bad ea41e41588c248ee8b8162869c1e1c0565a4b3f6
> > # bad: [f078727b250c2653fc9a564f15547c17ebac3f99] [SCSI] remove scsi_req_map_sg
> > git bisect bad f078727b250c2653fc9a564f15547c17ebac3f99
> > # good: [78a42ce8fb2604c459e9ebb2a4f2d546b8250111] [SCSI] osst: make all the buffer the same size
> > git bisect good 78a42ce8fb2604c459e9ebb2a4f2d546b8250111
> > # good: [26243043f207b3faa00594a33e10b2103205f27b] [SCSI] osst: replace scsi_execute_async with the block layer API
> > git bisect good 26243043f207b3faa00594a33e10b2103205f27b
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: BUG in dma-mapping.h:218 // MESH SCSI driver not working
  2009-08-05  1:04               ` FUJITA Tomonori
@ 2009-08-05  1:11                 ` Benjamin Herrenschmidt
  0 siblings, 0 replies; 3+ messages in thread
From: Benjamin Herrenschmidt @ 2009-08-05  1:11 UTC (permalink / raw)
  To: FUJITA Tomonori; +Cc: James.Bottomley, linuxppc-dev, linux-scsi, stef.simoens

On Wed, 2009-08-05 at 10:04 +0900, FUJITA Tomonori wrote:
> 
> Looks like that READ CAPACITY fails. We use kmalloc'ed buffer for READ
> CAPACITY so I'm not sure about an alignment issue that you mentioned
> in this thread earlier.
> 
> You said your machine with a MESH appears to work. Did you confirm it?
> 
Not yet. It's a fishy machine that needs other patches to get back to
working condition, I haven't had time yet (everybody's sick at home so
I've been mostly off the office and the machine is there).

I'm pretty sure the MESH will have issues though if the DMA buffers
aren't at least 16 (or maybe it's 32) bytes aligned. I don't think it's
a cache alignment issue, I suspect it's an issue with the DBDMA engine
queue on those chips though (it -could- be cache coherency bugs too,
never know with those old Apple home made chipsets).

I remember we had problems in the past with IDENTIFY iirc, which would
work normally as kmalloc() would return something cache line aligned...
until one enabled SLAB debugging.

Cheers,
Ben.

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2009-08-05  1:11 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <4A68E1CE.4040307@scarlet.be>
     [not found] ` <1248425553.25467.20.camel@pasglop>
     [not found]   ` <4A70937B.7040201@scarlet.be>
     [not found]     ` <1248910358.1509.27.camel@pasglop>
     [not found]       ` <4A70DE54.5090406@scarlet.be>
     [not found]         ` <1248915149.1509.29.camel@pasglop>
     [not found]           ` <4A7553E7.6030804@scarlet.be>
2009-08-02 23:13             ` BUG in dma-mapping.h:218 // MESH SCSI driver not working Benjamin Herrenschmidt
2009-08-05  1:04               ` FUJITA Tomonori
2009-08-05  1:11                 ` Benjamin Herrenschmidt

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox