MO-Drive under 2.4.3

public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed

* MO-Drive under 2.4.3
@ 2001-04-13  8:47 Detlev Offenbach
  2001-04-13 13:08 ` Alan Cox
  0 siblings, 1 reply; 10+ messages in thread
From: Detlev Offenbach @ 2001-04-13  8:47 UTC (permalink / raw)
  To: linux-kernel

Hi out there,

I have a problem using my MO-Drive under kernel 2.4.3. I have several disks 
formated with a VFAT filesystem. Under kernel 2.2.19 everything works fine. 
Under kernel 2.4.3 I cannot write anything to the disk without hanging the 
complete system so that I have to use the reset button. For disks with an 
ext2 filesystem it works okay.

Regards
Detlev
-- 
Detlev Offenbach
detlev@offenbach.fs.uunet.de

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-13  8:47 MO-Drive under 2.4.3 Detlev Offenbach
@ 2001-04-13 13:08 ` Alan Cox
  2001-04-14  9:45   ` Detlev Offenbach
  2001-04-14 13:00   ` Daniel Kobras
  0 siblings, 2 replies; 10+ messages in thread
From: Alan Cox @ 2001-04-13 13:08 UTC (permalink / raw)
  To: Detlev Offenbach; +Cc: linux-kernel

> I have a problem using my MO-Drive under kernel 2.4.3. I have several disks 
> formated with a VFAT filesystem. Under kernel 2.2.19 everything works fine. 
> Under kernel 2.4.3 I cannot write anything to the disk without hanging the 
> complete system so that I have to use the reset button. For disks with an 
> ext2 filesystem it works okay.

This is a bug in the scsi layer. linux-scsi@vger.kernel.org, not that any of
the scsi maintainers seem to care about it right now.




^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-13 13:08 ` Alan Cox
@ 2001-04-14  9:45   ` Detlev Offenbach
  2001-04-14 14:25     ` Alan Cox
  2001-04-14 13:00   ` Daniel Kobras
  1 sibling, 1 reply; 10+ messages in thread
From: Detlev Offenbach @ 2001-04-14  9:45 UTC (permalink / raw)
  To: Alan Cox; +Cc: linux-kernel

Hi Alan,

thanks for the reply.

Am Freitag, 13. April 2001 15:08 schrieb Alan Cox:
> > I have a problem using my MO-Drive under kernel 2.4.3. I have several
> > disks formated with a VFAT filesystem. Under kernel 2.2.19 everything
> > works fine. Under kernel 2.4.3 I cannot write anything to the disk
> > without hanging the complete system so that I have to use the reset
> > button. For disks with an ext2 filesystem it works okay.
>
> This is a bug in the scsi layer. linux-scsi@vger.kernel.org, not that any
> of the scsi maintainers seem to care about it right now.

Does this mean I can forget about using kernel 2.4.x? :-(( Can you give me a 
hint where to look. Maybe I can fix it myself.

Regards
Detlev
-- 
Detlev Offenbach
detlev@offenbach.fs.uunet.de

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-14  9:45   ` Detlev Offenbach
@ 2001-04-14 14:25     ` Alan Cox
  0 siblings, 0 replies; 10+ messages in thread
From: Alan Cox @ 2001-04-14 14:25 UTC (permalink / raw)
  To: Detlev Offenbach; +Cc: Alan Cox, linux-kernel

> > This is a bug in the scsi layer. linux-scsi@vger.kernel.org, not that any
> > of the scsi maintainers seem to care about it right now.
> 
> Does this mean I can forget about using kernel 2.4.x? :-(( Can you give me a 
> hint where to look. Maybe I can fix it myself.

The FAT layer asks for a 512 byte block, the scsi layer in 2.2 would then 
reblock the data to handle this internally. In 2.4 the scsi layer errors this
then all the error handling gets tangled up (and doesnt work) and the machine
oopses.

So you either need to teach lots of file systems about handling blocks that
are smaller than the physical layer, or the block layer to do reblocking in
some cases (at an obvious performance hit).

And someone needs to fix the error handling so it errors rather than exploding

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-13 13:08 ` Alan Cox
  2001-04-14  9:45   ` Detlev Offenbach
@ 2001-04-14 13:00   ` Daniel Kobras
  2001-04-14 14:29     ` Alan Cox
  1 sibling, 1 reply; 10+ messages in thread
From: Daniel Kobras @ 2001-04-14 13:00 UTC (permalink / raw)
  To: Alan Cox; +Cc: linux-kernel

On Fri, Apr 13, 2001 at 02:08:41PM +0100, Alan Cox wrote:
> > I have a problem using my MO-Drive under kernel 2.4.3. I have several disks 
> > formated with a VFAT filesystem. Under kernel 2.2.19 everything works fine. 
> > Under kernel 2.4.3 I cannot write anything to the disk without hanging the 
> > complete system so that I have to use the reset button. For disks with an 
> > ext2 filesystem it works okay.
> 
> This is a bug in the scsi layer. linux-scsi@vger.kernel.org, not that any of
> the scsi maintainers seem to care about it right now.

Err..., now I'm confused. Last time this issue popped up, it was my
understanding that it's generic_file_{read,write}'s limitation to filesystems
with logical_blksize >= hw_blksize that makes MOs fail with VFAT. Now, is
this all moot, or is the SCSI thing just an additional problem?

Regards,

Daniel.

-- 
	GNU/Linux Audio Mechanics - http://www.glame.de
	      Cutting Edge Office - http://www.c10a02.de
	      GPG Key ID 89BF7E2B - http://www.keyserver.net

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-14 13:00   ` Daniel Kobras
@ 2001-04-14 14:29     ` Alan Cox
  2001-04-21 23:37       ` Daniel Kobras
  0 siblings, 1 reply; 10+ messages in thread
From: Alan Cox @ 2001-04-14 14:29 UTC (permalink / raw)
  To: Daniel Kobras; +Cc: Alan Cox, linux-kernel

> > This is a bug in the scsi layer. linux-scsi@vger.kernel.org, not that any of
> > the scsi maintainers seem to care about it right now.
> 
> Err..., now I'm confused. Last time this issue popped up, it was my
> understanding that it's generic_file_{read,write}'s limitation to filesystems
> with logical_blksize >= hw_blksize that makes MOs fail with VFAT. Now, is
> this all moot, or is the SCSI thing just an additional problem?

generic_file_* doesnt handle metadata issues - its too high up


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-14 14:29     ` Alan Cox
@ 2001-04-21 23:37       ` Daniel Kobras
  2001-04-21 23:59         ` Alan Cox
  0 siblings, 1 reply; 10+ messages in thread
From: Daniel Kobras @ 2001-04-21 23:37 UTC (permalink / raw)
  To: Alan Cox; +Cc: linux-kernel

On Sat, Apr 14, 2001 at 03:29:45PM +0100, Alan Cox wrote:
> > > This is a bug in the scsi layer. linux-scsi@vger.kernel.org, not that any of
> > > the scsi maintainers seem to care about it right now.
> > 
> > Err..., now I'm confused. Last time this issue popped up, it was my
> > understanding that it's generic_file_{read,write}'s limitation to filesystems
> > with logical_blksize >= hw_blksize that makes MOs fail with VFAT. Now, is
> > this all moot, or is the SCSI thing just an additional problem?
> 
> generic_file_* doesnt handle metadata issues - its too high up

As much as SCSI is too low down, so I still don't see why this should be a
SCSI issue. Assuming requests are no smaller than hw_blksize looks rather sane
for a low-level driver. It's still enforced in ll_rw_blk(). I'd rather blame
the callers when they bypass the check via submit_bh().

More to the topic, FAT in 2.2 contained lots of special code to deal with
bigblocks. In 2.4, this code got removed in favour of the generic
functions in fs/buffer.c, to the effect that operations that happen to not
deref a NULL pointer now request 2k blocks using 512 byte-calculated block
numbers. I see two options:

a) Put in lots of bigblock special case code in FAT;
b) teach submit_bh() or generic_make_request() to transparently reblock 
   bhs < hw_blksize and remove most special cases from FAT. Specifically,
   it ought to stop pretending in sb->s_blocksize to use 2k blocks when
   the fs is really tied to 512 byte blocks.

I tend to favour b), but which one is more likely to be accepted?

Regards,

Daniel.

-- 
	GNU/Linux Audio Mechanics - http://www.glame.de
	      Cutting Edge Office - http://www.c10a02.de
	      GPG Key ID 89BF7E2B - http://www.keyserver.net

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-21 23:37       ` Daniel Kobras
@ 2001-04-21 23:59         ` Alan Cox
  2001-04-22  2:36           ` Jens Axboe
  0 siblings, 1 reply; 10+ messages in thread
From: Alan Cox @ 2001-04-21 23:59 UTC (permalink / raw)
  To: Daniel Kobras; +Cc: Alan Cox, linux-kernel

> a) Put in lots of bigblock special case code in FAT;
> b) teach submit_bh() or generic_make_request() to transparently reblock 
>    bhs < hw_blksize and remove most special cases from FAT. Specifically,
>    it ought to stop pretending in sb->s_blocksize to use 2k blocks when
>    the fs is really tied to 512 byte blocks.
> 
> I tend to favour b), but which one is more likely to be accepted?

Al Viro suggested c) which was to transparently make it a loopback mount of
the raw device and let a loopback layer do the work.


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-21 23:59         ` Alan Cox
@ 2001-04-22  2:36           ` Jens Axboe
  2001-05-01 16:01             ` Daniel Kobras
  0 siblings, 1 reply; 10+ messages in thread
From: Jens Axboe @ 2001-04-22  2:36 UTC (permalink / raw)
  To: Alan Cox; +Cc: Daniel Kobras, linux-kernel

On Sun, Apr 22 2001, Alan Cox wrote:
> > a) Put in lots of bigblock special case code in FAT;
> > b) teach submit_bh() or generic_make_request() to transparently reblock 
> >    bhs < hw_blksize and remove most special cases from FAT. Specifically,
> >    it ought to stop pretending in sb->s_blocksize to use 2k blocks when
> >    the fs is really tied to 512 byte blocks.
> > 
> > I tend to favour b), but which one is more likely to be accepted?
> 
> Al Viro suggested c) which was to transparently make it a loopback mount of
> the raw device and let a loopback layer do the work.

... which is basically the same thing, in that we need to support writes
< hardware block size to devices. This is never going to be an efficient
mechanism, the read gathering required for a 512b write on a 2048b media
is scary. Think cd-rw 64kB blocksize for write. Ugh.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: MO-Drive under 2.4.3
  2001-04-22  2:36           ` Jens Axboe
@ 2001-05-01 16:01             ` Daniel Kobras
  0 siblings, 0 replies; 10+ messages in thread
From: Daniel Kobras @ 2001-05-01 16:01 UTC (permalink / raw)
  To: Jens Axboe; +Cc: Alan Cox, linux-kernel

On Sun, Apr 22, 2001 at 04:36:18AM +0200, Jens Axboe wrote:
> On Sun, Apr 22 2001, Alan Cox wrote:
> > > a) Put in lots of bigblock special case code in FAT;
> > > b) teach submit_bh() or generic_make_request() to transparently reblock 
> > >    bhs < hw_blksize and remove most special cases from FAT. Specifically,
> > >    it ought to stop pretending in sb->s_blocksize to use 2k blocks when
> > >    the fs is really tied to 512 byte blocks.
> > > 
> > > I tend to favour b), but which one is more likely to be accepted?
> > 
> > Al Viro suggested c) which was to transparently make it a loopback mount of
> > the raw device and let a loopback layer do the work.
> 
> ... which is basically the same thing, in that we need to support writes
> < hardware block size to devices. This is never going to be an efficient
> mechanism, the read gathering required for a 512b write on a 2048b media
> is scary. Think cd-rw 64kB blocksize for write. Ugh.

Here's my stab at reblocking support. I've tested with VFAT filesystems on a
SCSI magneto-optical with 2k hw block size. losetup /dev/loop0 /dev/sda1 &&
mount /dev/loop0 /mnt/mo. read/write/mmap appear to work, but obviously the code
is in dire need for optimisations. What's the reasoning behind never caching bhs
to the underlying blkdev? Using getblk&friends instead of allocating each bh
ourselves and dropping it right afterwards would make the code a lot less
horrible. But I fear someone must have had good reason not to do so. Does
anyone know details? And moreover, does anyone see problems with this change
other than performance woes?

Regards,

Daniel.

--[snip]--

--- loop.c.vanilla	Mon Apr 30 21:08:05 2001
+++ loop.c	Tue May  1 17:05:32 2001
@@ -49,6 +49,10 @@
  *   problem above. Encryption modules that used to rely on the old scheme
  *   should just call ->i_mapping->bmap() to calculate the physical block
  *   number.
+ *
+ * Added reblocking support for misdesigned filesystems (read: FAT) that make
+ * strong assumptions about the max block size of the underlying device.
+ * Daniel Kobras <kobras@linux.de>, May 1st, 2001.
  */ 
 
 #include <linux/config.h>
@@ -381,9 +385,16 @@
 		loop_add_bh(lo, bh);
 }
 
+static void loop_end_io_sync(struct buffer_head *bh, int uptodate)
+{
+	mark_buffer_uptodate(bh, uptodate);
+	unlock_buffer(bh);
+}
+
 static struct buffer_head *loop_get_buffer(struct loop_device *lo,
 					   struct buffer_head *rbh)
 {
+	int bs;
 	struct buffer_head *bh;
 
 	do {
@@ -396,7 +407,6 @@
 	} while (1);
 	memset(bh, 0, sizeof(*bh));
 
-	bh->b_size = rbh->b_size;
 	bh->b_dev = rbh->b_rdev;
 	spin_lock_irq(&lo->lo_lock);
 	bh->b_rdev = lo->lo_device;
@@ -411,10 +421,21 @@
 	bh->b_page = alloc_page(GFP_BUFFER);
 	bh->b_data = kmap(bh->b_page);
 
+	init_waitqueue_head(&bh->b_wait);
+
 	bh->b_end_io = loop_end_io_transfer;
 	bh->b_rsector = rbh->b_rsector + (lo->lo_offset >> 9);
-	init_waitqueue_head(&bh->b_wait);
 
+	bs = loop_get_bs(lo);
+	
+	if (rbh->b_size >= bs) {
+		bh->b_size = rbh->b_size;
+		return bh;
+	}
+
+	bh->b_size = bs;
+	bh->b_rsector &= ~((bh->b_size>>9) - 1);
+	
 	return bh;
 }
 
@@ -468,9 +489,31 @@
 	bh->b_private = rbh;
 	IV = loop_get_iv(lo, bh->b_rsector);
 	if (rw == WRITE) {
-		set_bit(BH_Dirty, &bh->b_state);
-		if (lo_do_transfer(lo, WRITE, bh->b_data, rbh->b_data, bh->b_size, IV))
-			goto err;
+		if (bh->b_size == rbh->b_size) {
+			set_bit(BH_Dirty, &bh->b_state);
+			if (lo_do_transfer(lo, WRITE, bh->b_data, rbh->b_data,				                   rbh->b_size, IV))
+				goto err;
+		} else {
+			/* FIXME. This, of course, is outright ridiculous.
+			 * We obviously want to cache the bhs.  The naive
+			 * approch simply uses getblk() in loop_get_buffer(),
+			 * and we'd just call bread() below.  But surely the
+			 * current code has compelling reasons not to do so?
+			 * loop eating up too many bhs?
+			 */
+			unsigned long off;
+			bh->b_end_io = loop_end_io_sync;
+			generic_make_request(READ, bh);
+			lock_buffer(bh);
+			if (!buffer_uptodate(bh))
+				goto err;
+			bh->b_end_io = loop_end_io_transfer;
+			set_bit(BH_Dirty, &bh->b_state);
+			off = (rbh->b_rsector - bh->b_rsector) << 9;
+			if (lo_do_transfer(lo, WRITE, bh->b_data+off,
+	        	                   rbh->b_data, rbh->b_size, IV))
+				goto err;
+		}
 	}
 
 	generic_make_request(rw, bh);
@@ -503,9 +546,11 @@
 	} else {
 		struct buffer_head *rbh = bh->b_private;
 		unsigned long IV = loop_get_iv(lo, rbh->b_rsector);
-
-		ret = lo_do_transfer(lo, READ, bh->b_data, rbh->b_data,
-				     bh->b_size, IV);
+		unsigned long off = 0;
+		if (bh->b_size != rbh->b_size)
+			off = (rbh->b_rsector - bh->b_rsector) << 9;
+		ret = lo_do_transfer(lo, READ, bh->b_data+off, rbh->b_data,
+				     rbh->b_size, IV);
 
 		rbh->b_end_io(rbh, !ret);
 		loop_put_buffer(bh);

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2001-05-01 16:03 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2001-04-13  8:47 MO-Drive under 2.4.3 Detlev Offenbach
2001-04-13 13:08 ` Alan Cox
2001-04-14  9:45   ` Detlev Offenbach
2001-04-14 14:25     ` Alan Cox
2001-04-14 13:00   ` Daniel Kobras
2001-04-14 14:29     ` Alan Cox
2001-04-21 23:37       ` Daniel Kobras
2001-04-21 23:59         ` Alan Cox
2001-04-22  2:36           ` Jens Axboe
2001-05-01 16:01             ` Daniel Kobras

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox