public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* RE: [linux-iscsi-devel] [question] deferred sense
@ 2005-01-05 17:47 Salyzyn, Mark
  2005-01-05 21:51 ` James Bottomley
  0 siblings, 1 reply; 13+ messages in thread
From: Salyzyn, Mark @ 2005-01-05 17:47 UTC (permalink / raw)
  To: linux-scsi

James Bottomley sez:
> The most common case for deferred sense is using a writeback cache.
> Here the device takes the write into its cache and returns GOOD.
Later
> trying to write it to the device it gets a medium error and now tries
to
> send deferred sense saying "you remember that write I told you worked,
> well actually..."  by this time, we've already acknowledged the write
to
> the block layer and any JFS will proceed on the assumption that it
> completed.  There's no way we can go back in time and apply this error
> to an already completed transaction.  However, the device has violated
> the journalling assumptions, and the FS will be corrupt.
> 
> So does anyone have any better suggestions for handling deferred
sense?

Change the assumptions in Journalling filesystems and make it work in a
deferred sense environment.

Don't delete the Journal until a safe point of time when the writeback
is confirmed either by underlying knowledge, time (flush interval
passed), amount of data (dirty cache is forced flushed), or a read-back
(resulting from the deferred error investigation). RAID cards and
external enclosures do this all the time, but to be fair with usually
more knowledge of all the cache handling.

Sincerely -- Mark Salyzyn

^ permalink raw reply	[flat|nested] 13+ messages in thread
* RE: [linux-iscsi-devel] [question] deferred sense
@ 2005-01-06 14:23 Salyzyn, Mark
  2005-01-06 14:38 ` James Bottomley
  0 siblings, 1 reply; 13+ messages in thread
From: Salyzyn, Mark @ 2005-01-06 14:23 UTC (permalink / raw)
  To: James Bottomley; +Cc: SCSI Mailing List

James Bottomley sez:
> Well ... this proposal would change every filesystem in the kernel for
> something that occurs rather rarely (I've never actually seen it
> happen), so it is rather a lot of work for not very much benefit.
> However, if you have actual proposals, lets see them.

Proposal would be difficult and illegal for me; I am bound in my
engineering contract to a proprietary world :-( I can just not clean my
room well enough.

However, you would see a similar problem being generated by a need for
quick and hot system failover and cache synchronization, bubbling all
the way up to the Linux disk cache; not just at the physical DASD level.
If the file-system is immune to a deferred failure from cached systems
(disk, RAID or linux cache), it will be hardened to deal with component
to complete system failover.

Brute force, with performance penalty, is to periodically flush cache
(SCSI_SYNC command 0x35 being the weapon of choice), and when that
completes the Journal is cleansed. Added (performance) value, is to
survive a failover or deferred failure without the need to do this
synchronization.

(Note: Adaptec's branch of the aacraid driver supports SCSI_SYNC)

Sincerely -- Mark Salyzyn

^ permalink raw reply	[flat|nested] 13+ messages in thread
* RE: [linux-iscsi-devel] [question] deferred sense
@ 2005-01-06 14:57 Salyzyn, Mark
  2005-01-06 15:50 ` James Bottomley
  0 siblings, 1 reply; 13+ messages in thread
From: Salyzyn, Mark @ 2005-01-06 14:57 UTC (permalink / raw)
  To: James Bottomley; +Cc: SCSI Mailing List

James Bottomley sez: 
> If you're thinking of having a writeback
> cache in a raid card on a host, then you'll find all the documentation
> for clustering will explicitly warn you to switch it to write through
(I
> know ours does).

The key to higher performance is a cluster solution that permits cache
on, and in doing so, the issues of deferred failure is similar. I am not
talking pie in the sky here; there are proprietary solutions (from low
end all the way up to the `world scale cache' of the Yotta-Yotta's of
the world)

The point is deferred failure is not the only user of these changes to
the file-system layer. In the low end, this kind of technology is being
added to Windoze and has been part of Netware (including complete system
memory synchronization between failover machines) for years; there are
customers of redundant solutions that require this.

In the Public Domain, simplicity usually drives adoption (just measure
the pain Sistina had asking for a new atomic SCSI command). I am not
sure the General Purpose customer wants to 'pay' for unused complexity
and the accompanying doubt of stability ...

Sincerely -- Mark Salyzyn

^ permalink raw reply	[flat|nested] 13+ messages in thread

end of thread, other threads:[~2005-01-11 11:44 UTC | newest]

Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <41DB21D7.5080904@us.ibm.com>
     [not found] ` <20050104234700.GA18343@visi.com>
     [not found]   ` <20050105092144.GB26793@lst.de>
     [not found]     ` <20050105152112.GA8472@visi.com>
2005-01-05 15:23       ` [linux-iscsi-devel] [question] deferred sense Christoph Hellwig
2005-01-05 16:44         ` James Bottomley
2005-01-06 16:37           ` Tom Coughlan
2005-01-07  0:06             ` Douglas Gilbert
2005-01-11 11:44               ` Douglas Gilbert
2005-01-05 17:47 Salyzyn, Mark
2005-01-05 21:51 ` James Bottomley
2005-01-05 22:58   ` Matthew Wilcox
2005-01-06 23:09   ` Bryan Henderson
  -- strict thread matches above, loose matches on Subject: below --
2005-01-06 14:23 Salyzyn, Mark
2005-01-06 14:38 ` James Bottomley
2005-01-06 14:57 Salyzyn, Mark
2005-01-06 15:50 ` James Bottomley

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox