public inbox for linux-bcache@vger.kernel.org
 help / color / mirror / Atom feed
* layering question.
@ 2015-08-04 16:20 A. James Lewis
  2015-08-04 17:01 ` Jens-U. Mozdzen
  2015-08-05  6:28 ` Kai Krakow
  0 siblings, 2 replies; 14+ messages in thread
From: A. James Lewis @ 2015-08-04 16:20 UTC (permalink / raw)
  To: linux-bcache

Hi all...

I've heard rumours that layering bcache with other block device drivers 
might not be recommended... I wonder what the truth really is... perhaps 
someone can advise.

I was planning to use 2 SSD's... combined with 4 large spinning drives 
to create a large filesystem with BTRFS...  my questions are as follows.

1. Is there a way to use 2 SSD's directly, or would it be OK to use MD 
to stripe them?... then used the MD array as the cache device?

2. I would be using BTRFS, so would it be better to create 4 separate 
bcache devices each attached to the single cache device, and then use 
BTRFS to raid 4 bcache devices... obviously this would be more flexible, 
or would I need to make an MD raid of the 4 devices, and then use that 
to create a single bcache device and build a BTRFS filesystem on top of 
that.

Hope that's clear, any clarification would be appreciated...

Also, there's talk about a pending on-disk cache format change some time 
around 3.19, but no details... is this over with, or still pending?

James

^ permalink raw reply	[flat|nested] 14+ messages in thread
* Re: layering question.
@ 2015-08-07 16:24 Jens-U. Mozdzen
  0 siblings, 0 replies; 14+ messages in thread
From: Jens-U. Mozdzen @ 2015-08-07 16:24 UTC (permalink / raw)
  To: linux-bcache

Hi James,

Zitat von "A. James Lewis" <james@fsck.co.uk>:
> OK, but in that case bcache is not between your MD RAID and it's  
> disks, so if your disks are dropping out of the MD array, that has  
> to be either an independent problem, or a very complex bug.

My guess is that it's a rather simple timeout / locking problem, which  
leads to an expiring timer in the MD code. And bcache has a well-known  
history for locking problems, according to the mailing list.

Regards,
Jens

> James
>
>
> On 07/08/15 16:36, Jens-U. Mozdzen wrote:
>> Hi James,
>>
>> Zitat von "A. James Lewis" <james@fsck.co.uk>:
>>> That's interesting, are you putting your MD on top of multiple  
>>> bcache devices... rather than bcache on top of an MD device... I  
>>> wonder what the rationale behind this is?
>>
>> Hi James, no such thing here...
>>
>> bcache is running on top of two MD-RAIDs - RAID6 with 7 spinning  
>> drives and RAID1 with two SSDs.
>>
>> The stack is, from bottom to top:
>>
>> - MD-RAID6 data, MD-RAID1 cache
>> - bcache (/dev/bcache0, used as an LVM PV)
>> - LVM
>> - many LVs
>> - DRBD on top of most of the LVs
>> - Ext4 on each of the DRBD devices
>> - SCST / NFS / SMB sharing these file systems
>>
>> In the referenced incidents, SCST reports that (many) writes failed  
>> due to time-out, and MD reports a single disk faulty. No other  
>> traces in syslog, especially no stalled processes, locking problems  
>> or kernel bugs.
>>
>> The i/o pattern is highly parallel reads and writes, mostly via SCST.
>>
>> Regards,
>> Jens

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2015-08-07 16:24 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-08-04 16:20 layering question A. James Lewis
2015-08-04 17:01 ` Jens-U. Mozdzen
2015-08-04 17:16   ` A. James Lewis
2015-08-05  6:56     ` Jens-U. Mozdzen
2015-08-05  6:28 ` Kai Krakow
2015-08-05  7:04   ` Jens-U. Mozdzen
2015-08-05 23:10     ` Kai Krakow
2015-08-06  0:54       ` A. James Lewis
2015-08-06 23:12         ` Kai Krakow
2015-08-07 12:43           ` Jens-U. Mozdzen
2015-08-07 14:38             ` A. James Lewis
2015-08-07 15:36               ` Jens-U. Mozdzen
2015-08-07 16:16                 ` A. James Lewis
  -- strict thread matches above, loose matches on Subject: below --
2015-08-07 16:24 Jens-U. Mozdzen

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox