public inbox for linux-bcache@vger.kernel.org
 help / color / mirror / Atom feed
* full ssd for bcache or part?
@ 2013-10-23 14:49 Vasiliy Tolstov
       [not found] ` <CACaajQscOOajT2A6MNpstALph8D=jqdvqifynYxxR4gTXrGTGQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Vasiliy Tolstov @ 2013-10-23 14:49 UTC (permalink / raw)
  To: linux-bcache-u79uwXL29TY76Z2rM5mHXA

As i known ssd slowdown then its full. Does it matter for bcache?
Does i need to for example from 240 Gb drive create partition with
200Gb for cache and 40 unused?

-- 
Vasiliy Tolstov,
e-mail: v.tolstov-+9FY0jupvH6HXe+LvDLADg@public.gmane.org
jabber: vase-+9FY0jupvH6HXe+LvDLADg@public.gmane.org

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: full ssd for bcache or part?
       [not found] ` <CACaajQscOOajT2A6MNpstALph8D=jqdvqifynYxxR4gTXrGTGQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2013-10-23 16:15   ` Matthew Patton
       [not found]     ` <f46b2a85-6116-4dac-8acd-03007110cf4b-2ueSQiBKiTY7tOexoI0I+QC/G2K4zDHf@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: Matthew Patton @ 2013-10-23 16:15 UTC (permalink / raw)
  To: Vasiliy Tolstov, linux-bcache-u79uwXL29TY76Z2rM5mHXA

25 to 30 percent reserve is strongly recommended for any write heavy workload. The performance improvement will blow your mind.

A 240GB drive is a 256 with 7 percent reserve. Partition your device to 185 GB. 

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: full ssd for bcache or part?
       [not found]     ` <f46b2a85-6116-4dac-8acd-03007110cf4b-2ueSQiBKiTY7tOexoI0I+QC/G2K4zDHf@public.gmane.org>
@ 2013-10-23 16:55       ` Vasiliy Tolstov
  2013-10-24  2:24       ` Brad Campbell
  1 sibling, 0 replies; 6+ messages in thread
From: Vasiliy Tolstov @ 2013-10-23 16:55 UTC (permalink / raw)
  To: Matthew Patton; +Cc: linux-bcache-u79uwXL29TY76Z2rM5mHXA

2013/10/23 Matthew Patton <pattonme-/E1597aS9LQAvxtiuMwx3w@public.gmane.org>:
> 25 to 30 percent reserve is strongly recommended for any write heavy workload. The performance improvement will blow your mind.
>
> A 240GB drive is a 256 with 7 percent reserve. Partition your device to 185 GB.


Thanks!

-- 
Vasiliy Tolstov,
e-mail: v.tolstov-+9FY0jupvH6HXe+LvDLADg@public.gmane.org
jabber: vase-+9FY0jupvH6HXe+LvDLADg@public.gmane.org

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: full ssd for bcache or part?
       [not found]     ` <f46b2a85-6116-4dac-8acd-03007110cf4b-2ueSQiBKiTY7tOexoI0I+QC/G2K4zDHf@public.gmane.org>
  2013-10-23 16:55       ` Vasiliy Tolstov
@ 2013-10-24  2:24       ` Brad Campbell
       [not found]         ` <526884F8.4080000-+nnirC7rrGZibQn6LdNjmg@public.gmane.org>
  1 sibling, 1 reply; 6+ messages in thread
From: Brad Campbell @ 2013-10-24  2:24 UTC (permalink / raw)
  To: Matthew Patton, Vasiliy Tolstov,
	linux-bcache-u79uwXL29TY76Z2rM5mHXA

On 10/24/2013 12:15 AM, Matthew Patton wrote:
> 25 to 30 percent reserve is strongly recommended for any write heavy workload. The performance improvement will blow your mind.
>
> A 240GB drive is a 256 with 7 percent reserve. Partition your device to 185 GB.

I thought (and maybe I'm wrong) that a good chunk of reserved space was 
essential to allow the drive to efficiently manage its read-modify-write 
cycles. It was my understanding that bcache was explicitly designed to 
fill erase block sized chunks sequentially and discard them in whole 
units, negating the requirement for the drive to actually perform RMW 
cycles and therefore negating the requirement for the extra free space.

Have I got it wrong?

Regards,
Brad

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: full ssd for bcache or part?
       [not found]         ` <526884F8.4080000-+nnirC7rrGZibQn6LdNjmg@public.gmane.org>
@ 2013-10-24  3:40           ` matthew patton
       [not found]             ` <1382586002.46067.YahooMailNeo-XYahOdtEMNnuQS8rMknbopOW+3bF1jUfVpNB7YpNyf8@public.gmane.org>
  0 siblings, 1 reply; 6+ messages in thread
From: matthew patton @ 2013-10-24  3:40 UTC (permalink / raw)
  To: Brad Campbell, Vasiliy Tolstov,
	linux-bcache-u79uwXL29TY76Z2rM5mHXA@public.gmane.org


>I thought (and maybe I'm wrong) that a good chunk of reserved space was essential to allow the drive to efficiently manage its read-modify-write cycles.

Correct. The industry ~standard of 7% (basically the difference between GiB and GB) is woefully inadequate for any kind of steady write load. ALL enterprise SSDs use north of 20% and and I've seen as high as 50%.

> It was my understanding that bcache was explicitly designed to fill erase block sized
> chunks sequentially and discard them in whole units,
> negating the requirement for the drive to actually perform RMW cycles


RMW is an essential and inalienable of how an SSD works. Every manufacturer can use different page and erase block sizes. And much of the time they don't publish the specs publicly. So while Kent may have gone to deliberate length to optimize the way BCache does IO by using aligned, suitably large chunks (eg. 128KB-512KB) he has zero control over what the firmware decides to do.

BTW, did you undo the retarded disk label that Linux has used for decades which is guaranteed to cause mis-aligned I/O? I expect BCache will start it's data area at 1MB offset from where the device starts. But it can't do much to remedy the  problem if you didn't align the partition or LV you handed BCache correctly to begin with.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: full ssd for bcache or part?
       [not found]             ` <1382586002.46067.YahooMailNeo-XYahOdtEMNnuQS8rMknbopOW+3bF1jUfVpNB7YpNyf8@public.gmane.org>
@ 2013-10-24  4:08               ` Brad Campbell
  0 siblings, 0 replies; 6+ messages in thread
From: Brad Campbell @ 2013-10-24  4:08 UTC (permalink / raw)
  To: matthew patton, Vasiliy Tolstov,
	linux-bcache-u79uwXL29TY76Z2rM5mHXA@public.gmane.org

On 10/24/2013 11:40 AM, matthew patton wrote:
>
>> I thought (and maybe I'm wrong) that a good chunk of reserved space was essential to allow the drive to efficiently manage its read-modify-write cycles.
>
> Correct. The industry ~standard of 7% (basically the difference between GiB and GB) is woefully inadequate for any kind of steady write load. ALL enterprise SSDs use north of 20% and and I've seen as high as 50%.

I must admit all my 250G SSDs are partitioned down to 200G.

>> It was my understanding that bcache was explicitly designed to fill erase block sized
>> chunks sequentially and discard them in whole units,
>> negating the requirement for the drive to actually perform RMW cycles
>
>
> RMW is an essential and inalienable of how an SSD works.

Well, yes... but. I'd have though if you don't give the drive a reason 
to perform an RMW, then it would be a reasonable assumption that perhaps 
it won't actually do one. Perhaps I give the firmware authors too much 
credit.

> Every manufacturer can use different page and erase block sizes. And much of the time they don't publish the specs publicly. So while Kent may have gone to deliberate length to optimize the way BCache does IO by using aligned, suitably large chunks (eg. 128KB-512KB) he has zero control over what the firmware decides to do.


This is essentially true, however making your storage bucket size big 
enough to believe it holds at least 1 full erase block would be a 
reasonable assumption. Oops.. ass-u-me..

> BTW, did you undo the retarded disk label that Linux has used for decades which is guaranteed to cause mis-aligned I/O? I expect BCache will start it's data area at 1MB offset from where the device starts. But it can't do much to remedy the  problem if you didn't align the partition or LV you handed BCache correctly to begin with.

I'm pretty sure all my drives are properly aligned. I learned that very 
quickly when I started using the WD "advanced format" drives, except for 
SSD's I align on 1M insteak of 4k.

I don't actually use bcache in production as when I did my last storage 
upgrade I just could not get it reliable (well before it hit the 
mainline kernel). I just keep tabs on it with the intention of using it 
when it develops the ability to mirror writeback data. In the mean time 
I'm just running on a RAID10 of 6 SSD's.

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2013-10-24  4:08 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-10-23 14:49 full ssd for bcache or part? Vasiliy Tolstov
     [not found] ` <CACaajQscOOajT2A6MNpstALph8D=jqdvqifynYxxR4gTXrGTGQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-10-23 16:15   ` Matthew Patton
     [not found]     ` <f46b2a85-6116-4dac-8acd-03007110cf4b-2ueSQiBKiTY7tOexoI0I+QC/G2K4zDHf@public.gmane.org>
2013-10-23 16:55       ` Vasiliy Tolstov
2013-10-24  2:24       ` Brad Campbell
     [not found]         ` <526884F8.4080000-+nnirC7rrGZibQn6LdNjmg@public.gmane.org>
2013-10-24  3:40           ` matthew patton
     [not found]             ` <1382586002.46067.YahooMailNeo-XYahOdtEMNnuQS8rMknbopOW+3bF1jUfVpNB7YpNyf8@public.gmane.org>
2013-10-24  4:08               ` Brad Campbell

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox