public inbox for linux-bcache@vger.kernel.org
 help / color / mirror / Atom feed
* Loosing transactions
@ 2013-01-23 20:14 Pierre Beck
       [not found] ` <51004490.704-MZZvbRqs/9F0RdzJJlgK+g@public.gmane.org>
  0 siblings, 1 reply; 9+ messages in thread
From: Pierre Beck @ 2013-01-23 20:14 UTC (permalink / raw)
  To: linux-bcache-u79uwXL29TY76Z2rM5mHXA

Hi,

something is not working as advertised :-)

I have a test setup for power loss behaviour evaluation. Recently a 
batch of SSDs was of interest and following them, naturally, bcache.

The test is simple: format an ext4 fs on the target device, copy over an 
empty mysql db and server with ACID compliant config (defaults, innodb 
table), then write inserts with a python script and output the latest 
insert id. Watch via SSH, then cut power. I was positively surprised 
that the consumer SSDs obey flushes and don't loose transactions (stored 
transaction was in fact always one or two ahead of output). Intel 
520series, Samsung 840 Pro and Corsair Neutron GTX, all 256 GB, in case 
you're wondering. The Intel 520 was alot faster btw., I think Sandforce 
did a really good job performance-wise. Testing an OCZ Vector failed, 
BIOS hang during detection.

Using an external Ext4 Journal with data=journal yielded SSD-like write 
performance with writebacks to an ST3000DM001 at the same level thanks 
to re-ordering, not loosing transactions as well.

Adding bcache, tests immediately failed, in both writeback and 
writethrough modes. Watching writethrough mode, the performance of the 
HDD looked odd, because waiting for cache flushes it should not exceed 1 
MiB/s, yet I saw 30 MiB/s. So cache flushes are simply eaten somewhere.

dmesg says this at boot time:

Jan 23 19:23:37 dr-nick kernel: [    2.948131] sd 2:0:0:0: [sdb] 
5860533168 512-byte logical blocks: (3.00 TB/2.72 TiB)
Jan 23 19:23:37 dr-nick kernel: [    2.948135] sd 2:0:0:0: [sdb] 
4096-byte physical blocks
Jan 23 19:23:37 dr-nick kernel: [    2.948185] sd 2:0:0:0: [sdb] Write 
Protect is off
Jan 23 19:23:37 dr-nick kernel: [    2.948189] sd 2:0:0:0: [sdb] Mode 
Sense: 00 3a 00 00
Jan 23 19:23:37 dr-nick kernel: [    2.948212] sd 2:0:0:0: [sdb] Write 
cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jan 23 19:23:37 dr-nick kernel: [    2.948914] sd 3:0:0:0: [sdc] 
468862128 512-byte logical blocks: (240 GB/223 GiB)
Jan 23 19:23:37 dr-nick kernel: [    2.948986] sd 3:0:0:0: [sdc] Write 
Protect is off
Jan 23 19:23:37 dr-nick kernel: [    2.948990] sd 3:0:0:0: [sdc] Mode 
Sense: 00 3a 00 00
Jan 23 19:23:37 dr-nick kernel: [    2.949013] sd 3:0:0:0: [sdc] Write 
cache: enabled, read cache: enabled, doesn't support DPO or FUA

and bcache journal recovery looks like this:

Jan 23 19:24:58 dr-nick kernel: [   96.909115] bcache: 
btree_journal_read() done
Jan 23 19:24:58 dr-nick kernel: [   97.112616] bcache: btree_check() done
Jan 23 19:24:58 dr-nick kernel: [   97.113322] bcache: journal replay 
done, 103 keys in 2 entries, seq 6175-6176
Jan 23 19:24:58 dr-nick kernel: [   97.118998] bcache: Caching sdb as 
bcache0 on set f5f0cd6d-0f77-49d3-ab2d-2203ffff1668
Jan 23 19:24:58 dr-nick kernel: [   97.119125] bcache: registered cache 
device sdc

I wonder if there's some cache flushing method missing in bcache that 
other device mappers use to work around the missing support for FUA 
(queue draining?).

Any ideas where to start debugging?

Greetings,

Pierre Beck

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2013-01-30 20:18 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-01-23 20:14 Loosing transactions Pierre Beck
     [not found] ` <51004490.704-MZZvbRqs/9F0RdzJJlgK+g@public.gmane.org>
2013-01-24 23:35   ` Kent Overstreet
     [not found]     ` <20130124233559.GO26407-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2013-01-28 14:45       ` Pierre Beck
     [not found]         ` <51068F01.9060000-MZZvbRqs/9F0RdzJJlgK+g@public.gmane.org>
2013-01-29 19:01           ` Kent Overstreet
     [not found]             ` <20130129190133.GL26407-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2013-01-29 19:09               ` Kent Overstreet
     [not found]                 ` <20130129190942.GM26407-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2013-01-29 20:16                   ` Pierre Beck
     [not found]                     ` <51082E02.7000908-MZZvbRqs/9F0RdzJJlgK+g@public.gmane.org>
2013-01-30 19:02                       ` Kent Overstreet
     [not found]                         ` <20130130190220.GS26407-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2013-01-30 20:05                           ` Pierre Beck
     [not found]                             ` <51097D0A.6040204-MZZvbRqs/9F0RdzJJlgK+g@public.gmane.org>
2013-01-30 20:18                               ` Kent Overstreet

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox