zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn; qiulg@ecloutech.com.cn

linux-bcache.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn; qiulg@ecloutech.com.cn
@ 2017-09-30 10:50 qiulg
  2017-09-30 12:18 ` Coly Li
  0 siblings, 1 reply; 7+ messages in thread
From: qiulg @ 2017-09-30 10:50 UTC (permalink / raw)
  To: linux-bcache

Hi,
 
Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
Here are the results during test:
 
# lsscsi
-- the sdj device is gone.
 
# ls -l /dev/bcache4
brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
-- the bcache4 device is still there
 
# dd if=/dev/bcache4 of=/dev/null bs=1M count=100
dd: reading `/dev/bcache4': Input/output error
0+0 records in
0+0 records out
0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
-- this is resonable
 
# fdisk -l /dev/bcache4
-- fdisk has no result. This is resonable
 
# dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
100+0 records in
100+0 records out
104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
-- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
 
# dd if=/dev/bcache4 of=/dev/null bs=1M count=100
dd: reading `/dev/bcache4': Input/output error
46+1 records in
46+1 records out
48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
-- now read still shows error but 48MB data read out.
 
# fdisk -l /dev/bcache4
Note: sector size is 4096 (not 512)
Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
255 heads, 63 sectors/track, 7296 cylinders
Units = cylinders of 16065 * 4096 = 65802240 bytes
Sector size (logical/physical): 4096 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disk identifier: 0x00000000
-- fdisk got answer. Strange. 480GB is the backing device size.
 
# dd if=/dev/zero of=/dev/bcache4 bs=1M
dd: writing `/dev/bcache4': No space left on device
457863+0 records in
457862+0 records out
480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
-- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
-- during dd, the /var/log/message shows error
...
Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
...
 
I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
 
Appreciated for any help.
 
Richard Qiu

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn; qiulg@ecloutech.com.cn
  2017-09-30 10:50 zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn; qiulg@ecloutech.com.cn qiulg
@ 2017-09-30 12:18 ` Coly Li
  2017-10-01 14:29   ` /dev/bcacheX can still be written when plug out backing device qiulg
  0 siblings, 1 reply; 7+ messages in thread
From: Coly Li @ 2017-09-30 12:18 UTC (permalink / raw)
  To: qiulg; +Cc: linux-bcache

On 2017/9/30 下午6:50, qiulg wrote:
> Hi,
>  
> Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
> Here are the results during test:
>  
> # lsscsi
> -- the sdj device is gone.
>  
> # ls -l /dev/bcache4
> brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
> -- the bcache4 device is still there
>  
> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
> dd: reading `/dev/bcache4': Input/output error
> 0+0 records in
> 0+0 records out
> 0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
> -- this is resonable
>  
> # fdisk -l /dev/bcache4
> -- fdisk has no result. This is resonable
>  
> # dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
> 100+0 records in
> 100+0 records out
> 104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
> -- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
>  
> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
> dd: reading `/dev/bcache4': Input/output error
> 46+1 records in
> 46+1 records out
> 48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
> -- now read still shows error but 48MB data read out.
>  
> # fdisk -l /dev/bcache4
> Note: sector size is 4096 (not 512)
> Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
> 255 heads, 63 sectors/track, 7296 cylinders
> Units = cylinders of 16065 * 4096 = 65802240 bytes
> Sector size (logical/physical): 4096 bytes / 4096 bytes
> I/O size (minimum/optimal): 4096 bytes / 4096 bytes
> Disk identifier: 0x00000000
> -- fdisk got answer. Strange. 480GB is the backing device size.
>  
> # dd if=/dev/zero of=/dev/bcache4 bs=1M
> dd: writing `/dev/bcache4': No space left on device
> 457863+0 records in
> 457862+0 records out
> 480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
> -- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
> -- during dd, the /var/log/message shows error
> ...
> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
> Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
> Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
> ...
>  
> I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
>  
> Appreciated for any help.
>  
Hi,

Could you please try oflag=direct and observe what is the difference ?

Buffered I/O is asynchronized, application is not able to get an
immediate -EIO.


-- 
Coly Li

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: /dev/bcacheX can still be written when plug out backing device
  2017-09-30 12:18 ` Coly Li
@ 2017-10-01 14:29   ` qiulg
  2017-10-01 14:50     ` Coly Li
  0 siblings, 1 reply; 7+ messages in thread
From: qiulg @ 2017-10-01 14:29 UTC (permalink / raw)
  To: Coly Li
  Cc: linux-bcache, 朱菁, 秦晓强,
	邱利公

Thanks Coly. Sorry for the wrong subject, changed. 

Tridd dd with oflag=direct, the bcache device can still be written without backing device, but this time it finished at about 7.7GB.

# dd if=/dev/zero of=/dev/bcache4 oflag=direct 
 dd: writing to `/dev/bcache4': Input/output error
15052824+0 records in
15052823+0 records out
7707045376 bytes (7.7 GB) copied, 1022.63 s, 7.5 MB/s 

Richard 

------------------ Original ------------------
From:  "Coly Li"<i@coly.li>;
Date:  Sat, Sep 30, 2017 08:18 PM
To:  "qiulg"<qiulg@ecloudtech.com.cn>;
Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>;
Subject:  Re: zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn;qiulg@ecloutech.com.cn
 
On 2017/9/30 下午6:50, qiulg wrote:
> Hi,
>  
> Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
> Here are the results during test:
>  
> # lsscsi
> -- the sdj device is gone.
>  
> # ls -l /dev/bcache4
> brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
> -- the bcache4 device is still there
>  
> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
> dd: reading `/dev/bcache4': Input/output error
> 0+0 records in
> 0+0 records out
> 0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
> -- this is resonable
>  
> # fdisk -l /dev/bcache4
> -- fdisk has no result. This is resonable
>  
> # dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
> 100+0 records in
> 100+0 records out
> 104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
> -- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
>  
> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
> dd: reading `/dev/bcache4': Input/output error
> 46+1 records in
> 46+1 records out
> 48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
> -- now read still shows error but 48MB data read out.
>  
> # fdisk -l /dev/bcache4
> Note: sector size is 4096 (not 512)
> Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
> 255 heads, 63 sectors/track, 7296 cylinders
> Units = cylinders of 16065 * 4096 = 65802240 bytes
> Sector size (logical/physical): 4096 bytes / 4096 bytes
> I/O size (minimum/optimal): 4096 bytes / 4096 bytes
> Disk identifier: 0x00000000
> -- fdisk got answer. Strange. 480GB is the backing device size.
>  
> # dd if=/dev/zero of=/dev/bcache4 bs=1M
> dd: writing `/dev/bcache4': No space left on device
> 457863+0 records in
> 457862+0 records out
> 480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
> -- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
> -- during dd, the /var/log/message shows error
> ...
> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
> Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
> Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
> ...
>  
> I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
>  
> Appreciated for any help.
>  
Hi,

Could you please try oflag=direct and observe what is the difference ?

Buffered I/O is asynchronized, application is not able to get an
immediate -EIO.


-- 
Coly Li

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: /dev/bcacheX can still be written when plug out backing device
  2017-10-01 14:29   ` /dev/bcacheX can still be written when plug out backing device qiulg
@ 2017-10-01 14:50     ` Coly Li
  2017-10-01 15:51       ` qiulg
  0 siblings, 1 reply; 7+ messages in thread
From: Coly Li @ 2017-10-01 14:50 UTC (permalink / raw)
  To: qiulg; +Cc: linux-bcache, 朱菁, 秦晓强

On 2017/10/1 下午10:29, qiulg wrote:
> Thanks Coly. Sorry for the wrong subject, changed. 
> 
> Tridd dd with oflag=direct, the bcache device can still be written without backing device, but this time it finished at about 7.7GB.
> 
> # dd if=/dev/zero of=/dev/bcache4 oflag=direct 
>  dd: writing to `/dev/bcache4': Input/output error
> 15052824+0 records in
> 15052823+0 records out
> 7707045376 bytes (7.7 GB) copied, 1022.63 s, 7.5 MB/s 
> 

Aha, it is backing device...I was blind...
The situation you hit is the backing device failure, which is not
properly handled by bcache yet. (indeed cache device failure is not
properly handled neither)

You are not the first one complains on this. I also receive such a
similar feature request, but it is on the tail of my task list. If
someone may contribute patches, I am glad to test.

Thanks.

Coly

> ------------------ Original ------------------
> From:  "Coly Li"<i@coly.li>;
> Date:  Sat, Sep 30, 2017 08:18 PM
> To:  "qiulg"<qiulg@ecloudtech.com.cn>;
> Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>;
> Subject:  Re: zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn;qiulg@ecloutech.com.cn
>  
> On 2017/9/30 下午6:50, qiulg wrote:
>> Hi,
>>  
>> Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
>> Here are the results during test:
>>  
>> # lsscsi
>> -- the sdj device is gone.
>>  
>> # ls -l /dev/bcache4
>> brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
>> -- the bcache4 device is still there
>>  
>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>> dd: reading `/dev/bcache4': Input/output error
>> 0+0 records in
>> 0+0 records out
>> 0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
>> -- this is resonable
>>  
>> # fdisk -l /dev/bcache4
>> -- fdisk has no result. This is resonable
>>  
>> # dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
>> 100+0 records in
>> 100+0 records out
>> 104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
>> -- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
>>  
>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>> dd: reading `/dev/bcache4': Input/output error
>> 46+1 records in
>> 46+1 records out
>> 48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
>> -- now read still shows error but 48MB data read out.
>>  
>> # fdisk -l /dev/bcache4
>> Note: sector size is 4096 (not 512)
>> Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
>> 255 heads, 63 sectors/track, 7296 cylinders
>> Units = cylinders of 16065 * 4096 = 65802240 bytes
>> Sector size (logical/physical): 4096 bytes / 4096 bytes
>> I/O size (minimum/optimal): 4096 bytes / 4096 bytes
>> Disk identifier: 0x00000000
>> -- fdisk got answer. Strange. 480GB is the backing device size.
>>  
>> # dd if=/dev/zero of=/dev/bcache4 bs=1M
>> dd: writing `/dev/bcache4': No space left on device
>> 457863+0 records in
>> 457862+0 records out
>> 480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
>> -- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
>> -- during dd, the /var/log/message shows error
>> ...
>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
>> Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
>> Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
>> ...
>>  
>> I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
>>  
>> Appreciated for any help.
>>  
> Hi,
> 
> Could you please try oflag=direct and observe what is the difference ?
> 
> Buffered I/O is asynchronized, application is not able to get an
> immediate -EIO.
> 

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: /dev/bcacheX can still be written when plug out backing device
  2017-10-01 14:50     ` Coly Li
@ 2017-10-01 15:51       ` qiulg
  2017-10-01 15:57         ` Coly Li
  0 siblings, 1 reply; 7+ messages in thread
From: qiulg @ 2017-10-01 15:51 UTC (permalink / raw)
  To: Coly Li; +Cc: linux-bcache, 朱菁, 秦晓强

That's bad news. :( 
What is expected is freezing write I/O and return error to application when cache device or backing device failed. Hope the patch can be released soon.  

Thanks. 
Qiu
 
------------------ Original ------------------
From:  "Coly Li"<i@coly.li>;
Date:  Sun, Oct 1, 2017 10:50 PM
To:  "qiulg"<qiulg@ecloudtech.com.cn>;
Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>; "朱菁"<zhujing@ecloudtech.com.cn>; "秦晓强"<qinxq@ecloudtech.com.cn>;
Subject:  Re: /dev/bcacheX can still be written when plug out backing device
 
On 2017/10/1 下午10:29, qiulg wrote:
> Thanks Coly. Sorry for the wrong subject, changed. 
> 
> Tridd dd with oflag=direct, the bcache device can still be written without backing device, but this time it finished at about 7.7GB.
> 
> # dd if=/dev/zero of=/dev/bcache4 oflag=direct 
>  dd: writing to `/dev/bcache4': Input/output error
> 15052824+0 records in
> 15052823+0 records out
> 7707045376 bytes (7.7 GB) copied, 1022.63 s, 7.5 MB/s 
> 

Aha, it is backing device...I was blind...
The situation you hit is the backing device failure, which is not
properly handled by bcache yet. (indeed cache device failure is not
properly handled neither)

You are not the first one complains on this. I also receive such a
similar feature request, but it is on the tail of my task list. If
someone may contribute patches, I am glad to test.

Thanks.

Coly

> ------------------ Original ------------------
> From:  "Coly Li"<i@coly.li>;
> Date:  Sat, Sep 30, 2017 08:18 PM
> To:  "qiulg"<qiulg@ecloudtech.com.cn>;
> Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>;
> Subject:  Re: zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn;qiulg@ecloutech.com.cn
>  
> On 2017/9/30 下午6:50, qiulg wrote:
>> Hi,
>>  
>> Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
>> Here are the results during test:
>>  
>> # lsscsi
>> -- the sdj device is gone.
>>  
>> # ls -l /dev/bcache4
>> brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
>> -- the bcache4 device is still there
>>  
>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>> dd: reading `/dev/bcache4': Input/output error
>> 0+0 records in
>> 0+0 records out
>> 0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
>> -- this is resonable
>>  
>> # fdisk -l /dev/bcache4
>> -- fdisk has no result. This is resonable
>>  
>> # dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
>> 100+0 records in
>> 100+0 records out
>> 104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
>> -- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
>>  
>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>> dd: reading `/dev/bcache4': Input/output error
>> 46+1 records in
>> 46+1 records out
>> 48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
>> -- now read still shows error but 48MB data read out.
>>  
>> # fdisk -l /dev/bcache4
>> Note: sector size is 4096 (not 512)
>> Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
>> 255 heads, 63 sectors/track, 7296 cylinders
>> Units = cylinders of 16065 * 4096 = 65802240 bytes
>> Sector size (logical/physical): 4096 bytes / 4096 bytes
>> I/O size (minimum/optimal): 4096 bytes / 4096 bytes
>> Disk identifier: 0x00000000
>> -- fdisk got answer. Strange. 480GB is the backing device size.
>>  
>> # dd if=/dev/zero of=/dev/bcache4 bs=1M
>> dd: writing `/dev/bcache4': No space left on device
>> 457863+0 records in
>> 457862+0 records out
>> 480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
>> -- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
>> -- during dd, the /var/log/message shows error
>> ...
>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
>> Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
>> Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
>> ...
>>  
>> I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
>>  
>> Appreciated for any help.
>>  
> Hi,
> 
> Could you please try oflag=direct and observe what is the difference ?
> 
> Buffered I/O is asynchronized, application is not able to get an
> immediate -EIO.
>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: /dev/bcacheX can still be written when plug out backing device
  2017-10-01 15:51       ` qiulg
@ 2017-10-01 15:57         ` Coly Li
  2017-10-01 16:00           ` qiulg
  0 siblings, 1 reply; 7+ messages in thread
From: Coly Li @ 2017-10-01 15:57 UTC (permalink / raw)
  To: qiulg; +Cc: linux-bcache, 朱菁, 秦晓强

On 2017/10/1 下午11:51, qiulg wrote:
> That's bad news. :( 
> What is expected is freezing write I/O and return error to application when cache device or backing device failed. Hope the patch can be released soon.  
> 

Yes this is one of the expected behavior.
Another expectation is after backing device is failed, bcache complains,
but still permits cache device to cache a pre-defined amount of data,
then freeing write I/O.

Once the patch is available, I will let you know :-)

Coly


> Thanks. 
> Qiu
>  
> ------------------ Original ------------------
> From:  "Coly Li"<i@coly.li>;
> Date:  Sun, Oct 1, 2017 10:50 PM
> To:  "qiulg"<qiulg@ecloudtech.com.cn>;
> Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>; "朱菁"<zhujing@ecloudtech.com.cn>; "秦晓强"<qinxq@ecloudtech.com.cn>;
> Subject:  Re: /dev/bcacheX can still be written when plug out backing device
>  
> On 2017/10/1 下午10:29, qiulg wrote:
>> Thanks Coly. Sorry for the wrong subject, changed. 
>>
>> Tridd dd with oflag=direct, the bcache device can still be written without backing device, but this time it finished at about 7.7GB.
>>
>> # dd if=/dev/zero of=/dev/bcache4 oflag=direct 
>>  dd: writing to `/dev/bcache4': Input/output error
>> 15052824+0 records in
>> 15052823+0 records out
>> 7707045376 bytes (7.7 GB) copied, 1022.63 s, 7.5 MB/s 
>>
> 
> Aha, it is backing device...I was blind...
> The situation you hit is the backing device failure, which is not
> properly handled by bcache yet. (indeed cache device failure is not
> properly handled neither)
> 
> You are not the first one complains on this. I also receive such a
> similar feature request, but it is on the tail of my task list. If
> someone may contribute patches, I am glad to test.
> 
> Thanks.
> 
> Coly
> 
>> ------------------ Original ------------------
>> From:  "Coly Li"<i@coly.li>;
>> Date:  Sat, Sep 30, 2017 08:18 PM
>> To:  "qiulg"<qiulg@ecloudtech.com.cn>;
>> Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>;
>> Subject:  Re: zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn;qiulg@ecloutech.com.cn
>>  
>> On 2017/9/30 下午6:50, qiulg wrote:
>>> Hi,
>>>  
>>> Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
>>> Here are the results during test:
>>>  
>>> # lsscsi
>>> -- the sdj device is gone.
>>>  
>>> # ls -l /dev/bcache4
>>> brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
>>> -- the bcache4 device is still there
>>>  
>>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>>> dd: reading `/dev/bcache4': Input/output error
>>> 0+0 records in
>>> 0+0 records out
>>> 0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
>>> -- this is resonable
>>>  
>>> # fdisk -l /dev/bcache4
>>> -- fdisk has no result. This is resonable
>>>  
>>> # dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
>>> 100+0 records in
>>> 100+0 records out
>>> 104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
>>> -- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
>>>  
>>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>>> dd: reading `/dev/bcache4': Input/output error
>>> 46+1 records in
>>> 46+1 records out
>>> 48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
>>> -- now read still shows error but 48MB data read out.
>>>  
>>> # fdisk -l /dev/bcache4
>>> Note: sector size is 4096 (not 512)
>>> Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
>>> 255 heads, 63 sectors/track, 7296 cylinders
>>> Units = cylinders of 16065 * 4096 = 65802240 bytes
>>> Sector size (logical/physical): 4096 bytes / 4096 bytes
>>> I/O size (minimum/optimal): 4096 bytes / 4096 bytes
>>> Disk identifier: 0x00000000
>>> -- fdisk got answer. Strange. 480GB is the backing device size.
>>>  
>>> # dd if=/dev/zero of=/dev/bcache4 bs=1M
>>> dd: writing `/dev/bcache4': No space left on device
>>> 457863+0 records in
>>> 457862+0 records out
>>> 480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
>>> -- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
>>> -- during dd, the /var/log/message shows error
>>> ...
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
>>> Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
>>> Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
>>> ...
>>>  
>>> I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
>>>  
>>> Appreciated for any help.
>>>  
>> Hi,
>>
>> Could you please try oflag=direct and observe what is the difference ?
>>
>> Buffered I/O is asynchronized, application is not able to get an
>> immediate -EIO.
>>
> 


-- 
Coly Li

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: /dev/bcacheX can still be written when plug out backing device
  2017-10-01 15:57         ` Coly Li
@ 2017-10-01 16:00           ` qiulg
  0 siblings, 0 replies; 7+ messages in thread
From: qiulg @ 2017-10-01 16:00 UTC (permalink / raw)
  To: Coly Li; +Cc: linux-bcache, 朱菁, 秦晓强

Thanks a lot Coly!

Qiu
  
------------------ Original ------------------
From:  "Coly Li"<i@coly.li>;
Date:  Sun, Oct 1, 2017 11:57 PM
To:  "qiulg"<qiulg@ecloudtech.com.cn>;
Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>; "朱菁"<zhujing@ecloudtech.com.cn>; "秦晓强"<qinxq@ecloudtech.com.cn>;
Subject:  Re: /dev/bcacheX can still be written when plug out backing device
 
On 2017/10/1 下午11:51, qiulg wrote:
> That's bad news. :( 
> What is expected is freezing write I/O and return error to application when cache device or backing device failed. Hope the patch can be released soon.  
> 

Yes this is one of the expected behavior.
Another expectation is after backing device is failed, bcache complains,
but still permits cache device to cache a pre-defined amount of data,
then freeing write I/O.

Once the patch is available, I will let you know :-)

Coly


> Thanks. 
> Qiu
>  
> ------------------ Original ------------------
> From:  "Coly Li"<i@coly.li>;
> Date:  Sun, Oct 1, 2017 10:50 PM
> To:  "qiulg"<qiulg@ecloudtech.com.cn>;
> Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>; "朱菁"<zhujing@ecloudtech.com.cn>; "秦晓强"<qinxq@ecloudtech.com.cn>;
> Subject:  Re: /dev/bcacheX can still be written when plug out backing device
>  
> On 2017/10/1 下午10:29, qiulg wrote:
>> Thanks Coly. Sorry for the wrong subject, changed. 
>>
>> Tridd dd with oflag=direct, the bcache device can still be written without backing device, but this time it finished at about 7.7GB.
>>
>> # dd if=/dev/zero of=/dev/bcache4 oflag=direct 
>>  dd: writing to `/dev/bcache4': Input/output error
>> 15052824+0 records in
>> 15052823+0 records out
>> 7707045376 bytes (7.7 GB) copied, 1022.63 s, 7.5 MB/s 
>>
> 
> Aha, it is backing device...I was blind...
> The situation you hit is the backing device failure, which is not
> properly handled by bcache yet. (indeed cache device failure is not
> properly handled neither)
> 
> You are not the first one complains on this. I also receive such a
> similar feature request, but it is on the tail of my task list. If
> someone may contribute patches, I am glad to test.
> 
> Thanks.
> 
> Coly
> 
>> ------------------ Original ------------------
>> From:  "Coly Li"<i@coly.li>;
>> Date:  Sat, Sep 30, 2017 08:18 PM
>> To:  "qiulg"<qiulg@ecloudtech.com.cn>;
>> Cc:  "linux-bcache"<linux-bcache@vger.kernel.org>;
>> Subject:  Re: zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn;qiulg@ecloutech.com.cn
>>  
>> On 2017/9/30 下午6:50, qiulg wrote:
>>> Hi,
>>>  
>>> Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
>>> Here are the results during test:
>>>  
>>> # lsscsi
>>> -- the sdj device is gone.
>>>  
>>> # ls -l /dev/bcache4
>>> brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
>>> -- the bcache4 device is still there
>>>  
>>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>>> dd: reading `/dev/bcache4': Input/output error
>>> 0+0 records in
>>> 0+0 records out
>>> 0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
>>> -- this is resonable
>>>  
>>> # fdisk -l /dev/bcache4
>>> -- fdisk has no result. This is resonable
>>>  
>>> # dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
>>> 100+0 records in
>>> 100+0 records out
>>> 104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
>>> -- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
>>>  
>>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>>> dd: reading `/dev/bcache4': Input/output error
>>> 46+1 records in
>>> 46+1 records out
>>> 48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
>>> -- now read still shows error but 48MB data read out.
>>>  
>>> # fdisk -l /dev/bcache4
>>> Note: sector size is 4096 (not 512)
>>> Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
>>> 255 heads, 63 sectors/track, 7296 cylinders
>>> Units = cylinders of 16065 * 4096 = 65802240 bytes
>>> Sector size (logical/physical): 4096 bytes / 4096 bytes
>>> I/O size (minimum/optimal): 4096 bytes / 4096 bytes
>>> Disk identifier: 0x00000000
>>> -- fdisk got answer. Strange. 480GB is the backing device size.
>>>  
>>> # dd if=/dev/zero of=/dev/bcache4 bs=1M
>>> dd: writing `/dev/bcache4': No space left on device
>>> 457863+0 records in
>>> 457862+0 records out
>>> 480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
>>> -- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
>>> -- during dd, the /var/log/message shows error
>>> ...
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
>>> Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
>>> Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
>>> ...
>>>  
>>> I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
>>>  
>>> Appreciated for any help.
>>>  
>> Hi,
>>
>> Could you please try oflag=direct and observe what is the difference ?
>>
>> Buffered I/O is asynchronized, application is not able to get an
>> immediate -EIO.
>>
> 


-- 
Coly Li

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2017-10-01 16:00 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-09-30 10:50 zhujing@ecloutech.com.cn; qinxq@ecloutech.com.cn; qiulg@ecloutech.com.cn qiulg
2017-09-30 12:18 ` Coly Li
2017-10-01 14:29   ` /dev/bcacheX can still be written when plug out backing device qiulg
2017-10-01 14:50     ` Coly Li
2017-10-01 15:51       ` qiulg
2017-10-01 15:57         ` Coly Li
2017-10-01 16:00           ` qiulg

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).