qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v4 0/7] block: Add retry for werror=/rerror= mechanism
@ 2020-12-15 12:30 Jiahui Cen
  2020-12-15 12:30 ` [PATCH v4 1/7] qapi/block-core: Add retry option for error action Jiahui Cen
                   ` (9 more replies)
  0 siblings, 10 replies; 12+ messages in thread
From: Jiahui Cen @ 2020-12-15 12:30 UTC (permalink / raw)
  To: qemu-devel
  Cc: Kevin Wolf, cenjiahui, zhang.zhanghailiang, qemu-block,
	Michael S. Tsirkin, Markus Armbruster, Max Reitz, Stefan Hajnoczi,
	fangying1, John Snow

A VM in the cloud environment may use a virutal disk as the backend storage,
and there are usually filesystems on the virtual block device. When backend
storage is temporarily down, any I/O issued to the virtual block device
will cause an error. For example, an error occurred in ext4 filesystem would
make the filesystem readonly. In production environment, a cloud backend
storage can be soon recovered. For example, an IP-SAN may be down due to
network failure and will be online soon after network is recovered. However,
the error in the filesystem may not be recovered unless a device reattach
or system restart. Thus an I/O retry mechanism is in need to implement a
self-healing system.

This patch series propose to extend the werror=/rerror= mechanism to add
a 'retry' feature. It can automatically retry failed I/O requests on error
without sending error back to guest, and guest can get back running smoothly
when I/O is recovred.

v3->v4:
* Adapt to werror=/rerror= mechanism.

v2->v3:
* Add a doc to describe I/O hang.

v1->v2:
* Rebase to fix compile problems.
* Fix incorrect remove of rehandle list.
* Provide rehandle pause interface.

REF: https://lists.gnu.org/archive/html/qemu-devel/2020-10/msg06560.html

Signed-off-by: Jiahui Cen <cenjiahui@huawei.com>
Signed-off-by: Ying Fang <fangying1@huawei.com>

Jiahui Cen (7):
  qapi/block-core: Add retry option for error action
  block-backend: Introduce retry timer
  block-backend: Add device specific retry callback
  block-backend: Enable retry action on errors
  block-backend: Add timeout support for retry
  block: Add error retry param setting
  virtio_blk: Add support for retry on errors

 block/block-backend.c          | 66 ++++++++++++++++++++
 blockdev.c                     | 52 +++++++++++++++
 hw/block/block.c               | 10 +++
 hw/block/virtio-blk.c          | 19 +++++-
 include/hw/block/block.h       |  7 ++-
 include/sysemu/block-backend.h | 10 +++
 qapi/block-core.json           |  4 +-
 7 files changed, 162 insertions(+), 6 deletions(-)

-- 
2.28.0



^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2021-01-27 17:19 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2020-12-15 12:30 [PATCH v4 0/7] block: Add retry for werror=/rerror= mechanism Jiahui Cen
2020-12-15 12:30 ` [PATCH v4 1/7] qapi/block-core: Add retry option for error action Jiahui Cen
2021-01-27 17:16   ` Eric Blake
2020-12-15 12:30 ` [PATCH v4 2/7] block-backend: Introduce retry timer Jiahui Cen
2020-12-15 12:30 ` [PATCH v4 3/7] block-backend: Add device specific retry callback Jiahui Cen
2020-12-15 12:30 ` [PATCH v4 4/7] block-backend: Enable retry action on errors Jiahui Cen
2020-12-15 12:30 ` [PATCH v4 5/7] block-backend: Add timeout support for retry Jiahui Cen
2020-12-15 12:30 ` [PATCH v4 6/7] block: Add error retry param setting Jiahui Cen
2020-12-15 12:30 ` [PATCH v4 7/7] virtio_blk: Add support for retry on errors Jiahui Cen
2020-12-21  7:57 ` [PATCH v4 0/7] block: Add retry for werror=/rerror= mechanism Jiahui Cen
2021-01-05  9:33 ` Ping: " Jiahui Cen
2021-01-25  3:23 ` Ying Fang

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).