* [PATCH] libceph: set last_piece in ceph_msg_data_pages_cursor_init() correctly
@ 2014-08-08 11:18 Ilya Dryomov
2014-08-08 17:46 ` Sage Weil
2014-08-08 20:59 ` Alex Elder
0 siblings, 2 replies; 3+ messages in thread
From: Ilya Dryomov @ 2014-08-08 11:18 UTC (permalink / raw)
To: ceph-devel
Determining ->last_piece based on the value of ->page_offset + length
is incorrect because length here is the length of the entire message.
->last_piece set to false even if page array data item length is <=
PAGE_SIZE, which results in invalid length passed to
ceph_tcp_{send,recv}page() and causes various asserts to fire.
# cat pages-cursor-init.sh
#!/bin/bash
rbd create --size 10 --image-format 2 foo
FOO_DEV=$(rbd map foo)
dd if=/dev/urandom of=$FOO_DEV bs=1M &>/dev/null
rbd snap create foo@snap
rbd snap protect foo@snap
rbd clone foo@snap bar
# rbd_resize calls librbd rbd_resize(), size is in bytes
./rbd_resize bar $(((4 << 20) + 512))
rbd resize --size 10 bar
BAR_DEV=$(rbd map bar)
# trigger a 512-byte copyup -- 512-byte page array data item
dd if=/dev/urandom of=$BAR_DEV bs=1M count=1 seek=5
The problem exists only in ceph_msg_data_pages_cursor_init(),
ceph_msg_data_pages_advance() does the right thing. The size_t cast is
unnecessary.
Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
---
net/ceph/messenger.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/net/ceph/messenger.c b/net/ceph/messenger.c
index e51cad0db580..b2f571dd933d 100644
--- a/net/ceph/messenger.c
+++ b/net/ceph/messenger.c
@@ -901,7 +901,7 @@ static void ceph_msg_data_pages_cursor_init(struct ceph_msg_data_cursor *cursor,
BUG_ON(page_count > (int)USHRT_MAX);
cursor->page_count = (unsigned short)page_count;
BUG_ON(length > SIZE_MAX - cursor->page_offset);
- cursor->last_piece = (size_t)cursor->page_offset + length <= PAGE_SIZE;
+ cursor->last_piece = cursor->page_offset + cursor->resid <= PAGE_SIZE;
}
static struct page *
--
1.7.10.4
^ permalink raw reply related [flat|nested] 3+ messages in thread* Re: [PATCH] libceph: set last_piece in ceph_msg_data_pages_cursor_init() correctly
2014-08-08 11:18 [PATCH] libceph: set last_piece in ceph_msg_data_pages_cursor_init() correctly Ilya Dryomov
@ 2014-08-08 17:46 ` Sage Weil
2014-08-08 20:59 ` Alex Elder
1 sibling, 0 replies; 3+ messages in thread
From: Sage Weil @ 2014-08-08 17:46 UTC (permalink / raw)
To: Ilya Dryomov; +Cc: ceph-devel
On Fri, 8 Aug 2014, Ilya Dryomov wrote:
> Determining ->last_piece based on the value of ->page_offset + length
> is incorrect because length here is the length of the entire message.
> ->last_piece set to false even if page array data item length is <=
> PAGE_SIZE, which results in invalid length passed to
> ceph_tcp_{send,recv}page() and causes various asserts to fire.
>
> # cat pages-cursor-init.sh
> #!/bin/bash
> rbd create --size 10 --image-format 2 foo
> FOO_DEV=$(rbd map foo)
> dd if=/dev/urandom of=$FOO_DEV bs=1M &>/dev/null
> rbd snap create foo@snap
> rbd snap protect foo@snap
> rbd clone foo@snap bar
> # rbd_resize calls librbd rbd_resize(), size is in bytes
> ./rbd_resize bar $(((4 << 20) + 512))
> rbd resize --size 10 bar
> BAR_DEV=$(rbd map bar)
> # trigger a 512-byte copyup -- 512-byte page array data item
> dd if=/dev/urandom of=$BAR_DEV bs=1M count=1 seek=5
>
> The problem exists only in ceph_msg_data_pages_cursor_init(),
> ceph_msg_data_pages_advance() does the right thing. The size_t cast is
> unnecessary.
>
> Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
Looks good!
Reviewed-by: Sage Weil <sage@redhat.com>
> ---
> net/ceph/messenger.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/net/ceph/messenger.c b/net/ceph/messenger.c
> index e51cad0db580..b2f571dd933d 100644
> --- a/net/ceph/messenger.c
> +++ b/net/ceph/messenger.c
> @@ -901,7 +901,7 @@ static void ceph_msg_data_pages_cursor_init(struct ceph_msg_data_cursor *cursor,
> BUG_ON(page_count > (int)USHRT_MAX);
> cursor->page_count = (unsigned short)page_count;
> BUG_ON(length > SIZE_MAX - cursor->page_offset);
> - cursor->last_piece = (size_t)cursor->page_offset + length <= PAGE_SIZE;
> + cursor->last_piece = cursor->page_offset + cursor->resid <= PAGE_SIZE;
> }
>
> static struct page *
> --
> 1.7.10.4
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
^ permalink raw reply [flat|nested] 3+ messages in thread* Re: [PATCH] libceph: set last_piece in ceph_msg_data_pages_cursor_init() correctly
2014-08-08 11:18 [PATCH] libceph: set last_piece in ceph_msg_data_pages_cursor_init() correctly Ilya Dryomov
2014-08-08 17:46 ` Sage Weil
@ 2014-08-08 20:59 ` Alex Elder
1 sibling, 0 replies; 3+ messages in thread
From: Alex Elder @ 2014-08-08 20:59 UTC (permalink / raw)
To: Ilya Dryomov, ceph-devel
On 08/08/2014 06:18 AM, Ilya Dryomov wrote:
> Determining ->last_piece based on the value of ->page_offset + length
> is incorrect because length here is the length of the entire message.
> ->last_piece set to false even if page array data item length is <=
> PAGE_SIZE, which results in invalid length passed to
> ceph_tcp_{send,recv}page() and causes various asserts to fire.
I glanced at this this morning and thought it might have
a problem. Now that I've had the time to look at it a
little more closely I conclude it's fine.
I'm going to explain my reasoning, which just reaffirms what
you've explained. (Doing this helps me brush up on things,
I'm starting to get rusty...)
A message has a list of data items, and a data item is
broken into pieces, each of which (for a pages data item)
holds data from a single page. A cursor keeps the data item
now being processed in cursor->data, and cursor->resid is the
number of un-consumed bytes in the current data item. We want
to set cursor->last_piece when the next piece to process is
the last in the *data item*, not the last in the *message*,
so we need to compare against cursor->resid, not the message
length.
Before your change, any message with more than one data
item including a page array data item that was not last
in the list would have problems.
So in summary: Looks good.
Reviewed-by: Alex Elder <elder@linaro.org>
> # cat pages-cursor-init.sh
> #!/bin/bash
> rbd create --size 10 --image-format 2 foo
> FOO_DEV=$(rbd map foo)
> dd if=/dev/urandom of=$FOO_DEV bs=1M &>/dev/null
> rbd snap create foo@snap
> rbd snap protect foo@snap
> rbd clone foo@snap bar
> # rbd_resize calls librbd rbd_resize(), size is in bytes
> ./rbd_resize bar $(((4 << 20) + 512))
> rbd resize --size 10 bar
> BAR_DEV=$(rbd map bar)
> # trigger a 512-byte copyup -- 512-byte page array data item
> dd if=/dev/urandom of=$BAR_DEV bs=1M count=1 seek=5
>
> The problem exists only in ceph_msg_data_pages_cursor_init(),
> ceph_msg_data_pages_advance() does the right thing. The size_t cast is
> unnecessary.
>
> Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
> ---
> net/ceph/messenger.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/net/ceph/messenger.c b/net/ceph/messenger.c
> index e51cad0db580..b2f571dd933d 100644
> --- a/net/ceph/messenger.c
> +++ b/net/ceph/messenger.c
> @@ -901,7 +901,7 @@ static void ceph_msg_data_pages_cursor_init(struct ceph_msg_data_cursor *cursor,
> BUG_ON(page_count > (int)USHRT_MAX);
> cursor->page_count = (unsigned short)page_count;
> BUG_ON(length > SIZE_MAX - cursor->page_offset);
> - cursor->last_piece = (size_t)cursor->page_offset + length <= PAGE_SIZE;
> + cursor->last_piece = cursor->page_offset + cursor->resid <= PAGE_SIZE;
> }
>
> static struct page *
>
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2014-08-08 20:59 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-08-08 11:18 [PATCH] libceph: set last_piece in ceph_msg_data_pages_cursor_init() correctly Ilya Dryomov
2014-08-08 17:46 ` Sage Weil
2014-08-08 20:59 ` Alex Elder
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.