From: Mark Nelson <mark.nelson@inktank.com>
To: Stefan Priebe <s.priebe@profihost.ag>
Cc: Yehuda Sadeh <yehuda@inktank.com>,
Brian Behlendorf <behlendorf1@llnl.gov>,
Sage Weil <sage@inktank.com>,
Jeff Mitchell <jeffrey.mitchell@gmail.com>,
Henry C Chang <henry.cy.chang@gmail.com>,
Aleksey Leonov <aleonov@nazarianin.com>,
ceph-devel <ceph-devel@vger.kernel.org>
Subject: Re: test osd on zfs
Date: Wed, 17 Apr 2013 15:16:33 -0500 [thread overview]
Message-ID: <516F0321.2@inktank.com> (raw)
In-Reply-To: <516EF34E.5000000@profihost.ag>
I'll let Brian talk about the virtues of ZFS, but from my perspective
it's an interesting option as there are a lot of folks banging on it for
NFS servers and it has some interesting capabilities. I have no idea
how well it will work in practice, but if we can show that Ceph can run
on it at least people can try it out and give us feedback.
Mark
On 04/17/2013 02:09 PM, Stefan Priebe wrote:
> Sorry to disturb, but what is the raeson / advantage of using zfs for ceph?
>
> Greets,
> Stefan
> Am 17.04.2013 21:07, schrieb Yehuda Sadeh:
>> On Wed, Apr 17, 2013 at 11:57 AM, Brian Behlendorf
>> <behlendorf1@llnl.gov> wrote:
>>>
>>> Here's a patch for the ERANGE error (lightly tested). Sage's patch
>>> looks
>>> good but only covers one of two code paths for xattrs. With zfs they
>>> may
>>> either be stored as a system attribute which is usually close to the
>>> dnode
>>> on disk (zfs set xattr=sa pool/dataset). Or they may be stored in
>>> their own
>>> object which is how it's implemented on Solaris (zfs set xattr=on
>>> pool/dataset). The second method is still the default for compatibility
>>> reasons even though it's slower. Sage's patch only covered the SA case.
>>>
>>>
>>>> Well, looking at the code again it's not going to work, as setxattr is
>>>> going to fail with ERANGE.
>>>
>>> Why? We support an arbitrary number of maximum sized xattrs (65536).
>>> What
>>> am I missing here?
>>>
>>> Incidentally, does anybody know of an good xattr test suite we could
>>> add to
>>> our regression tests?
>>>
>>> Thanks,
>>> Brian
>>>
>>> diff --git a/module/zfs/zpl_xattr.c b/module/zfs/zpl_xattr.c
>>> index c03764f..9f4d63c 100644
>>> --- a/module/zfs/zpl_xattr.c
>>> +++ b/module/zfs/zpl_xattr.c
>>> @@ -225,6 +225,11 @@ zpl_xattr_get_dir(struct inode *ip, const char
>>> *name,
>>> void *value,
>>> goto out;
>>> }
>>>
>>> + if (size < i_size_read(xip)) {
>>> + error = -ERANGE;
>>> + goto out;
>>> + }
>>> +
>>> error = zpl_read_common(xip, value, size, 0, UIO_SYSSPACE,
>>> 0, cr);
>>> out:
>>> if (xip)
>>> @@ -263,7 +268,10 @@ zpl_xattr_get_sa(struct inode *ip, const char
>>> *name,
>>> void *value, size_t size)
>>> if (!size)
>>> return (nv_size);
>>>
>>> - memcpy(value, nv_value, MIN(size, nv_size));
>>>
>>> + if (size < nv_size)
>>> + return (-ERANGE);
>>
>> Note, that zpl_xattr_get_sa() is called by __zpl_xattr_get() which can
>> also be called by zpl_xattr_get() to test for xattr existence. So it
>> needs to make sure that zpl_xattr_set() doesn't fail if getting
>> -ERANGE.
>>
>>> +
>>> + memcpy(value, nv_value, size);
>>>
>>> return (MIN(size, nv_size));
>>
>> No need for MIN() here.
>>
>>
>> Yehuda
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2013-04-17 20:16 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <516E7D5C.7080309@nazarianin.com>
2013-04-17 15:19 ` test osd on zfs Sage Weil
2013-04-17 15:57 ` Henry C Chang
2013-04-17 16:37 ` Jeff Mitchell
2013-04-17 17:00 ` Henry C Chang
2013-04-17 17:00 ` Sage Weil
2013-04-17 17:04 ` Yehuda Sadeh
2013-04-17 17:05 ` Sage Weil
2013-04-17 17:15 ` Yehuda Sadeh
2013-04-17 18:06 ` Brian Behlendorf
2013-04-17 18:57 ` Brian Behlendorf
2013-04-17 19:07 ` Yehuda Sadeh
2013-04-17 19:09 ` Stefan Priebe
2013-04-17 20:16 ` Mark Nelson [this message]
2013-04-17 20:49 ` Jeff Mitchell
2013-04-17 21:14 ` Brian Behlendorf
2013-04-18 2:20 ` Henry C Chang
2013-04-18 5:56 ` Stefan Priebe - Profihost AG
2013-04-18 14:50 ` Sage Weil
2013-04-18 20:07 ` Alex Elsayed
2013-04-19 10:47 ` Jeff Mitchell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=516F0321.2@inktank.com \
--to=mark.nelson@inktank.com \
--cc=aleonov@nazarianin.com \
--cc=behlendorf1@llnl.gov \
--cc=ceph-devel@vger.kernel.org \
--cc=henry.cy.chang@gmail.com \
--cc=jeffrey.mitchell@gmail.com \
--cc=s.priebe@profihost.ag \
--cc=sage@inktank.com \
--cc=yehuda@inktank.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.