* [PATCH] rbd: revalidate_disk upon rbd resize
@ 2013-04-10 16:06 Laurent Barbe
2013-04-10 19:30 ` Alex Elder
0 siblings, 1 reply; 11+ messages in thread
From: Laurent Barbe @ 2013-04-10 16:06 UTC (permalink / raw)
To: ceph-devel; +Cc: linux-kernel, Laurent Barbe
If rbd disk is open and rbd resize is done, new size is not visible by
filesystem.
Like is done in virtio-blk and dm driver, revalidate_disk() permits to
update the bd_inode size.
Signed-off-by: Laurent Barbe <laurent@ksperis.com>
---
drivers/block/rbd.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/drivers/block/rbd.c b/drivers/block/rbd.c
index f556f8a..1963025 100644
--- a/drivers/block/rbd.c
+++ b/drivers/block/rbd.c
@@ -2293,6 +2293,7 @@ static void rbd_update_mapping_size(struct rbd_device *rbd_dev)
dout("setting size to %llu sectors", (unsigned long long) size);
rbd_dev->mapping.size = (u64) size;
set_capacity(rbd_dev->disk, size);
+ revalidate_disk(rbd_dev->disk);
}
/*
--
1.7.10.4
^ permalink raw reply related [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-10 16:06 Laurent Barbe
@ 2013-04-10 19:30 ` Alex Elder
2013-04-11 2:53 ` Alex Elder
0 siblings, 1 reply; 11+ messages in thread
From: Alex Elder @ 2013-04-10 19:30 UTC (permalink / raw)
To: Laurent Barbe; +Cc: ceph-devel, linux-kernel
On 04/10/2013 11:06 AM, Laurent Barbe wrote:
> If rbd disk is open and rbd resize is done, new size is not visible by
> filesystem.
> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
> update the bd_inode size.
Looks good to me. I'll take this in via the ceph-client tree.
Thanks a lot.
Reviewed-by: Alex Elder <elder@inktank.com>
> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
> ---
> drivers/block/rbd.c | 1 +
> 1 file changed, 1 insertion(+)
>
> diff --git a/drivers/block/rbd.c b/drivers/block/rbd.c
> index f556f8a..1963025 100644
> --- a/drivers/block/rbd.c
> +++ b/drivers/block/rbd.c
> @@ -2293,6 +2293,7 @@ static void rbd_update_mapping_size(struct rbd_device *rbd_dev)
> dout("setting size to %llu sectors", (unsigned long long) size);
> rbd_dev->mapping.size = (u64) size;
> set_capacity(rbd_dev->disk, size);
> + revalidate_disk(rbd_dev->disk);
> }
>
> /*
>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-10 19:30 ` Alex Elder
@ 2013-04-11 2:53 ` Alex Elder
[not found] ` <CADksbcpbhp8-Db2bqQc_fFwmT-iObYdjK+rXJF5DiZ9wu1sSQA@mail.gmail.com>
0 siblings, 1 reply; 11+ messages in thread
From: Alex Elder @ 2013-04-11 2:53 UTC (permalink / raw)
To: Laurent Barbe; +Cc: ceph-devel, linux-kernel
On 04/10/2013 02:30 PM, Alex Elder wrote:
> On 04/10/2013 11:06 AM, Laurent Barbe wrote:
>> If rbd disk is open and rbd resize is done, new size is not visible by
>> filesystem.
>> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
>> update the bd_inode size.
>
> Looks good to me. I'll take this in via the ceph-client tree.
> Thanks a lot.
Unfortunately this leads to a lock inversion. I'm going to
think about how to go about resolving it, so I won't be
committing it just yet.
-Alex
>
> Reviewed-by: Alex Elder <elder@inktank.com>
>
>> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
>> ---
>> drivers/block/rbd.c | 1 +
>> 1 file changed, 1 insertion(+)
>>
>> diff --git a/drivers/block/rbd.c b/drivers/block/rbd.c
>> index f556f8a..1963025 100644
>> --- a/drivers/block/rbd.c
>> +++ b/drivers/block/rbd.c
>> @@ -2293,6 +2293,7 @@ static void rbd_update_mapping_size(struct rbd_device *rbd_dev)
>> dout("setting size to %llu sectors", (unsigned long long) size);
>> rbd_dev->mapping.size = (u64) size;
>> set_capacity(rbd_dev->disk, size);
>> + revalidate_disk(rbd_dev->disk);
>> }
>>
>> /*
>>
>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
[not found] ` <CADksbcpbhp8-Db2bqQc_fFwmT-iObYdjK+rXJF5DiZ9wu1sSQA@mail.gmail.com>
@ 2013-04-11 13:27 ` Alex Elder
2013-04-11 13:59 ` Laurent Barbe
0 siblings, 1 reply; 11+ messages in thread
From: Alex Elder @ 2013-04-11 13:27 UTC (permalink / raw)
To: Laurent Barbe; +Cc: ceph-devel, linux-kernel
On 04/11/2013 03:47 AM, Laurent Barbe wrote:
> Thanks Alex,
>
> What do you mean about "lock inversion", deadlock ?
> Is it the mutex_lock in block_dev.c:revalidate_disk() and the mutex
> in rbd_dev_refresh ?
Yes.
When we refresh an rbd device we get the rbd device header
semaphore, and with this patch we then acquire the bdev's
mutex.
Meanhwhile, via blkdev_close() __blkdev_put() acquires the
bdev->bd_mutex, and eventually we get down to creating an
image object request. If it's a write, we need to get the
snapshot context, and to do that we get the rbd device
header mutex.
So we're acquiring the locks in two different orders, and
that's what I meant by "lock inversion," and yes, that
can lead to deadlock.
There may be a simple fix--like holding off calling
revalidate_disk() until after we release the lock,
most likely in rbd_dev_refresh(). But I just haven't
really thought it through yet.
-Alex
>
> 2013/4/11 Alex Elder <elder@inktank.com>
>
>> On 04/10/2013 02:30 PM, Alex Elder wrote:
>>> On 04/10/2013 11:06 AM, Laurent Barbe wrote:
>>>> If rbd disk is open and rbd resize is done, new size is not visible by
>>>> filesystem.
>>>> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
>>>> update the bd_inode size.
>>>
>>> Looks good to me. I'll take this in via the ceph-client tree.
>>> Thanks a lot.
>>
>> Unfortunately this leads to a lock inversion. I'm going to
>> think about how to go about resolving it, so I won't be
>> committing it just yet.
>>
>> -Alex
>>
>>>
>>> Reviewed-by: Alex Elder <elder@inktank.com>
>>>
>>>> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
>>>> ---
>>>> drivers/block/rbd.c | 1 +
>>>> 1 file changed, 1 insertion(+)
>>>>
>>>> diff --git a/drivers/block/rbd.c b/drivers/block/rbd.c
>>>> index f556f8a..1963025 100644
>>>> --- a/drivers/block/rbd.c
>>>> +++ b/drivers/block/rbd.c
>>>> @@ -2293,6 +2293,7 @@ static void rbd_update_mapping_size(struct
>> rbd_device *rbd_dev)
>>>> dout("setting size to %llu sectors", (unsigned long long) size);
>>>> rbd_dev->mapping.size = (u64) size;
>>>> set_capacity(rbd_dev->disk, size);
>>>> + revalidate_disk(rbd_dev->disk);
>>>> }
>>>>
>>>> /*
>>>>
>>>
>>
>>
>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-11 13:27 ` Alex Elder
@ 2013-04-11 13:59 ` Laurent Barbe
0 siblings, 0 replies; 11+ messages in thread
From: Laurent Barbe @ 2013-04-11 13:59 UTC (permalink / raw)
To: Alex Elder; +Cc: ceph-devel, linux-kernel
Ok,
thank you for your explanation, I'll look a little later.
2013/4/11 Alex Elder <elder@inktank.com>:
> On 04/11/2013 03:47 AM, Laurent Barbe wrote:
>> Thanks Alex,
>>
>> What do you mean about "lock inversion", deadlock ?
>> Is it the mutex_lock in block_dev.c:revalidate_disk() and the mutex
>> in rbd_dev_refresh ?
>
> Yes.
>
> When we refresh an rbd device we get the rbd device header
> semaphore, and with this patch we then acquire the bdev's
> mutex.
>
> Meanhwhile, via blkdev_close() __blkdev_put() acquires the
> bdev->bd_mutex, and eventually we get down to creating an
> image object request. If it's a write, we need to get the
> snapshot context, and to do that we get the rbd device
> header mutex.
>
> So we're acquiring the locks in two different orders, and
> that's what I meant by "lock inversion," and yes, that
> can lead to deadlock.
>
> There may be a simple fix--like holding off calling
> revalidate_disk() until after we release the lock,
> most likely in rbd_dev_refresh(). But I just haven't
> really thought it through yet.
>
> -Alex
>
>>
>> 2013/4/11 Alex Elder <elder@inktank.com>
>>
>>> On 04/10/2013 02:30 PM, Alex Elder wrote:
>>>> On 04/10/2013 11:06 AM, Laurent Barbe wrote:
>>>>> If rbd disk is open and rbd resize is done, new size is not visible by
>>>>> filesystem.
>>>>> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
>>>>> update the bd_inode size.
>>>>
>>>> Looks good to me. I'll take this in via the ceph-client tree.
>>>> Thanks a lot.
>>>
>>> Unfortunately this leads to a lock inversion. I'm going to
>>> think about how to go about resolving it, so I won't be
>>> committing it just yet.
>>>
>>> -Alex
>>>
>>>>
>>>> Reviewed-by: Alex Elder <elder@inktank.com>
>>>>
>>>>> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
>>>>> ---
>>>>> drivers/block/rbd.c | 1 +
>>>>> 1 file changed, 1 insertion(+)
>>>>>
>>>>> diff --git a/drivers/block/rbd.c b/drivers/block/rbd.c
>>>>> index f556f8a..1963025 100644
>>>>> --- a/drivers/block/rbd.c
>>>>> +++ b/drivers/block/rbd.c
>>>>> @@ -2293,6 +2293,7 @@ static void rbd_update_mapping_size(struct
>>> rbd_device *rbd_dev)
>>>>> dout("setting size to %llu sectors", (unsigned long long) size);
>>>>> rbd_dev->mapping.size = (u64) size;
>>>>> set_capacity(rbd_dev->disk, size);
>>>>> + revalidate_disk(rbd_dev->disk);
>>>>> }
>>>>>
>>>>> /*
>>>>>
>>>>
>>>
>>>
>>
>
^ permalink raw reply [flat|nested] 11+ messages in thread
* [PATCH] rbd: revalidate_disk upon rbd resize
@ 2013-04-12 13:52 Laurent Barbe
2013-04-12 13:59 ` Laurent Barbe
2013-04-15 17:24 ` Alex Elder
0 siblings, 2 replies; 11+ messages in thread
From: Laurent Barbe @ 2013-04-12 13:52 UTC (permalink / raw)
To: elder; +Cc: ceph-devel, Laurent Barbe
If rbd disk is open and rbd resize is done, new size is not visible by
filesystem.
Like is done in virtio-blk and dm driver, revalidate_disk() permits to
update the bd_inode size.
Signed-off-by: Laurent Barbe <laurent@ksperis.com>
---
drivers/block/rbd.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/drivers/block/rbd.c b/drivers/block/rbd.c
index 4daa400..5f88de9 100644
--- a/drivers/block/rbd.c
+++ b/drivers/block/rbd.c
@@ -1883,6 +1883,7 @@ static int rbd_dev_refresh(struct rbd_device *rbd_dev, u64 *hver)
else
ret = rbd_dev_v2_refresh(rbd_dev, hver);
mutex_unlock(&ctl_mutex);
+ revalidate_disk(rbd_dev->disk);
return ret;
}
--
1.7.10.4
^ permalink raw reply related [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-12 13:52 [PATCH] rbd: revalidate_disk upon rbd resize Laurent Barbe
@ 2013-04-12 13:59 ` Laurent Barbe
2013-04-15 17:24 ` Alex Elder
1 sibling, 0 replies; 11+ messages in thread
From: Laurent Barbe @ 2013-04-12 13:59 UTC (permalink / raw)
To: Alex Elder; +Cc: ceph-devel, Laurent Barbe
Hello,
I have juste move revalidate_disk() in rbd_dev_refresh() outside the mutex.
It seems ok for me.
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-12 13:52 [PATCH] rbd: revalidate_disk upon rbd resize Laurent Barbe
2013-04-12 13:59 ` Laurent Barbe
@ 2013-04-15 17:24 ` Alex Elder
2013-04-22 17:10 ` Alex Elder
1 sibling, 1 reply; 11+ messages in thread
From: Alex Elder @ 2013-04-15 17:24 UTC (permalink / raw)
To: Laurent Barbe; +Cc: ceph-devel
On 04/12/2013 08:52 AM, Laurent Barbe wrote:
> If rbd disk is open and rbd resize is done, new size is not visible by
> filesystem.
> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
> update the bd_inode size.
>
> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
This one looks good (too). And this time I've tested
it just to be sure I get no lockdep warnings. Unless
I see any strangeness in my sanity testing (which I
don't expect) I'll commit this to the testing branch
later today. Thanks a lot.
Reviewed-by: Alex Elder <elder@inktank.com>
> ---
> drivers/block/rbd.c | 1 +
> 1 file changed, 1 insertion(+)
>
> diff --git a/drivers/block/rbd.c b/drivers/block/rbd.c
> index 4daa400..5f88de9 100644
> --- a/drivers/block/rbd.c
> +++ b/drivers/block/rbd.c
> @@ -1883,6 +1883,7 @@ static int rbd_dev_refresh(struct rbd_device *rbd_dev, u64 *hver)
> else
> ret = rbd_dev_v2_refresh(rbd_dev, hver);
> mutex_unlock(&ctl_mutex);
> + revalidate_disk(rbd_dev->disk);
>
> return ret;
> }
>
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-15 17:24 ` Alex Elder
@ 2013-04-22 17:10 ` Alex Elder
2013-04-22 17:14 ` Sage Weil
0 siblings, 1 reply; 11+ messages in thread
From: Alex Elder @ 2013-04-22 17:10 UTC (permalink / raw)
To: Laurent Barbe; +Cc: ceph-devel
On 04/15/2013 12:24 PM, Alex Elder wrote:
> On 04/12/2013 08:52 AM, Laurent Barbe wrote:
>> If rbd disk is open and rbd resize is done, new size is not visible by
>> filesystem.
>> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
>> update the bd_inode size.
>>
>> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
>
> This one looks good (too). And this time I've tested
> it just to be sure I get no lockdep warnings. Unless
> I see any strangeness in my sanity testing (which I
> don't expect) I'll commit this to the testing branch
> later today. Thanks a lot.
>
> Reviewed-by: Alex Elder <elder@inktank.com>
I have committed this to the "testing" branch of the
ceph-client git repository.
-Alex
. . .
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-22 17:10 ` Alex Elder
@ 2013-04-22 17:14 ` Sage Weil
2013-04-22 17:35 ` Alex Elder
0 siblings, 1 reply; 11+ messages in thread
From: Sage Weil @ 2013-04-22 17:14 UTC (permalink / raw)
To: Alex Elder; +Cc: Laurent Barbe, ceph-devel
On Mon, 22 Apr 2013, Alex Elder wrote:
> On 04/15/2013 12:24 PM, Alex Elder wrote:
> > On 04/12/2013 08:52 AM, Laurent Barbe wrote:
> >> If rbd disk is open and rbd resize is done, new size is not visible by
> >> filesystem.
> >> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
> >> update the bd_inode size.
> >>
> >> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
> >
> > This one looks good (too). And this time I've tested
> > it just to be sure I get no lockdep warnings. Unless
> > I see any strangeness in my sanity testing (which I
> > don't expect) I'll commit this to the testing branch
> > later today. Thanks a lot.
> >
> > Reviewed-by: Alex Elder <elder@inktank.com>
>
> I have committed this to the "testing" branch of the
> ceph-client git repository.
Is this the one that had the lockdep issue?
sage
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: [PATCH] rbd: revalidate_disk upon rbd resize
2013-04-22 17:14 ` Sage Weil
@ 2013-04-22 17:35 ` Alex Elder
0 siblings, 0 replies; 11+ messages in thread
From: Alex Elder @ 2013-04-22 17:35 UTC (permalink / raw)
To: Sage Weil; +Cc: Laurent Barbe, ceph-devel
On 04/22/2013 12:14 PM, Sage Weil wrote:
> On Mon, 22 Apr 2013, Alex Elder wrote:
>> On 04/15/2013 12:24 PM, Alex Elder wrote:
>>> On 04/12/2013 08:52 AM, Laurent Barbe wrote:
>>>> If rbd disk is open and rbd resize is done, new size is not visible by
>>>> filesystem.
>>>> Like is done in virtio-blk and dm driver, revalidate_disk() permits to
>>>> update the bd_inode size.
>>>>
>>>> Signed-off-by: Laurent Barbe <laurent@ksperis.com>
>>>
>>> This one looks good (too). And this time I've tested
>>> it just to be sure I get no lockdep warnings. Unless
>>> I see any strangeness in my sanity testing (which I
>>> don't expect) I'll commit this to the testing branch
>>> later today. Thanks a lot.
>>>
>>> Reviewed-by: Alex Elder <elder@inktank.com>
>>
>> I have committed this to the "testing" branch of the
>> ceph-client git repository.
>
> Is this the one that had the lockdep issue?
It did, but moving where it got called (into
rbd_dev_refresh(), outside the control mutex)
resolved that.
-Alex
^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2013-04-22 17:35 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-04-12 13:52 [PATCH] rbd: revalidate_disk upon rbd resize Laurent Barbe
2013-04-12 13:59 ` Laurent Barbe
2013-04-15 17:24 ` Alex Elder
2013-04-22 17:10 ` Alex Elder
2013-04-22 17:14 ` Sage Weil
2013-04-22 17:35 ` Alex Elder
-- strict thread matches above, loose matches on Subject: below --
2013-04-10 16:06 Laurent Barbe
2013-04-10 19:30 ` Alex Elder
2013-04-11 2:53 ` Alex Elder
[not found] ` <CADksbcpbhp8-Db2bqQc_fFwmT-iObYdjK+rXJF5DiZ9wu1sSQA@mail.gmail.com>
2013-04-11 13:27 ` Alex Elder
2013-04-11 13:59 ` Laurent Barbe
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.