* [PATCH for-next] RDMA/rxe: Fix "Replace red-black trees by xarrays"
@ 2022-04-10 22:39 Bob Pearson
2022-04-11 3:06 ` Bart Van Assche
0 siblings, 1 reply; 5+ messages in thread
From: Bob Pearson @ 2022-04-10 22:39 UTC (permalink / raw)
To: bvanassche, jgg, zyjzyj2000, linux-rdma, linux-scsi, yi.zhang; +Cc: Bob Pearson
The referenced commit causes lockdep warnings by using the
default spin_lock in xa_alloc_cyclic and xa_erase which
include calls to xa_lock()/xa_unlock() while at the same time
explicitly calling xa_lock_irqsave() in rxe_pool_get_index().
The latter is required to handle some object lookups correctly. The
immediate fix is to explicitly use xa_lock_irqsave() everywhere.
This commit replaces xa_alloc_cyclic() by __xa_alloc_cyclic() and
xa_erase() by __xa_erase() and explicitly lock these calls with
xa_lock_irqsave().
This commit will be reverted later when the read side operations
in rxe_pool.c will be converted to rcu_read_locks which will not
require locking the write side operations with irqsave locks.
This commit fixes the "WARNING: Inconsistent lock state" bug in
blktests. The recent revert patch from Bart fixes the other
bug in blktests with very long delays.
Fixes: 3225717f6dfa ("RDMA/rxe: Replace red-black trees by carrays")
Signed-off-by: Bob Pearson <rpearsonhpe@gmail.com>
---
drivers/infiniband/sw/rxe/rxe_pool.c | 18 +++++++++++++++---
1 file changed, 15 insertions(+), 3 deletions(-)
diff --git a/drivers/infiniband/sw/rxe/rxe_pool.c b/drivers/infiniband/sw/rxe/rxe_pool.c
index 87066d04ed18..440f96af213b 100644
--- a/drivers/infiniband/sw/rxe/rxe_pool.c
+++ b/drivers/infiniband/sw/rxe/rxe_pool.c
@@ -118,7 +118,9 @@ void rxe_pool_cleanup(struct rxe_pool *pool)
void *rxe_alloc(struct rxe_pool *pool)
{
+ struct xarray *xa = &pool->xa;
struct rxe_pool_elem *elem;
+ unsigned long flags;
void *obj;
int err;
@@ -138,8 +140,10 @@ void *rxe_alloc(struct rxe_pool *pool)
elem->obj = obj;
kref_init(&elem->ref_cnt);
- err = xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
+ xa_lock_irqsave(xa, flags);
+ err = __xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
&pool->next, GFP_KERNEL);
+ xa_unlock_irqrestore(xa, flags);
if (err)
goto err_free;
@@ -154,6 +158,8 @@ void *rxe_alloc(struct rxe_pool *pool)
int __rxe_add_to_pool(struct rxe_pool *pool, struct rxe_pool_elem *elem)
{
+ struct xarray *xa = &pool->xa;
+ unsigned long flags;
int err;
if (WARN_ON(pool->flags & RXE_POOL_ALLOC))
@@ -166,8 +172,10 @@ int __rxe_add_to_pool(struct rxe_pool *pool, struct rxe_pool_elem *elem)
elem->obj = (u8 *)elem - pool->elem_offset;
kref_init(&elem->ref_cnt);
- err = xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
+ xa_lock_irqsave(xa, flags);
+ err = __xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
&pool->next, GFP_KERNEL);
+ xa_unlock_irqrestore(xa, flags);
if (err)
goto err_cnt;
@@ -200,8 +208,12 @@ static void rxe_elem_release(struct kref *kref)
{
struct rxe_pool_elem *elem = container_of(kref, typeof(*elem), ref_cnt);
struct rxe_pool *pool = elem->pool;
+ struct xarray *xa = &pool->xa;
+ unsigned long flags;
- xa_erase(&pool->xa, elem->index);
+ xa_lock_irqsave(xa, flags);
+ __xa_erase(&pool->xa, elem->index);
+ xa_unlock_irqrestore(xa, flags);
if (pool->cleanup)
pool->cleanup(elem);
--
2.32.0
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH for-next] RDMA/rxe: Fix "Replace red-black trees by xarrays"
2022-04-10 22:39 [PATCH for-next] RDMA/rxe: Fix "Replace red-black trees by xarrays" Bob Pearson
@ 2022-04-11 3:06 ` Bart Van Assche
2022-04-11 3:13 ` Bob Pearson
2022-04-11 3:15 ` Zhu Yanjun
0 siblings, 2 replies; 5+ messages in thread
From: Bart Van Assche @ 2022-04-11 3:06 UTC (permalink / raw)
To: Bob Pearson, jgg, zyjzyj2000, linux-rdma, linux-scsi, yi.zhang
On 4/10/22 15:39, Bob Pearson wrote:
> Fixes: 3225717f6dfa ("RDMA/rxe: Replace red-black trees by carrays")
^^^^^^^
xarrays?
> @@ -138,8 +140,10 @@ void *rxe_alloc(struct rxe_pool *pool)
> elem->obj = obj;
> kref_init(&elem->ref_cnt);
>
> - err = xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
> + xa_lock_irqsave(xa, flags);
> + err = __xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
> &pool->next, GFP_KERNEL);
> + xa_unlock_irqrestore(xa, flags);
Please take a look at the xas_unlock_type() and xas_lock_type() calls in
__xas_nomem(). I think that the above change will trigger a GFP_KERNEL
allocation with interrupts disabled. My understanding is that GFP_KERNEL
allocations may sleep and hence that the above code may cause
__xas_nomem() to sleep with interrupts disabled. I don't think that is
allowed.
Thanks,
Bart.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH for-next] RDMA/rxe: Fix "Replace red-black trees by xarrays"
2022-04-11 3:06 ` Bart Van Assche
@ 2022-04-11 3:13 ` Bob Pearson
2022-04-11 11:38 ` Jason Gunthorpe
2022-04-11 3:15 ` Zhu Yanjun
1 sibling, 1 reply; 5+ messages in thread
From: Bob Pearson @ 2022-04-11 3:13 UTC (permalink / raw)
To: Bart Van Assche, jgg, zyjzyj2000, linux-rdma, linux-scsi,
yi.zhang
On 4/10/22 22:06, Bart Van Assche wrote:
> On 4/10/22 15:39, Bob Pearson wrote:
>> Fixes: 3225717f6dfa ("RDMA/rxe: Replace red-black trees by carrays")
> ^^^^^^^
> xarrays?
>
>> @@ -138,8 +140,10 @@ void *rxe_alloc(struct rxe_pool *pool)
>> elem->obj = obj;
>> kref_init(&elem->ref_cnt);
>> - err = xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
>> + xa_lock_irqsave(xa, flags);
>> + err = __xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
>> &pool->next, GFP_KERNEL);
>> + xa_unlock_irqrestore(xa, flags);
>
> Please take a look at the xas_unlock_type() and xas_lock_type() calls in __xas_nomem(). I think that the above change will trigger a GFP_KERNEL allocation with interrupts disabled. My understanding is that GFP_KERNEL allocations may sleep and hence that the above code may cause __xas_nomem() to sleep with interrupts disabled. I don't think that is allowed.
>
> Thanks,
>
> Bart.
You're right. I missed that. Zhu wants to write the patch so hopefully he's on top of that.
For now we could use GFP_ATOMIC.
Bob
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH for-next] RDMA/rxe: Fix "Replace red-black trees by xarrays"
2022-04-11 3:06 ` Bart Van Assche
2022-04-11 3:13 ` Bob Pearson
@ 2022-04-11 3:15 ` Zhu Yanjun
1 sibling, 0 replies; 5+ messages in thread
From: Zhu Yanjun @ 2022-04-11 3:15 UTC (permalink / raw)
To: Bart Van Assche
Cc: Bob Pearson, Jason Gunthorpe, RDMA mailing list, linux-scsi,
Yi Zhang
On Mon, Apr 11, 2022 at 11:06 AM Bart Van Assche <bvanassche@acm.org> wrote:
>
> On 4/10/22 15:39, Bob Pearson wrote:
> > Fixes: 3225717f6dfa ("RDMA/rxe: Replace red-black trees by carrays")
> ^^^^^^^
> xarrays?
>
> > @@ -138,8 +140,10 @@ void *rxe_alloc(struct rxe_pool *pool)
> > elem->obj = obj;
> > kref_init(&elem->ref_cnt);
> >
> > - err = xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
> > + xa_lock_irqsave(xa, flags);
> > + err = __xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
> > &pool->next, GFP_KERNEL);
> > + xa_unlock_irqrestore(xa, flags);
>
> Please take a look at the xas_unlock_type() and xas_lock_type() calls in
> __xas_nomem(). I think that the above change will trigger a GFP_KERNEL
> allocation with interrupts disabled. My understanding is that GFP_KERNEL
> allocations may sleep and hence that the above code may cause
> __xas_nomem() to sleep with interrupts disabled. I don't think that is
> allowed.
Thanks. I will send V2.
Zhu Yanjun
>
> Thanks,
>
> Bart.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH for-next] RDMA/rxe: Fix "Replace red-black trees by xarrays"
2022-04-11 3:13 ` Bob Pearson
@ 2022-04-11 11:38 ` Jason Gunthorpe
0 siblings, 0 replies; 5+ messages in thread
From: Jason Gunthorpe @ 2022-04-11 11:38 UTC (permalink / raw)
To: Bob Pearson; +Cc: Bart Van Assche, zyjzyj2000, linux-rdma, linux-scsi, yi.zhang
On Sun, Apr 10, 2022 at 10:13:16PM -0500, Bob Pearson wrote:
> On 4/10/22 22:06, Bart Van Assche wrote:
> > On 4/10/22 15:39, Bob Pearson wrote:
> >> Fixes: 3225717f6dfa ("RDMA/rxe: Replace red-black trees by carrays")
> > ^^^^^^^
> > xarrays?
> >
> >> @@ -138,8 +140,10 @@ void *rxe_alloc(struct rxe_pool *pool)
> >> elem->obj = obj;
> >> kref_init(&elem->ref_cnt);
> >> - err = xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
> >> + xa_lock_irqsave(xa, flags);
> >> + err = __xa_alloc_cyclic(&pool->xa, &elem->index, elem, pool->limit,
> >> &pool->next, GFP_KERNEL);
> >> + xa_unlock_irqrestore(xa, flags);
> >
> > Please take a look at the xas_unlock_type() and xas_lock_type() calls in __xas_nomem(). I think that the above change will trigger a GFP_KERNEL allocation with interrupts disabled. My understanding is that GFP_KERNEL allocations may sleep and hence that the above code may cause __xas_nomem() to sleep with interrupts disabled. I don't think that is allowed.
> >
> > Thanks,
> >
> > Bart.
>
> You're right. I missed that. Zhu wants to write the patch so hopefully he's on top of that.
> For now we could use GFP_ATOMIC.
Yes, you cannot use irq_save varients here. You have to know your
calling context is non-atomic already and use the irq wrapper.
Jason
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2022-04-11 11:38 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-04-10 22:39 [PATCH for-next] RDMA/rxe: Fix "Replace red-black trees by xarrays" Bob Pearson
2022-04-11 3:06 ` Bart Van Assche
2022-04-11 3:13 ` Bob Pearson
2022-04-11 11:38 ` Jason Gunthorpe
2022-04-11 3:15 ` Zhu Yanjun
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox