public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Guoqing Jiang <guoqing.jiang@linux.dev>
To: Zhu Yanjun <zyjzyj2000@gmail.com>
Cc: syzbot <syzbot+eba589d8f49c73d356da@syzkaller.appspotmail.com>,
	jgg@ziepe.ca, leon@kernel.org, linux-kernel@vger.kernel.org,
	linux-rdma@vger.kernel.org, netdev@vger.kernel.org,
	syzkaller-bugs@googlegroups.com
Subject: Re: [syzbot] [rdma?] INFO: trying to register non-static key in skb_dequeue (2)
Date: Tue, 23 May 2023 13:56:51 +0800	[thread overview]
Message-ID: <e2a3a27e-9c12-f180-4bb6-1906aa1a1844@linux.dev> (raw)
In-Reply-To: <CAD=hENc72B+gLLd_Xn7w8bd_qDw=mFd5sC0RKEsHpNA=85a9KA@mail.gmail.com>



On 5/23/23 13:52, Zhu Yanjun wrote:
> On Tue, May 23, 2023 at 1:44 PM Guoqing Jiang <guoqing.jiang@linux.dev> wrote:
>>
>>
>> On 5/23/23 13:18, Zhu Yanjun wrote:
>>> On Tue, May 23, 2023 at 1:08 PM Zhu Yanjun <zyjzyj2000@gmail.com> wrote:
>>>> On Tue, May 23, 2023 at 12:29 PM Zhu Yanjun <zyjzyj2000@gmail.com> wrote:
>>>>> On Tue, May 23, 2023 at 12:10 PM Guoqing Jiang <guoqing.jiang@linux.dev> wrote:
>>>>>>
>>>>>> On 5/23/23 12:02, Zhu Yanjun wrote:
>>>>>>> On Tue, May 23, 2023 at 11:47 AM Zhu Yanjun <zyjzyj2000@gmail.com> wrote:
>>>>>>>> On Tue, May 23, 2023 at 10:26 AM Guoqing Jiang <guoqing.jiang@linux.dev> wrote:
>>>>>>>>> On 5/23/23 10:13, syzbot wrote:
>>>>>>>>>> Hello,
>>>>>>>>>>
>>>>>>>>>> syzbot tried to test the proposed patch but the build/boot failed:
>>>>>>>>>>
>>>>>>>>>> failed to apply patch:
>>>>>>>>>> checking file drivers/infiniband/sw/rxe/rxe_qp.c
>>>>>>>>>> patch: **** unexpected end of file in patch
>>>>>>>> This is not the root cause. The fix is not good.
>>>>>>> This problem is about "INFO: trying to register non-static key. The
>>>>>>> code is fine but needs lockdep annotation, or maybe"
>>>>> This warning is from "lock is not initialized". This is a
>>>>> use-before-initialized problem.
>>>>> The correct fix is to initialize the lock that is complained before it is used.
>>>>>
>>>>> Zhu Yanjun
>>>> Based on the call trace, the followings are the order of this call trace.
>>>>
>>>> 291 /* called by the create qp verb */
>>>> 292 int rxe_qp_from_init(struct rxe_dev *rxe, struct rxe_qp *qp,
>>>> struct rxe_pd *pd,
>>>> 297 {
>>>>               ...
>>>> 317         rxe_qp_init_misc(rxe, qp, init);
>>>>               ...
>>>> 322
>>>> 323         err = rxe_qp_init_resp(rxe, qp, init, udata, uresp);
>>>> 324         if (err)
>>>> 325                 goto err2;   <--- error
>>>>
>>>>               ...
>>>>
>>>> 334 err2:
>>>> 335         rxe_queue_cleanup(qp->sq.queue); <--- Goto here
>>>> 336         qp->sq.queue = NULL;
>>>>
>>>> In rxe_qp_init_resp, the error occurs before skb_queue_head_init.
>>>> So this call trace appeared.
>>> 250 static int rxe_qp_init_resp(struct rxe_dev *rxe, struct rxe_qp *qp,
>>> 254 {
>>>                           ...
>>> 264
>>> 265                 type = QUEUE_TYPE_FROM_CLIENT;
>>> 266                 qp->rq.queue = rxe_queue_init(rxe, &qp->rq.max_wr,
>>> 267                                         wqe_size, type);
>>> 268                 if (!qp->rq.queue)
>>> 269                         return -ENOMEM;    <---Error here
>>> 270
>>>
>>> ...
>>>
>>> 282         skb_queue_head_init(&qp->resp_pkts); <-this is not called.
>>> ...
>>> This will make spin_lock of resp_pkts is used before initialized.
>> IMHO, the above is same as
>>
>>> Which is caused by  "skb_queue_head_init(&qp->resp_pkts)" is not called
>>> given rxe_qp_init_resp returns error, but the cleanup still trigger the
>>> chain.
>>>
>>> rxe_qp_do_cleanup -> rxe_completer -> drain_resp_pkts ->
>>> skb_dequeue(&qp->resp_pkts)
>> my previous analysis. If not, could you provide another better way to
>> fix it?
> Move the initialization to the beginning. This can fix this problem.
> See below:
>
> "
> diff --git a/drivers/infiniband/sw/rxe/rxe_qp.c
> b/drivers/infiniband/sw/rxe/rxe_qp.c
> index c5451a4488ca..22ef6188d7b1 100644
> --- a/drivers/infiniband/sw/rxe/rxe_qp.c
> +++ b/drivers/infiniband/sw/rxe/rxe_qp.c
> @@ -176,6 +176,9 @@ static void rxe_qp_init_misc(struct rxe_dev *rxe,
> struct rxe_qp *qp,
>          spin_lock_init(&qp->rq.producer_lock);
>          spin_lock_init(&qp->rq.consumer_lock);
>
> +       skb_queue_head_init(&qp->req_pkts);
> +       skb_queue_head_init(&qp->resp_pkts);
> +
>          atomic_set(&qp->ssn, 0);
>          atomic_set(&qp->skb_out, 0);
>   }
> @@ -234,8 +237,6 @@ static int rxe_qp_init_req(struct rxe_dev *rxe,
> struct rxe_qp *qp,
>          qp->req.opcode          = -1;
>          qp->comp.opcode         = -1;
>
> -       skb_queue_head_init(&qp->req_pkts);
> -
>          rxe_init_task(&qp->req.task, qp, rxe_requester);
>          rxe_init_task(&qp->comp.task, qp, rxe_completer);
>
> @@ -279,8 +280,6 @@ static int rxe_qp_init_resp(struct rxe_dev *rxe,
> struct rxe_qp *qp,
>                  }
>          }
>
> -       skb_queue_head_init(&qp->resp_pkts);
> -
>          rxe_init_task(&qp->resp.task, qp, rxe_responder);
>
>          qp->resp.opcode         = OPCODE_NONE;
> "

It is weird to me that init them in init_misc instead of init_req/resp, 
given they
are dedicated/used for the different purpose. But just my 0.02$.

Guoqing

  reply	other threads:[~2023-05-23  5:57 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-05-03  8:37 [syzbot] [rdma?] INFO: trying to register non-static key in skb_dequeue (2) syzbot
2023-05-18  9:20 ` syzbot
2023-05-23  2:07   ` Guoqing Jiang
2023-05-23  2:13     ` syzbot
2023-05-23  2:25       ` Guoqing Jiang
2023-05-23  2:55         ` syzbot
2023-05-23  3:47         ` Zhu Yanjun
2023-05-23  3:58           ` Guoqing Jiang
2023-05-23  4:02           ` Zhu Yanjun
2023-05-23  4:10             ` Guoqing Jiang
2023-05-23  4:29               ` Zhu Yanjun
2023-05-23  5:08                 ` Zhu Yanjun
2023-05-23  5:18                   ` Zhu Yanjun
2023-05-23  5:44                     ` Guoqing Jiang
2023-05-23  5:52                       ` Zhu Yanjun
2023-05-23  5:56                         ` Guoqing Jiang [this message]
2023-05-23  6:04                           ` Zhu Yanjun
2023-05-23  5:50                 ` Guoqing Jiang
2023-05-23  5:55                   ` Zhu Yanjun
2023-05-23  6:00                     ` Guoqing Jiang
2023-05-23  6:07                       ` Zhu Yanjun
2023-05-23  6:11                         ` Guoqing Jiang
2023-05-23  6:40                           ` Zhu Yanjun
     [not found] <20230518112255.4516-1-hdanton@sina.com>
2023-05-18 11:44 ` syzbot

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=e2a3a27e-9c12-f180-4bb6-1906aa1a1844@linux.dev \
    --to=guoqing.jiang@linux.dev \
    --cc=jgg@ziepe.ca \
    --cc=leon@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-rdma@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=syzbot+eba589d8f49c73d356da@syzkaller.appspotmail.com \
    --cc=syzkaller-bugs@googlegroups.com \
    --cc=zyjzyj2000@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox