All of lore.kernel.org
 help / color / mirror / Atom feed
From: Stephen Hemminger <stephen@networkplumber.org>
To: Sarosh Arif <sarosh.arif@emumba.com>
Cc: "Mattias Rönnblom" <hofors@lysator.liu.se>,
	bugzilla@dpdk.org, dev <dev@dpdk.org>
Subject: Re: [Bug 1030] rte_malloc() and rte_free() get stuck when used with signal handler
Date: Wed, 5 Oct 2022 10:30:59 -0700	[thread overview]
Message-ID: <20221005103059.112892cd@hermes.local> (raw)
In-Reply-To: <CABoZmYN9Ls+RCLyLwiF5DM==ynKspmY36Wx+4ABxwTtXXK4Fbg@mail.gmail.com>

On Mon, 13 Jun 2022 14:48:45 +0500
Sarosh Arif <sarosh.arif@emumba.com> wrote:

> Thank you for help, I'll do it this way.
> 
> On Sat, Jun 11, 2022 at 9:25 PM Mattias Rönnblom <hofors@lysator.liu.se> wrote:
> >
> > On 2022-06-10 08:04, Sarosh Arif wrote:  
> > > On Thu, Jun 9, 2022 at 8:26 PM Stephen Hemminger
> > > <stephen@networkplumber.org> wrote:  
> > >>
> > >> On Thu, 09 Jun 2022 12:47:43 +0000
> > >> bugzilla@dpdk.org wrote:
> > >>  
> > >>> https://bugs.dpdk.org/show_bug.cgi?id=1030
> > >>>
> > >>>              Bug ID: 1030
> > >>>             Summary: rte_malloc() and rte_free() get stuck when used with
> > >>>                      signal handler
> > >>>             Product: DPDK
> > >>>             Version: 22.03
> > >>>            Hardware: All
> > >>>                  OS: Linux
> > >>>              Status: UNCONFIRMED
> > >>>            Severity: normal
> > >>>            Priority: Normal
> > >>>           Component: core
> > >>>            Assignee: dev@dpdk.org
> > >>>            Reporter: sarosh.arif@emumba.com
> > >>>    Target Milestone: ---
> > >>>
> > >>> Created attachment 205  
> > >>>    --> https://bugs.dpdk.org/attachment.cgi?id=205&action=edit  
> > >>> calls rte_malloc and rte_free in the handler and main code
> > >>>
> > >>> I have a dpdk based application which uses rte_malloc() and rte_free()
> > >>> frequently in it's main code. The general method to close the application is
> > >>> though sending SIGINT. The application has a signal handler written for cleanup
> > >>> purposes before closing the application. The handler also uses rte_free() to
> > >>> release some of the memory during cleanup. The application gets stuck in a
> > >>> deadlock.
> > >>>
> > >>>
> > >>> Upon investigation I found out that both rte_free() and rte_malloc() use
> > >>> rte_spinlock_lock() function to place a lock on heap. While this lock is placed
> > >>> and the application receives SIGINT, it goes into the handler without releasing
> > >>> the lock. Since the handler itself calls rte_free() which tries to acquire the
> > >>> lock it gets stuck.
> > >>>
> > >>>
> > >>> I have attached a sample application to reproduce this problem.
> > >>>
> > >>>
> > >>> Steps to reproduce this problem:
> > >>>
> > >>> 1. compile the code provided in attachment with any version of dpdk
> > >>> 2. run the compiled binary
> > >>> 3. press ctrl+c till the prints stop
> > >>>
> > >>> Actual Results:
> > >>> The application gets stuck in either rte_free() or rte_malloc()
> > >>>
> > >>> Expected Results:
> > >>> Application should allocate and free the memory without getting stuck
> > >>>  
> > >>
> > >> rte_malloc and rte_free are not async sigsafe()
> > >>  
> > > Oh, I did not know that. This should be mentioned in the documentation.  
> >
> > Is there anything except <rte_atomic.h> that is/should be async-signal-safe?
> >  
> > >> but then again regular glibc is not either.  
> > > Memory allocated with glibc malloc() is freed by itself upon closing
> > > the application. My application runs as a secondary process, and it
> > > needs to use rte_malloc() specifically because the memory should be
> > > shared between the two processes. If I don't free it upon closure it
> > > would just be leaked. Is there any other solution for it?  
> >
> > The standard solution is that the signal handler using some appropriate,
> > async-signal-safe way talks to the main thread, which then goes on to
> > cleanly terminate the application.
> >
> > A write() to an fd, or an atomic store to a flag are two options.  

Patch is pending (why is it not merged?) to describe what is signal safe.
https://patchwork.dpdk.org/project/dpdk/patch/20220711230448.557715-1-stephen@networkplumber.org/

  reply	other threads:[~2022-10-05 17:31 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-06-09 12:47 [Bug 1030] rte_malloc() and rte_free() get stuck when used with signal handler bugzilla
2022-06-09 15:25 ` Stephen Hemminger
2022-06-10  6:04   ` Sarosh Arif
2022-06-11 16:25     ` Mattias Rönnblom
2022-06-13  9:48       ` Sarosh Arif
2022-10-05 17:30         ` Stephen Hemminger [this message]
2022-11-16  9:37 ` bugzilla

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20221005103059.112892cd@hermes.local \
    --to=stephen@networkplumber.org \
    --cc=bugzilla@dpdk.org \
    --cc=dev@dpdk.org \
    --cc=hofors@lysator.liu.se \
    --cc=sarosh.arif@emumba.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.