netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Jarek Poplawski <jarkao2@gmail.com>
To: David Miller <davem@davemloft.net>
Cc: andres@anarazel.de, arun@linux.vnet.ibm.com, tglx@linutronix.de,
	shemminger@vyatta.com, netdev@vger.kernel.org,
	linux-kernel@vger.kernel.org
Subject: Re: Soft-Lockup/Race in networking in 2.6.31-rc1+195 ( possibly?caused by netem)
Date: Sat, 4 Jul 2009 00:56:40 +0200	[thread overview]
Message-ID: <20090703225640.GA3639@ami.dom.local> (raw)
In-Reply-To: <20090703.132220.57384838.davem@davemloft.net>

On Fri, Jul 03, 2009 at 01:22:20PM -0700, David Miller wrote:
> From: Jarek Poplawski <jarkao2@gmail.com>
> Date: Fri, 3 Jul 2009 12:03:01 +0000
> 
> > On Fri, Jul 03, 2009 at 01:26:21PM +0200, Andres Freund wrote:
> >> On Friday 03 July 2009 08:12:13 Jarek Poplawski wrote:
> >> > On Fri, Jul 03, 2009 at 03:31:31AM +0200, Andres Freund wrote:
> >> > ...
> >> >
> >> > > Ok. I finally see the light. I bisected the issue down to
> >> > > eea08f32adb3f97553d49a4f79a119833036000a :  timers: Logic to move non
> >> > > pinned timers
> >> > >
> >> > > Disabling timer migration like provided in the earlier commit stops the
> >> > > issue from occuring.
> >> > >
> >> > > That it is related to timers is sensible in the light of my findings,
> >> > > that I could trigger the issue only when using delay in netem - that is
> >> > > the codepath using qdisc_watchdog...
> >> >
> >> > Andres, thanks for your work and time. It saved me a lot of searching,
> >> > because I wasn't able to trigger this on my old box.
> >> Thanks. It allowed me to go through some of my remaining paperwork ;-)
> >> 
> >> Does anybody of you have an idea where the problem actually resides? 
> > 
> > Do you mean possibly broken timers are not enough?
> 
> Well, if you look at that commit the bisect pointed to Jarek, it is a
> change which starts causing a situation which never happened before.
> Namely, timers added on one cpu can be migrated and fire on another.
> 
> So this could be exposing races in the networking that technically
> always existed.

I'm not sure I get your point; could you give some example?
Actually, I've suspected races in timers code.

Jarek P.

  reply	other threads:[~2009-07-03 22:57 UTC|newest]

Thread overview: 66+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-07-03  1:31 Soft-Lockup/Race in networking in 2.6.31-rc1+195 ( possibly caused by netem) Andres Freund
2009-07-03  6:12 ` Soft-Lockup/Race in networking in 2.6.31-rc1+195 ( possibly?caused " Jarek Poplawski
2009-07-03 11:26   ` Andres Freund
2009-07-03 12:03     ` Jarek Poplawski
2009-07-03 12:30       ` Andres Freund
2009-07-03 20:22       ` David Miller
2009-07-03 22:56         ` Jarek Poplawski [this message]
2009-07-04  1:55           ` David Miller
2009-07-04  6:36             ` Jarek Poplawski
2009-07-04 15:18     ` Jarek Poplawski
2009-07-06  4:53       ` Joao Correia
2009-07-06  8:14         ` Jarek Poplawski
2009-07-06 11:28           ` Joao Correia
2009-07-06 14:19         ` Jarek Poplawski
2009-07-06 16:13           ` Andres Freund
2009-07-06 16:31             ` Jarek Poplawski
2009-07-06 17:23               ` Joao Correia
2009-07-06 17:26                 ` Andres Freund
2009-07-07  6:50                   ` Jarek Poplawski
2009-07-07 10:40                     ` Joao Correia
2009-07-07 10:47                       ` Andres Freund
     [not found]                         ` <a5d9929e0907070403n698c9eb9p53a5bb07bafcc169@mail.gmail.com>
2009-07-07 11:05                           ` Fwd: " Joao Correia
2009-07-07 13:18                       ` Jarek Poplawski
2009-07-07 13:22                         ` Andres Freund
2009-07-07 13:29                           ` Jarek Poplawski
2009-07-07 13:34                             ` Andres Freund
2009-07-07 13:57                               ` Jarek Poplawski
2009-07-07 16:11                                 ` Andres Freund
2009-07-08  8:08                                   ` Jarek Poplawski
2009-07-08  8:29                                     ` Andres Freund
2009-07-08  9:13                                       ` Jarek Poplawski
2009-07-08 21:44                                         ` Joao Correia
2009-07-08 22:07                                           ` Jarek Poplawski
2009-07-08 22:27                                             ` Joao Correia
2009-07-08 22:42                                               ` Jarek Poplawski
2009-07-08 22:48                                                 ` Joao Correia
2009-07-08 22:23                                     ` Andres Freund
2009-07-08 22:48                                       ` Jarek Poplawski
2009-07-09 10:31                                         ` Thomas Gleixner
2009-07-09 10:44                                           ` Jarek Poplawski
2009-07-09 12:03                                             ` Thomas Gleixner
2009-07-09 13:22                                               ` Jarek Poplawski
2009-07-09 14:15                                                 ` Thomas Gleixner
2009-07-09 14:24                                                   ` Jarek Poplawski
2009-07-09 14:25                                                     ` Joao Correia
2009-07-09 14:28                                                     ` Thomas Gleixner
2009-07-09 15:28                                                       ` Andres Freund
2009-07-09 16:01                                                         ` Thomas Gleixner
2009-07-09 16:46                                                           ` Andres Freund
2009-07-09 17:44                                                             ` Thomas Gleixner
2009-07-09 21:19                                                               ` Joao Correia
2009-07-07 13:20                       ` Jarek Poplawski
2009-07-06 17:24               ` Andres Freund
  -- strict thread matches above, loose matches on Subject: below --
2009-06-30 23:20 Soft-Lockup/Race in networking in 2.6.31-rc1+195 (possibly caused " Andres Freund
2009-07-01 18:39 ` Jarek Poplawski
2009-07-01 21:22   ` Andres Freund
2009-07-02  0:37     ` Andres Freund
2009-07-02  9:30       ` Jarek Poplawski
2009-07-02 10:12         ` Jarek Poplawski
2009-07-02 10:51           ` Joao Correia
2009-07-02 11:09             ` Jarek Poplawski
2009-07-02 11:11           ` Andres Freund
2009-07-02 11:43             ` Jarek Poplawski
2009-07-02 11:43             ` Andres Freund
2009-07-02 11:54               ` Jarek Poplawski
2009-07-02 11:59                 ` Andres Freund

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090703225640.GA3639@ami.dom.local \
    --to=jarkao2@gmail.com \
    --cc=andres@anarazel.de \
    --cc=arun@linux.vnet.ibm.com \
    --cc=davem@davemloft.net \
    --cc=linux-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=shemminger@vyatta.com \
    --cc=tglx@linutronix.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).