public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Serious problem with ticket spinlocks on ia64
@ 2010-08-27 13:37 Petr Tesarik
  2010-08-27 13:48 ` Hedi Berriche
  2010-08-27 16:08 ` Luck, Tony
  0 siblings, 2 replies; 36+ messages in thread
From: Petr Tesarik @ 2010-08-27 13:37 UTC (permalink / raw)
  To: linux-ia64; +Cc: linux-kernel, Tony Luck, Hedi Berriche

Hi everybody,

SGI has recently experienced failures with the new ticket spinlock
implementation. Hedi Berriche sent me a simple test case that can
trigger the failure on the siglock. To debug the issue, I wrote a
small module that watches writes to current->sighand->siglock and
records the values.

I observed that the __ticket_spin_lock() primitive fails when the
tail wraps around to zero. I reconstructed the following:

CPU 7 holds the spinlock
CPU 5 wants to acquire the spinlock
Spinlock value is 0xfffcffff
	 (now serving 0x7fffe, next ticket 0x7ffff)

CPU 7 executes st2.rel to release the spinlock.
At the same time CPU 5 executes a fetchadd4.acq.
The resulting lock value is 0xfffe0000 (correct), and CPU 5 has
recorded its ticket number (0x7fff).

Consequently, the first spinlock loop iteration succeeds, and CPU 5
now holds the spinlock.

Next, CPU 5 releases the spinlock with st2.rel, changing the lock
value to 0x0 (correct).

SO FAR SO GOOD.

Now, CPU 4, CPU 5 and CPU 7 all want to acquire the lock again.
Interestingly, CPU 5 and CPU 7 are both granted the same ticket,
and the spinlock value (as seen from the debug fault handler) is
0x0 after single-stepping over the fetchadd4.acq, in both cases.
CPU 4 correctly sets the spinlock value to 0x1.

I don't know if the simultaneos acquire attempt and release are
necessary to trigger the bug, but I noted it here.

I've only seen this happen when the spinlock wraps around to zero,
but I don't know whether it cannot happen otherwise.

In any case, there seems to be a serious problem with memory
ordering, and I'm not an expert to tell exactly what it is.

Any ideas?

Petr Tesarik
L3 International
Novell, Inc.

^ permalink raw reply	[flat|nested] 36+ messages in thread

end of thread, other threads:[~2010-09-10  2:55 UTC | newest]

Thread overview: 36+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-08-27 13:37 Serious problem with ticket spinlocks on ia64 Petr Tesarik
2010-08-27 13:48 ` Hedi Berriche
2010-08-27 14:09   ` Petr Tesarik
2010-08-27 14:31     ` Hedi Berriche
2010-08-27 14:40       ` Petr Tesarik
2010-08-27 14:52         ` Hedi Berriche
2010-08-27 16:37           ` Petr Tesarik
2010-08-27 16:08 ` Luck, Tony
2010-08-27 17:16   ` Petr Tesarik
2010-08-27 18:20     ` Hedi Berriche
2010-08-27 19:40     ` Petr Tesarik
2010-08-27 20:29   ` Luck, Tony
2010-08-27 20:41     ` Petr Tesarik
2010-08-27 21:03     ` Petr Tesarik
2010-08-27 21:11       ` Luck, Tony
2010-08-27 22:13         ` Petr Tesarik
2010-08-27 23:26           ` Luck, Tony
2010-08-27 23:55             ` Luck, Tony
2010-08-28  0:28               ` Hedi Berriche
2010-08-28  5:01                 ` Luck, Tony
2010-08-30 18:17                   ` Luck, Tony
2010-08-30 21:41                     ` Petr Tesarik
2010-08-30 22:43                       ` Tony Luck
2010-08-31 22:17                         ` Tony Luck
2010-09-01 23:09                           ` Tony Luck
2010-09-02  0:26                             ` Hedi Berriche
2010-09-03  0:06                               ` Tony Luck
2010-09-03  9:04                                 ` Petr Tesarik
2010-09-03 14:35                                   ` Petr Tesarik
2010-09-03 14:52                                     ` Petr Tesarik
2010-09-03 15:50                                       ` Tony Luck
2010-09-06 14:47                                         ` Petr Tesarik
2010-09-07 13:17                                           ` Petr Tesarik
2010-09-07 17:35                                             ` Tony Luck
2010-09-08 15:55                                               ` Tony Luck
2010-09-10  2:55                                     ` Dave Jones

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox