public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Q: sys_futex() && timespec_valid()
@ 2010-06-25 19:20 Oleg Nesterov
  2010-06-25 19:42 ` Darren Hart
  2010-06-25 19:59 ` Thomas Gleixner
  0 siblings, 2 replies; 15+ messages in thread
From: Oleg Nesterov @ 2010-06-25 19:20 UTC (permalink / raw)
  To: Darren Hart, Ingo Molnar, Linus Torvalds, Peter Zijlstra,
	Thomas Gleixner
  Cc: Andreas Schwab, Danny Feng, Jakub Jelinek, Ulrich Drepper,
	linux-kernel

Hello.

Another stupid question about the trivial problem I am going to ask,
just to report the authoritative answer back to bugzilla. The problem
is, personally I am not sure we should/can add the user-visible change
required by glibc maintainers, and I am in no position to suggest them
to fix the user-space code instead.


In short, glibc developers believe that sys_futex(ts) is buggy and
needs the fix to return -ETIMEDOUT instead of -EINVAL in case when
ts->tv_sec < 0 and the timeout is absolute.

Ignoring the possible cleanups/microoptimizations, something like this:

--- x/kernel/futex.c
+++ x/kernel/futex.c
@@ -2625,6 +2625,16 @@ SYSCALL_DEFINE6(futex, u32 __user *, uad
 		      cmd == FUTEX_WAIT_REQUEUE_PI)) {
 		if (copy_from_user(&ts, utime, sizeof(ts)) != 0)
 			return -EFAULT;
+
+		// absolute timeout
+		if (cmd != FUTEX_WAIT) {
+			if (ts->tv_nsec >= NSEC_PER_SEC)
+				return -EINVAL;
+			if (ts->tv_sec < 0)
+				return -ETIMEDOUT;
+		}
+
+
 		if (!timespec_valid(&ts))
 			return -EINVAL;
 
------------------------------------------------------------------------

Otherwise, pthread_rwlock_timedwrlock(ts) hangs spinning in user-space
forever if ts->tv_sec < 0.

To clarify: this depends on libc version and arch.

This happens because pthread_rwlock_timedwrlock(rwlock, ts) on x86_64
roughly does:

	for (;;) {
		if (fast_path_succeeds(rwlock))
			return 0;

		if (ts->tv_nsec >= NSEC_PER_SEC)
			return EINVAL;

		errcode = sys_futex(FUTEX_WAIT_BITSET_PRIVATE, ts);
		if (errcode == ETIMEDOUT)
			return ETIMEDOUT;
	}

and since the kernel return EINVAL due to !timespec_valid(ts), the
code above loops forever.

(btw, we have same problem with EFAULT, and this is considered as
 a caller's problem).

IOW, pthread_rwlock_timedwrlock() assumes that in this case
sys_futex() can return nothing interesting except 0 or ETIMEDOUT.
I guess pthread_rwlock_timedwrlock() is not alone, but I didn't check.



So, the question: do you think we can change sys_futex() to make
glibc happy?

Or, do you think it is user-space who should check tv_sec < 0 if
it wants ETIMEDOUT with the negative timeout ?

Thanks,

Oleg.


^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2010-06-28 16:05 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-06-25 19:20 Q: sys_futex() && timespec_valid() Oleg Nesterov
2010-06-25 19:42 ` Darren Hart
2010-06-25 19:49   ` Ulrich Drepper
2010-06-25 20:11     ` Thomas Gleixner
2010-06-28 13:58     ` Oleg Nesterov
2010-06-28 14:37       ` Jakub Jelinek
2010-06-28 15:02         ` Oleg Nesterov
2010-06-25 19:56   ` Mathieu Desnoyers
2010-06-25 19:59 ` Thomas Gleixner
2010-06-25 20:04   ` Ulrich Drepper
2010-06-25 20:25     ` Thomas Gleixner
2010-06-28 15:15     ` Linus Torvalds
2010-06-28 15:29       ` Andreas Schwab
2010-06-28 15:33         ` Thomas Gleixner
2010-06-28 16:04         ` Linus Torvalds

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox