* [RFC PATCH] poll() in 32-bit applications does not handle timeout of -1 properly on 64-bit kernels
@ 2012-02-07 0:05 Josh Hunt
2012-02-07 0:38 ` Al Viro
0 siblings, 1 reply; 3+ messages in thread
From: Josh Hunt @ 2012-02-07 0:05 UTC (permalink / raw)
To: Al Viro, linux-fsdevel@vger.kernel.org,
linux-kernel@vger.kernel.org
[-- Attachment #1: Type: text/plain, Size: 1218 bytes --]
We've hit an issue where our 32-bit applications, when running on a
64-bit kernel, using poll() and passing in a value of -1 for the timeout
return after ~49 days (2^32 msec). Instead of waiting indefinitely as it
is stated they should. Reproducing the issue is trivial. I've
instrumented the kernel and found we are hitting the case where poll()
believes we've passed in a positive number and thus creates a timespec,
etc. Currently poll() is defined in userspace as:
int poll(struct pollfd *ufds, nfds_t nfds, int timeout);
but in the kernel timeout is of type long.
I can think of a few ways to solve this. One, which is the patch I've
attached, is to change the type of timeout to int in the kernel. I'm not
certain the ramifications this may have since it's changing a syscall's
arguments which may be a big no-no :) Another way I am proposing is by
bounds checking. Currently we do the following:
if (timeout_msecs >= 0) {
to = &end_time;
poll_select_set_timeout(to, timeout_msecs / MSEC_PER_SEC,
NSEC_PER_MSEC * (timeout_msecs % MSEC_PER_SEC));
}
We could add an upper bound on timeout_msecs to say < 0xffffffff. I'm
not sure if either is acceptable though.
Josh
[-- Attachment #2: poll-timeout-try1.patch --]
[-- Type: text/x-patch, Size: 1683 bytes --]
commit 7f2c4ee58b66f51b5e2c79fc42db84ca4d6c0a18
Author: Josh Hunt <johunt@akamai.com>
Date: Mon Feb 6 15:21:04 2012 -0800
poll() in 32-bit applications does not handle timeout of -1 properly on 64-bit kernels
We have observed our 32-bit applications running on 64-bit kernels which
pass a value of -1 to poll()'s timeout argument do not wait infinitely as
it should. Instead it winds up waiting ~49 days which is approx the # of msecs
in 2^32 (or 32-bit negative 1.) With this patch both 32 and 64 bit applications
now support handling of -1 as an argument for timeout.
Signed-off-by: Josh Hunt <johunt@akamai.com>
diff --git a/fs/select.c b/fs/select.c
index d33418f..e782258 100644
--- a/fs/select.c
+++ b/fs/select.c
@@ -912,7 +912,7 @@ static long do_restart_poll(struct restart_block *restart_block)
}
SYSCALL_DEFINE3(poll, struct pollfd __user *, ufds, unsigned int, nfds,
- long, timeout_msecs)
+ int, timeout_msecs)
{
struct timespec end_time, *to = NULL;
int ret;
diff --git a/include/linux/syscalls.h b/include/linux/syscalls.h
index 515669f..8ec1153 100644
--- a/include/linux/syscalls.h
+++ b/include/linux/syscalls.h
@@ -624,7 +624,7 @@ asmlinkage long sys_socketpair(int, int, int, int __user *);
asmlinkage long sys_socketcall(int call, unsigned long __user *args);
asmlinkage long sys_listen(int, int);
asmlinkage long sys_poll(struct pollfd __user *ufds, unsigned int nfds,
- long timeout);
+ int timeout);
asmlinkage long sys_select(int n, fd_set __user *inp, fd_set __user *outp,
fd_set __user *exp, struct timeval __user *tvp);
asmlinkage long sys_old_select(struct sel_arg_struct __user *arg);
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [RFC PATCH] poll() in 32-bit applications does not handle timeout of -1 properly on 64-bit kernels
2012-02-07 0:05 [RFC PATCH] poll() in 32-bit applications does not handle timeout of -1 properly on 64-bit kernels Josh Hunt
@ 2012-02-07 0:38 ` Al Viro
0 siblings, 0 replies; 3+ messages in thread
From: Al Viro @ 2012-02-07 0:38 UTC (permalink / raw)
To: Josh Hunt; +Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org
On Mon, Feb 06, 2012 at 06:05:30PM -0600, Josh Hunt wrote:
> We've hit an issue where our 32-bit applications, when running on a
> 64-bit kernel, using poll() and passing in a value of -1 for the timeout
> return after ~49 days (2^32 msec). Instead of waiting indefinitely as it
> is stated they should. Reproducing the issue is trivial. I've
> instrumented the kernel and found we are hitting the case where poll()
> believes we've passed in a positive number and thus creates a timespec,
> etc. Currently poll() is defined in userspace as:
>
> int poll(struct pollfd *ufds, nfds_t nfds, int timeout);
>
> but in the kernel timeout is of type long.
>
> I can think of a few ways to solve this. One, which is the patch I've
> attached, is to change the type of timeout to int in the kernel. I'm not
> certain the ramifications this may have since it's changing a syscall's
> arguments which may be a big no-no :) Another way I am proposing is by
> bounds checking. Currently we do the following:
>
> if (timeout_msecs >= 0) {
> to = &end_time;
> poll_select_set_timeout(to, timeout_msecs / MSEC_PER_SEC,
> NSEC_PER_MSEC * (timeout_msecs % MSEC_PER_SEC));
> }
>
> We could add an upper bound on timeout_msecs to say < 0xffffffff. I'm
> not sure if either is acceptable though.
Or just add compat_sys_poll() with that argument being int and have it call
sys_poll(). The value will be sign-extended...
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [RFC PATCH] poll() in 32-bit applications does not handle timeout of -1 properly on 64-bit kernels
@ 2012-02-07 17:51 Josh Hunt
0 siblings, 0 replies; 3+ messages in thread
From: Josh Hunt @ 2012-02-07 17:51 UTC (permalink / raw)
To: Al Viro, linux-fsdevel@vger.kernel.org, tglx, mingo, hpa, x86,
arnd, linux-arch
Cc: linux-kernel@vger.kernel.org
[-- Attachment #1: Type: text/plain, Size: 1916 bytes --]
On 02/06/2012 06:38 PM, Al Viro wrote:
> On Mon, Feb 06, 2012 at 06:05:30PM -0600, Josh Hunt wrote:
>> We've hit an issue where our 32-bit applications, when running on a
>> 64-bit kernel, using poll() and passing in a value of -1 for the timeout
>> return after ~49 days (2^32 msec). Instead of waiting indefinitely as it
>> is stated they should. Reproducing the issue is trivial. I've
>> instrumented the kernel and found we are hitting the case where poll()
>> believes we've passed in a positive number and thus creates a timespec,
>> etc. Currently poll() is defined in userspace as:
>>
>> int poll(struct pollfd *ufds, nfds_t nfds, int timeout);
>>
>> but in the kernel timeout is of type long.
>>
>> I can think of a few ways to solve this. One, which is the patch I've
>> attached, is to change the type of timeout to int in the kernel. I'm not
>> certain the ramifications this may have since it's changing a syscall's
>> arguments which may be a big no-no :) Another way I am proposing is by
>> bounds checking. Currently we do the following:
>>
>> if (timeout_msecs >= 0) {
>> to = &end_time;
>> poll_select_set_timeout(to, timeout_msecs / MSEC_PER_SEC,
>> NSEC_PER_MSEC * (timeout_msecs % MSEC_PER_SEC));
>> }
>>
>> We could add an upper bound on timeout_msecs to say < 0xffffffff. I'm
>> not sure if either is acceptable though.
>
> Or just add compat_sys_poll() with that argument being int and have it call
> sys_poll(). The value will be sign-extended...
Al
I've implemented what you suggested by adding compat_sys_poll() with
an int argument for timeout allowing it to do the sign extension. I
wanted to point out there was an almost identical patch submitted last
year, which appears to have gotten lost in the wash:
https://lkml.org/lkml/2011/9/18/19
I am guessing there are other architectures affected by this bug. This
patch only fixes x86.
Josh
[-- Attachment #2: compat-sys-poll.patch --]
[-- Type: text/x-patch, Size: 2339 bytes --]
commit cde9eb901ccb3b5af3e501b018b90f16c53942c2
Author: Josh Hunt <johunt@akamai.com>
Date: Mon Feb 6 20:51:31 2012 -0800
compat: poll() in 32-bit applications does not handle negative timeout values properly on 64-bit kernels
We have observed our 32-bit applications running on 64-bit kernels do not
wait infinitely when passed a negative value for the timeout argument.
Instead we see poll() returning in ~49 days or 2^32 msecs, because the
timeout argument is not getting sign-extended. Implementing
compat_sys_poll() to handle this case.
Reported-by: Phil Lisiecki <lisiecki@akamai.com>
Signed-off-by: Josh Hunt <johunt@akamai.com>
Cc: <stable@vger.kernel.org>
diff --git a/arch/x86/syscalls/syscall_32.tbl b/arch/x86/syscalls/syscall_32.tbl
index ce98e28..8407150 100644
--- a/arch/x86/syscalls/syscall_32.tbl
+++ b/arch/x86/syscalls/syscall_32.tbl
@@ -174,7 +174,7 @@
165 i386 getresuid sys_getresuid16
166 i386 vm86 ptregs_vm86 sys32_vm86_warning
167 i386 query_module
-168 i386 poll sys_poll
+168 i386 poll sys_poll compat_sys_poll
169 i386 nfsservctl
170 i386 setresgid sys_setresgid16
171 i386 getresgid sys_getresgid16
diff --git a/fs/compat.c b/fs/compat.c
index fa9d721..77bd50e 100644
--- a/fs/compat.c
+++ b/fs/compat.c
@@ -1611,6 +1611,12 @@ asmlinkage long compat_sys_pselect6(int n, compat_ulong_t __user *inp,
sigsetsize);
}
+asmlinkage long compat_sys_poll(struct pollfd __user *ufds, unsigned int nfds,
+ int timeout_msecs)
+{
+ return sys_poll(ufds, nfds, timeout_msecs);
+}
+
asmlinkage long compat_sys_ppoll(struct pollfd __user *ufds,
unsigned int nfds, struct compat_timespec __user *tsp,
const compat_sigset_t __user *sigmask, compat_size_t sigsetsize)
diff --git a/include/linux/compat.h b/include/linux/compat.h
index 41c9f65..66e61e0 100644
--- a/include/linux/compat.h
+++ b/include/linux/compat.h
@@ -433,6 +433,8 @@ asmlinkage long compat_sys_pselect6(int n, compat_ulong_t __user *inp,
compat_ulong_t __user *exp,
struct compat_timespec __user *tsp,
void __user *sig);
+asmlinkage long compat_sys_poll(struct pollfd __user *ufds, unsigned int nfds,
+ int timeout_msecs);
asmlinkage long compat_sys_ppoll(struct pollfd __user *ufds,
unsigned int nfds,
struct compat_timespec __user *tsp,
^ permalink raw reply related [flat|nested] 3+ messages in thread
end of thread, other threads:[~2012-02-07 17:51 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-02-07 0:05 [RFC PATCH] poll() in 32-bit applications does not handle timeout of -1 properly on 64-bit kernels Josh Hunt
2012-02-07 0:38 ` Al Viro
-- strict thread matches above, loose matches on Subject: below --
2012-02-07 17:51 Josh Hunt
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).