From: Eric Dumazet <dada1@cosmosbay.com>
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Al Viro <viro@ftp.linux.org.uk>,
Kyle Moffett <mrmacman_g4@mac.com>,
Ulrich Drepper <drepper@redhat.com>,
Davide Libenzi <davidel@xmailserver.org>,
Alan Cox <alan@lxorguk.ukuu.org.uk>, Theodore Tso <tytso@mit.edu>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Andrew Morton <akpm@linux-foundation.org>,
Ingo Molnar <mingo@elte.hu>
Subject: Re: [patch 7/8] fdmap v2 - implement sys_socket2
Date: Sun, 10 Jun 2007 11:14:11 +0200 [thread overview]
Message-ID: <466BC0E3.4050600@cosmosbay.com> (raw)
In-Reply-To: <alpine.LFD.0.98.0706091311040.20321@woody.linux-foundation.org>
Linus Torvalds a écrit :
> (And dammit, that _is_ a *real*issue*. No races necessary, no NR_OPEN
> iterations, no even *halfway* suspect code. It's perfectly fine to do
>
> close(0);
> close(1);
> close(2);
> .. generate filenames, whatever ..
> if (open(..) < 0 || open(..) < 0 || open(..) < 0)
> die("Couldn't redirect stdin/stdout/stderr");
>
> and there's absolutely nothing wrong with this kind of setup, even if you
> could obviously have done it other ways too (ie by using "dup2()" instead
> of "close + open"),
>
This kind of setup was OK 25 years ago, before multithreading era.
You cannot reasonably expect it to work in a multithreaded program.
Anyway, I would like to give an alternative idea of the double fdmap, and
probably more *secure* .
Current fd API mandates integers (32 bits)
Lot of broken code consider a fd must be >= 0, so we currently are limited to
31 bits.
With NR_OPEN = 1024*1024 = 2^20, that give us 11 bits that we could use as a
signature.
That is, we could use O_NONSEQ as a indication to kernel to give us a
composite fd : 20 low order bits give the slot in file table, then 11 bits can
be use to make sure the fd was not stolen by malicious code.
Legacy app, (without O_NONSEQ in flags) would get POSIX compatables fd in [0,
2^20-1] range, with the lowest available fd.
If O_NONSEQ is given, kernel is free to give an fd in [0, 2^31 - 1], with a
strategy that could be the one Davide gave in its patch (with a list of
available slots). But instead of FIFO, we can use now LIFO, more cache friendly.
In fget()/fget_light()/close(), we can then use 20 bits to select the slot in
the single fdmap.
And 11 bits to check the 'signature'.
So if open( O_NONSEQFD) gave us 0x77000010, we cannot do close(0x10) or
read(0x10, ....)
Storage for these bits is already there in Davide fd_slot structure, where we
currently use one long to store 3 bits 'only'.
This should work even bumping NR_OPEN to say... 8*1024*1024, and 8 bits signature.
next prev parent reply other threads:[~2007-06-10 9:16 UTC|newest]
Thread overview: 129+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-06-06 22:30 [patch 7/8] fdmap v2 - implement sys_socket2 Davide Libenzi
2007-06-06 22:44 ` David Miller
2007-06-06 22:52 ` Davide Libenzi
2007-06-06 22:57 ` David Miller
2007-06-06 22:57 ` Ulrich Drepper
2007-06-06 23:02 ` David Miller
2007-06-06 22:59 ` Alan Cox
2007-06-06 22:58 ` Ulrich Drepper
2007-06-06 23:04 ` Davide Libenzi
2007-06-06 23:08 ` David Miller
2007-06-06 23:19 ` Alan Cox
2007-06-06 23:22 ` Ulrich Drepper
2007-06-07 10:04 ` Alan Cox
2007-06-07 11:59 ` Kyle Moffett
2007-06-07 13:12 ` Eric Dumazet
2007-06-07 15:51 ` Davide Libenzi
2007-06-07 19:49 ` Davide Libenzi
2007-06-07 20:02 ` Ulrich Drepper
2007-06-07 20:05 ` Eric Dumazet
2007-06-07 20:18 ` Ulrich Drepper
2007-06-07 21:44 ` Davide Libenzi
2007-06-07 22:03 ` Ulrich Drepper
2007-06-07 22:40 ` Davide Libenzi
2007-06-08 12:07 ` Theodore Tso
2007-06-08 13:01 ` Alan Cox
2007-06-08 18:11 ` Davide Libenzi
2007-06-08 18:26 ` Alan Cox
2007-06-08 18:43 ` Ulrich Drepper
2007-06-08 18:46 ` Al Viro
2007-06-08 18:56 ` Ulrich Drepper
2007-06-08 19:07 ` Linus Torvalds
2007-06-08 19:21 ` Davide Libenzi
2007-06-09 0:03 ` Linus Torvalds
2007-06-09 0:13 ` Davide Libenzi
2007-06-09 0:36 ` Al Viro
2007-06-09 1:19 ` Ulrich Drepper
2007-06-09 1:41 ` Al Viro
2007-06-09 2:10 ` Ulrich Drepper
2007-06-09 15:15 ` Al Viro
2007-06-09 16:26 ` Ulrich Drepper
2007-06-09 16:54 ` Al Viro
2007-06-09 17:04 ` Davide Libenzi
2007-06-09 17:08 ` Davide Libenzi
2007-06-09 17:08 ` Ulrich Drepper
2007-06-09 17:24 ` Al Viro
2007-06-09 19:27 ` Kyle Moffett
2007-06-09 20:06 ` Al Viro
2007-06-09 20:21 ` Linus Torvalds
2007-06-09 20:31 ` Davide Libenzi
2007-06-09 21:41 ` Matt Mackall
2007-06-09 22:12 ` Davide Libenzi
2007-06-09 20:49 ` Al Viro
2007-06-09 21:55 ` Matt Mackall
2007-06-09 23:33 ` Linus Torvalds
2007-06-10 3:35 ` Davide Libenzi
2007-06-10 3:49 ` Davide Libenzi
2007-06-10 3:19 ` Al Viro
2007-06-10 3:48 ` Linus Torvalds
2007-06-10 4:00 ` Al Viro
2007-06-10 4:03 ` Linus Torvalds
2007-06-10 4:06 ` Al Viro
2007-06-10 4:45 ` dean gaudet
2007-06-10 5:06 ` Linus Torvalds
2007-06-10 5:46 ` Al Viro
2007-06-10 17:23 ` Linus Torvalds
2007-06-10 6:35 ` Kari Hurtta
2007-06-10 15:21 ` Alan Cox
2007-06-10 9:14 ` Eric Dumazet [this message]
2007-06-10 15:16 ` Alan Cox
2007-06-10 18:19 ` Linus Torvalds
2007-06-10 2:40 ` Al Viro
2007-06-08 19:34 ` Alan Cox
2007-06-08 19:30 ` Alan Cox
2007-06-08 19:37 ` Davide Libenzi
2007-06-08 19:48 ` Alan Cox
2007-06-08 19:51 ` Davide Libenzi
2007-06-08 21:24 ` Alan Cox
2007-06-08 21:59 ` Davide Libenzi
2007-06-08 22:28 ` Alan Cox
2007-06-08 22:38 ` Davide Libenzi
2007-06-11 8:24 ` Xavier Bestel
2007-06-08 19:22 ` Davide Libenzi
2007-06-09 5:41 ` Paul Mackerras
2007-06-09 14:38 ` Kyle Moffett
2007-06-10 6:48 ` Paul Mackerras
2007-06-10 15:56 ` Davide Libenzi
2007-06-10 19:16 ` Davide Libenzi
2007-06-09 17:00 ` Davide Libenzi
2007-06-10 6:26 ` Paul Mackerras
2007-06-10 7:10 ` William Lee Irwin III
2007-06-10 15:52 ` Davide Libenzi
2007-06-08 18:07 ` Davide Libenzi
2007-06-08 18:35 ` Linus Torvalds
2007-06-07 21:57 ` Davide Libenzi
2007-06-08 4:38 ` Eric Dumazet
2007-06-08 5:20 ` Davide Libenzi
2007-06-07 14:25 ` Ulrich Drepper
2007-06-07 17:56 ` Eric Dumazet
2007-06-07 18:03 ` Davide Libenzi
2007-06-07 18:57 ` Eric Dumazet
2007-06-07 18:26 ` Ulrich Drepper
2007-06-07 18:39 ` Davide Libenzi
2007-06-07 18:56 ` Ulrich Drepper
2007-06-07 19:12 ` Davide Libenzi
2007-06-07 20:03 ` Andrew Morton
2007-06-08 2:55 ` Ulrich Drepper
2007-06-08 5:16 ` Davide Libenzi
2007-06-06 23:29 ` Davide Libenzi
2007-06-07 10:06 ` Alan Cox
2007-06-07 10:45 ` Eric Dumazet
2007-06-07 11:27 ` Alan Cox
2007-06-07 15:41 ` Davide Libenzi
2007-06-07 20:10 ` Linus Torvalds
2007-06-07 20:47 ` Eric Dumazet
2007-06-07 21:08 ` Linus Torvalds
2007-06-07 21:41 ` Davide Libenzi
2007-06-07 20:59 ` Guillaume Chazarain
2007-06-07 21:06 ` Guillaume Chazarain
2007-06-07 21:31 ` Ulrich Drepper
2007-06-07 22:22 ` Davide Libenzi
2007-06-07 23:42 ` Linus Torvalds
2007-06-08 0:04 ` Davide Libenzi
2007-06-08 0:59 ` Matt Mackall
2007-06-08 2:25 ` Linus Torvalds
2007-06-08 15:56 ` Jeff Dike
2007-06-07 0:29 ` Arnd Bergmann
2007-06-07 0:33 ` Davide Libenzi
-- strict thread matches above, loose matches on Subject: below --
2007-06-06 22:30 [patch 1/8] fdmap v2 - fdmap core Davide Libenzi
2007-06-07 6:54 ` Eric Dumazet
2007-06-07 7:10 ` Davide Libenzi
2007-06-07 10:39 ` [patch 7/8] fdmap v2 - implement sys_socket2 Eric Dumazet
2007-06-07 15:42 ` Davide Libenzi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=466BC0E3.4050600@cosmosbay.com \
--to=dada1@cosmosbay.com \
--cc=akpm@linux-foundation.org \
--cc=alan@lxorguk.ukuu.org.uk \
--cc=davidel@xmailserver.org \
--cc=drepper@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=mrmacman_g4@mac.com \
--cc=torvalds@linux-foundation.org \
--cc=tytso@mit.edu \
--cc=viro@ftp.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.