From: Jon Maloy <jon.maloy@ericsson.com>
To: Ying Xue <ying.xue@windriver.com>
Cc: Paul.Gortmaker@windriver.com,
tipc-discussion@lists.sourceforge.net, nhorman@tuxdriver.com,
netdev@vger.kernel.org
Subject: Re: [PATCH net-next v3] tipc: sk_recv_queue size check only for connectionless sockets
Date: Mon, 10 Dec 2012 10:49:42 -0500 [thread overview]
Message-ID: <50C60496.5040800@ericsson.com> (raw)
In-Reply-To: <50C5B5B5.9060004@ericsson.com>
On 12/10/2012 05:13 AM, Jon Maloy wrote:
> On 12/10/2012 04:23 AM, Ying Xue wrote:
>> The sk_receive_queue limit control is currently performed for all
>> arriving messages, disregarding socket and message type. But for
>> connectionless sockets this check is redundant, since the protocol
>> flow already makes queue overflow impossible.
>>
>> We move the sk_receive_queue limit control so that it's only performed
>> for connectionless sockets, i.e. SOCK_RDM and SOCK_DGRAM type sockets.
>>
>> However, as Neil Horman specified, we cannot simply force the socket
>> receive queue limit against connectionless sockets as it may create a
>> DoS vulnerability. For example, if a sender floods a receiver with
>> messages containing an invalid set of message importance bits or
>> CRITICAL importance, we will queue messages indefinitely.
>>
>> To avoid DoS attack, socket receive queue will be marked as overflow
>> if we receive messages with invalid message importances, meanwhile,
>> we also set one new threshold for CRITICAL importance messages.
>>
>> Signed-off-by: Ying Xue <ying.xue@windriver.com>
>> Signed-off-by: Jon Maloy <jon.maloy@ericsson.com>
>> Cc: Neil Horman <nhorman@tuxdriver.com>
>> Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com>
>> ---
>> v3 changes:
>> - set new threshold for CRITICAL message
>> - defined an importance factor table to avoid multiplication and
>> division operations in rx_queue_full().
>> - changed return value of rx_queue_full() from integer to boolean.
>>
>> net/tipc/socket.c | 44 +++++++++++++++++++-------------------------
>> 1 files changed, 19 insertions(+), 25 deletions(-)
>>
>> diff --git a/net/tipc/socket.c b/net/tipc/socket.c
>> index 9b4e483..a18a757 100644
>> --- a/net/tipc/socket.c
>> +++ b/net/tipc/socket.c
>> @@ -43,7 +43,7 @@
>> #define SS_LISTENING -1 /* socket is listening */
>> #define SS_READY -2 /* socket is connectionless */
>>
>> -#define OVERLOAD_LIMIT_BASE 10000
>> +#define OVERLOAD_LIMIT_BASE 5000
>> #define CONN_TIMEOUT_DEFAULT 8000 /* default connect timeout = 8s */
>>
>> struct tipc_sock {
>> @@ -73,6 +73,13 @@ static struct proto tipc_proto;
>>
>> static int sockets_enabled;
>>
>> +static const u32 msg_importance_factor[] = {
>> + OVERLOAD_LIMIT_BASE, /* TIPC_LOW_IMPORTANCE limit */
>> + OVERLOAD_LIMIT_BASE * 2, /* TIPC_MEDIUM_IMPORTANCE limit */
>> + OVERLOAD_LIMIT_BASE * 100, /* TIPC_HIGH_IMPORTANCE limit */
>> + OVERLOAD_LIMIT_BASE * 200 /* TIPC_CRITICAL_IMPORTANCE limit */
>> + };
>> +
>> /*
>> * Revised TIPC socket locking policy:
>> *
>> @@ -1158,28 +1165,17 @@ static void tipc_data_ready(struct sock *sk, int len)
>> * rx_queue_full - determine if receive queue can accept another message
>> * @msg: message to be added to queue
>> * @queue_size: current size of queue
>> - * @base: nominal maximum size of queue
>> *
>> - * Returns 1 if queue is unable to accept message, 0 otherwise
>> + * Returns true if queue is unable to accept message, false otherwise
>> */
>> -static int rx_queue_full(struct tipc_msg *msg, u32 queue_size, u32 base)
>> +static bool rx_queue_full(struct tipc_msg *msg, u32 queue_size)
>> {
>> - u32 threshold;
>> u32 imp = msg_importance(msg);
>>
>> - if (imp == TIPC_LOW_IMPORTANCE)
>> - threshold = base;
>> - else if (imp == TIPC_MEDIUM_IMPORTANCE)
>> - threshold = base * 2;
>> - else if (imp == TIPC_HIGH_IMPORTANCE)
>> - threshold = base * 100;
>> - else
>> - return 0;
>> + if (unlikely(imp > TIPC_CRITICAL_IMPORTANCE))
>> + return true;
>
> This test is not necessary. Such messages have already been filtered out
> in tipc_recv_msg() at link level.
> The test msg_isdata(), which determines if a message should be sent up to
> the port/socket level, is also an implicit test that
> importance < TIPC_CRITICAL_IMPORTANCE.
(importance <= TIPC_CRITICAL_IMPORTANCE), of course.
This is an effect of the co-location of the user and importance fields in the
TIPC header. I.e., the importance is in reality coded into the value of
the user field.
To clarify (and improve) my previous suggestion, what I had in mind
was something like this:
recv_q_len = skb_queue_len(&sk->sk_receive_queue);
- if (unlikely(recv_q_len >= (OVERLOAD_LIMIT_BASE / 2))) {
- if (rx_queue_full(msg, recv_q_len,
- OVERLOAD_LIMIT_BASE / 2))
- return TIPC_ERR_OVERLOAD;
- }
+ imp = msg_importance(msg);
+ if (unlikely(recv_q_len > (OVERLOAD_LIMIT_BASE << (imp << 1))))
+ return TIPC_ERR_OVERLOAD;
} else {
if (msg_mcast(msg))
return TIPC_ERR_NO_PORT;
No need for any separate function at all, and guaranteed inline.
This will probably translate to one single shift-instruction extra,
and no memor access, since the compiler merge (imp << 1) with the
shift operation used to read imp from the header.
The limits for message rejection become the following,
given an OVERLOAD_LIMIT_BASE of 10000:
TIP_LOW_IMPORTANCE: 10000 (previously 10000)
TIPC_MEDIUM_IMPORTANCE 40000 (previously 20000)
TIPC_MEDIUM_IMPORTANCE 1600000 (previously 1000000)
TIPC_CRITICAL_IMPORTANCE 25600000 (previously no limit)
///jon
>
>
>>
>> - if (msg_connected(msg))
>> - threshold *= 4;
>> -
>> - return queue_size >= threshold;
>> + return queue_size >= msg_importance_factor[imp];
>
> Ok. Less optimal than my suggestion, but also lower risk until we know
> the consequences of changing the multiplication factors.
>
>> }
>>
>> /**
>> @@ -1275,7 +1271,6 @@ static u32 filter_rcv(struct sock *sk, struct sk_buff *buf)
>> {
>> struct socket *sock = sk->sk_socket;
>> struct tipc_msg *msg = buf_msg(buf);
>> - u32 recv_q_len;
>> u32 res = TIPC_OK;
>>
>> /* Reject message if it is wrong sort of message for socket */
>> @@ -1285,19 +1280,18 @@ static u32 filter_rcv(struct sock *sk, struct sk_buff *buf)
>> if (sock->state == SS_READY) {
>> if (msg_connected(msg))
>> return TIPC_ERR_NO_PORT;
>> + /* Reject SOCK_DGRAM and SOCK_RDM message if there isn't room
>> + * to queue it
>> + */
>> + if (unlikely(rx_queue_full(msg,
>> + skb_queue_len(&sk->sk_receive_queue))))
>> + return TIPC_ERR_OVERLOAD;
>> } else {
>> res = filter_connect(tipc_sk(sk), &buf);
>> if (res != TIPC_OK || buf == NULL)
>> return res;
>> }
>>
>> - /* Reject message if there isn't room to queue it */
>> - recv_q_len = skb_queue_len(&sk->sk_receive_queue);
>> - if (unlikely(recv_q_len >= (OVERLOAD_LIMIT_BASE / 2))) {
>> - if (rx_queue_full(msg, recv_q_len, OVERLOAD_LIMIT_BASE / 2))
>> - return TIPC_ERR_OVERLOAD;
>> - }
>> -
>> /* Enqueue message (finally!) */
>> TIPC_SKB_CB(buf)->handle = 0;
>> __skb_queue_tail(&sk->sk_receive_queue, buf);
>>
>
> --
> To unsubscribe from this list: send the line "unsubscribe netdev" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
next prev parent reply other threads:[~2012-12-10 15:49 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-12-10 9:23 [PATCH net-next v3] tipc: sk_recv_queue size check only for connectionless sockets Ying Xue
2012-12-10 10:13 ` Jon Maloy
2012-12-10 15:49 ` Jon Maloy [this message]
2012-12-10 18:22 ` Neil Horman
2012-12-10 14:51 ` Neil Horman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50C60496.5040800@ericsson.com \
--to=jon.maloy@ericsson.com \
--cc=Paul.Gortmaker@windriver.com \
--cc=netdev@vger.kernel.org \
--cc=nhorman@tuxdriver.com \
--cc=tipc-discussion@lists.sourceforge.net \
--cc=ying.xue@windriver.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).