* What's the status of TCP friends? @ 2014-03-16 9:07 Zheng Liu 2014-03-16 14:03 ` Eric Dumazet 0 siblings, 1 reply; 8+ messages in thread From: Zheng Liu @ 2014-03-16 9:07 UTC (permalink / raw) To: netdev Cc: Li Yu, David S. Miller, Eric Dumazet, Bruce Brutus Curtis, Weiping Pan Hi all, Until now the TCP friends patch set doesn't be applied. Now what's the status of TCP friends? Is it applicable to be merged into upstream kernel? Any problem that needs to be fixed? Please let me know if I can help. Thanks, - Zheng ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: What's the status of TCP friends? 2014-03-16 9:07 What's the status of TCP friends? Zheng Liu @ 2014-03-16 14:03 ` Eric Dumazet 2014-03-17 3:16 ` Zheng Liu 0 siblings, 1 reply; 8+ messages in thread From: Eric Dumazet @ 2014-03-16 14:03 UTC (permalink / raw) To: Zheng Liu Cc: netdev, Li Yu, David S. Miller, Eric Dumazet, Bruce Brutus Curtis, Weiping Pan On Sun, 2014-03-16 at 17:07 +0800, Zheng Liu wrote: > Hi all, > > Until now the TCP friends patch set doesn't be applied. Now what's the > status of TCP friends? Is it applicable to be merged into upstream > kernel? Any problem that needs to be fixed? Please let me know if I > can help. Well, last attempts showed that while the idea sounded cool, implementation opened many races and added quite a lot of complexity in fast path. We have AF_UNIX with a lot of problems in it, do we really want to bring these AF_UNIX problems to AF_INET ? I would rather spend time on AF_UNIX if it doesn't fit your needs right now, or consider jumping to KDBUS modern design. Using AF_INET for IPC is poor choice. ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: What's the status of TCP friends? 2014-03-16 14:03 ` Eric Dumazet @ 2014-03-17 3:16 ` Zheng Liu 2014-03-18 1:21 ` Hannes Frederic Sowa 0 siblings, 1 reply; 8+ messages in thread From: Zheng Liu @ 2014-03-17 3:16 UTC (permalink / raw) To: Eric Dumazet Cc: netdev, Li Yu, David S. Miller, Eric Dumazet, Bruce Brutus Curtis, Weiping Pan On Sun, Mar 16, 2014 at 07:03:39AM -0700, Eric Dumazet wrote: > On Sun, 2014-03-16 at 17:07 +0800, Zheng Liu wrote: > > Hi all, > > > > Until now the TCP friends patch set doesn't be applied. Now what's the > > status of TCP friends? Is it applicable to be merged into upstream > > kernel? Any problem that needs to be fixed? Please let me know if I > > can help. > > Well, last attempts showed that while the idea sounded cool, > implementation opened many races and added quite a lot of complexity in > fast path. Thanks for letting me know the current status of this patch set. > > We have AF_UNIX with a lot of problems in it, do we really want to bring > these AF_UNIX problems to AF_INET ? Please bear with me because I am a newbie. Out of curiosity, what's the problem in AF_UNIX? > > I would rather spend time on AF_UNIX if it doesn't fit your needs right > now, or consider jumping to KDBUS modern design. > > Using AF_INET for IPC is poor choice. The reason why we use AF_INET for IPC rather than use AF_UNIX is that we have two applications that need to communicate with each other. They could be deployed on the same server or different servers. So obviously if we use AF_INET, we just need to indicate a IP address in config file. That sounds rational and maintainable for our operation team. Regards, - Zheng ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: What's the status of TCP friends? 2014-03-17 3:16 ` Zheng Liu @ 2014-03-18 1:21 ` Hannes Frederic Sowa 2014-03-18 3:13 ` Zheng Liu 0 siblings, 1 reply; 8+ messages in thread From: Hannes Frederic Sowa @ 2014-03-18 1:21 UTC (permalink / raw) To: Zheng Liu Cc: Eric Dumazet, netdev, Li Yu, David S. Miller, Eric Dumazet, Bruce Brutus Curtis, Weiping Pan, tmorvai On Mon, Mar 17, 2014 at 11:16:05AM +0800, Zheng Liu wrote: > On Sun, Mar 16, 2014 at 07:03:39AM -0700, Eric Dumazet wrote: > > On Sun, 2014-03-16 at 17:07 +0800, Zheng Liu wrote: > > > Hi all, > > > > > > Until now the TCP friends patch set doesn't be applied. Now what's the > > > status of TCP friends? Is it applicable to be merged into upstream > > > kernel? Any problem that needs to be fixed? Please let me know if I > > > can help. > > > > Well, last attempts showed that while the idea sounded cool, > > implementation opened many races and added quite a lot of complexity in > > fast path. > > Thanks for letting me know the current status of this patch set. > > > > > We have AF_UNIX with a lot of problems in it, do we really want to bring > > these AF_UNIX problems to AF_INET ? > > Please bear with me because I am a newbie. Out of curiosity, what's the > problem in AF_UNIX? AFAIK AF_UNIX/SOCK_STREAM is fine. I currently know about two problems with AF_UNIX datagram modes: 1) Not correclty handling socket receive buffer limits. SOCK_DGRAM simply relies on the sending window only and socket receive queue is only limited by the number of packets enqueued. A DoS against another UNIX/DGRAM server is possible by not freeing its sending window if the client doesn't simply pick up its packets. 2) POLLOUT handling seems a bit messed up. This problem was reported by Tamas Morvai. We simply trigger POLLOUT too often thus waking up the sending side unnecessarily. Patch for 1) was rightfully rejected because it could easily break backwards compatibility. There were some ideas floating around which I discussed with Tamas but nothing definite for 2), as I got stuck to work on 1) and I am still unsure what side-effects could have the needed removal of the per-packet socket-receive limit, which seems to be needed for solving 2). The total amount of memory in use by AF_UNIX/DGRAM sockets would be limited by the sending buffers and rlimits, but it still seems unwise to do so. Also solving 2) could make problems regarding backwards-compatibility. > > I would rather spend time on AF_UNIX if it doesn't fit your needs right > > now, or consider jumping to KDBUS modern design. > > > > Using AF_INET for IPC is poor choice. > > The reason why we use AF_INET for IPC rather than use AF_UNIX is that we > have two applications that need to communicate with each other. They > could be deployed on the same server or different servers. So obviously > if we use AF_INET, we just need to indicate a IP address in config file. > That sounds rational and maintainable for our operation team. As soon as we are allowed to silently drop packets in AF_UNIX/DGRAM some problems would vanish, too. ;) Greetings, Hannes ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: What's the status of TCP friends? 2014-03-18 1:21 ` Hannes Frederic Sowa @ 2014-03-18 3:13 ` Zheng Liu 2014-03-18 3:52 ` David Miller 2014-03-18 4:03 ` Eric Dumazet 0 siblings, 2 replies; 8+ messages in thread From: Zheng Liu @ 2014-03-18 3:13 UTC (permalink / raw) To: Hannes Frederic Sowa Cc: Eric Dumazet, netdev, Li Yu, David S. Miller, Eric Dumazet, Bruce Brutus Curtis, Weiping Pan, tmorvai Hi Hannes, On Tue, Mar 18, 2014 at 02:21:20AM +0100, Hannes Frederic Sowa wrote: > On Mon, Mar 17, 2014 at 11:16:05AM +0800, Zheng Liu wrote: > > On Sun, Mar 16, 2014 at 07:03:39AM -0700, Eric Dumazet wrote: > > > On Sun, 2014-03-16 at 17:07 +0800, Zheng Liu wrote: > > > > Hi all, > > > > > > > > Until now the TCP friends patch set doesn't be applied. Now what's the > > > > status of TCP friends? Is it applicable to be merged into upstream > > > > kernel? Any problem that needs to be fixed? Please let me know if I > > > > can help. > > > > > > Well, last attempts showed that while the idea sounded cool, > > > implementation opened many races and added quite a lot of complexity in > > > fast path. > > > > Thanks for letting me know the current status of this patch set. > > > > > > > > We have AF_UNIX with a lot of problems in it, do we really want to bring > > > these AF_UNIX problems to AF_INET ? > > > > Please bear with me because I am a newbie. Out of curiosity, what's the > > problem in AF_UNIX? > > AFAIK AF_UNIX/SOCK_STREAM is fine. > > I currently know about two problems with AF_UNIX datagram modes: > > 1) Not correclty handling socket receive buffer limits. SOCK_DGRAM simply > relies on the sending window only and socket receive queue is only limited > by the number of packets enqueued. A DoS against another UNIX/DGRAM server > is possible by not freeing its sending window if the client doesn't simply > pick up its packets. > > 2) POLLOUT handling seems a bit messed up. This problem was reported by Tamas > Morvai. We simply trigger POLLOUT too often thus waking up the sending side > unnecessarily. > > Patch for 1) was rightfully rejected because it could easily break > backwards compatibility. > > There were some ideas floating around which I discussed with Tamas but > nothing definite for 2), as I got stuck to work on 1) and I am still > unsure what side-effects could have the needed removal of the per-packet > socket-receive limit, which seems to be needed for solving 2). The total > amount of memory in use by AF_UNIX/DGRAM sockets would be limited by > the sending buffers and rlimits, but it still seems unwise to do so. > > Also solving 2) could make problems regarding backwards-compatibility. Thanks for your explanation. > > > > I would rather spend time on AF_UNIX if it doesn't fit your needs right > > > now, or consider jumping to KDBUS modern design. > > > > > > Using AF_INET for IPC is poor choice. > > > > The reason why we use AF_INET for IPC rather than use AF_UNIX is that we > > have two applications that need to communicate with each other. They > > could be deployed on the same server or different servers. So obviously > > if we use AF_INET, we just need to indicate a IP address in config file. > > That sounds rational and maintainable for our operation team. > > As soon as we are allowed to silently drop packets in AF_UNIX/DGRAM some > problems would vanish, too. ;) That sounds great! But it might not satisfy our requirement. If we use AF_UNIX our program will not be deployed on two servers. Meanwhile AF_INET has been applied in our program to communicate with other clients. So DGRAM seems that it is not a good idea. Now our program needs a IPC mechansim that can commnucate between two servers and provide a high performance when two processes are run on the same server. That is the reason why I am interested in TCP friends. :) Regards, - Zheng ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: What's the status of TCP friends? 2014-03-18 3:13 ` Zheng Liu @ 2014-03-18 3:52 ` David Miller 2014-03-18 4:03 ` Eric Dumazet 1 sibling, 0 replies; 8+ messages in thread From: David Miller @ 2014-03-18 3:52 UTC (permalink / raw) To: gnehzuil.liu Cc: hannes, eric.dumazet, netdev, bingtian.ly, edumazet, brutus, panweiping3, tmorvai From: Zheng Liu <gnehzuil.liu@gmail.com> Date: Tue, 18 Mar 2014 11:13:46 +0800 > Now our program needs a IPC mechansim that can commnucate between > two servers and provide a high performance when two processes are > run on the same server. Using both TCP and AF_UNIX, as needed, and dynamically, is a technique which has been in use for more than two decades by X. Nothing stops you from adopting such a scheme too. Usually when I someone says "I can't" it sounds to me more like "I've decided to stop trying to think of a way to make it work." ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: What's the status of TCP friends? 2014-03-18 3:13 ` Zheng Liu 2014-03-18 3:52 ` David Miller @ 2014-03-18 4:03 ` Eric Dumazet 2014-03-18 15:27 ` Stephen Hemminger 1 sibling, 1 reply; 8+ messages in thread From: Eric Dumazet @ 2014-03-18 4:03 UTC (permalink / raw) To: Zheng Liu Cc: Hannes Frederic Sowa, netdev, Li Yu, David S. Miller, Eric Dumazet, Bruce Brutus Curtis, Weiping Pan, tmorvai On Tue, 2014-03-18 at 11:13 +0800, Zheng Liu wrote: > That sounds great! But it might not satisfy our requirement. If we use > AF_UNIX our program will not be deployed on two servers. Meanwhile > AF_INET has been applied in our program to communicate with other > clients. So DGRAM seems that it is not a good idea. Now our program > needs a IPC mechansim that can commnucate between two servers and > provide a high performance when two processes are run on the same > server. That is the reason why I am interested in TCP friends. :) TCP friends is another layer added into TCP stack, for what ? Improving performance for lazy applications ? Really, if you cared about performance, you would have added a way to use fast IPC if available. TCP friends will still be slower than the available IPC mechanisms, by an order of magnitude. So instead of spending time on this TCP friends dream, I think you should focus on existing and supported mechanisms. I for example could add zerocopy support to AF_UNIX, if you think its worth it. ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: What's the status of TCP friends? 2014-03-18 4:03 ` Eric Dumazet @ 2014-03-18 15:27 ` Stephen Hemminger 0 siblings, 0 replies; 8+ messages in thread From: Stephen Hemminger @ 2014-03-18 15:27 UTC (permalink / raw) To: Eric Dumazet Cc: Zheng Liu, Hannes Frederic Sowa, netdev, Li Yu, David S. Miller, Eric Dumazet, Bruce Brutus Curtis, Weiping Pan, tmorvai On Mon, 17 Mar 2014 21:03:20 -0700 Eric Dumazet <eric.dumazet@gmail.com> wrote: > On Tue, 2014-03-18 at 11:13 +0800, Zheng Liu wrote: > > > That sounds great! But it might not satisfy our requirement. If we use > > AF_UNIX our program will not be deployed on two servers. Meanwhile > > AF_INET has been applied in our program to communicate with other > > clients. So DGRAM seems that it is not a good idea. Now our program > > needs a IPC mechansim that can commnucate between two servers and > > provide a high performance when two processes are run on the same > > server. That is the reason why I am interested in TCP friends. :) > > TCP friends is another layer added into TCP stack, for what ? > > Improving performance for lazy applications ? > > Really, if you cared about performance, you would have added a way to > use fast IPC if available. > > TCP friends will still be slower than the available IPC mechanisms, by > an order of magnitude. > > So instead of spending time on this TCP friends dream, I think you > should focus on existing and supported mechanisms. > > I for example could add zerocopy support to AF_UNIX, if you think its > worth it. I find ZMQ to be a useful API for this. It allows abstracting away the choice of transport. You can do inprocess, unix and tcp by specifying different connection in URL like syntax. ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2014-03-18 15:27 UTC | newest] Thread overview: 8+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2014-03-16 9:07 What's the status of TCP friends? Zheng Liu 2014-03-16 14:03 ` Eric Dumazet 2014-03-17 3:16 ` Zheng Liu 2014-03-18 1:21 ` Hannes Frederic Sowa 2014-03-18 3:13 ` Zheng Liu 2014-03-18 3:52 ` David Miller 2014-03-18 4:03 ` Eric Dumazet 2014-03-18 15:27 ` Stephen Hemminger
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).