* Using AF_XDP To Modify Outgoing Packets @ 2020-03-30 16:11 Christian Deacon 2020-03-30 16:24 ` David Ahern 2020-03-30 18:10 ` Jakub Kicinski 0 siblings, 2 replies; 9+ messages in thread From: Christian Deacon @ 2020-03-30 16:11 UTC (permalink / raw) To: xdp-newbies Hey everyone, I am fairly new to XDP and AF_XDP programming. Therefore, I apologize if this question sounds silly. To my understanding, an XDP program cannot process outgoing packets since it doesn't support the TX path. However, I read that AF_XDP sockets support both receiving and transmitting packets inside the user space while achieving zero-copy. I am trying to create a C program that modifies outgoing packets on an interface and trying to do this the fastest way possible. Unfortunately, I cannot use IPTables for this. I wanted to know if there is any way to use AF_XDP sockets to modify outgoing packets on an interface. I wasn't sure if the AF_XDP sockets only support receiving traffic from the XDP program when using the redirect function or not. If this isn't possible, are there any plans to add TX path support into XDP in the future? I'm also not sure what else I can do to achieve fast packet processing for this. I looked into using standard AF_PACKET sockets. However, since that makes a copy of the packet from the kernel, I'd assume that's pretty slow and I'd have to find a way to block the original packets (probably doing something with IPTables). I might also look into DPDK, but I want to see if using AF_XDP sockets for outgoing packet processing is possible first because it looks like it's going to take a while to learn DPDK. Any help is highly appreciated and thank you for your time! ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-03-30 16:11 Using AF_XDP To Modify Outgoing Packets Christian Deacon @ 2020-03-30 16:24 ` David Ahern 2020-03-30 18:10 ` Jakub Kicinski 1 sibling, 0 replies; 9+ messages in thread From: David Ahern @ 2020-03-30 16:24 UTC (permalink / raw) To: Christian Deacon, xdp-newbies On 3/30/20 10:11 AM, Christian Deacon wrote: > If this isn't possible, are there any plans to add TX path support into > XDP in the future? I am working on it. Latest set is here: https://github.com/dsahern/linux/commits/xdp/egress-rfc5-06 does not include support for AF_XDP, but hopefully that can be added later by someone with more knowledge of socket needs. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-03-30 16:11 Using AF_XDP To Modify Outgoing Packets Christian Deacon 2020-03-30 16:24 ` David Ahern @ 2020-03-30 18:10 ` Jakub Kicinski 2020-03-30 19:28 ` Christian Deacon 1 sibling, 1 reply; 9+ messages in thread From: Jakub Kicinski @ 2020-03-30 18:10 UTC (permalink / raw) To: Christian Deacon; +Cc: xdp-newbies On Mon, 30 Mar 2020 11:11:49 -0500 Christian Deacon wrote: > Hey everyone, > > > I am fairly new to XDP and AF_XDP programming. Therefore, I apologize if > this question sounds silly. > > > To my understanding, an XDP program cannot process outgoing packets > since it doesn't support the TX path. However, I read that AF_XDP > sockets support both receiving and transmitting packets inside the user > space while achieving zero-copy. I am trying to create a C program that > modifies outgoing packets on an interface and trying to do this the > fastest way possible. Unfortunately, I cannot use IPTables for this. > > > I wanted to know if there is any way to use AF_XDP sockets to modify > outgoing packets on an interface. I wasn't sure if the AF_XDP sockets > only support receiving traffic from the XDP program when using the > redirect function or not. > > > If this isn't possible, are there any plans to add TX path support into > XDP in the future? I'm also not sure what else I can do to achieve fast > packet processing for this. I looked into using standard AF_PACKET > sockets. However, since that makes a copy of the packet from the kernel, > I'd assume that's pretty slow and I'd have to find a way to block the > original packets (probably doing something with IPTables). I might also > look into DPDK, but I want to see if using AF_XDP sockets for outgoing > packet processing is possible first because it looks like it's going to > take a while to learn DPDK. > > > Any help is highly appreciated and thank you for your time! Could you give us some more info on the use case? Where do the packets originate? The main advantage of XDP is that it processes packets before the metadata needed by the networking stack is allocated (skb). This is only possible for RX or when packets are redirected from another interface with XDP. All packets which pass through the stack anyway can be efficiently modified using cls_bpf. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-03-30 18:10 ` Jakub Kicinski @ 2020-03-30 19:28 ` Christian Deacon 2020-03-31 9:29 ` Toke Høiland-Jørgensen 0 siblings, 1 reply; 9+ messages in thread From: Christian Deacon @ 2020-03-30 19:28 UTC (permalink / raw) To: Jakub Kicinski; +Cc: xdp-newbies Hey David and Jakub, Thank you for your replies! David, it's good to know that egress support is being added to XDP and I appreciate all the work you and others are putting into this! Do you know if there is any ETA on when this will be officially available/supported? With that said, will this be faster than most solutions available now for processing/modifying outbound packets such as using standard AF_PACKET sockets, IPTables, or DPDK? Jakub, thank you for that information! As for my project, I have a program forwarding traffic to a server via IPIP packets. The destination server has multiple network namespaces along with the IPIP tunnel endpoints and the application sitting inside each namespace. As of right now, the destination machine replies back through the IPIP tunnel (to the forwarding server) and the forwarding server has to send the replies back to the client. My goal is to make it so the application sends traffic back to the client directly by spoofing the source address as the forwarding server's IP address. This would result in less load on the forwarding server along with less latency in my case. Currently, the IPIP tunnel endpoints inside the namespaces are set as the default devices and all IPIP packets go out the main interface on the destination machine. Initially, I tried creating a veth pair and put the peer inside the namespace. I then created a bridge on the main namespace and bridged the veth on the main namespace. I assigned the bridge an IP and had an SNAT rule in the IPTables POSTROUTING chain to source all traffic out as the forwarding server IP. I set the veth pair inside the network namespace as the default device on the network namespace and set the next hop to the bridge IP. The networking part of this worked fine, traffic sent out from the application (through the default route in the network namespace) was reaching the clients directly and the clients were replying back to the forwarding server. However, this still didn't work and I believe the cause is due to the application not supporting two separate interfaces (one for receiving and one for sending). Unfortunately, the application is closed-source and I doubt support for using two separate interfaces will be added. With the above said, I've been trying to look into creating a program that would receive all outgoing packets on the main interface. It would check the outer IP header's protocol to ensure it's IPPROTO_IPIP. If this is the case, it would then check if the outer IP header's source address is the same as the main interface's IP address. If this matches, it would save the outer IP header's destination address and remove the outer IP header. It would then replace the inner IP header's source address with the saved address (outer IP header's destination address) which should be the IP of the forwarding server. Afterwards, it would recalculate the IP and transport header's checksums and continue sending the packet. I believe in theory this should work. I am trying to find the best way to achieve the above. I don't believe IPTables supports changing the packet's contents to the same extent as the above. I made an XDP program yesterday that would do this, but later found out XDP doesn't support egress at the moment. I still plan to use the code for when TX path/egress support is added. I'd like to come up with another solution in the meantime to achieve the above, though. I hope this helps clear up the situation and thank you again! On 3/30/2020 1:10 PM, Jakub Kicinski wrote: > On Mon, 30 Mar 2020 11:11:49 -0500 Christian Deacon wrote: >> Hey everyone, >> >> >> I am fairly new to XDP and AF_XDP programming. Therefore, I apologize if >> this question sounds silly. >> >> >> To my understanding, an XDP program cannot process outgoing packets >> since it doesn't support the TX path. However, I read that AF_XDP >> sockets support both receiving and transmitting packets inside the user >> space while achieving zero-copy. I am trying to create a C program that >> modifies outgoing packets on an interface and trying to do this the >> fastest way possible. Unfortunately, I cannot use IPTables for this. >> >> >> I wanted to know if there is any way to use AF_XDP sockets to modify >> outgoing packets on an interface. I wasn't sure if the AF_XDP sockets >> only support receiving traffic from the XDP program when using the >> redirect function or not. >> >> >> If this isn't possible, are there any plans to add TX path support into >> XDP in the future? I'm also not sure what else I can do to achieve fast >> packet processing for this. I looked into using standard AF_PACKET >> sockets. However, since that makes a copy of the packet from the kernel, >> I'd assume that's pretty slow and I'd have to find a way to block the >> original packets (probably doing something with IPTables). I might also >> look into DPDK, but I want to see if using AF_XDP sockets for outgoing >> packet processing is possible first because it looks like it's going to >> take a while to learn DPDK. >> >> >> Any help is highly appreciated and thank you for your time! > Could you give us some more info on the use case? Where do the packets > originate? The main advantage of XDP is that it processes packets > before the metadata needed by the networking stack is allocated (skb). > This is only possible for RX or when packets are redirected from > another interface with XDP. All packets which pass through the stack > anyway can be efficiently modified using cls_bpf. > ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-03-30 19:28 ` Christian Deacon @ 2020-03-31 9:29 ` Toke Høiland-Jørgensen 2020-03-31 23:22 ` Christian Deacon 0 siblings, 1 reply; 9+ messages in thread From: Toke Høiland-Jørgensen @ 2020-03-31 9:29 UTC (permalink / raw) To: Christian Deacon, Jakub Kicinski; +Cc: xdp-newbies Christian Deacon <gamemann@gflclan.com> writes: > Hey David and Jakub, > > Thank you for your replies! > > David, it's good to know that egress support is being added to XDP and I > appreciate all the work you and others are putting into this! Do you > know if there is any ETA on when this will be officially > available/supported? With that said, will this be faster than most > solutions available now for processing/modifying outbound packets such > as using standard AF_PACKET sockets, IPTables, or DPDK? > > Jakub, thank you for that information! As for my project, I have a > program forwarding traffic to a server via IPIP packets. The destination > server has multiple network namespaces along with the IPIP tunnel > endpoints and the application sitting inside each namespace. As of right > now, the destination machine replies back through the IPIP tunnel (to > the forwarding server) and the forwarding server has to send the replies > back to the client. My goal is to make it so the application sends > traffic back to the client directly by spoofing the source address as > the forwarding server's IP address. This would result in less load on > the forwarding server along with less latency in my case. Currently, the > IPIP tunnel endpoints inside the namespaces are set as the default > devices and all IPIP packets go out the main interface on the > destination machine. > > Initially, I tried creating a veth pair and put the peer inside the > namespace. I then created a bridge on the main namespace and bridged the > veth on the main namespace. I assigned the bridge an IP and had an SNAT > rule in the IPTables POSTROUTING chain to source all traffic out as the > forwarding server IP. I set the veth pair inside the network namespace > as the default device on the network namespace and set the next hop to > the bridge IP. The networking part of this worked fine, traffic sent out > from the application (through the default route in the network > namespace) was reaching the clients directly and the clients were > replying back to the forwarding server. However, this still didn't work > and I believe the cause is due to the application not supporting two > separate interfaces (one for receiving and one for sending). > Unfortunately, the application is closed-source and I doubt support for > using two separate interfaces will be added. > > With the above said, I've been trying to look into creating a program > that would receive all outgoing packets on the main interface. It would > check the outer IP header's protocol to ensure it's IPPROTO_IPIP. If > this is the case, it would then check if the outer IP header's source > address is the same as the main interface's IP address. If this matches, > it would save the outer IP header's destination address and remove the > outer IP header. It would then replace the inner IP header's source > address with the saved address (outer IP header's destination address) > which should be the IP of the forwarding server. Afterwards, it would > recalculate the IP and transport header's checksums and continue sending > the packet. I believe in theory this should work. > > I am trying to find the best way to achieve the above. I don't believe > IPTables supports changing the packet's contents to the same extent as > the above. > > I made an XDP program yesterday that would do this, but later found out > XDP doesn't support egress at the moment. I still plan to use the code > for when TX path/egress support is added. I'd like to come up with > another solution in the meantime to achieve the above, though. I think you could do this with the TC hook? You can install BPF programs there that have then same ability to modify the program as XDP does. And since the packets are coming from an application, you don't gain any speedup from XDP anyway (since the kernel has already built its packet data structures). -Toke ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-03-31 9:29 ` Toke Høiland-Jørgensen @ 2020-03-31 23:22 ` Christian Deacon 2020-04-01 8:33 ` Toke Høiland-Jørgensen 2020-04-01 9:55 ` Jesper Dangaard Brouer 0 siblings, 2 replies; 9+ messages in thread From: Christian Deacon @ 2020-03-31 23:22 UTC (permalink / raw) To: Toke Høiland-Jørgensen; +Cc: xdp-newbies Hey Toke, Thank you for your reply and information! After looking into the TC Hook, it looks like this will work for my case. I'll try to get a program made in the next few days or so. Thank you for the help! On 3/31/2020 4:29 AM, Toke Høiland-Jørgensen wrote: > Christian Deacon <gamemann@gflclan.com> writes: > >> Hey David and Jakub, >> >> Thank you for your replies! >> >> David, it's good to know that egress support is being added to XDP and I >> appreciate all the work you and others are putting into this! Do you >> know if there is any ETA on when this will be officially >> available/supported? With that said, will this be faster than most >> solutions available now for processing/modifying outbound packets such >> as using standard AF_PACKET sockets, IPTables, or DPDK? >> >> Jakub, thank you for that information! As for my project, I have a >> program forwarding traffic to a server via IPIP packets. The destination >> server has multiple network namespaces along with the IPIP tunnel >> endpoints and the application sitting inside each namespace. As of right >> now, the destination machine replies back through the IPIP tunnel (to >> the forwarding server) and the forwarding server has to send the replies >> back to the client. My goal is to make it so the application sends >> traffic back to the client directly by spoofing the source address as >> the forwarding server's IP address. This would result in less load on >> the forwarding server along with less latency in my case. Currently, the >> IPIP tunnel endpoints inside the namespaces are set as the default >> devices and all IPIP packets go out the main interface on the >> destination machine. >> >> Initially, I tried creating a veth pair and put the peer inside the >> namespace. I then created a bridge on the main namespace and bridged the >> veth on the main namespace. I assigned the bridge an IP and had an SNAT >> rule in the IPTables POSTROUTING chain to source all traffic out as the >> forwarding server IP. I set the veth pair inside the network namespace >> as the default device on the network namespace and set the next hop to >> the bridge IP. The networking part of this worked fine, traffic sent out >> from the application (through the default route in the network >> namespace) was reaching the clients directly and the clients were >> replying back to the forwarding server. However, this still didn't work >> and I believe the cause is due to the application not supporting two >> separate interfaces (one for receiving and one for sending). >> Unfortunately, the application is closed-source and I doubt support for >> using two separate interfaces will be added. >> >> With the above said, I've been trying to look into creating a program >> that would receive all outgoing packets on the main interface. It would >> check the outer IP header's protocol to ensure it's IPPROTO_IPIP. If >> this is the case, it would then check if the outer IP header's source >> address is the same as the main interface's IP address. If this matches, >> it would save the outer IP header's destination address and remove the >> outer IP header. It would then replace the inner IP header's source >> address with the saved address (outer IP header's destination address) >> which should be the IP of the forwarding server. Afterwards, it would >> recalculate the IP and transport header's checksums and continue sending >> the packet. I believe in theory this should work. >> >> I am trying to find the best way to achieve the above. I don't believe >> IPTables supports changing the packet's contents to the same extent as >> the above. >> >> I made an XDP program yesterday that would do this, but later found out >> XDP doesn't support egress at the moment. I still plan to use the code >> for when TX path/egress support is added. I'd like to come up with >> another solution in the meantime to achieve the above, though. > I think you could do this with the TC hook? You can install BPF programs > there that have then same ability to modify the program as XDP does. And > since the packets are coming from an application, you don't gain any > speedup from XDP anyway (since the kernel has already built its packet > data structures). > > -Toke > ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-03-31 23:22 ` Christian Deacon @ 2020-04-01 8:33 ` Toke Høiland-Jørgensen 2020-04-01 9:55 ` Jesper Dangaard Brouer 1 sibling, 0 replies; 9+ messages in thread From: Toke Høiland-Jørgensen @ 2020-04-01 8:33 UTC (permalink / raw) To: Christian Deacon; +Cc: xdp-newbies Christian Deacon <gamemann@gflclan.com> writes: > Hey Toke, > > > Thank you for your reply and information! > > > After looking into the TC Hook, it looks like this will work for my > case. I'll try to get a program made in the next few days or so. > > Thank you for the help! Awesome! You're welcome :) -Toke ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-03-31 23:22 ` Christian Deacon 2020-04-01 8:33 ` Toke Høiland-Jørgensen @ 2020-04-01 9:55 ` Jesper Dangaard Brouer 2020-04-02 8:26 ` Christian Deacon 1 sibling, 1 reply; 9+ messages in thread From: Jesper Dangaard Brouer @ 2020-04-01 9:55 UTC (permalink / raw) To: Christian Deacon; +Cc: brouer, Toke Høiland-Jørgensen, xdp-newbies On Tue, 31 Mar 2020 18:22:53 -0500 Christian Deacon <gamemann@gflclan.com> wrote: > After looking into the TC Hook, it looks like this will work for my > case. I'll try to get a program made in the next few days or so. If you want code examples of TC egress hook with BPF look at[1]: [1] https://github.com/xdp-project/xdp-cpumap-tc/ Notice that map pinning with libbpf have gotten easier (thanks to Toke). This example does work (runs in production), it shares and reuse maps between XDP and TC-bpf. The example uses the old way to define maps, and I recommend switching to the new BTF-based syntax instead (requires LLVM 10), which defines ".maps" SEC, see example[2], and notice "pinning" option, which obsoletes a lot of the map code in[1]. Attaching TC-bpf progs are different that XDP. I've hidden the detail in C-code function tc_egress_attach_bpf() (see[3]), which actually just calls the "tc" command. I was hoping that this would be replaced with a libbpf call, like we have for XDP, but that have not happened. [2] https://github.com/xdp-project/xdp-tools/blob/master/xdp-filter/xdpfilt_prog.h#L124-L131 [3] https://github.com/xdp-project/xdp-cpumap-tc/blob/master/src/common_user.c#L386 -- Best regards, Jesper Dangaard Brouer MSc.CS, Principal Kernel Engineer at Red Hat LinkedIn: http://www.linkedin.com/in/brouer google keywords: tc qdisc clsact cls_bpf egress filter ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: Using AF_XDP To Modify Outgoing Packets 2020-04-01 9:55 ` Jesper Dangaard Brouer @ 2020-04-02 8:26 ` Christian Deacon 0 siblings, 0 replies; 9+ messages in thread From: Christian Deacon @ 2020-04-02 8:26 UTC (permalink / raw) To: Jesper Dangaard Brouer; +Cc: xdp-newbies Hey Jesper, Thank you for the information and examples! They helped a lot! I've started developing the TC egress program today and I have it nearly finished. For some reason, when I modify the source address of the inner IP header after removing the outer IP header, the packet is never received from the destination host after returning 'TC_ACT_OK'. I did a tcpdump and can see the packets with the changed source IP along with all the checksums being calculated properly. They also have the correct MAC addresses in the Ethernet header. I plan to continue looking into it tomorrow and to my understanding, this mailing list is for XDP-specific threads. Therefore, I'll probably be creating a thread on the BPF mailing list if I can't figure things out (I didn't see any TC-specific mailing list). I just wanted to thank everyone who has replied for the information and help! I really appreciate it. On 4/1/2020 4:55 AM, Jesper Dangaard Brouer wrote: > On Tue, 31 Mar 2020 18:22:53 -0500 > Christian Deacon <gamemann@gflclan.com> wrote: > >> After looking into the TC Hook, it looks like this will work for my >> case. I'll try to get a program made in the next few days or so. > If you want code examples of TC egress hook with BPF look at[1]: > [1] https://github.com/xdp-project/xdp-cpumap-tc/ > > Notice that map pinning with libbpf have gotten easier (thanks to > Toke). This example does work (runs in production), it shares and reuse > maps between XDP and TC-bpf. The example uses the old way to define > maps, and I recommend switching to the new BTF-based syntax instead > (requires LLVM 10), which defines ".maps" SEC, see example[2], and > notice "pinning" option, which obsoletes a lot of the map code in[1]. > > Attaching TC-bpf progs are different that XDP. I've hidden the detail > in C-code function tc_egress_attach_bpf() (see[3]), which actually just > calls the "tc" command. I was hoping that this would be replaced with a > libbpf call, like we have for XDP, but that have not happened. > > > [2] https://github.com/xdp-project/xdp-tools/blob/master/xdp-filter/xdpfilt_prog.h#L124-L131 > [3] https://github.com/xdp-project/xdp-cpumap-tc/blob/master/src/common_user.c#L386 ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2020-04-02 8:27 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2020-03-30 16:11 Using AF_XDP To Modify Outgoing Packets Christian Deacon 2020-03-30 16:24 ` David Ahern 2020-03-30 18:10 ` Jakub Kicinski 2020-03-30 19:28 ` Christian Deacon 2020-03-31 9:29 ` Toke Høiland-Jørgensen 2020-03-31 23:22 ` Christian Deacon 2020-04-01 8:33 ` Toke Høiland-Jørgensen 2020-04-01 9:55 ` Jesper Dangaard Brouer 2020-04-02 8:26 ` Christian Deacon
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.