From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ilya Maximets Subject: Re: [PATCH] Unlink existing unused sockets at start up Date: Thu, 17 Dec 2015 14:47:49 +0300 Message-ID: <5672A0E5.4040904@samsung.com> References: <1450326062-105574-1-git-send-email-zhihong.wang@intel.com> Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit Cc: s.dyasly@samsung.com To: Zhihong Wang , dev@dpdk.org Return-path: Received: from mailout3.w1.samsung.com (mailout3.w1.samsung.com [210.118.77.13]) by dpdk.org (Postfix) with ESMTP id 192218D93 for ; Thu, 17 Dec 2015 12:47:52 +0100 (CET) Received: from eucpsbgm1.samsung.com (unknown [203.254.199.244]) by mailout3.w1.samsung.com (Oracle Communications Messaging Server 7.0.5.31.0 64bit (built May 5 2014)) with ESMTP id <0NZI00A583FQTL70@mailout3.w1.samsung.com> for dev@dpdk.org; Thu, 17 Dec 2015 11:47:50 +0000 (GMT) In-reply-to: <1450326062-105574-1-git-send-email-zhihong.wang@intel.com> List-Id: patches and discussions about DPDK List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" On 17.12.2015 07:21, Zhihong Wang wrote: > This patch unlinks existing unused sockets (which cause new bindings to fail, e.g. vHost PMD) to ensure smooth startup. > In a lot of cases DPDK applications are terminated abnormally without proper resource release. Original OVS related problem discussed previously here ( http://dpdk.org/ml/archives/dev/2015-December/030326.html ) fixed in OVS by commit 9b5422a98f817b9f2a1f8224cab7e1a8d0bbba1f Author: Ilya Maximets Date: Wed Dec 16 15:32:21 2015 +0300 ovs-lib: Try to call exit before killing. While killing OVS may not free all allocated resources. Example: Socket for vhost-user port will stay in a system after 'systemctl stop openvswitch' and opening that port after restart will fail. So, the crash of application is the last point of discussion. > Therefore, DPDK libs should be able to deal with unclean boot environment. Why are you think that recovery after crash of application is a problem of underneath library? Best regards, Ilya Maximets. > > Signed-off-by: Zhihong Wang > --- > lib/librte_vhost/vhost_user/vhost-net-user.c | 28 ++++++++++++++++++++++++---- > 1 file changed, 24 insertions(+), 4 deletions(-) > > diff --git a/lib/librte_vhost/vhost_user/vhost-net-user.c b/lib/librte_vhost/vhost_user/vhost-net-user.c > index 8b7a448..eac0721 100644 > --- a/lib/librte_vhost/vhost_user/vhost-net-user.c > +++ b/lib/librte_vhost/vhost_user/vhost-net-user.c > @@ -120,18 +120,38 @@ uds_socket(const char *path) > sockfd = socket(AF_UNIX, SOCK_STREAM, 0); > if (sockfd < 0) > return -1; > - RTE_LOG(INFO, VHOST_CONFIG, "socket created, fd:%d\n", sockfd); > + RTE_LOG(INFO, VHOST_CONFIG, "socket created, fd: %d\n", sockfd); > > memset(&un, 0, sizeof(un)); > un.sun_family = AF_UNIX; > snprintf(un.sun_path, sizeof(un.sun_path), "%s", path); > ret = bind(sockfd, (struct sockaddr *)&un, sizeof(un)); > if (ret == -1) { > - RTE_LOG(ERR, VHOST_CONFIG, "fail to bind fd:%d, remove file:%s and try again.\n", > + RTE_LOG(ERR, VHOST_CONFIG, > + "bind fd: %d to file: %s failed, checking socket...\n", > sockfd, path); > - goto err; > + ret = connect(sockfd, (struct sockaddr *)&un, sizeof(un)); > + if (ret == -1) { > + RTE_LOG(INFO, VHOST_CONFIG, > + "socket: %s is inactive, rebinding after unlink...\n", path); > + unlink(path); > + ret = bind(sockfd, (struct sockaddr *)&un, sizeof(un)); > + if (ret == -1) { > + RTE_LOG(ERR, VHOST_CONFIG, > + "bind fd: %d to file: %s failed even after unlink\n", > + sockfd, path); > + goto err; > + } > + } else { > + RTE_LOG(INFO, VHOST_CONFIG, > + "socket: %s is alive, remove it and try again\n", path); > + RTE_LOG(ERR, VHOST_CONFIG, > + "bind fd: %d to file: %s failed\n", sockfd, path); > + goto err; > + } > } > - RTE_LOG(INFO, VHOST_CONFIG, "bind to %s\n", path); > + RTE_LOG(INFO, VHOST_CONFIG, > + "bind fd: %d to file: %s successful\n", sockfd, path); > > ret = listen(sockfd, MAX_VIRTIO_BACKLOG); > if (ret == -1) >