From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jerin Jacob Subject: Re: [PATCH v6 1/4] lib/librte_ether: support device reset Date: Tue, 21 Jun 2016 16:27:52 +0530 Message-ID: <20160621105751.GA737@localhost.localdomain> References: <1466403870-6840-1-git-send-email-wenzhuo.lu@intel.com> <1466403870-6840-2-git-send-email-wenzhuo.lu@intel.com> <20160620091410.GA9323@localhost.localdomain> <20160620091714.276c186c@xeon-e3> <20160621035124.GC4903@localhost.localdomain> <6A0DE07E22DDAD4C9103DF62FEBC090903488DD1@shsmsx102.ccr.corp.intel.com> <20160621073710.GA30638@localhost.localdomain> <6A0DE07E22DDAD4C9103DF62FEBC090903488F5E@shsmsx102.ccr.corp.intel.com> <20160621085531.GA31880@localhost.localdomain> <2601191342CEEE43887BDE71AB97725836B73FD0@irsmsx105.ger.corp.intel.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Cc: "Lu, Wenzhuo" , Stephen Hemminger , "dev@dpdk.org" , "Richardson, Bruce" , "Chen, Jing D" , "Liang, Cunming" , "Wu, Jingjing" , "Zhang, Helin" , "thomas.monjalon@6wind.com" To: "Ananyev, Konstantin" Return-path: Received: from na01-bn1-obe.outbound.protection.outlook.com (mail-bn1on0065.outbound.protection.outlook.com [157.56.110.65]) by dpdk.org (Postfix) with ESMTP id 51A3BADE9 for ; Tue, 21 Jun 2016 12:58:23 +0200 (CEST) Content-Disposition: inline In-Reply-To: <2601191342CEEE43887BDE71AB97725836B73FD0@irsmsx105.ger.corp.intel.com> List-Id: patches and discussions about DPDK List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" On Tue, Jun 21, 2016 at 09:26:12AM +0000, Ananyev, Konstantin wrote: Hi Konstantin, > Hi Jerin, > > > -----Original Message----- > > From: Jerin Jacob [mailto:jerin.jacob@caviumnetworks.com] > > Sent: Tuesday, June 21, 2016 9:56 AM > > To: Lu, Wenzhuo > > Cc: Stephen Hemminger; dev@dpdk.org; Ananyev, Konstantin; Richardson, Bruce; Chen, Jing D; Liang, Cunming; Wu, Jingjing; Zhang, > > Helin; thomas.monjalon@6wind.com > > Subject: Re: [dpdk-dev] [PATCH v6 1/4] lib/librte_ether: support device reset > > > > On Tue, Jun 21, 2016 at 08:24:36AM +0000, Lu, Wenzhuo wrote: > > > Hi Jerin, > > > > Hi Wenzhuo, > > > > > > > > > > On Mon, Jun 20, 2016 at 02:24:27PM +0800, Wenzhuo Lu wrote: > > > > > > > > > Add an API to reset the device. > > > > > > > > > It's for VF device in this scenario, kernel PF + DPDK VF. > > > > > > > > > When the PF port down->up, APP should call this API to reset > > > > > > > > > VF port. Most likely, APP should call it in its management > > > > > > > > > thread and guarantee the thread safe. It means APP should stop > > > > > > > > > the rx/tx and the device, then reset the device, then recover > > > > > > > > > the device and rx/tx. > > > > > > > > > > > > > > > > Following is _a_ use-case for Device reset. But may be not be > > > > > > > > _the_ use case. IMO, We need to first say expected behavior of > > > > > > > > this API and add a use-case later. > > > > > > > > > > > > > > > > Other use-case would be, PCIe VF with functional level reset for > > > > > > > > SRIOV migration. > > > > > > > > Are we on same page? > > > > > > > > > > > > > > > > > > > > > In my experience with Linux devices, this is normally handled by > > > > > > > the device driver in the start routine. Since any use case which > > > > > > > needs this is going to do a stop/reset/start sequence, why not > > > > > > > just have the VF device driver do this in the start routine?. > > > > > > > > > > > > > > Adding yet another API and state transistion if not necessary > > > > > > > increases the complexity and required test cases for all devices. > > > > > > > > > > > > I agree with Stephen here.I think if application needs to call start > > > > > > after the device reset then we could add this logic in start itself > > > > > > rather exposing a yet another API > > > > > Do you mean changing the device_start to include all these actions, stop > > > > device -> stop queue -> re-setup queue -> start queue -> start device ? > > > > > > > > What was the expected API call sequence when you were introduced this API? > > > > > > > > Point was to have implicit device reset in the API call sequence(Wherever make > > > > sense for specific PMD) > > > I think the API call sequence depends on the implementation of the APP. Let's say if there's not this reset API, APP can use this API > > call sequence to handle the PF link down/up event, rte_eth_dev_close -> rte_eth_rx_queue_setup -> rte_eth_tx_queue_setup -> > > rte_eth_dev_start. > > > Actually our purpose is to use this reset API instead of the API call sequence. You can see the reset API is not necessary. The benefit > > is to save the code for APP. > > > > Then I am bit confused with original commit log description. > > | > > |It means APP should stop the rx/tx and the device, then reset the > > |device, then recover the device and rx/tx. > > | > > I was under impression that it a low level reset API for this device? Is > > n't it? > > > > The other issue is generalized outlook of the API, Certain PMD will not > > have PF link down/up event? Link down/up and only connected to VF and PF > > only for configuration. > > > > How about fixing it more transparently in PMD driver itself as > > PMD driver knows the PF link up/down event, Is it possible to > > recover the VF on that event if its only matter of resetting it? > > I think we already went through that discussion on the list. > Unfortunately with current dpdk design it is hardly possible. > To achieve that we need to introduce some sort of synchronisation > between IO and control APIs (locking or so). > Actually I am not sure why having a special reset function will be a problem. | |It means APP should stop the rx/tx and the device, then reset the |device, then recover the device and rx/tx. | Just to understand, If application still need to do the stop then what value addtion reset API brings on the table? > Yes, it would exist only for VFs, for PF it could be left unimplemented. > Though it definitely seems more convenient from user point of view, > they would know: to handle VF reset event, they just need to call that > particular function, not to re-implement their own. What if driver returns "not implemented" then application will have do generic rte_eth_dev_stop/rte_eth_dev_start.That way in application perspective we are NOT solving any problem. Jerin