* [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer failure
@ 2014-11-27 7:31 Dexuan Cui
2014-11-27 7:14 ` Jason Wang
0 siblings, 1 reply; 5+ messages in thread
From: Dexuan Cui @ 2014-11-27 7:31 UTC (permalink / raw)
To: gregkh, linux-kernel, driverdev-devel, olaf, apw, jasowang, kys,
vkuznets
Cc: haiyangz
In the case the user-space daemon crashes, hangs or is killed, we
need to down the semaphore, otherwise, after the daemon starts next
time, the obsolete data in fcopy_transaction.message or
fcopy_transaction.fcopy_msg will be used immediately.
Reviewed-by: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: K. Y. Srinivasan <kys@microsoft.com>
Signed-off-by: Dexuan Cui <decui@microsoft.com>
---
v2: I removed the "FCP" prefix as Greg asked.
I also updated the output message a little:
"FCP: failed to acquire the semaphore" -->
"can not acquire the semaphore: it is benign"
drivers/hv/hv_fcopy.c | 9 +++++++++
1 file changed, 9 insertions(+)
diff --git a/drivers/hv/hv_fcopy.c b/drivers/hv/hv_fcopy.c
index 23b2ce2..c518ad9 100644
--- a/drivers/hv/hv_fcopy.c
+++ b/drivers/hv/hv_fcopy.c
@@ -86,6 +86,15 @@ static void fcopy_work_func(struct work_struct *dummy)
* process the pending transaction.
*/
fcopy_respond_to_host(HV_E_FAIL);
+
+ /* In the case the user-space daemon crashes, hangs or is killed, we
+ * need to down the semaphore, otherwise, after the daemon starts next
+ * time, the obsolete data in fcopy_transaction.message or
+ * fcopy_transaction.fcopy_msg will be used immediately.
+ */
+ if (down_trylock(&fcopy_transaction.read_sema))
+ pr_debug("can not acquire the semaphore: it is benign\n");
+
}
static int fcopy_handle_handshake(u32 version)
--
1.9.1
^ permalink raw reply related [flat|nested] 5+ messages in thread* Re: [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer failure 2014-11-27 7:31 [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer failure Dexuan Cui @ 2014-11-27 7:14 ` Jason Wang 2014-11-27 8:50 ` Dexuan Cui 0 siblings, 1 reply; 5+ messages in thread From: Jason Wang @ 2014-11-27 7:14 UTC (permalink / raw) To: Dexuan Cui Cc: gregkh, linux-kernel, driverdev-devel, olaf, apw, kys, vkuznets, haiyangz ----- Original Message ----- > In the case the user-space daemon crashes, hangs or is killed, we > need to down the semaphore, otherwise, after the daemon starts next > time, the obsolete data in fcopy_transaction.message or > fcopy_transaction.fcopy_msg will be used immediately. > > Reviewed-by: Vitaly Kuznetsov <vkuznets@redhat.com> > Cc: K. Y. Srinivasan <kys@microsoft.com> > Signed-off-by: Dexuan Cui <decui@microsoft.com> > --- > > v2: I removed the "FCP" prefix as Greg asked. > > I also updated the output message a little: > "FCP: failed to acquire the semaphore" --> > "can not acquire the semaphore: it is benign" > > drivers/hv/hv_fcopy.c | 9 +++++++++ > 1 file changed, 9 insertions(+) > > diff --git a/drivers/hv/hv_fcopy.c b/drivers/hv/hv_fcopy.c > index 23b2ce2..c518ad9 100644 > --- a/drivers/hv/hv_fcopy.c > +++ b/drivers/hv/hv_fcopy.c > @@ -86,6 +86,15 @@ static void fcopy_work_func(struct work_struct *dummy) > * process the pending transaction. > */ > fcopy_respond_to_host(HV_E_FAIL); > + > + /* In the case the user-space daemon crashes, hangs or is killed, we > + * need to down the semaphore, otherwise, after the daemon starts next > + * time, the obsolete data in fcopy_transaction.message or > + * fcopy_transaction.fcopy_msg will be used immediately. > + */ Looks still racy, what happens if the daemon start before down_trylock() but after fcopy_respont_to_host() here? > + if (down_trylock(&fcopy_transaction.read_sema)) > + pr_debug("can not acquire the semaphore: it is benign\n"); typo > + > } > > static int fcopy_handle_handshake(u32 version) > -- > 1.9.1 > > -- > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > ^ permalink raw reply [flat|nested] 5+ messages in thread
* RE: [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer failure 2014-11-27 7:14 ` Jason Wang @ 2014-11-27 8:50 ` Dexuan Cui [not found] ` <F792CF86EFE20D4AB8064279AFBA51C613E641D8@HKNPRD3002MB017.064d.mgd.msft.net > 0 siblings, 1 reply; 5+ messages in thread From: Dexuan Cui @ 2014-11-27 8:50 UTC (permalink / raw) To: Jason Wang Cc: gregkh@linuxfoundation.org, linux-kernel@vger.kernel.org, driverdev-devel@linuxdriverproject.org, olaf@aepfle.de, apw@canonical.com, KY Srinivasan, vkuznets@redhat.com, Haiyang Zhang [-- Warning: decoded text below may be mangled, UTF-8 assumed --] [-- Attachment #1: Type: text/plain; charset="utf-8", Size: 2414 bytes --] > -----Original Message----- > From: Jason Wang [mailto:jasowang@redhat.com] > Sent: Thursday, November 27, 2014 15:15 PM > To: Dexuan Cui > Cc: gregkh@linuxfoundation.org; linux-kernel@vger.kernel.org; driverdev- > devel@linuxdriverproject.org; olaf@aepfle.de; apw@canonical.com; KY > Srinivasan; vkuznets@redhat.com; Haiyang Zhang > Subject: Re: [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer > failure > ----- Original Message ----- > > In the case the user-space daemon crashes, hangs or is killed, we > > need to down the semaphore, otherwise, after the daemon starts next > > time, the obsolete data in fcopy_transaction.message or > > fcopy_transaction.fcopy_msg will be used immediately. > > > > Reviewed-by: Vitaly Kuznetsov <vkuznets@redhat.com> > > Cc: K. Y. Srinivasan <kys@microsoft.com> > > Signed-off-by: Dexuan Cui <decui@microsoft.com> > > --- > > > > v2: I removed the "FCP" prefix as Greg asked. > > > > I also updated the output message a little: > > "FCP: failed to acquire the semaphore" --> > > "can not acquire the semaphore: it is benign" > > > > drivers/hv/hv_fcopy.c | 9 +++++++++ > > 1 file changed, 9 insertions(+) > > > > diff --git a/drivers/hv/hv_fcopy.c b/drivers/hv/hv_fcopy.c > > index 23b2ce2..c518ad9 100644 > > --- a/drivers/hv/hv_fcopy.c > > +++ b/drivers/hv/hv_fcopy.c > > @@ -86,6 +86,15 @@ static void fcopy_work_func(struct work_struct > *dummy) > > * process the pending transaction. > > */ > > fcopy_respond_to_host(HV_E_FAIL); > > + > > + /* In the case the user-space daemon crashes, hangs or is killed, we > > + * need to down the semaphore, otherwise, after the daemon starts > next > > + * time, the obsolete data in fcopy_transaction.message or > > + * fcopy_transaction.fcopy_msg will be used immediately. > > + */ > > Looks still racy, what happens if the daemon start before down_trylock() > but after fcopy_respont_to_host() here? Jason, Thanks for pointing this out! IMO we can resolve this by adding down_trylock() in fcopy_release(). What's your opinion? > > > + if (down_trylock(&fcopy_transaction.read_sema)) > > + pr_debug("can not acquire the semaphore: it is benign\n"); > > typo > > + > > } Sorry -- what typo do you mean? Thanks, -- Dexuan ÿôèº{.nÇ+·®+%Ëÿ±éݶ\x17¥wÿº{.nÇ+·¥{±þG«éÿ{ayº\x1dÊÚë,j\a¢f£¢·hïêÿêçz_è®\x03(éÝ¢j"ú\x1a¶^[m§ÿÿ¾\a«þG«éÿ¢¸?¨èÚ&£ø§~á¶iOæ¬z·vØ^\x14\x04\x1a¶^[m§ÿÿÃ\fÿ¶ìÿ¢¸?I¥ ^ permalink raw reply [flat|nested] 5+ messages in thread
[parent not found: <F792CF86EFE20D4AB8064279AFBA51C613E641D8@HKNPRD3002MB017.064d.mgd.msft.net >]
* RE: [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer failure [not found] ` <F792CF86EFE20D4AB8064279AFBA51C613E641D8@HKNPRD3002MB017.064d.mgd.msft.net > @ 2014-11-27 9:01 ` Jason Wang 2014-11-27 11:44 ` Dexuan Cui 0 siblings, 1 reply; 5+ messages in thread From: Jason Wang @ 2014-11-27 9:01 UTC (permalink / raw) To: Dexuan Cui Cc: gregkh@linuxfoundation.org, linux-kernel@vger.kernel.org, driverdev-devel@linuxdriverproject.org, olaf@aepfle.de, apw@canonical.com, KY Srinivasan, vkuznets@redhat.com, Haiyang Zhang On Thu, Nov 27, 2014 at 4:50 PM, Dexuan Cui <decui@microsoft.com> wrote: >> -----Original Message----- >> From: Jason Wang [mailto:jasowang@redhat.com] >> Sent: Thursday, November 27, 2014 15:15 PM >> To: Dexuan Cui >> Cc: gregkh@linuxfoundation.org; linux-kernel@vger.kernel.org; >> driverdev- >> devel@linuxdriverproject.org; olaf@aepfle.de; apw@canonical.com; KY >> Srinivasan; vkuznets@redhat.com; Haiyang Zhang >> Subject: Re: [PATCH v2] hv: hv_fcopy: drop the obsolete message on >> transfer >> failure >> ----- Original Message ----- >> > In the case the user-space daemon crashes, hangs or is killed, we >> > need to down the semaphore, otherwise, after the daemon starts >> next >> > time, the obsolete data in fcopy_transaction.message or >> > fcopy_transaction.fcopy_msg will be used immediately. >> > >> > Reviewed-by: Vitaly Kuznetsov <vkuznets@redhat.com> >> > Cc: K. Y. Srinivasan <kys@microsoft.com> >> > Signed-off-by: Dexuan Cui <decui@microsoft.com> >> > --- >> > >> > v2: I removed the "FCP" prefix as Greg asked. >> > >> > I also updated the output message a little: >> > "FCP: failed to acquire the semaphore" --> >> > "can not acquire the semaphore: it is benign" >> > >> > drivers/hv/hv_fcopy.c | 9 +++++++++ >> > 1 file changed, 9 insertions(+) >> > >> > diff --git a/drivers/hv/hv_fcopy.c b/drivers/hv/hv_fcopy.c >> > index 23b2ce2..c518ad9 100644 >> > --- a/drivers/hv/hv_fcopy.c >> > +++ b/drivers/hv/hv_fcopy.c >> > @@ -86,6 +86,15 @@ static void fcopy_work_func(struct work_struct >> *dummy) >> > * process the pending transaction. >> > */ >> > fcopy_respond_to_host(HV_E_FAIL); >> > + >> > + /* In the case the user-space daemon crashes, hangs or is >> killed, we >> > + * need to down the semaphore, otherwise, after the daemon >> starts >> next >> > + * time, the obsolete data in fcopy_transaction.message or >> > + * fcopy_transaction.fcopy_msg will be used immediately. >> > + */ >> >> Looks still racy, what happens if the daemon start before >> down_trylock() >> but after fcopy_respont_to_host() here? > Jason, > Thanks for pointing this out! > IMO we can resolve this by adding down_trylock() in fcopy_release(). > What's your opinion? Looks better and need to cancel the timeout also here? > > >> >> > + if (down_trylock(&fcopy_transaction.read_sema)) >> > + pr_debug("can not acquire the semaphore: it is benign\n"); >> >> typo >> > + >> > } > Sorry -- what typo do you mean? s/benign/begin/ ? > > Thanks, > -- Dexuan > �NrybXǧv^){.n+{zX\x17ܨ}Ơz&j:+v\azZ++zfh~iz\x1ew?&)ߢ^[f^jǫym@Aa\x7f\f0h\x0fi\x7f ^ permalink raw reply [flat|nested] 5+ messages in thread
* RE: [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer failure 2014-11-27 9:01 ` Jason Wang @ 2014-11-27 11:44 ` Dexuan Cui 0 siblings, 0 replies; 5+ messages in thread From: Dexuan Cui @ 2014-11-27 11:44 UTC (permalink / raw) To: Jason Wang Cc: gregkh@linuxfoundation.org, linux-kernel@vger.kernel.org, driverdev-devel@linuxdriverproject.org, olaf@aepfle.de, apw@canonical.com, KY Srinivasan, vkuznets@redhat.com, Haiyang Zhang [-- Warning: decoded text below may be mangled, UTF-8 assumed --] [-- Attachment #1: Type: text/plain; charset="utf-8", Size: 3439 bytes --] > -----Original Message----- > From: Jason Wang [mailto:jasowang@redhat.com] > Sent: Thursday, November 27, 2014 17:01 PM > To: Dexuan Cui > Cc: gregkh@linuxfoundation.org; linux-kernel@vger.kernel.org; driverdev- > devel@linuxdriverproject.org; olaf@aepfle.de; apw@canonical.com; KY > Srinivasan; vkuznets@redhat.com; Haiyang Zhang > Subject: RE: [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer > failure > On Thu, Nov 27, 2014 at 4:50 PM, Dexuan Cui <decui@microsoft.com> wrote: > >> -----Original Message----- > >> From: Jason Wang [mailto:jasowang@redhat.com] > >> Sent: Thursday, November 27, 2014 15:15 PM > >> To: Dexuan Cui > >> Cc: gregkh@linuxfoundation.org; linux-kernel@vger.kernel.org; > >> driverdev- > >> devel@linuxdriverproject.org; olaf@aepfle.de; apw@canonical.com; KY > >> Srinivasan; vkuznets@redhat.com; Haiyang Zhang > >> Subject: Re: [PATCH v2] hv: hv_fcopy: drop the obsolete message on > >> transfer > >> failure > >> ----- Original Message ----- > >> > In the case the user-space daemon crashes, hangs or is killed, we > >> > need to down the semaphore, otherwise, after the daemon starts > >> next > >> > time, the obsolete data in fcopy_transaction.message or > >> > fcopy_transaction.fcopy_msg will be used immediately. > >> > > >> > Reviewed-by: Vitaly Kuznetsov <vkuznets@redhat.com> > >> > Cc: K. Y. Srinivasan <kys@microsoft.com> > >> > Signed-off-by: Dexuan Cui <decui@microsoft.com> > >> > --- > >> > > >> > v2: I removed the "FCP" prefix as Greg asked. > >> > > >> > I also updated the output message a little: > >> > "FCP: failed to acquire the semaphore" --> > >> > "can not acquire the semaphore: it is benign" > >> > > >> > drivers/hv/hv_fcopy.c | 9 +++++++++ > >> > 1 file changed, 9 insertions(+) > >> > > >> > diff --git a/drivers/hv/hv_fcopy.c b/drivers/hv/hv_fcopy.c > >> > index 23b2ce2..c518ad9 100644 > >> > --- a/drivers/hv/hv_fcopy.c > >> > +++ b/drivers/hv/hv_fcopy.c > >> > @@ -86,6 +86,15 @@ static void fcopy_work_func(struct work_struct > >> *dummy) > >> > * process the pending transaction. > >> > */ > >> > fcopy_respond_to_host(HV_E_FAIL); > >> > + > >> > + /* In the case the user-space daemon crashes, hangs or is > >> killed, we > >> > + * need to down the semaphore, otherwise, after the daemon > >> starts > >> next > >> > + * time, the obsolete data in fcopy_transaction.message or > >> > + * fcopy_transaction.fcopy_msg will be used immediately. > >> > + */ > >> > >> Looks still racy, what happens if the daemon start before > >> down_trylock() > >> but after fcopy_respont_to_host() here? > > Jason, > > Thanks for pointing this out! > > IMO we can resolve this by adding down_trylock() in fcopy_release(). > > What's your opinion? > > > Looks better and need to cancel the timeout also here? OK, let me post a v3. > > > > > >> > >> > + if (down_trylock(&fcopy_transaction.read_sema)) > >> > + pr_debug("can not acquire the semaphore: it is benign\n"); > >> > >> typo > >> > + > >> > } > > Sorry -- what typo do you mean? > > s/benign/begin/ ? I meant the issue(can't get the semaphore) is benign. I think we can just remove the message, as KY suggested. Instead, I'll add a comment for it. Thanks, -- Dexuan ÿôèº{.nÇ+·®+%Ëÿ±éݶ\x17¥wÿº{.nÇ+·¥{±þG«éÿ{ayº\x1dÊÚë,j\a¢f£¢·hïêÿêçz_è®\x03(éÝ¢j"ú\x1a¶^[m§ÿÿ¾\a«þG«éÿ¢¸?¨èÚ&£ø§~á¶iOæ¬z·vØ^\x14\x04\x1a¶^[m§ÿÿÃ\fÿ¶ìÿ¢¸?I¥ ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2014-11-27 11:44 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-11-27 7:31 [PATCH v2] hv: hv_fcopy: drop the obsolete message on transfer failure Dexuan Cui
2014-11-27 7:14 ` Jason Wang
2014-11-27 8:50 ` Dexuan Cui
[not found] ` <F792CF86EFE20D4AB8064279AFBA51C613E641D8@HKNPRD3002MB017.064d.mgd.msft.net >
2014-11-27 9:01 ` Jason Wang
2014-11-27 11:44 ` Dexuan Cui
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox