* bug fix patch lost: git problem or just incorrect merge? @ 2010-05-21 15:41 James Bottomley 2010-05-21 16:45 ` Linus Torvalds 2010-05-21 22:02 ` David Miller 0 siblings, 2 replies; 6+ messages in thread From: James Bottomley @ 2010-05-21 15:41 UTC (permalink / raw) To: Linus Torvalds, David Miller; +Cc: linux-kernel, netdev, linux-scsi The patch in question is this one (upstream for a while): commit d7d05548a62c87ee55b0c81933669177f885aa8d Author: Mike Christie <michaelc@cs.wisc.edu> Date: Wed Mar 31 14:41:35 2010 -0500 [SCSI] iscsi_tcp: fix relogin/shutdown hang It's a simple one line change in iscsi_tcp.c (diff clipped): --- a/drivers/scsi/iscsi_tcp.c +++ b/drivers/scsi/iscsi_tcp.c @@ -599,7 +599,7 @@ static void iscsi_sw_tcp_conn_stop(struct iscsi_cls_conn *cls_conn, int flag) - if (sock->sk->sk_sleep && waitqueue_active(sock->sk->sk_sleep)) { + if (sock->sk->sk_sleep) { It was killed by this merge commit in the net-next tree: commit 278554bd6579206921f5d8a523649a7a57f8850d Merge: 5a147e8 cea0d76 Author: David S. Miller <davem@davemloft.net> Date: Wed May 12 00:05:35 2010 -0700 However, the curious thing is that git seems to have lost trace of the missing patch entirely: if I try to find it in linus' tree with a git log -- drivers/scsi/iscsi_tcp.c, it doesn't show up. The merge commit which killed it does list iscsi_tcp.c as a file conflict, but git show on that commit doesn't list that file in the resolution diff ... even though this is where it actually got killed. Is this a git problem ... or is it just a mismerge in the net tree? Either way, of course, we need the patch back ... James ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bug fix patch lost: git problem or just incorrect merge? 2010-05-21 15:41 bug fix patch lost: git problem or just incorrect merge? James Bottomley @ 2010-05-21 16:45 ` Linus Torvalds 2010-05-21 17:04 ` Linus Torvalds 2010-05-21 22:07 ` David Miller 2010-05-21 22:02 ` David Miller 1 sibling, 2 replies; 6+ messages in thread From: Linus Torvalds @ 2010-05-21 16:45 UTC (permalink / raw) To: James Bottomley; +Cc: David Miller, linux-kernel, netdev, linux-scsi On Fri, 21 May 2010, James Bottomley wrote: > > The patch in question is this one (upstream for a while): > > commit d7d05548a62c87ee55b0c81933669177f885aa8d > Author: Mike Christie <michaelc@cs.wisc.edu> > Date: Wed Mar 31 14:41:35 2010 -0500 > > [SCSI] iscsi_tcp: fix relogin/shutdown hang > > It's a simple one line change in iscsi_tcp.c (diff clipped): > > --- a/drivers/scsi/iscsi_tcp.c > +++ b/drivers/scsi/iscsi_tcp.c > @@ -599,7 +599,7 @@ static void iscsi_sw_tcp_conn_stop(struct iscsi_cls_conn *cls_conn, int flag) > - if (sock->sk->sk_sleep && waitqueue_active(sock->sk->sk_sleep)) { > + if (sock->sk->sk_sleep) { > > It was killed by this merge commit in the net-next tree: > > commit 278554bd6579206921f5d8a523649a7a57f8850d > Merge: 5a147e8 cea0d76 > Author: David S. Miller <davem@davemloft.net> > Date: Wed May 12 00:05:35 2010 -0700 > > However, the curious thing is that git seems to have lost trace of the > missing patch entirely No, it's there, and the bug is that David doesn't do merges well. One of the reasons I ask people to let me merge is that it results in cleaner history to not have criss-cross merges. And another is that I'm pretty good at it, and letting me make merges also means that I am more aware of problem spots. > if I try to find it in linus' tree with a git log -- > drivers/scsi/iscsi_tcp.c, it doesn't show up. That is because "git log" will see the merge, see that _all_ history came from the other side, and ignore the side that was ignored. Because clearly, that other side didn't actually contribute anything. Now, the _reason_ that other side didn't contribute anything is that David messed up the merge, and took just his own sides changes. > The merge commit which killed it does list iscsi_tcp.c as a file > conflict Yes. I re-did the merge, and the result looks like this (cut-and-paste whitespace damage, I'm just illustrating he point): diff --cc drivers/scsi/iscsi_tcp.c index 9eae04a,02143af..0000000 --- a/drivers/scsi/iscsi_tcp.c +++ b/drivers/scsi/iscsi_tcp.c @@@ -599,9 -599,9 +599,13 @@@ static void iscsi_sw_tcp_conn_stop(stru set_bit(ISCSI_SUSPEND_BIT, &conn->suspend_rx); write_unlock_bh(&tcp_sw_conn->sock->sk->sk_callback_lock); ++<<<<<<< HEAD + if (sk_sleep(sock->sk) && waitqueue_active(sk_sleep(sock->sk))) { ++======= + if (sock->sk->sk_sleep) { ++>>>>>>> cea0d76 sock->sk->sk_err = EIO; - wake_up_interruptible(sock->sk->sk_sleep); + wake_up_interruptible(sk_sleep(sock->sk)); } iscsi_conn_stop(cls_conn, flag); and David picked his side of things, not your side. The _correct_ merge would have been to take both changes, as is quite obvious if you do gitk 5a147e8...cea0d76 drivers/scsi/iscsi_tcp.c and see that the conflict comes because: - one side (David's) changed sock->sk->sk_sleep into sk_sleep(sock->sk) in commit aa395145165cb06a0d0885221bbe0ce4a564391d - the other side (your) removed the 'waitqueue_active()' part in commit d7d05548a62c87ee55b0c81933669177f885aa8d. So the end result _should_ have been this merge resolution: diff --cc drivers/scsi/iscsi_tcp.c index 9eae04a,02143af..0000000 --- a/drivers/scsi/iscsi_tcp.c +++ b/drivers/scsi/iscsi_tcp.c @@@ -599,9 -599,9 +599,9 @@@ static void iscsi_sw_tcp_conn_stop(stru set_bit(ISCSI_SUSPEND_BIT, &conn->suspend_rx); write_unlock_bh(&tcp_sw_conn->sock->sk->sk_callback_lock); - if (sk_sleep(sock->sk) && waitqueue_active(sk_sleep(sock->sk))) { - if (sock->sk->sk_sleep) { ++ if (sk_sleep(sock->sk)) { sock->sk->sk_err = EIO; - wake_up_interruptible(sock->sk->sk_sleep); + wake_up_interruptible(sk_sleep(sock->sk)); } iscsi_conn_stop(cls_conn, flag); but David just picked his side entirely. And that is also the reason for: > but git show on that commit doesn't list that file in the resolution > diff ... even though this is where it actually got killed. A merge diff ("combined diff") only shows conflicts as defined by "you resolved it to something that didn't match either side". That's a _real_ conflict. If you just end up picking one side, there is no diff. > Is this a git problem ... or is it just a mismerge in the net tree? So it's a mis-merge. You can see the commit with gitk v2.6.34.. --full-history drivers/scsi/iscsi_tcp.c which doesn't do the "ignore the side of a merge that didn't bring any new data in". Or, with any recent git, you can use "--simplify-merges" instead of full-history, which only simplifies trivial merges where neither side really touched things at all. If you do that, you'll also see why git doesn't show the uninteresting side of a merge by default. Just for fun, compare the graphs of gitk v2.6.34.. drivers/scsi/iscsi_tcp.c gitk v2.6.34.. --simplify-merges drivers/scsi/iscsi_tcp.c gitk v2.6.34.. --full-history drivers/scsi/iscsi_tcp.c and ask yourself: do you normally want to see _all_ the history, even stuff that didn't end up affecting the end result? > Either way, of course, we need the patch back ... I'll fix it up. Linus ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bug fix patch lost: git problem or just incorrect merge? 2010-05-21 16:45 ` Linus Torvalds @ 2010-05-21 17:04 ` Linus Torvalds 2010-05-21 17:23 ` James Bottomley 2010-05-21 22:07 ` David Miller 1 sibling, 1 reply; 6+ messages in thread From: Linus Torvalds @ 2010-05-21 17:04 UTC (permalink / raw) To: James Bottomley; +Cc: David Miller, linux-kernel, netdev, linux-scsi On Fri, 21 May 2010, Linus Torvalds wrote: > > > Either way, of course, we need the patch back ... > > I'll fix it up. Hmm. Pushed that out as appended, since that is the correct resolve. HOWEVER - the code still doesn't actually make any sense. It does if (sk_sleep(sock->sk)) { and that sk_sleep() today is an inline function that just does return &sk->sk_wq->wait; and testing the result of an address-of operation for NULL is almost certainly totally non-sensical. Sure, it _might_ work (maybe 'wait' is the first element in the 'sk_wq' data structure, and sk_wq is NULL), but that kind of code is always total and utterl crap regardless. So I pushed it out because I had done the resolve already, and it's no worse than it was before, but it's still a steaming buggy pile of shit. It being iscsi, I can't bring myself to care. But somebody who does, should really look at it. The most likely resolution is to remove the test entirely, since I don't think it's ever valid to have a socket that doesn't have a sk_wq (there's a _lot_ of unconditional use of sk_sleep()). Linus ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bug fix patch lost: git problem or just incorrect merge? 2010-05-21 17:04 ` Linus Torvalds @ 2010-05-21 17:23 ` James Bottomley 0 siblings, 0 replies; 6+ messages in thread From: James Bottomley @ 2010-05-21 17:23 UTC (permalink / raw) To: Linus Torvalds; +Cc: David Miller, linux-kernel, netdev, linux-scsi On Fri, 2010-05-21 at 10:04 -0700, Linus Torvalds wrote: > > On Fri, 21 May 2010, Linus Torvalds wrote: > > > > > Either way, of course, we need the patch back ... > > > > I'll fix it up. > > Hmm. Pushed that out as appended, since that is the correct resolve. Thanks! > HOWEVER - the code still doesn't actually make any sense. It does > > if (sk_sleep(sock->sk)) { > > and that sk_sleep() today is an inline function that just does > > return &sk->sk_wq->wait; > > and testing the result of an address-of operation for NULL is almost > certainly totally non-sensical. Sure, it _might_ work (maybe 'wait' is the > first element in the 'sk_wq' data structure, and sk_wq is NULL), but that > kind of code is always total and utterl crap regardless. > > So I pushed it out because I had done the resolve already, and it's no > worse than it was before, but it's still a steaming buggy pile of shit. Yes, the problem was caused by this patch commit 43815482370c510c569fd18edb57afcb0fa8cab6 Author: Eric Dumazet <eric.dumazet@gmail.com> Date: Thu Apr 29 11:01:49 2010 +0000 net: sock_def_readable() and friends RCU conversion Which moved sk_sleep() from returning the pointer to the waitqueue, which may or may not be assigned to returning a pointer to an internal waitqueue in the socket, which, obviously, can never be null. I suspect what iscsi should be doing is always sending the wakeup ... in which case with your resolution, the code is operating correctly even if the form is suboptimal. > It being iscsi, I can't bring myself to care. But somebody who does, > should really look at it. The most likely resolution is to remove the test > entirely, since I don't think it's ever valid to have a socket that > doesn't have a sk_wq (there's a _lot_ of unconditional use of sk_sleep()). I'll have Mike look at it, but I think just removing the if() will be the correct resolution. James ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bug fix patch lost: git problem or just incorrect merge? 2010-05-21 16:45 ` Linus Torvalds 2010-05-21 17:04 ` Linus Torvalds @ 2010-05-21 22:07 ` David Miller 1 sibling, 0 replies; 6+ messages in thread From: David Miller @ 2010-05-21 22:07 UTC (permalink / raw) To: torvalds; +Cc: James.Bottomley, linux-kernel, netdev, linux-scsi From: Linus Torvalds <torvalds@linux-foundation.org> Date: Fri, 21 May 2010 09:45:49 -0700 (PDT) > One of the reasons I ask people to let me merge is that it results in > cleaner history to not have criss-cross merges. And another is that I'm > pretty good at it, and letting me make merges also means that I am more > aware of problem spots. That wasn't possible in this case. This happened more than a week ago, as I needed to merge your tree into net-2.6 to resolve a conflict there. That's what took in the iscsi bug fix, and this is way before the merge window. Then I needed to pull net-2.6 into net-next-2.6 to resolve conflicts existing between those two trees. And this is why I ended up having to do the merge :-) >> Either way, of course, we need the patch back ... > > I'll fix it up. Thanks Linus. ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bug fix patch lost: git problem or just incorrect merge? 2010-05-21 15:41 bug fix patch lost: git problem or just incorrect merge? James Bottomley 2010-05-21 16:45 ` Linus Torvalds @ 2010-05-21 22:02 ` David Miller 1 sibling, 0 replies; 6+ messages in thread From: David Miller @ 2010-05-21 22:02 UTC (permalink / raw) To: James.Bottomley; +Cc: torvalds, linux-kernel, netdev, linux-scsi From: James Bottomley <James.Bottomley@suse.de> Date: Fri, 21 May 2010 10:41:55 -0500 > Is this a git problem ... or is it just a mismerge in the net tree? Mismerge, because sk->sk_sleep() doesn't exist any more I mistakenly updated the original line to do the sk_sleep() stuff. Sorry about that. ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2010-05-21 22:07 UTC | newest] Thread overview: 6+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2010-05-21 15:41 bug fix patch lost: git problem or just incorrect merge? James Bottomley 2010-05-21 16:45 ` Linus Torvalds 2010-05-21 17:04 ` Linus Torvalds 2010-05-21 17:23 ` James Bottomley 2010-05-21 22:07 ` David Miller 2010-05-21 22:02 ` David Miller
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).