From mboxrd@z Thu Jan 1 00:00:00 1970 From: annie li Subject: Re: domU crash with kernel BUG at drivers/net/xen-netfront.c:305 Date: Fri, 03 Jan 2014 14:15:01 +0800 Message-ID: <52C65565.6030608@oracle.com> References: <52BD5FDD.6060009@gmail.com> <52C4F48F.5090003@oracle.com> <20140102120150.GA1444@zion.uk.xensource.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20140102120150.GA1444@zion.uk.xensource.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Wei Liu Cc: Vasily Evseenko , xen-devel@lists.xen.org List-Id: xen-devel@lists.xenproject.org On 2014/1/2 20:01, Wei Liu wrote: > On Thu, Jan 02, 2014 at 01:09:35PM +0800, annie li wrote: > [...] >>> It seems the root of problem in dom0 messages above. Is it HW failure or >>> some internal kernel structures overflow? >> From the stack, it reminds me this issue is very likely same with >> the one which has been fixed. There is something wrong with counting >> slots in netback, and then responses overlapps request in the ring, >> and grantcopy gets wrong grant reference and throws out error in >> grant_table.c. See >> http://lists.xen.org/archives/html/xen-devel/2013-09/msg01143.html >> There were some back and forth work for this issue, but seems the >> fix patch exists since v3.12-rc4. Would you like to have a try with >> newer kernel version? >> > FWIW the patch you mentioned was backported to the kernel he used. Yes, it exists in 3.10.25 he used. Based on assumption of counting slots in netback causing this issue, maybe http://www.spinics.net/lists/netdev/msg260017.html is the right fix. This patch fixed an issue caused by counting slots, and it went into net-next tree, git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-next.git Thanks Annie