From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tupshin Harper Subject: Re: null pointer in domU Date: Thu, 28 Apr 2005 02:33:37 -0700 Message-ID: <4270ADF1.5060607@tupshin.com> References: <427049F9.2010605@tupshin.com> <7c06bd816eb0275c7eac49bb6efbc3ee@cl.cam.ac.uk> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <7c06bd816eb0275c7eac49bb6efbc3ee@cl.cam.ac.uk> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Keir Fraser Cc: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org Keir Fraser wrote: > domU got a response from dom0 containing an unexpected request id. > domU tried to lookup the skbuff corresponding to the id, but read > garbage (because the id was not currently in use). It crashed when it > tried to dereference the garbage skbuff pointer. > > This is rather nasty -- it's not clear whether dom0 or domU is at > fault (dom0 may have forgotten about the id, or corrupted its own > state, or the response from dom0 could be at fault). And it must be > hard to trigger as we have had no other reports of this. :-( > > -- Keir > Well thanks for the analysis :) And yes, it must be hard to trigger, because I've had multiple weeks of uptime running 8+ domUs with no problems. It might be interesting that the domU that crashed is the only one doing routing, and it also was under a moderately high traffic load at the time that it crashed. If you have any inclination to add some debug code to try to look for and analyze this problem, I'd be happy to run custom builds for either my dom0 or domU. Thanks. BTW, is there any reccomended way of detecting a crashed domU and restarting it? -Tupshin