From mboxrd@z Thu Jan 1 00:00:00 1970 From: Xiao Guangrong Subject: Re: commit 3c2e7f7de3 (KVM use NPT page attributes) causes boot failures Date: Wed, 2 Sep 2015 17:17:37 +0800 Message-ID: <55E6BEB1.6080106@linux.intel.com> References: <20150901070856.GA430@x4> <20150901072741.GB20383@gmail.com> <20150901074449.GB430@x4> <20150901083856.GD25398@gmail.com> <20150901084444.GB421@x4> <20150901085627.GF6315@gmail.com> <20150901100417.GA424@x4> <55E5A157.5040403@linux.intel.com> <20150901135636.GA423@x4> <55E6273D.6080701@linux.intel.com> <20150902035040.GA420@x4> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Cc: Ingo Molnar , Linus Torvalds , linux-kernel@vger.kernel.org, Peter Zijlstra , Thomas Gleixner , Andrew Morton , Mike Galbraith , Joerg Roedel , Paolo Bonzini , kvm@vger.kernel.org To: Markus Trippelsdorf Return-path: In-Reply-To: <20150902035040.GA420@x4> Sender: linux-kernel-owner@vger.kernel.org List-Id: kvm.vger.kernel.org On 09/02/2015 11:50 AM, Markus Trippelsdorf wrote: > On 2015.09.02 at 06:31 +0800, Xiao Guangrong wrote: >> >> >> On 09/01/2015 09:56 PM, Markus Trippelsdorf wrote: >>> On 2015.09.01 at 21:00 +0800, Xiao Guangrong wrote: >>>> >>>> Did it trigger the BUG()/BUG_ON() in mtrr2protval()/fallback_mtrr_type()? >>>> If yes, could you please print the actual value out? >>> >>> It is the BUG() in fallback_mtrr_type(). I changed it to a printk and >>> it prints 1 for the value of mtrr. >>> >>> MTRR_TYPE_WRCOMB 1 >>> >> >> Then I suspect pat is not enabled in your box, could you please check >> CONFIG_X86_PAT is selected in your .config file, pat is shown in >> /proc/cpuid, "nopat" kernel parameter is used, and dmesg | grep PAT. > > No. PAT is of course enabled and booting is successful sometimes even > with the BUG() in allback_mtrr_type(). I suspect a setup (timing) issue. Thanks for your confirmation. > > markus@x4 linux % cat .config | grep X86_PAT > CONFIG_X86_PAT=y > markus@x4 linux % dmesg | grep PAT > [ 0.000000] x86/PAT: Configuration [0-7]: WB WC UC- UC WB WC UC- WT Strange, BP processor has already set WC to PAT1, however KVM does not read it out from PAT MSR on its local CPU. Hmm... PAT default values do not include WC, it seems initing PAT on SP has not finished after module_init()? Could please apply this diff and test it again? diff --git a/arch/x86/kvm/svm.c b/arch/x86/kvm/svm.c index 189e464..d9d3a30 100644 --- a/arch/x86/kvm/svm.c +++ b/arch/x86/kvm/svm.c @@ -884,6 +884,7 @@ static u8 fallback_mtrr_type(int mtrr) case MTRR_TYPE_WRPROT: return MTRR_TYPE_UC_MINUS; default: + printk("BUG in %s, mtrr = %d.\n", __FUNCTION__, mtrr); BUG(); } } @@ -907,6 +908,8 @@ static void build_mtrr2protval(void) * guest. */ rdmsrl(MSR_IA32_CR_PAT, pat); + printk("KVM PAT: 0x%llx.\n", pat); + for (i = 0; i < 8; i++) { u8 mtrr = pat >> (8 * i); @@ -914,10 +917,17 @@ static void build_mtrr2protval(void) mtrr2protval[mtrr] = __cm_idx2pte(i); } + for (i = 0; i < 8; i++) + printk("mtrr2protval[%d]:%x.\n", i, mtrr2protval[i]); + + for (i = 0; i < 8; i++) { if (mtrr2protval[i] == MTRR2PROTVAL_INVALID) { u8 fallback = fallback_mtrr_type(i); mtrr2protval[i] = mtrr2protval[fallback]; + if (mtrr2protval[i] == MTRR2PROTVAL_INVALID) + printk("BUG in %s, mtrr2protval[%d] = %x.\n", __FUNCTION__, i, mtrr2protval[i]); + BUG_ON(mtrr2protval[i] == MTRR2PROTVAL_INVALID); } }