From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from sog-mx-1.v43.ch3.sourceforge.com ([172.29.43.191] helo=mx.sourceforge.net) by sfs-ml-2.v29.ch3.sourceforge.com with esmtp (Exim 4.76) (envelope-from ) id 1VRyK4-00019V-Ju for user-mode-linux-devel@lists.sourceforge.net; Fri, 04 Oct 2013 05:53:08 +0000 Received: from b.ns.miles-group.at ([95.130.255.144] helo=radon.swed.at) by sog-mx-1.v43.ch3.sourceforge.com with esmtps (TLSv1:AES256-SHA:256) (Exim 4.76) id 1VRyK2-0003EV-D7 for user-mode-linux-devel@lists.sourceforge.net; Fri, 04 Oct 2013 05:53:08 +0000 Message-ID: <524E57BA.805@nod.at> Date: Fri, 04 Oct 2013 07:52:58 +0200 From: Richard Weinberger MIME-Version: 1.0 References: <524C6643.2040209@gmx.de> <524DBD5D.1040203@gmx.de> <524DBFBB.1050002@nod.at> <524DC278.3020106@gmx.de> <524DC394.6030406@nod.at> <524DC675.4020201@gmx.de> In-Reply-To: <524DC675.4020201@gmx.de> List-Id: The user-mode Linux development list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Errors-To: user-mode-linux-devel-bounces@lists.sourceforge.net Subject: Re: [uml-devel] BUG: soft lockup for a user mode linux image To: =?UTF-8?B?VG9yYWxmIEbDtnJzdGVy?= Cc: UML devel , trinity@vger.kernel.org QW0gMDMuMTAuMjAxMyAyMTozMywgc2NocmllYiBUb3JhbGYgRsO2cnN0ZXI6Cj4gT24gMTAvMDMv MjAxMyAwOToyMCBQTSwgUmljaGFyZCBXZWluYmVyZ2VyIHdyb3RlOgo+PiBBbSAwMy4xMC4yMDEz IDIxOjE2LCBzY2hyaWViIFRvcmFsZiBGw7Zyc3RlcjoKPj4+IE9uIDEwLzAzLzIwMTMgMDk6MDQg UE0sIFJpY2hhcmQgV2VpbmJlcmdlciB3cm90ZToKPj4+PiBBbSAwMy4xMC4yMDEzIDIwOjU0LCBz Y2hyaWViIFRvcmFsZiBGw7Zyc3RlcjoKPj4+Pj4gT24gMTAvMDIvMjAxMyAwOTo1NSBQTSwgUmlj aGFyZCBXZWluYmVyZ2VyIHdyb3RlOgo+Pj4+Pj4gT24gV2VkLCBPY3QgMiwgMjAxMyBhdCA4OjMw IFBNLCBUb3JhbGYgRsO2cnN0ZXIgPHRvcmFsZi5mb2Vyc3RlckBnbXguZGU+IHdyb3RlOgo+Pj4+ Pj4+IFJ1bm5pbmcgdHJpbml0eSAoMSBwcm9jZXNzLCBubyB2aWN0aW0gZmlsZXMsIGp1c3QgIiQ+ dHJpbml0eSAtQzEpIGZvciBhIGxvbmdlciB0aW1lCj4+Pj4+Pj4gd2l0aGluIGEgMzIgYml0IHVz ZXIgbW9kZSBsaW51eCBpbWFnZSB3aXRoIGEgcmVjZW50IGdpdCBrZXJuZWwgKGhvc3Q6IDMuMTEu MyBndWVzdCAzLjEyLXJjMy1nLi4uKQo+Pj4+Pj4+IHlpZWxkcyBpbnRvIHRoaXMga29uc29sZSBt ZXNzYWdlIDoKPj4+Pj4+Pgo+Pj4+Pj4+ICAqIFN0YXJ0aW5nIGxvY2FsCj4+Pj4+Pj4gbmV0LmNv cmUud2FybmluZ3MgPSAwICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIFsgb2sgXQo+Pj4+Pj4+IEJVRzogc29mdCBs b2NrdXAgLSBDUFUjMCBzdHVjayBmb3IgMjNzISBbdHJpbml0eS1jaGlsZDA6MjAzMV0KPj4+Pj4+ Pgo+Pj4+Pj4+Cj4+Pj4+Pj4gYW5kIGF0IHRoZSBob3N0IHQxIG9mIHRoZSAibGludXgiLXByb2Nl c3NlcyBlYXRzIGFsbCBDUFUgY3ljbGVzIGF0IDEgQ1BVIGNvcmUuCj4+Pj4+Pj4gMiBzdWJzZXF1 ZW50IG1hZGUgYmFjayB0cmFjZXMgbWFkZSB3aXRoCj4+Pj4+Pj4KPj4+Pj4+PiAkPiBzdWRvIGdk YiAvaG9tZS90Zm9lcnN0ZS9kZXZlbC9saW51eC9saW51eCAyODE0NCAtbiAtYmF0Y2ggLWV4IGJ0 Cj4+Pj4+Pj4KPj4+Pj4+PiBzaG93cyBuZWFybHkgYSBzaW1pbGFyIHBvc2l0aW9uIGFyb3VuZCBf X2dldF91c2VyX3BhZ2VzKCkgLSBib3RoIGFyZSBhdHRhY2hlZC4KPj4+Pj4+Pgo+Pj4+Pj4+IEkn bSBub3Qgc3VycHJpc2VkIHRoYXQgdHJpbml0eSBoYXJtcyBhIHN5c3RlbXMgLSBJJ20ganVzdCB3 b25kZXJpbmcgd2hldGhlciB0aGlzIHBhcnRpY3VsYXIgcGljdHVyZSBpcwo+Pj4+Pj4+IGV4cGVj dGVkIG9yIGlmIGl0IHBvaW50cyB0byBhbiBpc3N1ZS4KPj4+Pj4+Pgo+Pj4+Pj4+Cj4+Pj4+Pj4g RldJVyB0aGUgbGFzdCBsaW5lcyBvZiB0cmluaXR5IGxvZyB3ZXJlIDoKPj4+Pj4+Pgo+Pj4+Pj4+ Cj4+Pj4+Pj4gWzIwMzFdIFs5NF0gc2V0c2lkKCkgPSAyMDMxCj4+Pj4+Pj4gWzIwMzFdIFs5NV0g c2V0cmVzZ2lkKHJnaWQ9MHhmZmZmMzNlMywgZWdpZD0weGZmZmZmZjkzLCBzZ2lkPTB4MjIwMDAw NDApID0gLTEgKE9wZXJhdGlvbiBub3QgcGVybWl0dGVkKQo+Pj4+Pj4+IFsyMDMxXSBbOTZdIHZt c3BsaWNlKGZkPTUsIGlvdj0weDg1NTAxZTAsIG5yX3NlZ3M9MzAwLCBmbGFncz05KSA9IDB4MzAw MAo+Pj4+Pj4+IFsyMDMxXSBbOTddIHNldHJlc3VpZChydWlkPTB4ODA1NDkxOTMsIGV1aWQ9MHhj NjEwNDFlMCwgc3VpZD0weGZmMTliNmZhKSA9IC0xIChPcGVyYXRpb24gbm90IHBlcm1pdHRlZCkK Pj4+Pj4+PiBbMjAzMV0gWzk4XSBzZXRwcmlvcml0eSh3aGljaD0weGZmMDEwMDAwLCB3aG89MHhm MzczNzM3MywgbmljZXZhbD0weDgwODg5NjBjKSA9IC0xIChJbnZhbGlkIGFyZ3VtZW50KQo+Pj4+ Pj4+IFsyMDMxXSBbOTldIHNvY2tldGNhbGwoY2FsbD0xLCBhcmdzPTB4ODU1MDIwMCkgPSAtMSAo QWRkcmVzcyBmYW1pbHkgbm90IHN1cHBvcnRlZCBieSBwcm90b2NvbCkKPj4+Pj4+PiBbMjAzMV0g WzEwMF0gYWNjZXNzKGZpbGVuYW1lPSLvv70iLCBtb2RlPTIwMTcpID0gLTEgKEludmFsaWQgYXJn dW1lbnQpCj4+Pj4+Pj4gWzIwMzFdIFsxMDFdIGdldGdyb3VwcyhnaWRzZXRzaXplPTAsIGdyb3Vw bGlzdD0weDgwZDAwMDBbcGFnZV9yYW5kXSkgPSAzCj4+Pj4+Pj4gWzIwMzFdIFsxMDJdIG1zeW5j KHN0YXJ0PTB4YzAxMDAyMjAsIGxlbj0wLCBmbGFncz0zKSA9IC0xIChJbnZhbGlkIGFyZ3VtZW50 KQo+Pj4+Pj4+IFsyMDMxXSBbMTAzXSBzaWdwZW5kaW5nKHNldD0weDQwMDI1MDAwKSA9IDAKPj4+ Pj4+PiBbMjAzMV0gWzEwNF0gc2lnbmFsZmQ0KHVmZD0zODMsIHVzZXJfbWFzaz0xLCBzaXplbWFz az0weGE0MjAwMDAwLCBmbGFncz0weDgwODAwKSA9IC0xIChJbnZhbGlkIGFyZ3VtZW50KQo+Pj4+ Pj4+IFsyMDMxXSBbMTA1XSBzZW5kZmlsZShvdXRfZmQ9MzgzLCBpbl9mZD0zODIsIG9mZnNldD0w LCBjb3VudD00MDk2KSA9IC0xIChJbnZhbGlkIGFyZ3VtZW50KQo+Pj4+Pj4+IFsyMDMxXSBbMTA2 XSBmYW5vdGlmeV9tYXJrKGZhbm90aWZ5X2ZkPTM4MiwgZmxhZ3M9NSwgbWFzaz0weDgwMDAwMjMs IGRmZD0zODIsIHBhdGhuYW1lPSIvcHJvYy8xMDkyL3Rhc2svMTA5Mi9mZGluZm8vNjgiKSA9IC0x IChJbnZhbGlkIGFyZ3VtZW50KQo+Pj4+Pj4+IFsyMDMxXSBbMTA3XSB3YWl0NCh1cGlkPTEsIHN0 YXRfYWRkcj00LCBvcHRpb25zPTB4ZDc2MTk3OWIsIHJ1PTgpID0gLTEgKEludmFsaWQgYXJndW1l bnQpCj4+Pj4+Pj4gWzIwMzFdIFsxMDhdIHNpZ3BlbmRpbmcoc2V0PTB4ODBjYTAwMFtwYWdlX3pl cm9zXSkgPSAwCj4+Pj4+Pj4gWzIwMzFdIFsxMDldIHNldHJlc3VpZChydWlkPTB4ZWZmZmQ2ZmMs IGV1aWQ9MHgxYmY0YzkyZiwgc3VpZD0weGZmZmYyZTMzKSA9IC0xIChPcGVyYXRpb24gbm90IHBl cm1pdHRlZCkKPj4+Pj4+PiBbMjAzMV0gWzExMF0gbXVubG9jayhhZGRyPTB4NDAwMjUwMDAsIGxl bj0zNCkgPSAwCj4+Pj4+Pj4gWzIwMzFdIFsxMTFdIHRpbWVyX2RlbGV0ZSh0aW1lcl9pZD0weGZm ZmZmZmRjKSA9IC0xIChJbnZhbGlkIGFyZ3VtZW50KQo+Pj4+Pj4+IFsyMDMxXSBbMTEyXSBzY2hl ZF9nZXRfcHJpb3JpdHlfbWF4KHBvbGljeT0weDEwMDAwMDQwKSA9IC0xIChJbnZhbGlkIGFyZ3Vt ZW50KQo+Pj4+Pj4+IFsyMDMxXSBbMTEzXSBzeXNsb2codHlwZT0weGMxMDAwMDAwLCBidWY9MSwg bGVuPTB4ODJhNSkgPSAtMSAoT3BlcmF0aW9uIG5vdCBwZXJtaXR0ZWQpCj4+Pj4+Pj4gWzIwMzFd IFsxMTRdIHNldHByaW9yaXR5KHdoaWNoPTB4YzRjODA2YzYsIHdobz0weGZmZmZmZjAxLCBuaWNl dmFsPTB4ZmZmZjA2ODIpID0gLTEgKEludmFsaWQgYXJndW1lbnQpCj4+Pj4+Pj4gWzIwMzFdIFsx MTVdIGdldGdyb3VwczE2KGdpZHNldHNpemU9MHhmZmZlLCBncm91cGxpc3Q9MSkgPSAtMSAoQmFk IGFkZHJlc3MpCj4+Pj4+Pj4gWzIwMzFdIFsxMTZdIHJlbmFtZShvbGRuYW1lPTQsIG5ld25hbWU9 OCkgPSAtMSAoQmFkIGFkZHJlc3MpCj4+Pj4+Pj4gWzIwMzFdIFsxMTddIGlub3RpZnlfaW5pdCgp ID0gNjU0Cj4+Pj4+Pj4gWzIwMzFdIFsxMThdIGdldGdpZCgpID0gMTAwCj4+Pj4+Pj4gWzIwMzFd IFsxMTldIGZzdGF0YXQ2NChkZmQ9MzgyLCBmaWxlbmFtZT0iL3N5cy9kZXZpY2VzL3ZpcnR1YWwv bmV0L3NpdDAvZHVwbGV4Iiwgc3RhdGJ1Zj0wLCBmbGFnPTB4YjU0NWQ3MjcpID0gLTEgKEludmFs aWQgYXJndW1lbnQpCj4+Pj4+Pj4gWzIwMzFdIFsxMjBdIHVubGlua2F0KGRmZD0zODIsIHBhdGhu YW1lPSIvcHJvYy9zeXMvbmV0L2lwdjQvbmVpZ2gvZGVmYXVsdC9yZXRyYW5zX3RpbWUiLCBmbGFn PTB4YzAwZWY3NikgPSAtMSAoSW52YWxpZCBhcmd1bWVudCkKPj4+Pj4+PiBbMjAzMV0gWzEyMV0g dGltZXJmZF9jcmVhdGUoY2xvY2tpZD0wLCBmbGFncz0wKSA9IDY1NQo+Pj4+Pj4+IFsyMDMxXSBb MTIyXSBtdW5sb2NrKGFkZHI9NCwgbGVuPTB4M2ZmZikgPSAtMSAoQ2Fubm90IGFsbG9jYXRlIG1l bW9yeSkKPj4+Pj4+PiBbMjAzMV0gWzEyM10gZnJlbW92ZXhhdHRyKGZkPTM4MiwgbmFtZT0wKSA9 IC0xIChCYWQgYWRkcmVzcykKPj4+Pj4+PiBbMjAzMV0gWzEyNF0gc2NoZWRfZ2V0X3ByaW9yaXR5 X21pbihwb2xpY3k9MHhmZjU4YmZlZikgPSAtMSAoSW52YWxpZCBhcmd1bWVudCkKPj4+Pj4+PiBb MjAzMV0gWzEyNV0gbXFfdGltZWRyZWNlaXZlKG1xZGVzPTM5NywgdV9tc2dfcHRyPTQsIG1zZ19s ZW49NTI0NSwgdV9tc2dfcHJpbz0weGMwMTAwMjIwLCB1X2Fic190aW1lb3V0PTB4YzAxMDAyMjAp ID0gLTEgKEJhZCBhZGRyZXNzKQo+Pj4+Pj4+IFsyMDMxXSBbMTI2XSBjaGRpcihmaWxlbmFtZT0i L3Byb2MvMTE2L25ldC9wdHlwZSIpID0gLTEgKE5vdCBhIGRpcmVjdG9yeSkKPj4+Pj4+PiBbMjAz MV0gWzEyN10gc3NldG1hc2sobmV3bWFzaz0weDg4MDAwMDkyKSA9IDAKPj4+Pj4+PiBbMjAzMV0g WzEyOF0gc3RhdGZzKHBhdGhuYW1lPSIvcHJvYy82L21vdW50cyIsIGJ1Zj0wKSA9IC0xIChCYWQg YWRkcmVzcykKPj4+Pj4+PiBbMjAzMV0gWzEyOV0gZmNob3duMTYoZmQ9Mzk3LCB1c2VyPTEwNCwg Z3JvdXA9MHg5NDEwMDAwMCkgPSAtMSAoT3BlcmF0aW9uIG5vdCBwZXJtaXR0ZWQpCj4+Pj4+Pj4g WzIwMzFdIFsxMzBdIGZjaGRpcihmZD0zOTcpID0gLTEgKE5vdCBhIGRpcmVjdG9yeSkKPj4+Pj4+ PiBbMjAzMV0gWzEzMV0gbWtkaXIocGF0aG5hbWU9Ii9wcm9jLzEwOTIvdGFzay8xMDkyL2ZkaW5m by8zMTYiLCBtb2RlPTUyNSkgPSAtMSAoRmlsZSBleGlzdHMpCj4+Pj4+Pj4gWzIwMzFdIFsxMzJd IGZzZXR4YXR0cihmZD0zODYsIG5hbWU9MHg4NTZmMTU4LCB2YWx1ZT0weDg1NzExNjAsIHNpemU9 MCwgZmxhZ3M9MCkgPSAtMSAoTnVtZXJpY2FsIHJlc3VsdCBvdXQgb2YgcmFuZ2UpCj4+Pj4+Pj4g WzIwMzFdIFsxMzNdIGlvX3NldHVwKG5yX2V2ZW50cz00MDk1LCBjdHhwPTB4NDAyNjYwMDApIF5D S2lsbGVkIGJ5IHNpZ25hbCAyLgo+Pj4+Pj4KPj4+Pj4+IFJlYWRpbmcgeW91ciBnZGIgYmFja3Ry YWNlcyBzaG93IHRoYXQgc2NoZWR1bGVfdGltZW91dCgpIGdvdCBjYWxsZWQKPj4+Pj4+IHdpdGgg YSBuZWdhdGl2ZSB2YWx1ZS4KPj4+Pj4+IExvb2tzIGxpa2UgYW4gaW50ZWdlciBvdmVyZmxvdy4K Pj4+Pj4+IFRoZSBzb2Z0LWxvY2t1cCBtaWdodCBhbHNvIG9yaWdpbiBmcm9tIHRoYXQgKHZlcnkg YmlnIGludGVnZXIgd2hpY2gKPj4+Pj4+IGRpZCBub3Qgb3ZlcmZsb3cgamV0KQo+Pj4+Pj4KPj4+ Pj4KPj4+Pj4gSWYgdGhlIGN1bHByaXQgaXMgc29sdmVkIGJ5IHRoaXMgcGF0Y2ggSSdkIGxpa2Ug dG8gc2VuZCBpdCBvdXQuIEJ1dCBJJ20KPj4+Pj4gdW5zdXJlIHdoZXRoZXIgaXQgY2F0Y2hlcyB0 aGUgY3VscHJpdCBvciBpZiBpdCBqdXN0IGNvdmVycyB0aGUgcm9vdCBjYXVzZS4KPj4+Pgo+Pj4+ IEkgZmVhciB5b3VyIFBhdGNoIHdpbGwgbm90IGZpeCB0aGUgaXNzdWUuCj4+Pj4KPj4+PiBEb2Vz IHRoZSBpc3N1ZSBvbmx5IHRyaWdnZXIgb24gMzJiaXQgVU1Mcz8KPj4+IE5vIGRpZWEsIEkgZG8g b25seSBoYXZlIGEgMzIgYml0IHN5c3RlbSBoZXJlIChib3RoIGhvc3QgYW5kIGNsaWVudCkuCj4+ Pgo+Pj4+IEhvdyBsb25nIGRvZXMgaXQgdGFrZSB0aWxsIHRyaW5pdHkgaGl0cyBpdD8KPj4+IGEg Y29tbWFuZCBsaWtlCj4+Pgo+Pj4gJD4gc3NoIHRmb2Vyc3RlQHRyaW5pdHkgInJtIC1yZiB0Mzsg bWtkaXIgdDM7IGNkIHQzOyB0cmluaXR5IC1DNCIKPj4+Cj4+PiB1c3VhbGx5IG5lZWRzIDEwIHRp bGwgMTUgbWluIHRvIHRyaWdnZXIgdGhlIGlzc3VlLiBXaXRoIGp1c3QgMSB0cmluaXR5Cj4+PiB0 YXNrICgtQzEpIGhvd2V2ZXIgaXQgbmVlZHMgb2Z0ZW4gYSBob3VyIG9yIG1vcmUuCj4+Cj4+IFRo YXQncyBnb29kLiA6LSkKPj4gWW91IGNhbiBwbGFjZSBzb21lIHByaW50aygpcyBpbnRvIGJhbGFu Y2VfZGlydHlfcGFnZXMoKSBhbmQgb2JzZXJ2ZSB0aGUgdmFsdWVzCj4+IG9mIHBlcmlvZCwgbWF4 X3BhdXNlLCBtaW5fcGF1c2UsIGV0Yy4uLgo+PiBNYXliZSB0aGlzIHdpbGwgZ2l2ZSB1cyBhIGNs dWUuCj4gCj4gV2lsbCBkbyBpdCBhZ2Fpbi4gSSBkaWQgaXQgYmVmb3JlIGFscmVhZHkgYnV0IHBl cmlvZCB3YXMgdGhlIG9ubHkgdGhpbmcKPiB3aGljaCBiZWNhbWVzIG90aGVyd2lzZSBuZWdhdGl2 ZSAtCj4gYW5kIGJlY2F1c2UgcGFnZXNfZGlydGllZCB3YXMgb2Z0ZW4gaW4gdGhlIHJhbmdlIG9m IDIwIC0gMzAgYW5kIHBlcmlvYWQKPiBhcm91bmQgLTMwMDAKCkRpZCB5b3UgcHJpbnQgaXQgYmVm b3JlIG9yIGFmdGVyCiAgICAgICAgICAgICAgICBpZiAoY3VycmVudC0+ZGlydHlfcGF1c2VkX3do ZW4pCiAgICAgICAgICAgICAgICAgICAgICAgIHBhdXNlIC09IG5vdyAtIGN1cnJlbnQtPmRpcnR5 X3BhdXNlZF93aGVuOwoKSSdtIGFsc28gd29uZGVyaW5nIHdoeQoJCWlmIChwYXVzZSA8IG1pbl9w YXVzZSkgewpkaWQgbm90IHRyaWdnZXIgb3IgZGlkIGl0PwoKPiBJIGp1c3Qgd29uZGVyZWQgaWYg SFogaXMgY2FzdHMgYXMgYSBzaG9ydCBpbnQgaW4gdGhlIGNhbGN1bGF0aW9uID8KPiBCVFcgcHJp bnRrIGRvZXNuJ3Qgc2hvd2VkIGFueXRoaW5nIGluIHRoZSBzeXNsb2cgKG9rLCBJIHVzZWQKPiBw cmludGsoS0VSTl9FUlIgLi4uKQoKc3lzbG9nIG9yIGRtZXNnPwoKVGhhbmtzLAovL3JpY2hhcmQK Ci0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0t LS0tLS0tLS0tLS0tLS0tLS0tLS0tLQpPY3RvYmVyIFdlYmluYXJzOiBDb2RlIGZvciBQZXJmb3Jt YW5jZQpGcmVlIEludGVsIHdlYmluYXJzIGNhbiBoZWxwIHlvdSBhY2NlbGVyYXRlIGFwcGxpY2F0 aW9uIHBlcmZvcm1hbmNlLgpFeHBsb3JlIHRpcHMgZm9yIE1QSSwgT3Blbk1QLCBhZHZhbmNlZCBw cm9maWxpbmcsIGFuZCBtb3JlLiBHZXQgdGhlIG1vc3QgZnJvbSAKdGhlIGxhdGVzdCBJbnRlbCBw cm9jZXNzb3JzIGFuZCBjb3Byb2Nlc3NvcnMuIFNlZSBhYnN0cmFjdHMgYW5kIHJlZ2lzdGVyID4K aHR0cDovL3B1YmFkcy5nLmRvdWJsZWNsaWNrLm5ldC9nYW1wYWQvY2xrP2lkPTYwMTM0NzkxJml1 PS80MTQwL29zdGcuY2xrdHJrCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fClVzZXItbW9kZS1saW51eC1kZXZlbCBtYWlsaW5nIGxpc3QKVXNlci1tb2RlLWxp bnV4LWRldmVsQGxpc3RzLnNvdXJjZWZvcmdlLm5ldApodHRwczovL2xpc3RzLnNvdXJjZWZvcmdl Lm5ldC9saXN0cy9saXN0aW5mby91c2VyLW1vZGUtbGludXgtZGV2ZWwK From mboxrd@z Thu Jan 1 00:00:00 1970 From: Richard Weinberger Subject: Re: [uml-devel] BUG: soft lockup for a user mode linux image Date: Fri, 04 Oct 2013 07:52:58 +0200 Message-ID: <524E57BA.805@nod.at> References: <524C6643.2040209@gmx.de> <524DBD5D.1040203@gmx.de> <524DBFBB.1050002@nod.at> <524DC278.3020106@gmx.de> <524DC394.6030406@nod.at> <524DC675.4020201@gmx.de> Mime-Version: 1.0 Content-Transfer-Encoding: QUOTED-PRINTABLE Return-path: In-Reply-To: <524DC675.4020201@gmx.de> Sender: trinity-owner@vger.kernel.org List-ID: Content-Type: text/plain; charset="utf-8" To: =?UTF-8?B?VG9yYWxmIEbDtnJzdGVy?= Cc: trinity@vger.kernel.org, UML devel Am 03.10.2013 21:33, schrieb Toralf F=C3=B6rster: > On 10/03/2013 09:20 PM, Richard Weinberger wrote: >> Am 03.10.2013 21:16, schrieb Toralf F=C3=B6rster: >>> On 10/03/2013 09:04 PM, Richard Weinberger wrote: >>>> Am 03.10.2013 20:54, schrieb Toralf F=C3=B6rster: >>>>> On 10/02/2013 09:55 PM, Richard Weinberger wrote: >>>>>> On Wed, Oct 2, 2013 at 8:30 PM, Toralf F=C3=B6rster wrote: >>>>>>> Running trinity (1 process, no victim files, just "$>trinity -C= 1) for a longer time >>>>>>> within a 32 bit user mode linux image with a recent git kernel = (host: 3.11.3 guest 3.12-rc3-g...) >>>>>>> yields into this konsole message : >>>>>>> >>>>>>> * Starting local >>>>>>> net.core.warnings =3D 0 = [ ok ] >>>>>>> BUG: soft lockup - CPU#0 stuck for 23s! [trinity-child0:2031] >>>>>>> >>>>>>> >>>>>>> and at the host t1 of the "linux"-processes eats all CPU cycles= at 1 CPU core. >>>>>>> 2 subsequent made back traces made with >>>>>>> >>>>>>> $> sudo gdb /home/tfoerste/devel/linux/linux 28144 -n -batch -e= x bt >>>>>>> >>>>>>> shows nearly a similar position around __get_user_pages() - bot= h are attached. >>>>>>> >>>>>>> I'm not surprised that trinity harms a systems - I'm just wonde= ring whether this particular picture is >>>>>>> expected or if it points to an issue. >>>>>>> >>>>>>> >>>>>>> FWIW the last lines of trinity log were : >>>>>>> >>>>>>> >>>>>>> [2031] [94] setsid() =3D 2031 >>>>>>> [2031] [95] setresgid(rgid=3D0xffff33e3, egid=3D0xffffff93, sgi= d=3D0x22000040) =3D -1 (Operation not permitted) >>>>>>> [2031] [96] vmsplice(fd=3D5, iov=3D0x85501e0, nr_segs=3D300, fl= ags=3D9) =3D 0x3000 >>>>>>> [2031] [97] setresuid(ruid=3D0x80549193, euid=3D0xc61041e0, sui= d=3D0xff19b6fa) =3D -1 (Operation not permitted) >>>>>>> [2031] [98] setpriority(which=3D0xff010000, who=3D0xf3737373, n= iceval=3D0x8088960c) =3D -1 (Invalid argument) >>>>>>> [2031] [99] socketcall(call=3D1, args=3D0x8550200) =3D -1 (Addr= ess family not supported by protocol) >>>>>>> [2031] [100] access(filename=3D"=EF=BF=BD", mode=3D2017) =3D -1= (Invalid argument) >>>>>>> [2031] [101] getgroups(gidsetsize=3D0, grouplist=3D0x80d0000[pa= ge_rand]) =3D 3 >>>>>>> [2031] [102] msync(start=3D0xc0100220, len=3D0, flags=3D3) =3D = -1 (Invalid argument) >>>>>>> [2031] [103] sigpending(set=3D0x40025000) =3D 0 >>>>>>> [2031] [104] signalfd4(ufd=3D383, user_mask=3D1, sizemask=3D0xa= 4200000, flags=3D0x80800) =3D -1 (Invalid argument) >>>>>>> [2031] [105] sendfile(out_fd=3D383, in_fd=3D382, offset=3D0, co= unt=3D4096) =3D -1 (Invalid argument) >>>>>>> [2031] [106] fanotify_mark(fanotify_fd=3D382, flags=3D5, mask=3D= 0x8000023, dfd=3D382, pathname=3D"/proc/1092/task/1092/fdinfo/68") =3D = -1 (Invalid argument) >>>>>>> [2031] [107] wait4(upid=3D1, stat_addr=3D4, options=3D0xd761979= b, ru=3D8) =3D -1 (Invalid argument) >>>>>>> [2031] [108] sigpending(set=3D0x80ca000[page_zeros]) =3D 0 >>>>>>> [2031] [109] setresuid(ruid=3D0xefffd6fc, euid=3D0x1bf4c92f, su= id=3D0xffff2e33) =3D -1 (Operation not permitted) >>>>>>> [2031] [110] munlock(addr=3D0x40025000, len=3D34) =3D 0 >>>>>>> [2031] [111] timer_delete(timer_id=3D0xffffffdc) =3D -1 (Invali= d argument) >>>>>>> [2031] [112] sched_get_priority_max(policy=3D0x10000040) =3D -1= (Invalid argument) >>>>>>> [2031] [113] syslog(type=3D0xc1000000, buf=3D1, len=3D0x82a5) =3D= -1 (Operation not permitted) >>>>>>> [2031] [114] setpriority(which=3D0xc4c806c6, who=3D0xffffff01, = niceval=3D0xffff0682) =3D -1 (Invalid argument) >>>>>>> [2031] [115] getgroups16(gidsetsize=3D0xfffe, grouplist=3D1) =3D= -1 (Bad address) >>>>>>> [2031] [116] rename(oldname=3D4, newname=3D8) =3D -1 (Bad addre= ss) >>>>>>> [2031] [117] inotify_init() =3D 654 >>>>>>> [2031] [118] getgid() =3D 100 >>>>>>> [2031] [119] fstatat64(dfd=3D382, filename=3D"/sys/devices/virt= ual/net/sit0/duplex", statbuf=3D0, flag=3D0xb545d727) =3D -1 (Invalid a= rgument) >>>>>>> [2031] [120] unlinkat(dfd=3D382, pathname=3D"/proc/sys/net/ipv4= /neigh/default/retrans_time", flag=3D0xc00ef76) =3D -1 (Invalid argumen= t) >>>>>>> [2031] [121] timerfd_create(clockid=3D0, flags=3D0) =3D 655 >>>>>>> [2031] [122] munlock(addr=3D4, len=3D0x3fff) =3D -1 (Cannot all= ocate memory) >>>>>>> [2031] [123] fremovexattr(fd=3D382, name=3D0) =3D -1 (Bad addre= ss) >>>>>>> [2031] [124] sched_get_priority_min(policy=3D0xff58bfef) =3D -1= (Invalid argument) >>>>>>> [2031] [125] mq_timedreceive(mqdes=3D397, u_msg_ptr=3D4, msg_le= n=3D5245, u_msg_prio=3D0xc0100220, u_abs_timeout=3D0xc0100220) =3D -1 (= Bad address) >>>>>>> [2031] [126] chdir(filename=3D"/proc/116/net/ptype") =3D -1 (No= t a directory) >>>>>>> [2031] [127] ssetmask(newmask=3D0x88000092) =3D 0 >>>>>>> [2031] [128] statfs(pathname=3D"/proc/6/mounts", buf=3D0) =3D -= 1 (Bad address) >>>>>>> [2031] [129] fchown16(fd=3D397, user=3D104, group=3D0x94100000)= =3D -1 (Operation not permitted) >>>>>>> [2031] [130] fchdir(fd=3D397) =3D -1 (Not a directory) >>>>>>> [2031] [131] mkdir(pathname=3D"/proc/1092/task/1092/fdinfo/316"= , mode=3D525) =3D -1 (File exists) >>>>>>> [2031] [132] fsetxattr(fd=3D386, name=3D0x856f158, value=3D0x85= 71160, size=3D0, flags=3D0) =3D -1 (Numerical result out of range) >>>>>>> [2031] [133] io_setup(nr_events=3D4095, ctxp=3D0x40266000) ^CKi= lled by signal 2. >>>>>> >>>>>> Reading your gdb backtraces show that schedule_timeout() got cal= led >>>>>> with a negative value. >>>>>> Looks like an integer overflow. >>>>>> The soft-lockup might also origin from that (very big integer wh= ich >>>>>> did not overflow jet) >>>>>> >>>>> >>>>> If the culprit is solved by this patch I'd like to send it out. B= ut I'm >>>>> unsure whether it catches the culprit or if it just covers the ro= ot cause. >>>> >>>> I fear your Patch will not fix the issue. >>>> >>>> Does the issue only trigger on 32bit UMLs? >>> No diea, I do only have a 32 bit system here (both host and client)= =2E >>> >>>> How long does it take till trinity hits it? >>> a command like >>> >>> $> ssh tfoerste@trinity "rm -rf t3; mkdir t3; cd t3; trinity -C4" >>> >>> usually needs 10 till 15 min to trigger the issue. With just 1 trin= ity >>> task (-C1) however it needs often a hour or more. >> >> That's good. :-) >> You can place some printk()s into balance_dirty_pages() and observe = the values >> of period, max_pause, min_pause, etc... >> Maybe this will give us a clue. >=20 > Will do it again. I did it before already but period was the only thi= ng > which becames otherwise negative - > and because pages_dirtied was often in the range of 20 - 30 and perio= ad > around -3000 Did you print it before or after if (current->dirty_paused_when) pause -=3D now - current->dirty_paused_when; I'm also wondering why if (pause < min_pause) { did not trigger or did it? > I just wondered if HZ is casts as a short int in the calculation ? > BTW printk doesn't showed anything in the syslog (ok, I used > printk(KERN_ERR ...) syslog or dmesg? Thanks, //richard