From mboxrd@z Thu Jan 1 00:00:00 1970 From: Oded Gabbay Subject: [BUG] 3.19-rc3+ - mm: prevent endless growth of anon_vma hierarchy Date: Sun, 11 Jan 2015 10:58:31 +0200 Message-ID: <54B23B37.5080306@amd.com> Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Return-path: Received: from na01-bl2-obe.outbound.protection.outlook.com (mail-bl2on0114.outbound.protection.outlook.com [65.55.169.114]) by gabe.freedesktop.org (Postfix) with ESMTP id 3F5C66E04F for ; Sun, 11 Jan 2015 00:58:49 -0800 (PST) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: koct9i@gmail.com, riel@redhat.com, akpm@linux-foundation.org, torvalds@linux-foundation.org Cc: jmarchan@redhat.com, dan.forrest@ssec.wisc.edu, "linux-kernel@vger.kernel.org" , dri-devel , mhocko@suse.cz, linux-mm@kvack.org, "Deucher, Alexander" List-Id: dri-devel@lists.freedesktop.org SGVsbG8sCgpUb2RheSBJIHRvb2sgbGF0ZXN0IGJyYW5jaCBmcm9tIExpbnVzIHJlcG8gdG8gY2hl Y2sgZm9yIGFtZGtmZCByZWdyZXNzaW9ucwpkdWUgdG8gbXkgZml4ZXMgcHVsbGVkIGZvciAzLjE5 LXJjNCwgYW5kIEkgZ290IGEga2VybmVsIGJ1ZyAoc2VlIGJlbG93IGRtZXNnCm91dHB1dCkuCgpJ IGRpZCBhIGJpc2VjdCBhbmQgdGhlIGZpcnN0IGJhZCBjb21taXQgaXM6Cgo3YTNlZjIwOGU2NjJm NGI2M2Q0M2EyM2Y2MWE2NGExMjljNTI1YmJjIGlzIHRoZSBmaXJzdCBiYWQgY29tbWl0CmNvbW1p dCA3YTNlZjIwOGU2NjJmNGI2M2Q0M2EyM2Y2MWE2NGExMjljNTI1YmJjCkF1dGhvcjogS29uc3Rh bnRpbiBLaGxlYm5pa292IDxrb2N0OWlAZ21haWwuY29tPgpEYXRlOiAgIFRodSBKYW4gOCAxNDoz MjoxNSAyMDE1IC0wODAwCgogICAgbW06IHByZXZlbnQgZW5kbGVzcyBncm93dGggb2YgYW5vbl92 bWEgaGllcmFyY2h5CgpUaGUgYnVnIGlzIGJlZm9yZSB0aGUgcHVsbCBvZiBsYXRlc3QgYW1ka2Zk IGZpeGVzLCBzbyBpdCBpcyBub3QgcmVsYXRlZCB0bwpteSBwdWxsIHJlcXVlc3QuIEZyb20gdGhl IGJpc2VjdCBsb2cgKGVuZCBvZiBlbWFpbCksIHlvdSBjYW4gc2VlIDMuMTktcmMzIGlzCmZpbmUu CgpUaGUgcHJvYmxlbSBvY2N1cnJlZCB3aGlsZSBydW5uaW5nIGphdmEgb3ZlciBIU0EuIFRoZSBL ZXJuZWwgaXMgNjQtYml0IGFuZAp1c2Vyc3BhY2UgcHJvY2Vzc2VzIGFyZSA2NC1iaXQgYXMgd2Vs bC4gQ1BVIGlzIEFNRCBLYXZlcmkgKEExMC03ODUwKS4KT3BlbkNML09wZW5NUCBvdmVyIEhTQSBy dW4gd2l0aG91dCBwcm9ibGVtcy4KCmRtZXNnIG91dHB1dDoKClsgIDI2Ni40OTE4NjRdIC0tLS0t LS0tLS0tLVsgY3V0IGhlcmUgXS0tLS0tLS0tLS0tLQpbICAyNjYuNDkxOTA0XSBrZXJuZWwgQlVH IGF0IG1tL3JtYXAuYzozOTkhClsgIDI2Ni40OTE5MzRdIGludmFsaWQgb3Bjb2RlOiAwMDAwIFsj MV0gU01QClsgIDI2Ni40OTE5NjJdIE1vZHVsZXMgbGlua2VkIGluOiBhbWRrZmQgYW1kX2lvbW11 X3YyIHJhZGVvbiBjZmJmaWxscmVjdApjZmJpbWdibHQgY2ZiY29weWFyZWEgZHJtX2ttc19oZWxw ZXIgdHRtIGZ1c2UKWyAgMjY2LjQ5MjA0M10gQ1BVOiAzIFBJRDogNTE1NSBDb21tOiBqYXZhIE5v dCB0YWludGVkIDMuMTkuMC1yYzMta2ZkKyAjMjQKWyAgMjY2LjQ5MjA4N10gSGFyZHdhcmUgbmFt ZTogQU1EIEJBTExJTkEvQmFsbGluYSwgQklPUwpXQkwzQjIwTl9XZWVrbHlfMTNfMTFfMiAxMS8y MC8yMDEzClsgIDI2Ni40OTIxNDFdIHRhc2s6IGZmZmY4ODAwYTNiM2M4NDAgdGk6IGZmZmY4ODAw OTE2ZjgwMDAgdGFzay50aToKZmZmZjg4MDA5MTZmODAwMApbICAyNjYuNDkyMTkxXSBSSVA6IDAw MTA6WzxmZmZmZmZmZjgxMTI2NjMwPl0gIFs8ZmZmZmZmZmY4MTEyNjYzMD5dCnVubGlua19hbm9u X3ZtYXMrMHgxMDIvMHgxNTkKWyAgMjY2LjQ5MjI0OV0gUlNQOiAwMDE4OmZmZmY4ODAwOTE2ZmJi NjggIEVGTEFHUzogMDAwMTAyODYKWyAgMjY2LjQ5MjI4NV0gUkFYOiBmZmZmODgwMDhmNmIzYmEw IFJCWDogZmZmZjg4MDA4ZjZiM2I5MCBSQ1g6IGZmZmY4ODAwYTNiM2NmMzAKWyAgMjY2LjQ5MjMz MV0gUkRYOiBmZmZmODgwMDkxNGIzYzk4IFJTSTogMDAwMDAwMDAwMDAwMDAwMSBSREk6IGZmZmY4 ODAwOTE0YjNjOTgKWyAgMjY2LjQ5MjM3Nl0gUkJQOiBmZmZmODgwMDkxNmZiYmE4IFIwODogMDAw MDAwMDAwMDAwMDAwMiBSMDk6IDAwMDAwMDAwMDAwMDAwMDAKWyAgMjY2LjQ5MjQyMV0gUjEwOiAw MDAwMDAwMDAwMDAwMDA4IFIxMTogMDAwMDAwMDAwMDAwMDAwMSBSMTI6IGZmZmY4ODAwOGY2ODYw NjgKWyAgMjY2LjQ5MjQ2NV0gUjEzOiBmZmZmODgwMDkxNGIzYzk4IFIxNDogZmZmZjg4MDA4ZjZi M2I5MCBSMTU6IGZmZmY4ODAwOGY2ODYwMDAKWyAgMjY2LjQ5MjUxM10gRlM6ICAwMDAwN2ZiODk2 NmY2NzAwKDAwMDApIEdTOmZmZmY4ODAxMWVkODAwMDAoMDAwMCkKa25sR1M6MDAwMDAwMDAwMDAw MDAwMApbICAyNjYuNDkyNTY2XSBDUzogIDAwMTAgRFM6IDAwMDAgRVM6IDAwMDAgQ1IwOiAwMDAw MDAwMDgwMDUwMDMzClsgIDI2Ni40OTI2MDFdIENSMjogMDAwMDdmNTBmYTE5MDc3MCBDUjM6IDAw MDAwMDAwMDFiMzEwMDAgQ1I0OiAwMDAwMDAwMDAwMDQwN2UwClsgIDI2Ni40OTI2NTJdIFN0YWNr OgpbICAyNjYuNDkyNjY1XSAgMDAwMDAwMDAwMDAwMDAwMCBmZmZmODgwMDhmNjg2MDc4IGZmZmY4 ODAwOTE2ZmJiYTgKZmZmZjg4MDA4ZjY4NjAwMApbICAyNjYuNDkyNzE0XSAgZmZmZjg4MDA5MTZm YmMwOCAwMDAwMDAwMDAwMDAwMDAwIDAwMDAwMDAwMDAwMDAwMDAKZmZmZjg4MDA4ZjY4NjAwMApb ICAyNjYuNDkyNzY0XSAgZmZmZjg4MDA5MTZmYmJmOCBmZmZmZmZmZjgxMTFiYTVkIDAwMDA3ZmI4 ODU5MTgwMDAKZmZmZjg4MDA4ZWRmMzAwMApbICAyNjYuNDkyODE1XSBDYWxsIFRyYWNlOgpbICAy NjYuNDkyODM0XSAgWzxmZmZmZmZmZjgxMTFiYTVkPl0gZnJlZV9wZ3RhYmxlcysweDhlLzB4Y2MK WyAgMjY2LjQ5Mjg3M10gIFs8ZmZmZmZmZmY4MTEyMjUzZT5dIGV4aXRfbW1hcCsweDg0LzB4MTE2 ClsgIDI2Ni40OTI5MDddICBbPGZmZmZmZmZmODEwM2Y3ODk+XSBtbXB1dCsweDUyLzB4ZTkKWyAg MjY2LjQ5Mjk0MF0gIFs8ZmZmZmZmZmY4MTA0MzkxOD5dIGRvX2V4aXQrMHgzY2QvMHg5YzkKWyAg MjY2LjQ5Mjk3NV0gIFs8ZmZmZmZmZmY4MTcwYzFlYz5dID8gX3Jhd19zcGluX3VubG9ja19pcnEr MHgyZC8weDMyClsgIDI2Ni40OTMwMTZdICBbPGZmZmZmZmZmODEwNDRkN2Y+XSBkb19ncm91cF9l eGl0KzB4NGMvMHhjOQpbICAyNjYuNDkzMDUxXSAgWzxmZmZmZmZmZjgxMDRlYjg3Pl0gZ2V0X3Np Z25hbCsweDU4Zi8weDViYwpbICAyNjYuNDkzMDkwXSAgWzxmZmZmZmZmZjgxMDAyMmM0Pl0gZG9f c2lnbmFsKzB4MjgvMHg1YjEKWyAgMjY2LjQ5MzEyM10gIFs8ZmZmZmZmZmY4MTcwY2EwYz5dID8g c3lzcmV0X3NpZ25hbCsweDUvMHg0MwpbICAyNjYuNDkzMTYyXSAgWzxmZmZmZmZmZjgxMDAyODgy Pl0gZG9fbm90aWZ5X3Jlc3VtZSsweDM1LzB4NjgKWyAgMjY2LjQ5MzIwMF0gIFs8ZmZmZmZmZmY4 MTcwY2M3Zj5dIGludF9zaWduYWwrMHgxMi8weDE3ClsgIDI2Ni40OTMyMzVdIENvZGU6IGU4IDAz IGI3IGY0IGZmIDQ5IDhiIDQ3IDc4IDRjIDhiIDIwIDQ4IDhkIDU4IGYwIDQ5IDgzCmVjIDEwIDQ4 IDhkIDQzIDEwIDQ4IDM5IDQ1IGM4IDc0IDU1IDQ4IDhiIDdiIDA4IDgzIGJmIDhjIDAwIDAwIDAw IDAwIDc0IDAyCjwwZj4gMGIgZTggYTQgZmQgZmYgZmYgNDggOGIgNDMgMTggNDggOGIgNTMgMTAg NDggODkgZGYgNDggODkgNDIKWyAgMjY2LjQ5MzQwNF0gUklQICBbPGZmZmZmZmZmODExMjY2MzA+ XSB1bmxpbmtfYW5vbl92bWFzKzB4MTAyLzB4MTU5ClsgIDI2Ni40OTM0NDddICBSU1AgPGZmZmY4 ODAwOTE2ZmJiNjg+ClsgIDI2Ni41MDg4NzddIC0tLVsgZW5kIHRyYWNlIDAyZDI4ZmU5YjNkZTJl MWEgXS0tLQpbICAyNjYuNTA4ODgwXSBGaXhpbmcgcmVjdXJzaXZlIGZhdWx0IGJ1dCByZWJvb3Qg aXMgbmVlZGVkIQoKCmdpdCBiaXNlY3QgbG9nOgoKZ2l0IGJpc2VjdCBzdGFydAojIGJhZDogW2E0 YWQ4OWE0Njg4MmI5MWI3ZGY5Y2ZiODNkZDIxYzA2YjgwNjVjMzBdIGZpeCB0byBxY29tIGV0aCBj dHJsCmdpdCBiaXNlY3QgYmFkIGE0YWQ4OWE0Njg4MmI5MWI3ZGY5Y2ZiODNkZDIxYzA2YjgwNjVj MzAKIyBnb29kOiBbYjE5NDBjZDIxYzBmNGFiZGNlMTAxMjUzZTg2MGZlZmY1NDcyOTFiMF0gTGlu dXggMy4xOS1yYzMKZ2l0IGJpc2VjdCBnb29kIGIxOTQwY2QyMWMwZjRhYmRjZTEwMTI1M2U4NjBm ZWZmNTQ3MjkxYjAKIyBiYWQ6IFswM2M3NTFhNWUxMGNhYWZiYjZkMWFmY2FmMWVhNjdmMjE1M2Mz MTkzXSBNZXJnZSBicmFuY2ggJ2Zvci1saW51cycKb2YgZ2l0Oi8vZ2l0Lmtlcm5lbC5vcmcvcHVi L3NjbS9saW51eC9rZXJuZWwvZ2l0L21hc29uL2xpbnV4LWJ0cmZzCmdpdCBiaXNlY3QgYmFkIDAz Yzc1MWE1ZTEwY2FhZmJiNmQxYWZjYWYxZWE2N2YyMTUzYzMxOTMKIyBiYWQ6IFs1M2RjMjBiOWEz ZDkyOGIwNzQ0ZGFkNWFlZTY1YjYxMGRlMWNjODVkXSBvY2ZzMjogZml4IHRoZSB3cm9uZwpkaXJl Y3RvcnkgcGFzc2VkIHRvIG9jZnMyX2xvb2t1cF9pbm9fZnJvbV9uYW1lKCkgd2hlbiBsaW5rIGZp bGUKZ2l0IGJpc2VjdCBiYWQgNTNkYzIwYjlhM2Q5MjhiMDc0NGRhZDVhZWU2NWI2MTBkZTFjYzg1 ZAojIGdvb2Q6IFtlODgyOWYwMDdlOTgyYTlhOGZiNDAyMzEwOTIzM2Q1ZjM0NGQ0NjU3XSBiYXRt YW4tYWR2OiBmaXggY291bnRlcgpmb3IgbXVsdGljYXN0IHN1cHBvcnRpbmcgbm9kZXMKZ2l0IGJp c2VjdCBnb29kIGU4ODI5ZjAwN2U5ODJhOWE4ZmI0MDIzMTA5MjMzZDVmMzQ0ZDQ2NTcKIyBnb29k OiBbMmFiYWQ3OWFmYTcwMGU4MzdjYjRmZWVkMTcwMTQxMjkyZTA3MjBjMF0gcWxhM3h4eDogZG9u J3QgYWxsb3cKbmV2ZXIgZW5kIGJ1c3kgbG9vcApnaXQgYmlzZWN0IGdvb2QgMmFiYWQ3OWFmYTcw MGU4MzdjYjRmZWVkMTcwMTQxMjkyZTA3MjBjMAojIGdvb2Q6IFswYWRjMTgwMzg4MGRiNzI4ZmE3 ZjhjYmFkNWIyMTRhYjY1N2U1ZTBkXSBNZXJnZSB0YWcgJ2Zvci1saW51cy0zJwpvZiBnaXQ6Ly9n aXQuY29kZS5zZi5uZXQvcC9vcGVuaXBtaS9saW51eC1pcG1pCmdpdCBiaXNlY3QgZ29vZCAwYWRj MTgwMzg4MGRiNzI4ZmE3ZjhjYmFkNWIyMTRhYjY1N2U1ZTBkCiMgZ29vZDogWzMyNDVkNmFjYWI5 ODFhMjM4OGZmYjg3N2M3ZWNjOTdlNzYzYzU5ZDRdIGV4aXQ6IGZpeCByYWNlIGJldHdlZW4Kd2Fp dF9jb25zaWRlcl90YXNrKCkgYW5kIHdhaXRfdGFza196b21iaWUoKQpnaXQgYmlzZWN0IGdvb2Qg MzI0NWQ2YWNhYjk4MWEyMzg4ZmZiODc3YzdlY2M5N2U3NjNjNTlkNAojIGJhZDogWzJkNmQ3Zjk4 Mjg0NjQ4YzVlZDExM2ZlMjJhMTMyMTQ4OTUwYjE0MGZdIG1tOiBwcm90ZWN0CnNldF9wYWdlX2Rp cnR5KCkgZnJvbSBvbmdvaW5nIHRydW5jYXRpb24KZ2l0IGJpc2VjdCBiYWQgMmQ2ZDdmOTgyODQ2 NDhjNWVkMTEzZmUyMmExMzIxNDg5NTBiMTQwZgojIGJhZDogWzdhM2VmMjA4ZTY2MmY0YjYzZDQz YTIzZjYxYTY0YTEyOWM1MjViYmNdIG1tOiBwcmV2ZW50IGVuZGxlc3MgZ3Jvd3RoCm9mIGFub25f dm1hIGhpZXJhcmNoeQpnaXQgYmlzZWN0IGJhZCA3YTNlZjIwOGU2NjJmNGI2M2Q0M2EyM2Y2MWE2 NGExMjljNTI1YmJjCiMgZmlyc3QgYmFkIGNvbW1pdDogWzdhM2VmMjA4ZTY2MmY0YjYzZDQzYTIz ZjYxYTY0YTEyOWM1MjViYmNdIG1tOiBwcmV2ZW50CmVuZGxlc3MgZ3Jvd3RoIG9mIGFub25fdm1h IGhpZXJhcmNoeQoKSSBzYXcgdGhpcyBjb21taXQgaXMgbWFya2VkIHN0YWJsZSwgc28gdGhpcyBp cyBxdWl0ZSBwcm9ibGVtYXRpYyBhcyBpdCBjb3VsZApicmVhayB1c2Vyc3BhY2UgSFNBIGFwcHMg b3V0IHRoZXJlIChhbHRob3VnaCBhbWRrZmQgd2lsbCBvbmx5IGJlIHByZXNlbnQKZnJvbSAzLjE5 KS4KCkNvdWxkIHlvdSBwbGVhc2UgdGFrZSBhIGxvb2sgYW5kIGhlbHAgc29sdmUgdGhpcyBpc3N1 ZSA/CgpUaGFua3MsCgoJT2RlZApfX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX19fXwpkcmktZGV2ZWwgbWFpbGluZyBsaXN0CmRyaS1kZXZlbEBsaXN0cy5mcmVlZGVz a3RvcC5vcmcKaHR0cDovL2xpc3RzLmZyZWVkZXNrdG9wLm9yZy9tYWlsbWFuL2xpc3RpbmZvL2Ry aS1kZXZlbAo= From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-qg0-f43.google.com (mail-qg0-f43.google.com [209.85.192.43]) by kanga.kvack.org (Postfix) with ESMTP id 097376B0075 for ; Sun, 11 Jan 2015 03:58:50 -0500 (EST) Received: by mail-qg0-f43.google.com with SMTP id z107so14388984qgd.2 for ; Sun, 11 Jan 2015 00:58:49 -0800 (PST) Received: from na01-bl2-obe.outbound.protection.outlook.com (mail-bl2on0118.outbound.protection.outlook.com. [65.55.169.118]) by mx.google.com with ESMTPS id z110si18511920qgd.73.2015.01.11.00.58.48 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Sun, 11 Jan 2015 00:58:48 -0800 (PST) Message-ID: <54B23B37.5080306@amd.com> Date: Sun, 11 Jan 2015 10:58:31 +0200 From: Oded Gabbay MIME-Version: 1.0 Subject: [BUG] 3.19-rc3+ - mm: prevent endless growth of anon_vma hierarchy Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org List-ID: To: koct9i@gmail.com, mhocko@suse.cz, riel@redhat.com, akpm@linux-foundation.org, torvalds@linux-foundation.org Cc: dan.forrest@ssec.wisc.edu, jmarchan@redhat.com, "Bridgman, John" , "Elifaz, Dana" , Dave Airlie , "Deucher, Alexander" , dri-devel , "linux-kernel@vger.kernel.org" , linux-mm@kvack.org Hello, Today I took latest branch from Linus repo to check for amdkfd regressions due to my fixes pulled for 3.19-rc4, and I got a kernel bug (see below dmesg output). I did a bisect and the first bad commit is: 7a3ef208e662f4b63d43a23f61a64a129c525bbc is the first bad commit commit 7a3ef208e662f4b63d43a23f61a64a129c525bbc Author: Konstantin Khlebnikov Date: Thu Jan 8 14:32:15 2015 -0800 mm: prevent endless growth of anon_vma hierarchy The bug is before the pull of latest amdkfd fixes, so it is not related to my pull request. From the bisect log (end of email), you can see 3.19-rc3 is fine. The problem occurred while running java over HSA. The Kernel is 64-bit and userspace processes are 64-bit as well. CPU is AMD Kaveri (A10-7850). OpenCL/OpenMP over HSA run without problems. dmesg output: [ 266.491864] ------------[ cut here ]------------ [ 266.491904] kernel BUG at mm/rmap.c:399! [ 266.491934] invalid opcode: 0000 [#1] SMP [ 266.491962] Modules linked in: amdkfd amd_iommu_v2 radeon cfbfillrect cfbimgblt cfbcopyarea drm_kms_helper ttm fuse [ 266.492043] CPU: 3 PID: 5155 Comm: java Not tainted 3.19.0-rc3-kfd+ #24 [ 266.492087] Hardware name: AMD BALLINA/Ballina, BIOS WBL3B20N_Weekly_13_11_2 11/20/2013 [ 266.492141] task: ffff8800a3b3c840 ti: ffff8800916f8000 task.ti: ffff8800916f8000 [ 266.492191] RIP: 0010:[] [] unlink_anon_vmas+0x102/0x159 [ 266.492249] RSP: 0018:ffff8800916fbb68 EFLAGS: 00010286 [ 266.492285] RAX: ffff88008f6b3ba0 RBX: ffff88008f6b3b90 RCX: ffff8800a3b3cf30 [ 266.492331] RDX: ffff8800914b3c98 RSI: 0000000000000001 RDI: ffff8800914b3c98 [ 266.492376] RBP: ffff8800916fbba8 R08: 0000000000000002 R09: 0000000000000000 [ 266.492421] R10: 0000000000000008 R11: 0000000000000001 R12: ffff88008f686068 [ 266.492465] R13: ffff8800914b3c98 R14: ffff88008f6b3b90 R15: ffff88008f686000 [ 266.492513] FS: 00007fb8966f6700(0000) GS:ffff88011ed80000(0000) knlGS:0000000000000000 [ 266.492566] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 266.492601] CR2: 00007f50fa190770 CR3: 0000000001b31000 CR4: 00000000000407e0 [ 266.492652] Stack: [ 266.492665] 0000000000000000 ffff88008f686078 ffff8800916fbba8 ffff88008f686000 [ 266.492714] ffff8800916fbc08 0000000000000000 0000000000000000 ffff88008f686000 [ 266.492764] ffff8800916fbbf8 ffffffff8111ba5d 00007fb885918000 ffff88008edf3000 [ 266.492815] Call Trace: [ 266.492834] [] free_pgtables+0x8e/0xcc [ 266.492873] [] exit_mmap+0x84/0x116 [ 266.492907] [] mmput+0x52/0xe9 [ 266.492940] [] do_exit+0x3cd/0x9c9 [ 266.492975] [] ? _raw_spin_unlock_irq+0x2d/0x32 [ 266.493016] [] do_group_exit+0x4c/0xc9 [ 266.493051] [] get_signal+0x58f/0x5bc [ 266.493090] [] do_signal+0x28/0x5b1 [ 266.493123] [] ? sysret_signal+0x5/0x43 [ 266.493162] [] do_notify_resume+0x35/0x68 [ 266.493200] [] int_signal+0x12/0x17 [ 266.493235] Code: e8 03 b7 f4 ff 49 8b 47 78 4c 8b 20 48 8d 58 f0 49 83 ec 10 48 8d 43 10 48 39 45 c8 74 55 48 8b 7b 08 83 bf 8c 00 00 00 00 74 02 <0f> 0b e8 a4 fd ff ff 48 8b 43 18 48 8b 53 10 48 89 df 48 89 42 [ 266.493404] RIP [] unlink_anon_vmas+0x102/0x159 [ 266.493447] RSP [ 266.508877] ---[ end trace 02d28fe9b3de2e1a ]--- [ 266.508880] Fixing recursive fault but reboot is needed! git bisect log: git bisect start # bad: [a4ad89a46882b91b7df9cfb83dd21c06b8065c30] fix to qcom eth ctrl git bisect bad a4ad89a46882b91b7df9cfb83dd21c06b8065c30 # good: [b1940cd21c0f4abdce101253e860feff547291b0] Linux 3.19-rc3 git bisect good b1940cd21c0f4abdce101253e860feff547291b0 # bad: [03c751a5e10caafbb6d1afcaf1ea67f2153c3193] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs git bisect bad 03c751a5e10caafbb6d1afcaf1ea67f2153c3193 # bad: [53dc20b9a3d928b0744dad5aee65b610de1cc85d] ocfs2: fix the wrong directory passed to ocfs2_lookup_ino_from_name() when link file git bisect bad 53dc20b9a3d928b0744dad5aee65b610de1cc85d # good: [e8829f007e982a9a8fb4023109233d5f344d4657] batman-adv: fix counter for multicast supporting nodes git bisect good e8829f007e982a9a8fb4023109233d5f344d4657 # good: [2abad79afa700e837cb4feed170141292e0720c0] qla3xxx: don't allow never end busy loop git bisect good 2abad79afa700e837cb4feed170141292e0720c0 # good: [0adc1803880db728fa7f8cbad5b214ab657e5e0d] Merge tag 'for-linus-3' of git://git.code.sf.net/p/openipmi/linux-ipmi git bisect good 0adc1803880db728fa7f8cbad5b214ab657e5e0d # good: [3245d6acab981a2388ffb877c7ecc97e763c59d4] exit: fix race between wait_consider_task() and wait_task_zombie() git bisect good 3245d6acab981a2388ffb877c7ecc97e763c59d4 # bad: [2d6d7f98284648c5ed113fe22a132148950b140f] mm: protect set_page_dirty() from ongoing truncation git bisect bad 2d6d7f98284648c5ed113fe22a132148950b140f # bad: [7a3ef208e662f4b63d43a23f61a64a129c525bbc] mm: prevent endless growth of anon_vma hierarchy git bisect bad 7a3ef208e662f4b63d43a23f61a64a129c525bbc # first bad commit: [7a3ef208e662f4b63d43a23f61a64a129c525bbc] mm: prevent endless growth of anon_vma hierarchy I saw this commit is marked stable, so this is quite problematic as it could break userspace HSA apps out there (although amdkfd will only be present from 3.19). Could you please take a look and help solve this issue ? Thanks, Oded -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751175AbbAKI6v (ORCPT ); Sun, 11 Jan 2015 03:58:51 -0500 Received: from mail-bn1bon0144.outbound.protection.outlook.com ([157.56.111.144]:18414 "EHLO na01-bn1-obe.outbound.protection.outlook.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1750856AbbAKI6t (ORCPT ); Sun, 11 Jan 2015 03:58:49 -0500 X-WSS-ID: 0NI08XT-08-J93-02 X-M-MSG: Message-ID: <54B23B37.5080306@amd.com> Date: Sun, 11 Jan 2015 10:58:31 +0200 From: Oded Gabbay Organization: AMD User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.3.0 MIME-Version: 1.0 To: , , , , CC: , , , "Bridgman, John" , "Elifaz, Dana" , Dave Airlie , "Deucher, Alexander" , dri-devel , "linux-kernel@vger.kernel.org" , Subject: [BUG] 3.19-rc3+ - mm: prevent endless growth of anon_vma hierarchy Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit X-Originating-IP: [10.224.11.83] X-EOPAttributedMessage: 0 Authentication-Results: spf=none (sender IP is 165.204.84.222) smtp.mailfrom=Oded.Gabbay@amd.com; X-Forefront-Antispam-Report: CIP:165.204.84.222;CTRY:US;IPV:NLI;EFV:NLI;SFV:NSPM;SFS:(10019020)(979002)(6009001)(428002)(164054003)(199003)(189002)(479174004)(33656002)(106466001)(2201001)(36756003)(229853001)(101416001)(87936001)(92566002)(47776003)(77096005)(68736005)(54356999)(87266999)(50986999)(50466002)(64706001)(23676002)(65956001)(83506001)(59896002)(65806001)(65816999)(77156002)(19580395003)(86362001)(19580405001)(105586002)(575784001)(80316001)(46102003)(64126003)(97736003)(62966003)(969003)(989001)(999001)(1009001)(1019001);DIR:OUT;SFP:1102;SCL:1;SRVR:CY1PR0201MB1003;H:atltwp02.amd.com;FPR:;SPF:None;MLV:ovrnspm;PTR:InfoDomainNonexistent;A:1;MX:1;LANG:en; X-DmarcAction-Test: None X-Microsoft-Antispam: UriScan:; X-Microsoft-Antispam: BCL:0;PCL:0;RULEID:(3005003);SRVR:CY1PR0201MB1003; X-Exchange-Antispam-Report-Test: UriScan:; X-Exchange-Antispam-Report-CFA-Test: BCL:0;PCL:0;RULEID:(601004);SRVR:CY1PR0201MB1003; X-Forefront-PRVS: 045315E1EE X-Exchange-Antispam-Report-CFA-Test: BCL:0;PCL:0;RULEID:;SRVR:CY1PR0201MB1003; X-OriginatorOrg: amd4.onmicrosoft.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 11 Jan 2015 08:58:44.7858 (UTC) X-MS-Exchange-CrossTenant-Id: fde4dada-be84-483f-92cc-e026cbee8e96 X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=fde4dada-be84-483f-92cc-e026cbee8e96;Ip=[165.204.84.222] X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: CY1PR0201MB1003 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello, Today I took latest branch from Linus repo to check for amdkfd regressions due to my fixes pulled for 3.19-rc4, and I got a kernel bug (see below dmesg output). I did a bisect and the first bad commit is: 7a3ef208e662f4b63d43a23f61a64a129c525bbc is the first bad commit commit 7a3ef208e662f4b63d43a23f61a64a129c525bbc Author: Konstantin Khlebnikov Date: Thu Jan 8 14:32:15 2015 -0800 mm: prevent endless growth of anon_vma hierarchy The bug is before the pull of latest amdkfd fixes, so it is not related to my pull request. From the bisect log (end of email), you can see 3.19-rc3 is fine. The problem occurred while running java over HSA. The Kernel is 64-bit and userspace processes are 64-bit as well. CPU is AMD Kaveri (A10-7850). OpenCL/OpenMP over HSA run without problems. dmesg output: [ 266.491864] ------------[ cut here ]------------ [ 266.491904] kernel BUG at mm/rmap.c:399! [ 266.491934] invalid opcode: 0000 [#1] SMP [ 266.491962] Modules linked in: amdkfd amd_iommu_v2 radeon cfbfillrect cfbimgblt cfbcopyarea drm_kms_helper ttm fuse [ 266.492043] CPU: 3 PID: 5155 Comm: java Not tainted 3.19.0-rc3-kfd+ #24 [ 266.492087] Hardware name: AMD BALLINA/Ballina, BIOS WBL3B20N_Weekly_13_11_2 11/20/2013 [ 266.492141] task: ffff8800a3b3c840 ti: ffff8800916f8000 task.ti: ffff8800916f8000 [ 266.492191] RIP: 0010:[] [] unlink_anon_vmas+0x102/0x159 [ 266.492249] RSP: 0018:ffff8800916fbb68 EFLAGS: 00010286 [ 266.492285] RAX: ffff88008f6b3ba0 RBX: ffff88008f6b3b90 RCX: ffff8800a3b3cf30 [ 266.492331] RDX: ffff8800914b3c98 RSI: 0000000000000001 RDI: ffff8800914b3c98 [ 266.492376] RBP: ffff8800916fbba8 R08: 0000000000000002 R09: 0000000000000000 [ 266.492421] R10: 0000000000000008 R11: 0000000000000001 R12: ffff88008f686068 [ 266.492465] R13: ffff8800914b3c98 R14: ffff88008f6b3b90 R15: ffff88008f686000 [ 266.492513] FS: 00007fb8966f6700(0000) GS:ffff88011ed80000(0000) knlGS:0000000000000000 [ 266.492566] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 266.492601] CR2: 00007f50fa190770 CR3: 0000000001b31000 CR4: 00000000000407e0 [ 266.492652] Stack: [ 266.492665] 0000000000000000 ffff88008f686078 ffff8800916fbba8 ffff88008f686000 [ 266.492714] ffff8800916fbc08 0000000000000000 0000000000000000 ffff88008f686000 [ 266.492764] ffff8800916fbbf8 ffffffff8111ba5d 00007fb885918000 ffff88008edf3000 [ 266.492815] Call Trace: [ 266.492834] [] free_pgtables+0x8e/0xcc [ 266.492873] [] exit_mmap+0x84/0x116 [ 266.492907] [] mmput+0x52/0xe9 [ 266.492940] [] do_exit+0x3cd/0x9c9 [ 266.492975] [] ? _raw_spin_unlock_irq+0x2d/0x32 [ 266.493016] [] do_group_exit+0x4c/0xc9 [ 266.493051] [] get_signal+0x58f/0x5bc [ 266.493090] [] do_signal+0x28/0x5b1 [ 266.493123] [] ? sysret_signal+0x5/0x43 [ 266.493162] [] do_notify_resume+0x35/0x68 [ 266.493200] [] int_signal+0x12/0x17 [ 266.493235] Code: e8 03 b7 f4 ff 49 8b 47 78 4c 8b 20 48 8d 58 f0 49 83 ec 10 48 8d 43 10 48 39 45 c8 74 55 48 8b 7b 08 83 bf 8c 00 00 00 00 74 02 <0f> 0b e8 a4 fd ff ff 48 8b 43 18 48 8b 53 10 48 89 df 48 89 42 [ 266.493404] RIP [] unlink_anon_vmas+0x102/0x159 [ 266.493447] RSP [ 266.508877] ---[ end trace 02d28fe9b3de2e1a ]--- [ 266.508880] Fixing recursive fault but reboot is needed! git bisect log: git bisect start # bad: [a4ad89a46882b91b7df9cfb83dd21c06b8065c30] fix to qcom eth ctrl git bisect bad a4ad89a46882b91b7df9cfb83dd21c06b8065c30 # good: [b1940cd21c0f4abdce101253e860feff547291b0] Linux 3.19-rc3 git bisect good b1940cd21c0f4abdce101253e860feff547291b0 # bad: [03c751a5e10caafbb6d1afcaf1ea67f2153c3193] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs git bisect bad 03c751a5e10caafbb6d1afcaf1ea67f2153c3193 # bad: [53dc20b9a3d928b0744dad5aee65b610de1cc85d] ocfs2: fix the wrong directory passed to ocfs2_lookup_ino_from_name() when link file git bisect bad 53dc20b9a3d928b0744dad5aee65b610de1cc85d # good: [e8829f007e982a9a8fb4023109233d5f344d4657] batman-adv: fix counter for multicast supporting nodes git bisect good e8829f007e982a9a8fb4023109233d5f344d4657 # good: [2abad79afa700e837cb4feed170141292e0720c0] qla3xxx: don't allow never end busy loop git bisect good 2abad79afa700e837cb4feed170141292e0720c0 # good: [0adc1803880db728fa7f8cbad5b214ab657e5e0d] Merge tag 'for-linus-3' of git://git.code.sf.net/p/openipmi/linux-ipmi git bisect good 0adc1803880db728fa7f8cbad5b214ab657e5e0d # good: [3245d6acab981a2388ffb877c7ecc97e763c59d4] exit: fix race between wait_consider_task() and wait_task_zombie() git bisect good 3245d6acab981a2388ffb877c7ecc97e763c59d4 # bad: [2d6d7f98284648c5ed113fe22a132148950b140f] mm: protect set_page_dirty() from ongoing truncation git bisect bad 2d6d7f98284648c5ed113fe22a132148950b140f # bad: [7a3ef208e662f4b63d43a23f61a64a129c525bbc] mm: prevent endless growth of anon_vma hierarchy git bisect bad 7a3ef208e662f4b63d43a23f61a64a129c525bbc # first bad commit: [7a3ef208e662f4b63d43a23f61a64a129c525bbc] mm: prevent endless growth of anon_vma hierarchy I saw this commit is marked stable, so this is quite problematic as it could break userspace HSA apps out there (although amdkfd will only be present from 3.19). Could you please take a look and help solve this issue ? Thanks, Oded