From mboxrd@z Thu Jan 1 00:00:00 1970 From: Bojan Smojver Subject: Re: Memory corruption on hibernate/thaw with KMS Date: Mon, 10 Oct 2011 21:37:56 +1100 Message-ID: <1318243076.1899.1.camel@shrek.rexursive.com> References: <1317103928.1981.5.camel@shrek.rexursive.com> <1318230907.2019.4.camel@shrek.rexursive.com> <20111010075306.GC3021@phenom.ffwll.local> <1318234738.2010.8.camel@shrek.rexursive.com> <1318241119.1901.1.camel@shrek.rexursive.com> <1318241565.2051.1.camel@shrek.rexursive.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Received: from beauty.rexursive.com (beauty.rexursive.com [150.101.121.179]) by gabe.freedesktop.org (Postfix) with ESMTP id 16C019ED69 for ; Mon, 10 Oct 2011 03:37:57 -0700 (PDT) In-Reply-To: <1318241565.2051.1.camel@shrek.rexursive.com> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: intel-gfx-bounces+gcfxdi-intel-gfx=m.gmane.org@lists.freedesktop.org Errors-To: intel-gfx-bounces+gcfxdi-intel-gfx=m.gmane.org@lists.freedesktop.org To: Daniel Vetter Cc: intel-gfx@lists.freedesktop.org List-Id: intel-gfx@lists.freedesktop.org On Mon, 2011-10-10 at 21:12 +1100, Bojan Smojver wrote: > When I attempted to shut the machine down, it hung and I could see > more > kernel traces on the console. Tried again, but this time I used chvt 2; sleep 1; chvt 1 in the sequence (this seems to agitate the bug faster). Indeed, got problems quickly: --------------------------- [ 175.770300] BUG: unable to handle kernel NULL pointer dereference at 0000000000000010 [ 175.774934] IP: [] prio_tree_replace+0x4b/0x66 [ 175.779296] PGD 1f88d0067 PUD 1f88af067 PMD 0 [ 175.783593] Oops: 0002 [#1] SMP [ 175.788025] CPU 2 [ 175.788055] Modules linked in: fuse ppdev parport_pc lp parport sunrpc bnep bluetooth cpufreq_ondemand acpi_cpufreq freq_table mperf arc4 iwlagn snd_hda_codec_hdmi mac80211 snd_hda_codec_conexant uvcvideo snd_hda_intel snd_hda_codec videodev snd_hwdep media snd_seq qcserial v4l2_compat_ioctl32 usb_wwan snd_seq_device snd_pcm cfg80211 thinkpad_acpi snd_timer e1000e intel_ips iTCO_wdt iTCO_vendor_support joydev mxm_wmi snd_page_alloc snd i2c_i801 wmi rfkill pcspkr microcode soundcore ipv6 firewire_ohci sdhci_pci sdhci mmc_core firewire_core crc_itu_t i915 drm_kms_helper drm i2c_algo_bit i2c_core video [last unloaded: scsi_wait_scan] [ 175.810238] [ 175.814616] Pid: 3763, comm: gcm-apply Not tainted 3.1.0-rc9+ #105 LENOVO 4313CTO/4313CTO [ 175.819031] RIP: 0010:[] [] prio_tree_replace+0x4b/0x66 [ 175.823327] RSP: 0018:ffff8801f8bbfce8 EFLAGS: 00010207 [ 175.827099] RAX: ffff880229b84100 RBX: ffff8801f8bfb100 RCX: 0000000000000000 [ 175.829260] RDX: ffff880229b84050 RSI: ffff880229b84100 RDI: ffff88022c12b318 [ 175.831374] RBP: ffff8801f8bbfce8 R08: ffff880229b84100 R09: 0000000000000000 [ 175.833440] R10: ffff8801f8bfbd48 R11: ffff8801f8bfbd10 R12: ffff880229b84050 [ 175.835388] R13: ffff88022c12b318 R14: 0000000000000080 R15: 0000000000000000 [ 175.837373] FS: 0000000000000000(0000) GS:ffff88023bd00000(0000) knlGS:0000000000000000 [ 175.839407] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 175.841446] CR2: 0000000000000010 CR3: 00000002085a1000 CR4: 00000000000006e0 [ 175.843471] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 175.845556] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [ 175.847616] Process gcm-apply (pid: 3763, threadinfo ffff8801f8bbe000, task ffff8801a864ae40) [ 175.849738] Stack: [ 175.851809] ffff8801f8bbfd48 ffffffff8124327a ffff880229b84100 000000000000003d [ 175.853961] 000000000000003f 0000000000000000 000000000000003d ffff8801f8bfb0b0 [ 175.856109] ffff8801f8bfb100 ffff8801f8bfbd50 ffff88022c12b2f8 ffff8801f8bfbd48 [ 175.858296] Call Trace: [ 175.860435] [] prio_tree_insert+0x16b/0x216 [ 175.862599] [] vma_prio_tree_insert+0x26/0x3c [ 175.864759] [] __vma_link_file+0x64/0x66 [ 175.866899] [] vma_link+0x75/0x95 [ 175.869014] [] mmap_region+0x30a/0x46b [ 175.871114] [] do_mmap_pgoff+0x299/0x2f3 [ 175.873205] [] sys_mmap_pgoff+0x115/0x164 [ 175.875334] [] sys_mmap+0x22/0x24 [ 175.877409] [] system_call_fastpath+0x16/0x1b [ 175.879523] Code: 0f 0b 48 89 17 eb 16 48 89 4a 10 48 8b 4e 10 48 39 31 75 05 48 89 11 eb 04 48 89 51 08 48 8b 08 48 39 c1 74 0a 48 89 0a 48 8b 08 [ 175.879735] 89 51 10 48 8b 48 08 48 39 c1 74 0c 48 89 4a 08 48 8b 48 08 [ 175.884169] RIP [] prio_tree_replace+0x4b/0x66 [ 175.886469] RSP [ 175.888702] CR2: 0000000000000010 [ 176.196577] ---[ end trace 75df9d9a11de8acd ]--- [ 178.928408] PM: Marking nosave pages: 000000000009e000 - 0000000000100000 [ 178.928418] PM: Marking nosave pages: 00000000bb27c000 - 00000000bb282000 [ 178.928422] PM: Marking nosave pages: 00000000bb35f000 - 00000000bb40f000 [ 178.928430] PM: Marking nosave pages: 00000000bb46f000 - 00000000bb70f000 [ 178.928446] PM: Marking nosave pages: 00000000bb717000 - 00000000bb71f000 [ 178.928451] PM: Marking nosave pages: 00000000bb76c000 - 00000000bb7ff000 [ 178.928457] PM: Marking nosave pages: 00000000bb800000 - 0000000100000000 [ 178.930551] PM: Marking nosave pages: 00000001fc000000 - 0000000200000000 [ 178.930855] PM: Basic memory bitmaps created [ 178.930859] PM: Syncing filesystems ... done. [ 179.006362] Freezing user space processes ... [ 198.991114] Freezing of tasks failed after 20.00 seconds (1 tasks refusing to freeze, wq_busy=0): [ 198.991165] gcm-apply D 0000000000000000 0 3763 1 0x00800084 [ 198.991175] ffff8801f8bbf860 0000000000000086 0000000000000000 ffff880100000000 [ 198.991185] ffff8801a864ae40 ffff8801f8bbffd8 ffff8801f8bbffd8 0000000000012d00 [ 198.991194] ffff88022e5d1720 ffff8801a864ae40 ffff8801a864b2c0 0000000100000002 [ 198.991203] Call Trace: [ 198.991213] [] schedule+0x5a/0x5c [ 198.991218] [] rwsem_down_failed_common+0xd3/0x105 [ 198.991222] [] ? _raw_spin_unlock_irqrestore+0x17/0x19 [ 198.991225] [] rwsem_down_read_failed+0x12/0x14 [ 198.991231] [] call_rwsem_down_read_failed+0x14/0x30 [ 198.991235] [] ? down_read+0x21/0x25 [ 198.991240] [] acct_collect+0x4a/0x182 [ 198.991246] [] do_exit+0x21e/0x722 [ 198.991249] [] ? kmsg_dump+0x4b/0xd7 [ 198.991253] [] oops_end+0xbc/0xc5 [ 198.991256] [] no_context+0x203/0x212 [ 198.991259] [] __bad_area_nosemaphore+0x1cb/0x1ec [ 198.991263] [] ? search_module_extables+0x3f/0x69 [ 198.991266] [] bad_area_nosemaphore+0x13/0x15 [ 198.991270] [] do_page_fault+0x1b8/0x37e [ 198.991276] [] ? lookup_page_cgroup+0x28/0x3e [ 198.991282] [] ? dquot_file_open+0x1b/0x3e [ 198.991285] [] page_fault+0x25/0x30 [ 198.991289] [] ? prio_tree_replace+0x4b/0x66 [ 198.991292] [] prio_tree_insert+0x16b/0x216 [ 198.991297] [] vma_prio_tree_insert+0x26/0x3c [ 198.991302] [] __vma_link_file+0x64/0x66 [ 198.991305] [] vma_link+0x75/0x95 [ 198.991309] [] mmap_region+0x30a/0x46b [ 198.991312] [] do_mmap_pgoff+0x299/0x2f3 [ 198.991315] [] sys_mmap_pgoff+0x115/0x164 [ 198.991322] [] sys_mmap+0x22/0x24 [ 198.991326] [] system_call_fastpath+0x16/0x1b [ 198.991330] [ 198.991331] Restarting tasks ... done. [ 198.992966] PM: Basic memory bitmaps freed [ 198.996762] video LNXVIDEO:00: Restoring backlight state [ 204.166989] PM: Marking nosave pages: 000000000009e000 - 0000000000100000 [ 204.166994] PM: Marking nosave pages: 00000000bb27c000 - 00000000bb282000 [ 204.166997] PM: Marking nosave pages: 00000000bb35f000 - 00000000bb40f000 [ 204.167001] PM: Marking nosave pages: 00000000bb46f000 - 00000000bb70f000 [ 204.167014] PM: Marking nosave pages: 00000000bb717000 - 00000000bb71f000 [ 204.167016] PM: Marking nosave pages: 00000000bb76c000 - 00000000bb7ff000 [ 204.167021] PM: Marking nosave pages: 00000000bb800000 - 0000000100000000 [ 204.168816] PM: Marking nosave pages: 00000001fc000000 - 0000000200000000 [ 204.169060] PM: Basic memory bitmaps created [ 204.169061] PM: Syncing filesystems ... done. [ 204.242358] Freezing user space processes ... [ 224.228897] Freezing of tasks failed after 20.00 seconds (3 tasks refusing to freeze, wq_busy=0): [ 224.230914] gnome-settings- D 0000000000000000 0 1716 1553 0x00800084 [ 224.232935] ffff88022a76bcd0 0000000000000086 0000000008100073 ffffea0000000000 [ 224.234996] ffff88022d059720 ffff88022a76bfd8 ffff88022a76bfd8 0000000000012d00 [ 224.237043] ffffffff81a0d020 ffff88022d059720 00000000817c1372 0000000100000001 [ 224.239067] Call Trace: [ 224.241040] [] ? pmd_offset+0x19/0x3f [ 224.243014] [] schedule+0x5a/0x5c [ 224.244981] [] __mutex_lock_common+0x102/0x163 [ 224.246946] [] __mutex_lock_slowpath+0x1b/0x1d [ 224.248883] [] mutex_lock+0x23/0x37 [ 224.250799] [] dup_mm+0x2da/0x488 [ 224.252704] [] copy_process+0x9b1/0x119c [ 224.254592] [] ? security_file_alloc+0x16/0x18 [ 224.256473] [] do_fork+0xef/0x22d [ 224.258329] [] ? _raw_spin_lock+0xe/0x10 [ 224.260184] [] ? path_put+0x1f/0x23 [ 224.262024] [] sys_clone+0x28/0x2a [ 224.263836] [] stub_clone+0x13/0x20 [ 224.265631] [] ? system_call_fastpath+0x16/0x1b [ 224.267423] gnome-settings- D ffff88022ab2c700 0 1721 1553 0x00800084 [ 224.269227] ffff8802286c3890 0000000000000086 0000000000000000 0000000000000000 [ 224.271071] ffff88022a6a2e40 ffff8802286c3fd8 ffff8802286c3fd8 0000000000012d00 [ 224.272912] ffff880219774560 ffff88022a6a2e40 0000000000000000 0000000000000000 [ 224.274740] Call Trace: [ 224.276530] [] schedule+0x5a/0x5c [ 224.278320] [] rwsem_down_failed_common+0xd3/0x105 [ 224.280105] [] rwsem_down_read_failed+0x12/0x14 [ 224.281874] [] call_rwsem_down_read_failed+0x14/0x30 [ 224.283615] [] ? restore_args+0x30/0x30 [ 224.285331] [] ? down_read+0x21/0x25 [ 224.287026] [] ? restore_args+0x30/0x30 [ 224.288707] [] do_page_fault+0x1c5/0x37e [ 224.290389] [] ? __schedule+0x63b/0x669 [ 224.292060] [] ? __remove_hrtimer+0x5c/0x83 [ 224.293718] [] ? schedule+0x5a/0x5c [ 224.295365] [] page_fault+0x25/0x30 [ 224.297025] [] ? do_sys_poll+0x32c/0x389 [ 224.298694] [] ? do_sys_poll+0x317/0x389 [ 224.300340] [] ? poll_freewait+0xaa/0xaa [ 224.301971] [] ? __pollwait+0xcc/0xcc [ 224.303588] [] ? _raw_spin_unlock_irqrestore+0x17/0x19 [ 224.305195] [] ? select_task_rq_fair+0x3cc/0x658 [ 224.306785] [] ? _flat_send_IPI_mask+0x7b/0x84 [ 224.308363] [] ? current_fs_time+0x37/0x3e [ 224.309931] [] ? touch_atime+0xf8/0x113 [ 224.311483] [] ? get_futex_key+0x8e/0x274 [ 224.313019] [] ? futex_wake+0xfe/0x110 [ 224.314539] [] ? fsnotify+0x1eb/0x217 [ 224.316046] [] sys_poll+0x51/0xbb [ 224.317618] [] system_call_fastpath+0x16/0x1b [ 224.319148] gcm-apply D 0000000000000000 0 3763 1 0x00800084 [ 224.320644] ffff8801f8bbf860 0000000000000086 0000000000000000 ffff880100000000 [ 224.322166] ffff8801a864ae40 ffff8801f8bbffd8 ffff8801f8bbffd8 0000000000012d00 [ 224.323680] ffff88022e5d1720 ffff8801a864ae40 ffff8801a864b2c0 0000000100000002 [ 224.325181] Call Trace: [ 224.326640] [] schedule+0x5a/0x5c [ 224.328098] [] rwsem_down_failed_common+0xd3/0x105 [ 224.329552] [] ? _raw_spin_unlock_irqrestore+0x17/0x19 [ 224.330990] [] rwsem_down_read_failed+0x12/0x14 [ 224.332418] [] call_rwsem_down_read_failed+0x14/0x30 [ 224.333831] [] ? down_read+0x21/0x25 [ 224.335243] [] acct_collect+0x4a/0x182 [ 224.336650] [] do_exit+0x21e/0x722 [ 224.338052] [] ? kmsg_dump+0x4b/0xd7 [ 224.339455] [] oops_end+0xbc/0xc5 [ 224.340861] [] no_context+0x203/0x212 [ 224.342356] [] __bad_area_nosemaphore+0x1cb/0x1ec [ 224.342361] [] ? search_module_extables+0x3f/0x69 [ 224.342366] [] bad_area_nosemaphore+0x13/0x15 [ 224.342373] [] do_page_fault+0x1b8/0x37e [ 224.342378] [] ? lookup_page_cgroup+0x28/0x3e [ 224.342382] [] ? dquot_file_open+0x1b/0x3e [ 224.342385] [] page_fault+0x25/0x30 [ 224.342387] [] ? prio_tree_replace+0x4b/0x66 [ 224.342389] [] prio_tree_insert+0x16b/0x216 [ 224.342392] [] vma_prio_tree_insert+0x26/0x3c [ 224.342395] [] __vma_link_file+0x64/0x66 [ 224.342397] [] vma_link+0x75/0x95 [ 224.342399] [] mmap_region+0x30a/0x46b [ 224.342402] [] do_mmap_pgoff+0x299/0x2f3 [ 224.342404] [] sys_mmap_pgoff+0x115/0x164 [ 224.342409] [] sys_mmap+0x22/0x24 [ 224.342411] [] system_call_fastpath+0x16/0x1b [ 224.342414] [ 224.342414] Restarting tasks ... done. [ 224.344753] PM: Basic memory bitmaps freed [ 224.349953] video LNXVIDEO:00: Restoring backlight state --------------------------- So, at first glance, I would say the patch didn't fix it. Let me repeat my tests using nomodeset. -- Bojan